At 12:13 AM -0800 3/23/01, Rick McGowan wrote:
[snip]
>BTW, a bit off topic here but: I think it's high time that Project
>Gutenberg adopted some very clear character encoding guidelines now that
>they're expanding so widely. Or have they already adopted them and I've
>just missed the policy statement...? They're in for a real mess if they
>don't specify character encodings in a very controlled way.
>
> Rick
I suggested it recently when I volunteered for the Project, but the
plan so far is to leave these things up to the transcriber. If I have
anything to say about it, this policy will change. Since at the
moment I don't have anything to say about it, :-) I'm going to
define some formats and try to get everything converted into them.
Then we can all hope that less well supported formats and encodings
die out.
I intend to suggest HTML/UTF-8 and PDF, and I'm going to see about
setting up automated conversion scripts for any other formats that
come my way. The help of a few Unicoders to convert files and to
write Perl or other scripts would be of inestimable value.
Further suggestions welcomed.
--Ed Cherlin, President, CAUCE <http://www.cauge.org> "Everything should be made as simple as possible, __but no simpler__." Attributed to Albert Einstein
This archive was generated by hypermail 2.1.2 : Fri Jul 06 2001 - 00:17:15 EDT