In article <pan.2005.09.26.14.44.34.625071 DeleteThis @uselessemail.net>,
Proteus <proteus DeleteThis @uselessemail.net> wrote:
> Ok, fair answer. Then is there some utility or way to easily determine
> what type of char encoding a text file is in? I mean, if I have
> somefile.txt or somewebpage.html, how can I know what char encoding is
> embedded in the file? Is there some utility (hopefully in linux) to look
> at the type of encoding used?
No. Bits are just bits if there is no metadata that tells you the
encoding. In one text encoding a certain bit sequence might be a bullet
point and in another it might be the symbol for the British Pound. The
best a computer could do is the best a human can do: look at any
particular encoding and say it's *probably* wrong, but that doesn't get
you to what encoding is *definitely* right. You really need the author
to add that metadata if you want it to be clear.
>> Stay informed about: what char encoding are plain text files?