Re: Character sets and encodings confusion
  Home FAQ Contact Sign in
gnu.emacs.help only
 
Advanced search
POPULAR GROUPS

more...

 Up
Re: Character sets and encodings confusion         

Group: gnu.emacs.help · Group Profile
Author: Jason Rumney
Date: Jan 11, 2008 08:28

On 11 Jan, 14:26, "Otto Maddox" wrote:
> When I type `C-u C-x =' on the character `ВЈ', ...
> Why is the code point #x23? Should it not be #xA3 in Latin Alphabet 1?

The clue is in the following:
> charset: latin-iso8859-1
> (Right-Hand Part of Latin Alphabet 1 (ISO/IEC 8859-1): ISO-IR-100.)

Note that the latin-iso8859-1 charset only includes the Right-Hand
part (0x80-0xff).
> Because when you click on the #x23, the character list you get shows
> the code point as being #xA3, which is confusing.

It is confusing, but the table displayed is listed as the *coded*
charset, so it has the +0x80 transformation applied.
> Also, what are the first three numbers in parenthesis on the
> `character:' line?

They are the code-point in the internal encoding (emacs-mule in the
current version) in decimal, octal and hexadecimal.
no comments
diggit! del.icio.us! reddit!

RELATED THREADS
SubjectArticles qty Group
Setting permissions in User Security tab is reverting back to previous settingmicrosoft.public.windows.server.active_directory ·