man-pages/man5/charmap.5

130 lines
3.6 KiB
Groff
Raw Normal View History

2004-11-03 13:51:07 +00:00
.\" Hey emacs, this is -*- nroff -*-
.\"
.\" This file is part of locale(1) which displays the settings of the
2004-11-03 13:51:07 +00:00
.\" current locale.
.\" Copyright (C) 1994 Jochen Hein (Hein@Student.TU-Clausthal.de)
.\"
.\" This program is free software; you can redistribute it and/or modify
.\" it under the terms of the GNU General Public License as published by
.\" the Free Software Foundation; either version 2 of the License, or
.\" (at your option) any later version.
.\"
.\" This program is distributed in the hope that it will be useful,
.\" but WITHOUT ANY WARRANTY; without even the implied warranty of
.\" MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE. See the
.\" GNU General Public License for more details.
.\"
.\" You should have received a copy of the GNU General Public License
.\" along with this program; if not, write to the Free Software
.\" Foundation, Inc., 59 Temple Place, Suite 330, Boston, MA 02111, USA.
.\"
.TH CHARMAP 5 1994-11-28 "" "Linux User Manual"
2004-11-03 13:51:07 +00:00
.SH NAME
charmap \- character symbols to define character encodings
.SH DESCRIPTION
A character set description (charmap) defines a character set of
available characters and their encodings.
All supported character
sets should have the
2004-11-03 13:51:07 +00:00
.B portable character set
as a proper subset.
.\" Not true anymore:
.\" The portable character set is defined in the file
.\" .I /usr/lib/nls/charmap/POSIX
.\" .I /usr/share/i18n/charmap/POSIX
.\" for reference purposes.
2007-06-15 19:55:07 +00:00
.SS Syntax
2004-11-03 13:51:07 +00:00
The charmap file starts with a header, that may consist of the
following keywords:
.TP
.I <codeset>
is followed by the name of the codeset.
.TP
.I <mb_cur_max>
is followed by the max number of bytes for a multibyte-character.
Multibyte characters are currently not supported.
The default value
2004-11-03 13:51:07 +00:00
is 1.
.TP
.I <mb_cur_min>
is followed by the min number of bytes for a character.
This
value must be less or equal than
2005-07-19 15:36:19 +00:00
.BR mb_cur_max .
2004-11-03 13:51:07 +00:00
If not specified, it defaults to
2005-07-19 15:36:19 +00:00
.BR mb_cur_max .
2004-11-03 13:51:07 +00:00
.TP
.I <escape_char>
is followed by a character that should be used as the
escape-character for the rest of the file to mark characters that
should be interpreted in a special way.
It defaults to
2004-11-03 13:51:07 +00:00
the backslash (
.B \\\\
2004-11-03 13:51:07 +00:00
).
.TP
.I <comment_char>
is followed by a character that will be used as the
comment-character for the rest of the file.
It defaults to the
2004-11-03 13:51:07 +00:00
number sign (
.B #
).
.PP
The charmap-definition itself starts with the keyword
.B CHARMAP
in column 1.
The following lines may have one of the two following forms to
define the character-encodings:
.TP
.I <symbolic-name> <encoding> <comments>
This form defines exactly one character and its encoding.
2004-11-03 13:51:07 +00:00
.TP
.I <symbolic-name>...<symbolic-name> <encoding> <comments>
This form defines a couple of characters.
This is only useful for
multibyte-characters, which are currently not implemented.
2004-11-03 13:51:07 +00:00
.PP
The last line in a charmap-definition file must contain
.B END CHARMAP.
intro.1, time.1, _syscall.2, accept.2, brk.2, capget.2, faccessat.2, futimesat.2, gethostname.2, intro.2, ioprio_set.2, kill.2, madvise.2, mlock.2, mount.2, msgget.2, perf_event_open.2, poll.2, readv.2, sched_rr_get_interval.2, sched_setscheduler.2, select.2, select_tut.2, shmget.2, stat.2, timer_create.2, ustat.2, vfork.2, wait.2, alloca.3, atexit.3, catopen.3, crypt.3, dbopen.3, fenv.3, fopen.3, getaddrinfo_a.3, gethostbyname.3, getutent.3, intro.3, login.3, mallopt.3, queue.3, regex.3, sigpause.3, sigsetops.3, stdio.3, strftime.3, strptime.3, sysconf.3, ttyslot.3, undocumented.3, cciss.4, console_codes.4, hpsa.4, initrd.4, intro.4, rtc.4, st.4, tty_ioctl.4, wavelan.4, acct.5, charmap.5, dir_colors.5, hosts.5, intro.5, termcap.5, utmp.5, intro.6, armscii-8.7, boot.7, bootparam.7, capabilities.7, cpuset.7, credentials.7, ddp.7, epoll.7, glob.7, intro.7, ip.7, ipv6.7, iso_8859-1.7, iso_8859-10.7, iso_8859-11.7, iso_8859-13.7, iso_8859-14.7, iso_8859-15.7, iso_8859-16.7, iso_8859-2.7, iso_8859-3.7, iso_8859-4.7, iso_8859-5.7, iso_8859-6.7, iso_8859-7.7, iso_8859-8.7, iso_8859-9.7, koi8-r.7, koi8-u.7, man-pages.7, man.7, math_error.7, netlink.7, numa.7, packet.7, path_resolution.7, pipe.7, pthreads.7, raw.7, rtnetlink.7, signal.7, socket.7, spufs.7, svipc.7, tcp.7, time.7, udp.7, udplite.7, unicode.7, unix.7, uri.7, utf-8.7, x25.7, intro.8: Global fix: use consistent capitalization in .SS headings Capitalization in .SS sections across pages (and sometimes even within a single page) is wildly inconsistent. Make it consistent. Capitalize first word in heading, but otherwise use lower case, except where English usage (e.g., proper nouns) or programming language requirements (e.g., identifier names) dictate otherwise. Signed-off-by: Michael Kerrisk <mtk.manpages@gmail.com>
2013-02-24 14:08:38 +00:00
.SS "Symbolic names"
A
2004-11-03 13:51:07 +00:00
.B symbolic name
for a character contains only characters of the
2004-11-03 13:51:07 +00:00
.B portable character set.
The name itself is enclosed between angle brackets.
Characters following an
.B <escape_char>
2004-11-03 13:51:07 +00:00
are interpreted as itself; for example, the sequence
.B "<\\\\\\\\\\\\>>"
2004-11-03 13:51:07 +00:00
represents the symbolic name
.B "\\\\>"
2004-11-03 13:51:07 +00:00
enclosed in angle brackets.
intro.1, time.1, _syscall.2, accept.2, brk.2, capget.2, faccessat.2, futimesat.2, gethostname.2, intro.2, ioprio_set.2, kill.2, madvise.2, mlock.2, mount.2, msgget.2, perf_event_open.2, poll.2, readv.2, sched_rr_get_interval.2, sched_setscheduler.2, select.2, select_tut.2, shmget.2, stat.2, timer_create.2, ustat.2, vfork.2, wait.2, alloca.3, atexit.3, catopen.3, crypt.3, dbopen.3, fenv.3, fopen.3, getaddrinfo_a.3, gethostbyname.3, getutent.3, intro.3, login.3, mallopt.3, queue.3, regex.3, sigpause.3, sigsetops.3, stdio.3, strftime.3, strptime.3, sysconf.3, ttyslot.3, undocumented.3, cciss.4, console_codes.4, hpsa.4, initrd.4, intro.4, rtc.4, st.4, tty_ioctl.4, wavelan.4, acct.5, charmap.5, dir_colors.5, hosts.5, intro.5, termcap.5, utmp.5, intro.6, armscii-8.7, boot.7, bootparam.7, capabilities.7, cpuset.7, credentials.7, ddp.7, epoll.7, glob.7, intro.7, ip.7, ipv6.7, iso_8859-1.7, iso_8859-10.7, iso_8859-11.7, iso_8859-13.7, iso_8859-14.7, iso_8859-15.7, iso_8859-16.7, iso_8859-2.7, iso_8859-3.7, iso_8859-4.7, iso_8859-5.7, iso_8859-6.7, iso_8859-7.7, iso_8859-8.7, iso_8859-9.7, koi8-r.7, koi8-u.7, man-pages.7, man.7, math_error.7, netlink.7, numa.7, packet.7, path_resolution.7, pipe.7, pthreads.7, raw.7, rtnetlink.7, signal.7, socket.7, spufs.7, svipc.7, tcp.7, time.7, udp.7, udplite.7, unicode.7, unix.7, uri.7, utf-8.7, x25.7, intro.8: Global fix: use consistent capitalization in .SS headings Capitalization in .SS sections across pages (and sometimes even within a single page) is wildly inconsistent. Make it consistent. Capitalize first word in heading, but otherwise use lower case, except where English usage (e.g., proper nouns) or programming language requirements (e.g., identifier names) dictate otherwise. Signed-off-by: Michael Kerrisk <mtk.manpages@gmail.com>
2013-02-24 14:08:38 +00:00
.SS "Character encoding"
2004-11-03 13:51:07 +00:00
The
encoding may be in each of the following three forms:
.TP
.I <escape_char>d<number>
with a decimal number
.TP
.I <escape_char>x<number>
with a hexadecimal number
2004-11-03 13:51:07 +00:00
.TP
.I <escape_char><number>
2008-07-01 14:15:13 +00:00
with an octal number.
2006-03-23 02:13:08 +00:00
.\" FIXME comments
.\" FIXME char ... char
2004-11-03 13:51:07 +00:00
.SH FILES
.I /usr/share/i18n/charmaps/*
2007-05-16 03:40:19 +00:00
.\" .SH AUTHOR
.\" Jochen Hein (jochen.hein@delphi.central.de)
2004-11-03 13:51:07 +00:00
.SH "CONFORMING TO"
POSIX.2.
2004-11-03 13:51:07 +00:00
.SH "SEE ALSO"
.BR locale (1),
.BR localedef (1),
.BR localeconv (3),
.BR setlocale (3),
.BR locale (5)