diff --git a/man7/utf-8.7 b/man7/utf-8.7 index ad159ae5b..392d90fc8 100644 --- a/man7/utf-8.7 +++ b/man7/utf-8.7 @@ -27,7 +27,7 @@ .\" 2001-05-11 Markus Kuhn .\" Update .\" -.TH UTF-8 7 2001-05-11 "GNU" "Linux Programmer's Manual" +.TH UTF-8 7 2012-04-30 "GNU" "Linux Programmer's Manual" .SH NAME UTF-8 \- an ASCII compatible multibyte Unicode encoding .SH DESCRIPTION @@ -99,14 +99,14 @@ All possible 2^31 UCS codes can be encoded using .BR UTF-8 . .TP * -The bytes 0xfe and 0xff are never used in the +The bytes 0xc0, 0xc1, 0xfe and 0xff are never used in the .B UTF-8 encoding. .TP * The first byte of a multibyte sequence which represents a single non-ASCII .B UCS -character is always in the range 0xc0 to 0xfd and indicates how long +character is always in the range 0xc2 to 0xfd and indicates how long this multibyte sequence is. All further bytes in a multibyte sequence are in the range 0x80 to 0xbf. @@ -288,7 +288,7 @@ ways to represent these things in a non-shortest .B UTF-8 encoding. .SS Standards -ISO/IEC 10646-1:2000, Unicode 3.1, RFC\ 2279, Plan 9. +ISO/IEC 10646-1:2000, Unicode 3.1, RFC\ 3629, Plan 9. .\" .SH AUTHOR .\" Markus Kuhn .SH "SEE ALSO"