man-pages/man3/mblen.3

82 lines
2.5 KiB
Groff
Raw Normal View History

2004-11-03 13:51:07 +00:00
.\" Copyright (c) Bruno Haible <haible@clisp.cons.org>
.\"
.\" This is free documentation; you can redistribute it and/or
.\" modify it under the terms of the GNU General Public License as
.\" published by the Free Software Foundation; either version 2 of
.\" the License, or (at your option) any later version.
.\"
.\" References consulted:
.\" GNU glibc-2 source code and manual
.\" Dinkumware C library reference http://www.dinkumware.com/
.\" OpenGroup's Single Unix specification http://www.UNIX-systems.org/online.html
.\" ISO/IEC 9899:1999
.\"
.TH MBLEN 3 1999-07-25 "GNU" "Linux Programmer's Manual"
.SH NAME
mblen \- determine number of bytes in next multibyte character
.SH SYNOPSIS
.nf
.B #include <stdlib.h>
.sp
.BI "int mblen(const char *" s ", size_t " n );
.fi
.SH DESCRIPTION
If \fIs\fP is not a NULL pointer, the
.BR mblen ()
function inspects at most
2004-11-03 13:51:07 +00:00
\fIn\fP bytes of the multibyte string starting at \fIs\fP and extracts the
next complete multibyte character.
It uses a static anonymous shift state only
known to the mblen function.
If the multibyte character is not the null wide
character, it returns the number of bytes that were consumed from \fIs\fP.
If the multibyte character is the null wide character, it returns 0.
2004-11-03 13:51:07 +00:00
.PP
If the \fIn\fP bytes starting at \fIs\fP do not contain a complete multibyte
character,
.BR mblen ()
returns \-1.
This can happen even if
2004-11-03 13:51:07 +00:00
\fIn\fP >= \fIMB_CUR_MAX\fP, if the multibyte string contains redundant shift
sequences.
.PP
If the multibyte string starting at \fIs\fP contains an invalid multibyte
sequence before the next complete character,
.BR mblen ()
2007-04-27 17:25:28 +00:00
also returns \-1.
2004-11-03 13:51:07 +00:00
.PP
If \fIs\fP is a NULL pointer, the
.BR mblen ()
function
2004-11-03 13:51:07 +00:00
.\" The Dinkumware doc and the Single Unix specification say this, but
.\" glibc doesn't implement this.
resets the shift state, only known to this function, to the initial state, and
2007-12-25 22:02:19 +00:00
returns nonzero if the encoding has nontrivial shift state, or zero if the
2004-11-03 13:51:07 +00:00
encoding is stateless.
.SH "RETURN VALUE"
The
.BR mblen ()
function returns the number of
bytes parsed from the multibyte
2004-11-03 13:51:07 +00:00
sequence starting at \fIs\fP, if a non-null wide character was recognized.
It returns 0, if a null wide character was recognized.
It returns \-1, if an
2004-11-03 13:51:07 +00:00
invalid multibyte sequence was encountered or if it couldn't parse a complete
multibyte character.
.SH "CONFORMING TO"
2006-08-03 13:57:30 +00:00
C99
2004-11-03 13:51:07 +00:00
.SH NOTES
The behavior of
.BR mblen ()
2007-06-22 18:25:23 +00:00
depends on the
.B LC_CTYPE
category of the
2004-11-03 13:51:07 +00:00
current locale.
.PP
The function
.BR mbrlen (3)
provides a better interface to the same
2004-11-03 13:51:07 +00:00
functionality.
2007-05-16 18:25:50 +00:00
.SH "SEE ALSO"
.BR mbrlen (3)