[comp.sys.next] Wide characters?

scott@sage.uchicago.edu (Scott Deerwester) (01/17/91)

I'm building an application that will use Unicode characters as the
native text representation standard.  And in stdlib.h are the
following tantalizing function declarations:

    extern int mblen(const char *s, size_t n);
    extern int mbtowc(wchar_t *pwc, const char *s, size_t n);
    extern int wctomb(char *s, wchar_t wchar);
    extern size_t mbstowcs(wchar_t *pwcs, const char *s, size_t n);
    extern size_t wcstombs(char *s, const wchar_t *pwcs, size_t n);

And in stddef.h is the following:

    typedef unsigned short wchar_t;

Looks a *whole* lot like the ANSI C ($31.95 for the book; ack)
definition, but I can't find a reference to these functions
*anywhere*!  I tried things like:

    find /usr/include -print | xargs grep wctomb

and:

    grep -l wctomb /usr/lib/lib*.a

No dice.  Anybody have a clue what these functions are supposed to do?
I'm *hoping* that they do some sort of mapping between something
vaguely Unicode-like and something vaguely ASCII-like.  (Hope springs
eternal...)  Does anybody know?
--
Scott Deerwester            | Internet: scott@tira.uchicago.edu  | ~{P;N,5B~}
Center for Information and  | Phone:    312-702-6948             |
   Language Studies         | 1100 E. 57th, CILS                 |
University of Chicago       | Chicago, IL 60637                  |

dennisg@kgw2.bwi.WEC.COM (Dennis Glatting) (01/18/91)

In article <SCOTT.91Jan16135908@sage.uchicago.edu>, scott@sage.uchicago.edu (Scott Deerwester) writes:
|> I'm building an application that will use Unicode characters as the
|> native text representation standard.  And in stdlib.h are the
|> following tantalizing function declarations:
|> 
|>     extern int mblen(const char *s, size_t n);
|>     extern int mbtowc(wchar_t *pwc, const char *s, size_t n);
|>     extern int wctomb(char *s, wchar_t wchar);
|>     extern size_t mbstowcs(wchar_t *pwcs, const char *s, size_t n);
|>     extern size_t wcstombs(char *s, const wchar_t *pwcs, size_t n);
|> 
|> And in stddef.h is the following:
|> 
|>     typedef unsigned short wchar_t;
|> 
|> Looks a *whole* lot like the ANSI C ($31.95 for the book; ack)
|> definition, but I can't find a reference to these functions
|> *anywhere*!  I tried things like:
|> 
|>     find /usr/include -print | xargs grep wctomb
|> 
|> and:
|> 
|>     grep -l wctomb /usr/lib/lib*.a
|> 
|> No dice.  Anybody have a clue what these functions are supposed to do?
|> I'm *hoping* that they do some sort of mapping between something
|> vaguely Unicode-like and something vaguely ASCII-like.  (Hope springs
|> eternal...)  Does anybody know?

see page 10-11 of the ARM




--
 ..!uunet!kgw2!dennisg  | Dennis P. Glatting
 dennisg@Xetron.COM     | X2NeXT developer
                        | And now a NeXT/C++ geek