cURL / Mailing Lists / curl-users / Single Mail

curl-users

Re: Japanese characters in URL

From: Sterling Hughes <sterling_at_designmultimedia.com>
Date: Wed, 9 May 2001 10:54:09 -0400 (EDT)

On Wed, 9 May 2001, SM wrote:

> At 10:06 07-05-2001 +0200, Daniel Stenberg wrote:
> >Now, how the heck do Japanese guys enter a Japanese URL in a command line? I
> >mean, what kind of byte-stream will be read from the argv[] array?
>
> The "locale" setting identifies the character set to be used and how the
> text should be handled. IIRC, JIS is one of the character sets used in
> Japan. The application (cURL) processing the input will have to handle the
> required conversions before passing the data over to libcurl.
>

There is also a plethora of other encodings besides Shift_JIS (such as
EUC-JP for one), correct?

> If the input uses a Double Byte Character Set (sometimes referred to as
> MultiByte Character Set), a lookup table can be used to do the conversion
> to 7-bit ASCII.
>

From what I understand this is a bitch ;)))

http://lxr.php.net/source/php4/ext/mbstring/

Is the PHP extension implementation of the Japanese multi-byte characters.

-Sterling
Received on 2001-05-10