Just one word. utf8everywhere.org

2014-04-25T14:33:52.045+02:00

> So, even if you have an 8-bit ASCII-codepaged...

2013-07-17T20:59:56.013+02:00

> So, even if you have an 8-bit ASCII-codepaged text, you cannot use it as UTF8.

You are conflating the ASCII _encoding_ with the 8-bit SBCS _format_.

ASCII is decidedly a 7-bit encoding, end of story. *No* 8-bit encoding (Latin1,Windows-1252,...) is synonymous with ASCII. To use the term "ASCII" when you really mean "any SBCS/MBCS encoding" does nothing but add to the confusion.

It is safe to say that for any encoding in common use today (UTF-xx, Latin1, SJIS, GBK, whatever), the first 128 characters of the encoding match the 128 characters of ASCII precisely. Anything beyond 0x7f is encoding-specific and simply cannot be represented as ASCII.

Comments on Qb's C++ blog: Unicode and your application (1 of n)

Just one word. utf8everywhere.org

> So, even if you have an 8-bit ASCII-codepaged...