Character encoding: the light comes on...

Apr 27, 2004

Well, after being given a Thai language file for the Forum, a Bulgarian language file for the Ringmaker, and throwing myself at character sets I couldn't read at all... I think I'm finally starting to get the hang of this character encoding business.

I mean, I understood it before, just not how the different sets worked together and how they were displayed when the current document uses the right encoding.

For instance, why a character would display correctly or incorrectly in UTF-8 was voodoo to me. :) Now I understand!

Anyway, I found a nice script to recode Bulgarian to UTF-8 at but I couldn't find the same for windows-874 (Thai).

So after much searching I asked on Usenet and what do you know? I got pointed to a Perl script that recoded Thai and from there it was a simple matter to translate it to PHP. If you'd like the function I came up with, you can download it from my PHP page.

The rush is finally over Orca Blog, now with RSS 2.0!

Comments closed

Recent posts

  1. Iguana no Musume / Iguana Girl Aug 2016
  2. What I'd Like To See In The Elder Scrolls VI - Part 2 Aug 2015
  3. What I'd Like To See In The Elder Scrolls VI Jul 2015
  4. Cyprus, and what capitalists want Mar 2013
  5. Let interest rates on housing rise Sep 2012
  6. Archive

Items of Interest

Webcomics Reading List

Good Eats

Twitter RSS 2.0 Valid XHTML 1.0! Copyright © 2018 Brian Huisman AKA GreyWyvern
ContactSite mapSearch