Switching a web site to UTF-8

From ShawnReevesWiki
Revision as of 11:05, 16 April 2013 by Shawn (talk | contribs) (Steps)
Jump to navigationJump to search

I'd enjoy a web site that had zero issues with characters rendered improperly, so I'm considering converting my web sites to use the UTF-8 character set for storage and presentation.

Steps

  1. Convert all the existing data in every text field of every table in the database. This is the toughest part, because data in a field may not match the declared character set of the field itself, so converting might mangle unexpected characters. It is mentioned in the tutorials below that one might convert the contents of a field to the declared character set of that field before converting it to UTF-8.
  2. Convert all the html pages to declare UTF-8 as their encoding. This is done in the META tag in the header.
  3. Convert all php scripts to use UTF-8 as the default character set.
  4. Convert all mysql connections and requests to use UTF-8.

Existing tutorials

Converting a MySQL database to UTF-8
http://www.drzycimski.com/programming/zend-framework/converting-a-mysql-database-to-utf-8/
Converting Database Character Sets << WordPress Codex
http://codex.wordpress.org/Converting_Database_Character_Sets