I recently rediscovered this strange behaviour in Python’s Unicode handling.—Evan

This is clearly incorrect. The UTF-8 is specified in the Unicode standard version 4, section 15.9:

  In UTF-8, the BOM corresponds to the byte sequence <EF BB BF>.

I normally find files with UTF-8 BOMs from many Windows applications when you save a text file as UTF8. I think that Notepad or WordPad does this, for example. I think UltraEdit also does the same thing. I know that Scintilla definitely does.—Evan