Sublime Forum

Problem with an Unicode character

#1

I open a simple html file which is encoded in UNICODE. It is not well displayed. It seems that the file is cut.
I see the file with an hexadecimal viewer and I see the problem is with the character BB DB (hex). If I delete this character the file is well displayed.
I would like to upload an example, Is it possible?

0 Likes

#2

There is no such thing as unicode encoding. Unicode is a “protocol”; UTF-8 is an encoding for that protocol, as are others like UTF-16LE. Can you try opening the file with another encoding and see if one displays correctly?

2 Likes

#3

You are correct. The encoding is UTF-16LE. I have tried before, what you said (opening the file with another encoding) but it is also bad displayed.

0 Likes

#4

Here are the first bytes of the file

0 Likes

#5

FF FE is the byte order mark for UTF-16LE, which gives a strong hint that it’s UTF-16LE. Not sure why you’re getting bad characters.

0 Likes