I open a simple html file which is encoded in UNICODE. It is not well displayed. It seems that the file is cut.
I see the file with an hexadecimal viewer and I see the problem is with the character BB DB (hex). If I delete this character the file is well displayed.
I would like to upload an example, Is it possible?
Problem with an Unicode character
javif
#1
0 Likes
rwols
#2
There is no such thing as unicode encoding. Unicode is a “protocol”; UTF-8 is an encoding for that protocol, as are others like UTF-16LE. Can you try opening the file with another encoding and see if one displays correctly?
2 Likes
javif
#3
You are correct. The encoding is UTF-16LE. I have tried before, what you said (opening the file with another encoding) but it is also bad displayed.
0 Likes
rwols
#5
FF FE
is the byte order mark for UTF-16LE, which gives a strong hint that it’s UTF-16LE. Not sure why you’re getting bad characters.
0 Likes