In a long bilingual vocabulary file, many letters are represented by one of six “surrogate” letter-pairs marked with an ‘x’: cx, gx and so on. I am seeking instructions for how to replace the six surrogate letter-pairs throughout the file with their counterparts in Unicode.
At the moment I do this manually, using the Windows Find and Replace function, replacing each of the six letter-pairs throughout the file one at a time. I would be grateful for instructions on how to do this more speedily in Sublime Text if that is possible.
The six Unicodes take the form: &#nnn; where nnn represents a three-digit decimal number (see below).
PLEASE NOTE that in the following list of the six surrogates and their Unicode counterparts, I have inserted a space between the &# and the decimal to preserve the Unicodes on-screen; without it, the forum’s screen by default converts the codes to the letters they represent - “ampersand-hash-365-semicolon” becomes “ĉ” - but I have to preserve the Unicodes in their &#- form.
The x-surrogates and their Unicode counterparts:
cx &# 265;
gx &# 285;
hx &# 293;
jx &# 309;