VSCII (Vietnamese Standard Code for Information Interchange), also known as TCVN 5712, ISO-IR-180, .VN, ABC or simply the TCVN encodings, is a set of three closely related Vietnamese national standard character encodings for using the Vietnamese language with computers, developed by the TCVN Technical Committee on Information Technology (TCVN/TC1) and first adopted in 1993 (as TCVN 5712:1993).

It should not be confused with the similarly-named unofficial VISCII encoding, which was sometimes used by overseas Vietnamese speakers. VISCII was also intended to stand for Vietnamese Standard Code for Information Interchange, but is not related to VSCII.

VSCII (TCVN) was used extensively in the north of Vietnam, while VNI was popular in the south. Unicode and the Windows-1258 code page are now used for virtually all Vietnamese computer data,[citation needed] but legacy files or archived messages may need conversion.

Encodings

All three forms of VSCII keep the 95 printable characters of ASCII unmodified.

VSCII-3, also known as TCVN 5712-3, VN3 or simply TCVN3, includes the fewest assignments. It is an extended ASCII, because it keeps all 128 codes of ASCII unmodified. It does not reassign any of the C0 and C1 control codes. Compared to ASCII, it adds 75 characters:

  • 67 lowercase characters, allowing full lowercase support.
  • 7 uppercase characters, allowing uppercase support for the 29 base letters without tone marks.
  • The non-breaking space.

Tone marks on uppercase vowels is accomplished in TCVN3 by switching to an all-capital font.

VSCII-2, also known as TCVN 5712-2 and VN2, is a superset of VSCII-3. It is an extended ASCII, because it keeps all 128 codes of ASCII unmodified. It does not reassign any of the C0 and C1 control codes, making it conformant with ISO 2022 as a 96-set. Compared to VSCII-3, it adds (for a total of 96 non-ASCII characters):

  • 16 more uppercase characters with pre-composed tone marks (for a total of 23 non-ASCII uppercase characters)
  • 5 combining diacritics for tone marks, allowing other combinations of uppercase letters and tone marks to be represented. Combining marks follow the base letter as in VNI (rather than preceding them as in ANSEL).

VSCII-1, also known as TCVN 5712-1 and VN1, is an extension of VSCII-2, and is a modified ASCII, since it replaces 12 of the 33 control characters with precomposed characters. Compared to VSCII-2, it (for a total of 140 non-ASCII characters):

  • Adds 44 more pre-composed uppercase letters, bringing them to the same count as the lowercase
  • Does this by replacing 12 ASCII control characters and allocating 32 graphical characters to the C1 control area, breaking ISO 2022 compatibility

Conversion from VSCII-3 to VSCII-2 or VSCII-1 and conversion from VSCII-2 to VSCII-1 are not necessary, but can result in smaller files.

Conversion from VSCII-1 to VSCII-2 or VSCII-3 and conversion from VSCII-2 to VSCII-3 require expansion of some pre-composed characters.

Character set

VSCII-1
0123456789ABCDEF
0xNULÚ00DA1EE4ETX1EEA1EEC1EEEBELBSHTLFVTFFCRSOSI
1xDLE1EE81EF01EF21EF61EF8Ý00DD1EF4CANEMSUBESCFSGSRSUS
2xSP!"#$%&'()*+,-./
3x0123456789:;<=>?
4x@ABCDEFGHIJKLMNO
5xPQRSTUVWXYZ[\]^_
6x`abcdefghijklmno
7xpqrstuvwxyz{|}~DEL
8xÀ00C01EA2Ã00C3Á00C11EA01EB61EACÈ00C81EBA1EBCÉ00C91EB81EC6Ì00CC1EC8Ĩ0128
9xÍ00CD1ECAÒ00D21ECEÕ00D5Ó00D31ECC1ED81EDC1EDE1EE01EDA1EE2Ù00D91EE6Ũ0168
AxNBSPĂ0102Â00C2Ê00CAÔ00D4Ơ01A0Ư01AFĐ0110ă0103â00E2ê00EAô00F4ơ01A1ư01B0đ01111EB0
Bx◌̀0300◌̉0309◌̃0303◌́0301◌̣0323à00E01EA3ã00E3á00E11EA11EB21EB11EB31EB51EAF1EB4
Cx1EAE1EA61EA81EAA1EA41EC01EB71EA71EA91EAB1EA51EADè00E81EC21EBB1EBD
Dxé00E91EB91EC11EC31EC5ế1EBF1EC7ì00EC1EC91EC41EBE1ED2ĩ0129í00ED1ECBò00F2
Ex1ED41ECFõ00F5ó00F31ECD1ED31ED51ED71ED11ED91EDD1EDF1EE11EDB1EE3ù00F9
Fx1ED61EE7ũ0169ú00FA1EE51EEB1EED1EEF1EE91EF11EF31EF71EF9ý1EF51ED0

External links