This appendix lists the languages, territories, character sets, and other locale data supported by the Oracle server. It includes these topics:
[Download MS-DOS 7.10 CD Full Installation]. ISO image of MS-DOS 7.10 Full Installation: Images: None [Download MS-DOS 7.10 CD Full Installation] Comments.
You can obtain information about character sets, languages, territories, and linguistic sorts by querying the V$NLS_VALID_VALUES
dynamic performance view.
See Also:
Oracle Database Reference for more information about the data that can be returned by this viewLanguages in Table A-1 provide support for locale-sensitive information such as the following:
Day and month names and their abbreviations
Symbols for equivalent expressions for A.M., P.M., A.D., and B.C.
Default sorting sequence for character data when the ORDER BY
SQL clause is specified
Writing direction (left to right or right to left)
Affirmative and negative response strings (for example, YES
and NO
)
By using Unicode databases and datatypes, you can store, process, and retrieve data for almost all contemporary languages, including many that do not appear in Table A-1.
Table A-1 Oracle Supported Languages
Language Name | Language Abbreviation | Default Sort |
---|---|---|
AMERICAN | us | binary |
ARABIC | ar | ARABIC |
ASSAMESE | as | binary |
AZERBAIJANI | az | AZERBAIJANI |
BANGLA | bn | binary |
BRAZILIAN PORTUGUESE | ptb | WEST_EUROPEAN |
BULGARIAN | bg | BULGARIAN |
CANADIAN FRENCH | frc | CANADIAN FRENCH |
CATALAN | ca | CATALAN |
CROATIAN | hr | CROATIAN |
CYRILLIC KAZAKH | ckk | GENERIC_M |
CYRILLIC SERBIAN | csr | GENERIC_M |
CYRILLIC UZBEK | cuz | GENERIC_M |
CZECH | cs | CZECH |
DANISH | dk | DANISH |
DUTCH | nl | DUTCH |
EGYPTIAN | eg | ARABIC |
ENGLISH | gb | binary |
ESTONIAN | et | ESTONIAN |
FINNISH | sf | FINNISH |
FRENCH | f | FRENCH |
GERMAN DIN | din | GERMAN |
GERMAN | d | GERMAN |
GREEK | el | GREEK |
GUJARATI | gu | binary |
HEBREW | iw | HEBREW |
HINDI | hi | binary |
HUNGARIAN | hu | HUNGARIAN |
ICELANDIC | is | ICELANDIC |
INDONESIAN | in | INDONESIAN |
ITALIAN | i | WEST_EUROPEAN |
JAPANESE | ja | binary |
KANNADA | kn | binary |
KOREAN | ko | binary |
LATIN AMERICAN SPANISH | esa | SPANISH |
LATIN SERBIAN | lsr | binary |
LATIN UZBEK | luz | GENERIC_M |
LATVIAN | lv | LATVIAN |
LITHUANIAN | lt | LITHUANIAN |
MACEDONIAN | mk | binary |
MALAY | ms | MALAY |
MALAYALAM | ml | binary |
MARATHI | mr | binary |
MEXICAN SPANISH | esm | WEST_EUROPEAN |
NORWEGIAN | n | NORWEGIAN |
ORIYA | or | binary |
POLISH | pl | POLISH |
PORTUGUESE | pt | WEST_EUROPEAN |
PUNJABI | pa | binary |
ROMANIAN | ro | ROMANIAN |
RUSSIAN | ru | RUSSIAN |
SIMPLIFIED CHINESE | zhs | binary |
SLOVAK | sk | SLOVAK |
SLOVENIAN | sl | SLOVENIAN |
SPANISH | e | SPANISH |
SWEDISH | s | SWEDISH |
TAMIL | ta | binary |
TELUGU | te | binary |
THAI | th | THAI_DICTIONARY |
TRADITIONAL CHINESE | zht | binary |
TURKISH | tr | TURKISH |
UKRAINIAN | uk | UKRAINIAN |
VIETNAMESE | vn | VIETNAMESE |
Oracle error messages have been translated into the languages which are listed in Table A-2.
Table A-2 Oracle Supported Messages
Name | Abbreviation |
---|---|
ARABIC | ar |
BRAZILIAN PORTUGUESE | ptb |
CATALAN | ca |
CZECH | cs |
DANISH | dk |
DUTCH | nl |
FINNISH | sf |
FRENCH | f |
GERMAN | d |
GREEK | el |
HEBREW | iw |
HUNGARIAN | hu |
ITALIAN | i |
JAPANESE | ja |
KOREAN | ko |
NORWEGIAN | n |
POLISH | pl |
PORTUGUESE | pt |
ROMANIAN | ro |
RUSSIAN | ru |
SIMPLIFIED CHINESE | zhs |
SLOVAK | sk |
SPANISH | e |
SWEDISH | s |
THAI | th |
TRADITIONAL CHINESE | zht |
TURKISH | tr |
Table A-3 lists the territories supported by the Oracle server.
Table A-3 Oracle Supported Territories
Name | Name | Name |
---|---|---|
ALGERIA | GREECE | POLAND |
AMERICA | HONG KONG | PORTUGAL |
ARGENTINA | HUNGARY | PUERTO RICO |
AUSTRALIA | ICELAND | QATAR |
AUSTRIA | INDIA | ROMANIA |
AZERBAIJAN | INDONESIA | RUSSIA |
BAHRAIN | IRAQ | SAUDI ARABIA |
BANGLADESH | IRELAND | SERBIA AND MONTENEGRO |
BELGIUM | ISRAEL | SINGAPORE |
BRAZIL | ITALY | SLOVAKIA |
BULGARIA | JAPAN | SLOVENIA |
CANADA | JORDAN | SOMALIA |
CATALONIA | KAZAKHSTAN | SOUTH AFRICA |
CHILE | KOREA | SPAIN |
CHINA | KUWAIT | SUDAN |
COLOMBIA | LATVIA | SWEDEN |
COSTA RICA | LEBANON | SWITZERLAND |
CROATIA | LIBYA | SYRIA |
CYPRUS | LITHUANIA | TAIWAN |
CZECH REPUBLIC | LUXEMBOURG | THAILAND |
DENMARK | MALAYSIA | THE NETHERLANDS |
DJIBOUTI | MAURITANIA | TUNISIA |
ECUADOR | MEXICO | TURKEY |
EGYPT | MOROCCO | UKRAINE |
EL SALVADOR | NEW ZEALAND | UNITED ARAB EMIRATES |
ESTONIA | NICARAGUA | UNITED KINGDOM |
FINLAND | NORWAY | UZBEKISTAN |
FRANCE | OMAN | VENEZUELA |
FYR MACEDONIA | PANAMA | VIETNAM |
GUATEMALA | PERU | YEMEN |
GERMANY | PHILIPPINES |
Oracle-supported character sets are listed in the following sections according to three broad categories.
In addition, common character set subset/superset combinations are listed. Some character sets can only be used with certain data types. For example, the AL16UTF16 character set can only be used as an NCHAR
character set, and not as a database character set.
Also documented in the comment section are other unique features of the character set that may be important to users or your database administrator. For example, the information includes whether the character set supports the euro currency symbol, whether user-defined characters are supported, and whether the character set is a strict superset of ASCII. (You can use the CSALTER
script to migrate an existing database to a new character set, only if all of the schema data is a strict subset of the new character set.)
The following is the key for the comment column of the character set tables:
Oracle does not document individual code page layouts. For specific details about a particular character set, its character repertoire, and code point values, you can use Oracle Locale Builder. Otherwise, you should refer to the actual national, international, or vendor-specific standards.
Table A-4 lists the recommended and most commonly used ASCII-based Oracle database character sets. The list is ordered alphabetically within their respective language group.
Table A-4 Recommended ASCII Database Character Sets
Name | Description | Comments | |
---|---|---|---|
Asian | |||
JA16EUC | EUC 24-bit Japanese | MB, ASCII | |
JA16EUCTILDE | The same as JA16EUC except for the way that the wave dash and the tilde are mapped to and from Unicode. | MB, ASCII | |
JA16SJIS | Shift-JIS 16-bit Japanese | MB, ASCII, UDC | |
JA16SJISTILDE | The same as JA16SJIS except for the way that the wave dash and the tilde are mapped to and from Unicode. | MB, ASCII, UDC | |
KO16MSWIN949 | MS Windows Code Page 949 Korean | MB, ASCII, UDC | |
TH8TISASCII | Thai Industrial Standard 620-2533 - ASCII 8-bit | SB, ASCII, EURO | |
VN8MSWIN1258 | MS Windows Code Page 1258 8-bit Vietnamese | SB, ASCII, EURO | |
ZHS16GBK | GBK 16-bit Simplified Chinese | MB, ASCII, UDC | |
ZHT16HKSCS | MS Windows Code Page 950 with Hong Kong Supplementary Character Set HKSCS-2001 (character set conversion to and from Unicode is based on Unicode 3.0) | MB, ASCII, EURO | |
ZHT16MSWIN950 | MS Windows Code Page 950 Traditional Chinese | MB, ASCII, UDC | |
ZHT32EUC | EUC 32-bit Traditional Chinese | MB, ASCII | |
European | |||
BLT8ISO8859P13 | ISO 8859-13 Baltic | SB, ASCII | |
BLT8MSWIN1257 | MS Windows Code Page 1257 8-bit Baltic | SB, ASCII, EURO | |
CL8ISO8859P5 | ISO 8859-5 Latin/Cyrillic | SB, ASCII | |
CL8MSWIN1251 | MS Windows Code Page 1251 8-bit Latin/Cyrillic | SB, ASCII, EURO | |
EE8ISO8859P2 | ISO 8859-2 East European | SB, ASCII | |
EL8ISO8859P7 | ISO 8859-7 Latin/Greek | SB, ASCII, EURO | |
EL8MSWIN1253 | MS Windows Code Page 1253 8-bit Latin/Greek | SB, ASCII, EURO | |
EE8MSWIN1250 | MS Windows Code Page 1250 8-bit East European | SB, ASCII, EURO | |
NE8ISO8859P10 | ISO 8859-10 North European | SB, ASCII | |
NEE8ISO8859P4 | ISO 8859-4 North and North-East European | SB, ASCII | |
WE8ISO8859P15 | ISO 8859-15 West European | SB, ASCII, EURO | |
WE8MSWIN1252 | MS Windows Code Page 1252 8-bit West European | SB, ASCII, EURO | |
MiddleEastern | |||
AR8ISO8859P6 | ISO 8859-6 Latin/Arabic | SB, ASCII | |
AR8MSWIN1256 | MS Windows Code Page 1256 8-Bit Latin/Arabic | SB, ASCII, EURO | |
IW8ISO8859P8 | ISO 8859-8 Latin/Hebrew | SB, ASCII | |
IW8MSWIN1255 | MS Windows Code Page 1255 8-bit Latin/Hebrew | SB, ASCII, EURO | |
TR8MSWIN1254 | MS Windows Code Page 1254 8-bit Turkish | SB, ASCII, EURO | |
WE8ISO8859P9 | ISO 8859-9 West European & Turkish | SB, ASCII | |
Universal | |||
AL32UTF8 | Unicode 4.0 UTF-8 Universal character set | MB, ASCII, EURO |
Table A-5 lists the recommended and most commonly used EBCDIC-based Oracle database character sets. The list is ordered alphabetically within their respective language group.
Table A-5 Recommended EBCDIC Database Character Sets
Name | Description | Comments | |
---|---|---|---|
Asian | |||
JA16DBCS | IBM EBCDIC 16-bit Japanese | MB, UDC | |
JA16EBCDIC930 | IBM DBCS Code Page 290 16-bit Japanese | MB, UDC | |
KO16DBCS | IBM EBCDIC 16-bit Korean | MB, UDC | |
TH8TISEBCDICS | Thai Industrial Standard 620-2533-EBCDIC Server 8-bit | SB | |
European | |||
BLT8EBCDIC1112S | EBCDIC Code Page 1112 8-bit Server Baltic Multilingual | SB | |
CE8BS2000 | Siemens EBCDIC.DF.04 8-bit Central European | SB | |
CL8BS2000 | Siemens EBCDIC.EHC.LC 8-bit Cyrillic | SB | |
CL8EBCDIC1025R | EBCDIC Code Page 1025 Server 8-bit Cyrillic | SB | |
CL8EBCDIC1158R | EBCDIC Code Page 1158 Server 8-bit Cyrillic | SB | |
D8EBCDIC1141 | EBCDIC Code Page 1141 8-bit Austrian German | SB, EURO | |
DK8DBCDIC1142 | EBCDIC Code Page 1142 8-bit Danish | SB, EURO | |
EE8BS2000 | Siemens EBCDIC.DF.04 8-bit East European | SB | |
EE8EBCDIC870S | EBCDIC Code Page 870 Server 8-bit East European | SB | |
EL8EBCDIC423R | IBM EBCDIC Code Page 423 for RDBMS server-side | SB | |
EL8EBCDIC875R | EBCDIC Code Page 875 Server 8-bit Greek | SB | |
F8EBCDIC1147 | EBCDIC Code Page 1147 8-bit French | SB, EURO | |
I8EBCDIC1144 | EBCDIC Code Page 1144 8-bit Italian | SB, EURO | |
S8EBCDCI1143 | EBCDIC Code Page 1143 8-bit Swedish | SB, EURO | |
WE8BS2000 | Siemens EBCDIC.DF.04 8-bit West European | SB | |
WE8BS2000E | Siemens EBCDIC.DF.04 8-bit West European | SB, EURO | |
WE8BS2000L5 | Siemens EBCDIC.DF.L5 8-bit West European/Turkish | SB | |
WE8EBCDIC1047E | Latin 1/Open Systems 1047 | SB, EBCDIC, EURO | |
WE8EBCDIC1140 | EBCDIC Code Page 1140 8-bit West European | SB, EURO | |
WE8EBCDIC1145 | EBCDIC Code Page 1145 8-bit West European | SB, EURO | |
WE8DBCDIC1146 | EBCDIC Code Page 1146 8-bit West European | SB, EURO | |
WE8EBCDIC1148 | EBCDIC Code Page 1148 8-bit West European | SB, EURO | |
MiddleEastern | |||
AR8EBCDIC420S | EBCDIC Code Page 420 Server 8-bit Latin/Arabic | SB | |
IW8EBCDIC424S | EBCDIC Code Page 424 Server 8-bit Latin/Hebrew | SB | |
TR8EBCDIC1026S | EBCDIC Code Page 1026 Server 8-bit Turkish | SB |
Table A-6 lists the other ASCII-based Oracle character sets. The list is ordered alphabetically within their language groups.
Table A-6 Other ASCII Character Sets
Name | Description | Comments | |
---|---|---|---|
Asian | |||
BN8BSCII | Bangladesh National Code 8-bit BSCII | SB, ASCII | |
IN8ISCII | Multiple-Script Indian Standard 8-bit Latin/Indian Languages | SB, ASCII | |
JA16VMS | JVMS 16-bit Japanese | MB, ASCII | |
KO16KSC5601 | KSC5601 16-bit Korean | MB, ASCII | |
KO16KSCCS | KSCCS 16-bit Korean | MB, ASCII | |
TH8MACTHAIS | Mac Server 8-bit Latin/Thai | SB, ASCII | |
VN8VN3 | VN3 8-bit Vietnamese | SB, ASCII | |
ZHS16CGB231280 | CGB2312-80 16-bit Simplified Chinese | MB, ASCII | |
ZHT16BIG5 | BIG5 16-bit Traditional Chinese | MB, ASCII | |
ZHT16CCDC | HP CCDC 16-bit Traditional Chinese | MB, ASCII | |
ZHT16DBT | Taiwan Taxation 16-bit Traditional Chinese | MB, ASCII | |
ZHT16HKSCS31 | MS Windows Code Page 950 with Hong Kong Supplementary Character Set HKSCS-2001 (character set conversion to and from Unicode is based on Unicode 3.1) | MB, ASCII, EURO | |
ZHT32SOPS | SOPS 32-bit Traditional Chinese | MB, ASCII | |
ZHT32TRIS | TRIS 32-bit Traditional Chinese | MB, ASCII | |
MiddleEastern | |||
AR8ADOS710 | Arabic MS-DOS 710 Server 8-bit Latin/Arabic | SB, ASCII | |
AR8ADOS710T | Arabic MS-DOS 710 8-bit Latin/Arabic | SB | |
AR8ADOS720 | Arabic MS-DOS 720 Server 8-bit Latin/Arabic | SB, ASCII | |
AR8ADOS720T | Arabic MS-DOS 720 8-bit Latin/Arabic | SB | |
AR8APTEC715 | APTEC 715 Server 8-bit Latin/Arabic | SB, ASCII | |
AR8APTEC715T | APTEC 715 8-bit Latin/Arabic | SB | |
AR8ASMO708PLUS | ASMO 708 Plus 8-bit Latin/Arabic | SB, ASCII | |
AR8ASMO8X | ASMO Extended 708 8-bit Latin/Arabic | SB, ASCII | |
AR8HPARABIC8T | HP 8-bit Latin/Arabic | SB | |
AR8ISO8859P6 | ISO 8859-6 Latin/Arabic | SB, ASCII | |
AR8MUSSAD768 | Mussa'd Alarabi/2 768 Server 8-bit Latin/Arabic | SB, ASCII | |
AR8MUSSAD768T | Mussa'd Alarabi/2 768 8-bit Latin/Arabic | SB | |
AR8NAFITHA711 | Nafitha Enhanced 711 Server 8-bit Latin/Arabic | SB, ASCII | |
AR8NAFITHA711T | Nafitha Enhanced 711 8-bit Latin/Arabic | SB | |
AR8NAFITHA721 | Nafitha International 721 Server 8-bit Latin/Arabic | SB, ASCII | |
AR8NAFITHA721T | Nafitha International 721 8-bit Latin/Arabic | SB | |
AR8SAKHR706 | SAKHR 706 Server 8-bit Latin/Arabic | SB, ASCII | |
AR8SAKHR707 | SAKHR 707 Server 8-bit Latin/Arabic | SB, ASCII | |
AR8SAKHR707T | SAKHR 707 8-bit Latin/Arabic | SB | |
AR8XBASIC | XBASIC 8-bit Latin/Arabic | SB | |
AZ8ISO8859PE | ISO 8859-9 Latin Azerbaijani | SB, ASCII | |
IN8ISCII | Multiple-Script Indian Standard 8-bit Latin/Indian Languages | SB, ASCII | |
IW8MACHEBREW | Mac Client 8-bit Hebrew | SB | |
IW8PC1507 | IBM-PC Code Page 1507/862 8-bit Latin/Hebrew | SB, ASCII | |
LA8ISO6937 | ISO 6937 8-bit Coded Character Set for Text Communication | SB, ASCII | |
TR7DEC | DEC VT100 7-bit Turkish | SB | |
TR8DEC | DEC 8-bit Turkish | SB, ASCII | |
TR8PC857 | IBM-PC Code Page 857 8-bit Turkish | SB, ASCII | |
European | |||
AR8ARABICMAC | Mac Client 8-bit Latin/Arabic | SB | |
AR8ARABICMACS | Mac Server 8-bit Latin/Arabic | SB, ASCII | |
BG8MSWIN | MS Windows 8-bit Bulgarian Cyrillic | SB, ASCII | |
BG8PC437S | IBM-PC Code Page 437 8-bit (Bulgarian Modification) | SB, ASCII | |
BLT8CP921 | Latvian Standard LVS8-92(1) Windows/Unix 8-bit Baltic | SB, ASCII | |
BLT8PC775 | IBM-PC Code Page 775 8-bit Baltic | SB, ASCII | |
CDN8PC863 | IBM-PC Code Page 863 8-bit Canadian French | SB, ASCII | |
CEL8ISO8859P14 | ISO 8859-13 Celtic | SB, ASCII | |
CL8ISOIR111 | ISOIR111 Cyrillic | SB | |
CL8KOI8R | RELCOM Internet Standard 8-bit Latin/Cyrillic | SB, ASCII | |
CL8KOI8U | KOI8 Ukrainian Cyrillic | SB | |
CL8MACCYRILLICS | Mac Server 8-bit Latin/Cyrillic | SB, ASCII | |
EE8MACCES | Mac Server 8-bit Central European | SB, ASCII | |
EE8MACCROATIANS | Mac Server 8-bit Croatian | SB, ASCII | |
EE8PC852 | IBM-PC Code Page 852 8-bit East European | SB, ASCII | |
EL8DEC | DEC 8-bit Latin/Greek | SB | |
EL8MACGREEKS | Mac Server 8-bit Greek | SB, ASCII | |
EL8PC437S | IBM-PC Code Page 437 8-bit (Greek modification) | SB, ASCII | |
EL8PC851 | IBM-PC Code Page 851 8-bit Greek/Latin | SB, ASCII | |
EL8PC869 | IBM-PC Code Page 869 8-bit Greek/Latin | SB, ASCII | |
ET8MSWIN923 | MS Windows Code Page 923 8-bit Estonian | SB, ASCII | |
HU8ABMOD | Hungarian 8-bit Special AB Mod | SB, ASCII | |
HU8CWI2 | Hungarian 8-bit CWI-2 | SB, ASCII | |
IS8PC861 | IBM-PC Code Page 861 8-bit Icelandic | SB, ASCII | |
IW7IS960 | Israeli Standard 960 7-bit Latin/Hebrew | SB | |
IW8ISO8859P8 | ISO 8859-8 Latin/Hebrew | SB, ASCII | |
LA8ISO6937 | ISO 6937 8-bit Coded Character Set for Text Communication | SB, ASCII | |
LA8PASSPORT | German Government Printer 8-bit All-European Latin | SB, ASCII | |
LT8MSWIN921 | MS Windows Code Page 921 8-bit Lithuanian | SB, ASCII | |
LT8PC772 | IBM-PC Code Page 772 8-bit Lithuanian (Latin/Cyrillic) | SB, ASCII | |
LT8PC774 | IBM-PC Code Page 774 8-bit Lithuanian (Latin) | SB, ASCII | |
LV8PC8LR | Latvian Version IBM-PC Code Page 866 8-bit Latin/Cyrillic | SB, ASCII | |
LV8PC1117 | IBM-PC Code Page 1117 8-bit Latvian | SB, ASCII | |
LV8RST104090 | IBM-PC Alternative Code Page 8-bit Latvian (Latin/Cyrillic) | SB, ASCII | |
N8PC865 | IBM-PC Code Page 865 8-bit Norwegian | SB, ASCII | |
RU8BESTA | BESTA 8-bit Latin/Cyrillic | SB, ASCII | |
RU8PC855 | IBM-PC Code Page 855 8-bit Latin/Cyrillic | SB, ASCII | |
RU8PC866 | IBM-PC Code Page 866 8-bit Latin/Cyrillic | SB, ASCII | |
SE8ISO8859P3 | ISO 8859-3 South European | SB, ASCII | |
TR8MACTURKISH | Mac Client 8-bit Turkish | SB | |
TR8MACTURKISHS | Mac Server 8-bit Turkish | SB, ASCII | |
TR8PC857 | IBM-PC Code Page 857 8-bit Turkish | SB, ASCII | |
US7ASCII | ASCII 7-bit American | SB, ASCII | |
US8PC437 | IBM-PC Code Page 437 8-bit American | SB, ASCII | |
WE8DEC | DEC 8-bit West European | SB, ASCII | |
WE8DG | DG 8-bit West European | SB, ASCII | |
WE8ISO8859P1 | ISO 8859-1 West European | SB, ASCII | |
WE8MACROMAN8S | Mac Server 8-bit Extended Roman8 West European | SB, ASCII | |
WE8NCR4970 | NCR 4970 8-bit West European | SB, ASCII | |
WE8NEXTSTEP | NeXTSTEP PostScript 8-bit West European | SB, ASCII | |
WE8PC850 | IBM-PC Code Page 850 8-bit West European | SB, ASCII | |
WE8PC858 | IBM-PC Code Page 858 8-bit West European | SB, ASCII, EURO | |
WE8PC860 | IBM-PC Code Page 860 8-bit West European | SB, ASCII | |
WE8ROMAN8 | HP Roman8 8-bit West European | SB, ASCII | |
Universal | |||
UTF8 | Unicode 3.0 UTF-8 Universal character set, CESU-8 compliant | MB, ASCII, EURO |
Table A-7 lists the other EBCDIC-based Oracle character sets. The list is ordered alphabetically within their language groups.
Table A-7 Other EBCDIC Character Sets
Name | Description | Comments | |
---|---|---|---|
Asian | |||
TH8TISEBCDIC | Thai Industrial Standard 620-2533 - EBCDIC 8-bit | SB | |
ZHS16DBCS | IBM EBCDIC 16-bit Simplified Chinese | MB, UDC | |
ZHT16DBCS | IBM EBCDIC 16-bit Traditional Chinese | MB, UDC | |
MiddleEastern | |||
AR8EBCDICX | EBCDIC XBASIC Server 8-bit Latin/Arabic | SB | |
IW8EBCDIC424 | EBCDIC Code Page 424 8-bit Latin/Hebrew | SB | |
IW8EBCDIC1086 | EBCDIC Code Page 1086 8-bit Hebrew | SB | |
TR8EBCDIC1026 | EBCDIC Code Page 1026 8-bit Turkish | SB | |
WE8EBCDIC37C | EBCDIC Code Page 37 8-bit Oracle/c | SB | |
European | |||
BLT8EBCDIC1112 | EBCDIC Code Page 1112 8-bit Server Baltic Multilingual | SB | |
CL8EBCDIC1025 | EBCDIC Code Page 1025 8-bit Cyrillic | SB | |
CL8EBCDIC1025C | EBCDIC Code Page 1025 Client 8-bit Cyrillic | SB | |
CL8EBCDIC1025S | EBCDIC Code Page 1025 Server 8-bit Cyrillic | SB | |
CL8EBCDIC1025X | EBCDIC Code Page 1025 (Modified) 8-bit Cyrillic | SB | |
CL8EBCDIC1158 | EBCDIC Code Page 1158 8-bit Cyrillic | SB | |
D8BS2000 | Siemens 9750-62 EBCDIC 8-bit German | SB | |
D8EBCDIC273 | EBCDIC Code Page 273/1 8-bit Austrian German | SB | |
DK7SIEMENS9780X | Siemens 97801/97808 7-bit Danish | SB | |
DK8BS2000 | Siemens 9750-62 EBCDIC 8-bit Danish | SB | |
DK8EBCDIC277 | EBCDIC Code Page 277/1 8-bit Danish | SB | |
E8BS2000 | Siemens 9750-62 EBCDIC 8-bit Spanish | SB | |
EE8EBCDIC870 | EBCDIC Code Page 870 8-bit East European | SB | |
EE8EBCDIC870C | EBCDIC Code Page 870 Client 8-bit East European | SB | |
EL8EBCDIC875 | EBCDIC Code Page 875 8-bit Greek | SB | |
EL8GCOS7 | Bull EBCDIC GCOS7 8-bit Greek | SB | |
F8BS2000 | Siemens 9750-62 EBCDIC 8-bit French | SB | |
F8EBCDIC297 | EBCDIC Code Page 297 8-bit French | SB | |
I8EBCDIC280 | EBCDIC Code Page 280/1 8-bit Italian | SB | |
S8BS2000 | Siemens 9750-62 EBCDIC 8-bit Swedish | SB | |
S8EBCDIC278 | EBCDIC Code Page 278/1 8-bit Swedish | SB | |
US8ICL | ICL EBCDIC 8-bit American | SB | |
US8BS2000 | Siemens 9750-62 EBCDIC 8-bit American | SB | |
WE8EBCDIC924 | Latin 9 EBCDIC 924 | SB, EBCDIC | |
WE8EBCDIC37 | EBCDIC Code Page 37 8-bit West European | SB | |
WE8EBCDIC284 | EBCDIC Code Page 284 8-bit Latin American/Spanish | SB | |
WE8EBCDIC285 | EBCDIC Code Page 285 8-bit West European | SB | |
WE8EBCDIC1047 | EBCDIC Code Page 1047 8-bit West European | SB | |
WE8EBCDIC1140C | EBCDIC Code Page 1140 8-bit West European | SB, EURO | |
WE8EBCDIC1148C | EBCDIC Code Page 1148 Client 8-bit West European | SB, EURO | |
WE8EBCDIC500C | EBCDIC Code Page 500 8-bit Oracle/c | SB | |
WE8EBCDIC500 | EBCDIC Code Page 500 8-bit West European | SB | |
WE8EBCDIC871 | EBCDIC Code Page 871 8-bit Icelandic | SB | |
WE8ICL | ICL EBCDIC 8-bit West European | SB | |
WE8GCOS7 | Bull EBCDIC GCOS7 8-bit West European | SB | |
Universal | |||
UTFE | EBCDIC form of Unicode 3.0 UTF-8 Universal character set (UTF-EBCDIC) | MB, EURO |
Table A-8 lists the character sets that support the Euro symbol.
Table A-8 Character Sets that Support the Euro Symbol
Character Set Name | Hexadecimal Code Value of the Euro Symbol |
---|---|
AL16UTF16 | 20AC |
AL32UTF8 | E282AC |
AR8MSWIN1256 | 80 |
BLT8MSWIN1257 | 80 |
CL8EBCDIC1158 | E1 |
CL8EBCDIC1158R | 9F |
CL8MSWIN1251 | 88 |
D8EBCDIC1141 | 9F |
DK8EBCDIC1142 | 5A |
EE8MSWIN1250 | 80 |
EL8EBCDIC423R | FD |
EL8EBCDIC875R | DF |
EL8ISO8859P7 | A4 |
EL8MSWIN1253 | 80 |
F8EBCDIC1147 | 9F |
I8EBCDIC1144 | 9F |
IW8MSWIN1255 | 80 |
KO16KSC5601 | A2E6 |
KO16KSCCS | D9E6 |
KO16MSWIN949 | A2E6 |
S8EBCDIC1143 | 5A |
TH8TISASCII | 80 |
TR8MSWIN1254 | 80 |
UTF8 | E282AC |
UTFE | CA4653 |
VN8MSWIN1258 | 80 |
WE8BS2000E | 9F |
WE8EBCDIC1047E | 9F |
WE8EBCDIC1140 | 9F |
WE8EBCDIC1140C | 9F |
WE8EBCDIC1145 | 9F |
WE8EBCDIC1146 | 9F |
WE8EBCDIC1148 | 9F |
WE8EBCDIC1148C | 9F |
WE8EBCDIC924 | 9F |
WE8ISO8859P15 | A4 |
WE8MACROMAN8 | DB |
WE8MACROMAN8S | DB |
WE8MSWIN1252 | 80 |
WE8PC858 | DF |
ZHS32GB18030 | A2E3 |
ZHT16HKSCS | A3E1 |
ZHT16HKSCS31 | A3E1 |
ZHT16MSWIN950 | A3E1 |
Table A-9 lists the Oracle character sets that are supported as client-only character sets. The list is ordered alphabetically within their respective language groups.
Table A-9 Client-Only Character Sets
Name | Description | Comments | |
---|---|---|---|
Asian | |||
JA16EUCYEN | EUC 24-bit Japanese with ' mapped to the Japanese yen character | MB | |
JA16MACSJIS | Mac client Shift-JIS 16-bit Japanese | MB | |
JA16SJISYEN | Shift-JIS 16-bit Japanese with ' mapped to the Japanese yen character | MB, UDC | |
TH8MACTHAI | Mac Client 8-bit Latin/Thai | SB | |
ZHS32GB18030 | GB18030-2000 | MB, ASCII, EURO | |
ZHS16MACCGB231280 | Mac client CGB2312-80 16-bit Simplified Chinese | MB | |
European | |||
CH7DEC | DEC VT100 7-bit Swiss (German/French) | SB | |
CL8MACCYRILLIC | Mac Client 8-bit Latin/Cyrillic | SB | |
D7SIEMENS9780X | Siemens 97801/97808 7-bit German | SB | |
D7DEC | DEC VT100 7-bit German | SB | |
EEC8EUROASCI | EEC Targon 35 ASCI West European/Greek | SB | |
EEC8EUROPA3 | EEC EUROPA3 8-bit West European/Greek | SB | |
EE8MACCROATIAN | Mac Client 8-bit Croatian | SB | |
EE8MACCE | Mac Client 8-bit Central European | SB | |
EL8PC737 | IBM-PC Code Page 737 8-bit Greek/Latin | SB | |
EL8MACGREEK | Mac Client 8-bit Greek | SB | |
E7DEC | DEC VT100 7-bit Spanish | SB | |
E7SIEMENS9780X | Siemens 97801/97808 7-bit Spanish | SB | |
F7DEC | DEC VT100 7-bit French | SB | |
F7SIEMENS9780X | Siemens 97801/97808 7-bit French | SB | |
I7DEC | DEC VT100 7-bit Italian | SB | |
I7SIEMENS9780X | Siemens 97801/97808 7-bit Italian | SB | |
IS8MACICELANDICS | Mac Server 8-bit Icelandic | SB | |
IS8MACICELANDIC | Mac Client 8-bit Icelandic | SB | |
NL7DEC | DEC VT100 7-bit Dutch | SB | |
NDK7DEC | DEC VT100 7-bit Norwegian/Danish | SB | |
N7SIEMENS9780X | Siemens 97801/97808 7-bit Norwegian | SB | |
SF7DEC | DEC VT100 7-bit Finnish | SB | |
S7SIEMENS9780X | Siemens 97801/97808 7-bit Swedish | SB | |
S7DEC | DEC VT100 7-bit Swedish | SB | |
SF7ASCII | ASCII 7-bit Finnish | SB | |
TR7DEC | DEC VT100 7-bit Turkish | SB | |
WE8ISOICLUK | ICL special version ISO8859-1 | SB | |
WE8MACROMAN8 | Mac Client 8-bit Extended Roman8 West European | SB | |
WE8HP | HP LaserJet 8-bit West European | SB | |
YUG7ASCII | ASCII 7-bit Yugoslavian | SB | |
MiddleEastern | |||
AR8ARABICMAC | Mac Client 8-bit Latin/Arabic | SB | |
AR8ARABICMACT | Mac 8-bit Latin/Arabic | SB | |
AR8MUSSAD768 | Mussa'd Alarabi/2 768 Server 8-bit Latin/Arabic | SB, ASCII | |
IW7IS960 | Israeli Standard 960 7-bit Latin/Hebrew | SB | |
IW8MACHEBREW | Mac Client 8-bit Hebrew | SB | |
TR8MACTURKISH | Mac Client 8-bit Turkish | SB |
Table A-10 lists the Oracle character sets that provide universal language support. They attempt to support all languages of the world, including, but not limited to, Asian, European, and Middle Eastern languages.
Table A-10 Universal Character Sets
Name | Description | Comments |
---|---|---|
AL16UTF16 | Unicode 4.0 UTF-16 Universal character set | MB, EURO, FIXED |
AL32UTF8 | Unicode 4.0 UTF-8 Universal character set | MB, ASCII, EURO |
UTF8 | Unicode 3.0 UTF-8 Universal character set, CESU-8 compliant | MB, ASCII, EURO |
UTFE | EBCDIC form of Unicode 3.0 UTF-8 Universal character set (UTF-EBCDIC) | MB, EURO |
Note:
CESU-8 defines an encoding scheme for Unicode that is identical to UTF-8 except for its representation of supplementary characters. In CESU-8, supplementary characters are represented as six-byte sequences that result from the transformation of each UTF-16 surrogate code unit into an eight-bit form that is similar to the UTF-8 transformation, but without first converting the input surrogate pairs to a scalar value. See Unicode Technical Report #26.See Also:
Chapter 6, 'Supporting Multilingual Databases with Unicode'The following character set encodings are supported for conversion only. They cannot be used as the database or national character set:
You can use these character sets as the source_char_set
or dest_char_set
in the CONVERT
function.
See Oracle Database SQL Reference for more information about the CONVERT
function and 'The CONVERT Function'.
Table A-11 lists common subset/superset relationships.
Table A-11 Subset-Superset Pairs
Subset | Superset |
---|---|
AR8ADOS710 | AR8ADOS710T |
AR8ADOS720 | AR8ADOS720T |
AR8ADOS720T | AR8ADOS720 |
AR8APTEC715 | AR8APTEC715T |
AR8ARABICMACT | AR8ARABICMAC |
AR8ISO8859P6 | AR8ASMO708PLUS |
AR8ISO8859P6 | AR8ASMO8X |
AR8MUSSAD768 | AR8MUSSAD768T |
AR8MUSSAD768T | AR8MUSSAD768 |
AR8NAFITHA711 | AR8NAFITHA711T |
AR8NAFITHA721 | AR8NAFITHA721T |
AR8SAKHR707 | AR8SAKHR707T |
AR8SAKHR707T | AR8SAKHR707 |
BLT8CP921 | BLT8ISO8859P13 |
BLT8CP921 | LT8MSWIN921 |
D7DEC | D7SIEMENS9780X |
D7SIEMENS9780X | D7DEC |
DK7SIEMENS9780X | N7SIEMENS9780X |
I7DEC | I7SIEMENS9780X |
I7SIEMENS9780X | IW8EBCDIC424 |
IW8EBCDIC424 | IW8EBCDIC1086 |
KO16KSC5601 | KO16MSWIN949 |
LT8MSWIN921 | BLT8ISO8859P13 |
LT8MSWIN921 | BLT8CP921 |
N7SIEMENS9780X | DK7SIEMENS9780X |
US7ASCII | See Table A-12, 'US7ASCII Supersets'. |
UTF8 | AL32UTF8 |
WE8DEC | TR8DEC |
WE8DEC | WE8NCR4970 |
WE8ISO8859P1 | WE8MSWIN1252 |
WE8ISO8859P9 | TR8MSWIN1254 |
WE8NCR4970 | TR8DEC |
WE8NCR4970 | WE8DEC |
WE8PC850 | WE8PC858 |
US7ASCII is a special case because so many other character sets are supersets of it. Table A-12 lists supersets for US7ASCII.
Table A-12 US7ASCII Supersets
Supersets | Supersets | Supersets |
---|---|---|
AL32UTF8 | EE8ISO8859P2 | RU8BESTA |
AR8ADOS710 | EE8MACCES | RU8PC855 |
AR8ADOS710T | EE8MACCROATIANS | RU8PC866 |
AR8ADOS720 | EE8MSWIN1250 | SE8ISO8859P3 |
AR8ADOS720T | EE8PC852 | TH8MACTHAIS |
AR8APTEC715 | EL8DEC | TH8TISASCII |
AR8APTEC715T | EL8ISO8859P7 | TR8DEC |
AR8ARABICMACS | EL8MACGREEKS | TR8MACTURKISHS |
AR8ASMO708PLUS | EL8MSWIN1253 | TR8MSWIN1254 |
AR8ASMO8X | EL8PC437S | TR8PC857 |
AR8HPARABIC8T | EL8PC851 | US8PC437 |
AR8ISO8859P6 | EL8PC869 | UTF8 |
AR8MSWIN1256 | ET8MSWIN923 | VN8MSWIN1258 |
AR8MUSSAD768 | HU8ABMOD | VN8VN3 |
AR8MUSSAD768T | HU8CWI2 | WE8DEC |
AR8NAFITHA711 | IN8ISCII | WE8DG |
AR8NAFITHA711T | IS8PC861 | WE8ISO8859P1 |
AR8NAFITHA721 | IW8ISO8859P8 | WE8ISO8859P15 |
AR8NAFITHA721T | IW8MACHEBREWS | WE8ISO8859P9 |
AR8SAKHR706 | IW8MSWIN1255 | WE8MACROMAN8S |
AR8SAKHR707 | IW8PC1507 | WE8MSWIN1252 |
AR8SAKHR707T | JA16EUC | WE8NCR4970 |
AZ8ISO8859PE | JA16SJIS | WE8NEXTSTEP |
BG8MSWIN | JA16VMS | WE8PC850 |
BG8PC437S | KO16KSC5601 | WE8PC858 |
BLT8CP921 | KO16KSCCS | WE8PC860 |
BLT8ISO8859P13 | KO16MSWIN949 | WE8ROMAN8 |
BLT8MSWIN1257 | LA8ISO6937 | ZHS16CGB231280 |
BLT8PC775 | LA8PASSPORT | ZHS16GBK |
BN8BSCII | LT8MSWIN921 | ZHT16BIG5 |
CDN8PC863 | LT8PC772 | ZHT16CCDC |
CEL8ISO8859P14 | LT8PC774 | ZHT16DBT |
CL8ISO8859P5 | LV8PC1117 | ZHT16HKSCS |
CL8KOI8R | LV8PC8LR | ZHT16MSWIN950 |
CL8KOI8U | LV8RST104090 | ZHT32EUC |
CL8ISOIR111 | N8PC865 | ZHT32SOPS |
CL8MACCYRILLICS | NE8ISO8859P10 | ZHT32TRIS |
CL8MSWIN1251 | NEE8ISO8859P4 | ZHS32GB18030 |
Table A-13 displays the languages and character sets that are supported by the language and character set detection in the Character Set Scanner utilities (CSSCAN
and LCSSCAN
) and the Globalization Development Kit (GDK).
Each language has several character sets that can be detected.
When the binary values for a language match two or more encodings that have a subset/superset relationship, the subset character set is returned. For example, if the language is German and all characters are 7-bit, then US7ASCII is returned instead of WE8MSWIN1252, WE8ISO8859P15, or WE8ISO8859P1.
When the character set is determined to be UTF-8, the Oracle character set UTF8 is returned by default unless 4-byte characters (supplementary characters) are detected within the text. If 4-byte characters are detected, then the character set is reported as AL32UTF8.
Table A-13 Languages and Character Sets Supported by CSSCAN, LCSSCAN, and GDK
Language | Character Sets |
---|---|
Arabic | AL16UTF16, AL32UTF8, AR8ISO8859P6, AR8MSWIN1256, UTF8 |
Bulgarian | AL16UTF16, AL32UTF8, CL8ISO8859P5, CL8MSWIN1251, UTF8 |
Catalan | AL16UTF16, AL32UTF8, US7ASCII, UTF8, WE8ISO8859P1, WE8ISO8859P15, WE8MSWIN1252 |
Croatian | AL16UTF16, AL32UTF8, EE8ISO8859P2, EE8MSWIN1250, UTF8 |
Czech | AL16UTF16, AL32UTF8, EE8ISO8859P2, EE8MSWIN1250, UTF8 |
Danish | AL16UTF16, AL32UTF8, US7ASCII, UTF8, WE8ISO8859P1, WE8ISO8859P15, WE8MSWIN1252 |
Dutch | AL16UTF16, AL32UTF8, US7ASCII, UTF8, WE8ISO8859P1, WE8ISO8859P15, WE8MSWIN1252 |
English | AL16UTF16, AL32UTF8, US7ASCII, UTF8, WE8ISO8859P1, WE8ISO8859P15, WE8MSWIN1252 |
Estonian | AL16UTF16, AL32UTF8, NEE8IOS8859P4, UTF8 |
Finnish | AL16UTF16, AL32UTF8, US7ASCII, UTF8, WE8ISO8859P1, WE8ISO8859P15, WE8MSWIN1252 |
French | AL16UTF16, AL32UTF8, US7ASCII, UTF8, WE8ISO8859P1, WE8ISO8859P15, WE8MSWIN1252 |
German | AL16UTF16, AL32UTF8, US7ASCII, UTF8, WE8ISO8859P1, WE8ISO8859P15, WE8MSWIN1252 |
Greek | AL16UTF16, AL32UTF8, EL8ISO8859P7, EL8MSWIN1253, UTF8 |
Hebrew | AL16UTF16, AL32UTF8, IW8ISO8859P8, IW8MSWIN1255, UTF8 |
Hungarian | AL16UTF16, AL32UTF8, EE8ISO8859P2, EE8MSWIN1250, UTF8 |
Italian | AL16UTF16, AL32UTF8, US7ASCII, UTF8, WE8ISO8859P1, WE8ISO8859P15, WE8MSWIN1252 |
Japanese | AL16UTF16, AL32UTF8, ISO2022-JP, JA16EUC, JA16SJIS, UTF8 |
Korean | AL16UTF16, AL32UTF8, ISO2022-KR, KO16KSC5601, KO16MSWIN949, UTF8 |
Malay | AL16UTF16, AL32UTF8, US7ASCII, UTF8, WE8ISO8859P1, WE8ISO8859P15, WE8MSWIN1252 |
Norwegian | AL16UTF16, AL32UTF8, US7ASCII, UTF8, WE8ISO8859P1, WE8ISO8859P15, WE8MSWIN1252 |
Polish | AL16UTF16, AL32UTF8, EE8ISO8859P2, EE8MSWIN1250, UTF8 |
Portuguese | AL16UTF16, AL32UTF8, US7ASCII, UTF8, WE8ISO8859P1, WE8ISO8859P15, WE8MSWIN1252 |
Romanian | AL16UTF16, AL32UTF8, EE8ISO8859P2, EE8MSWIN1250, UTF8 |
Russian | AL16UTF16, AL32UTF8, CL8ISO8859P5, CL8KOI8R, CL8MSWIN1251, UTF8 |
Simplified Chinese | AL16UTF16, AL32UTF8, HZ-GB-2312, UTF8, ZHS16GBK, ZHS16CGB231280 |
Slovak | AL16UTF16, AL32UTF8, EE8ISO8859P2, EE8MSWIN1250, UTF8 |
Spanish | AL16UTF16, AL32UTF8, US7ASCII, UTF8, WE8ISO8859P1, WE8ISO8859P15, WE8MSWIN1252 |
Swedish | AL16UTF16, AL32UTF8, US7ASCII, UTF8, WE8ISO8859P1, WE8ISO8859P15, WE8MSWIN1252 |
Thai | AL16UTF16, AL32UTF8, TH8TISASCII, UTF8 |
Traditional Chinese | AL16UTF16, AL32UTF8, UTF8, ZHT16MSWIN950 |
Turkish | AL16UTF16, AL32UTF8, TR8MSWIN1254, UTF8, WE8ISO8859P9 |
Oracle offers two kinds of linguistic sorts, monolingual and multilingual. In addition, monolingual sorts can be extended to handle special cases. These special cases (represented with a prefix X) typically mean that the characters are sorted differently from their ASCII values. For example, ch
and ll
are treated as a single character in XSPANISH.
All of the linguistic sorts can be also be performed as case-insensitive or accent-insensitive by appending _CI
or _AI
to the linguistic sort name.
Table A-14 lists the monolingual linguistic sorts supported by the Oracle server.
See Also:
Table A-1, 'Oracle Supported Languages' for a list of the default sort for each languageTable A-14 Monolingual Linguistic Sorts
Basic Name | Extended Name | Special Cases |
---|---|---|
ARABIC | - | - |
ARABIC_MATCH | - | - |
ARABIC_ABJ_SORT | - | - |
ARABIC_ABJ_MATCH | - | - |
ASCII7 | - | - |
AZERBAIJANI | XAZERBAIJANI | i, I, lowercase i without dot, uppercase I with dot |
BENGALI | - | - |
BIG5 | - | - |
BINARY | - | - |
BULGARIAN | - | - |
CATALAN | XCATALAN | æ, AE, ß |
CROATIAN | XCROATIAN | D, L, N, d, l, n, ß |
CZECH | XCZECH | ch, CH, Ch, ß |
CZECH_PUNCTUATION | XCZECH_PUNCTUATION | ch, CH, Ch, ß |
DANISH | XDANISH | A, ß, Å, å |
DUTCH | XDUTCH | ij, IJ |
EBCDIC | - | - |
EEC_EURO | - | - |
EEC_EUROPA3 | - | - |
ESTONIAN | - | - |
FINNISH | - | - |
FRENCH | XFRENCH | - |
GERMAN | XGERMAN | ß |
GERMAN_DIN | XGERMAN_DIN | ß, ä, ö, ü, Ä, Ö, Ü |
GBK | - | - |
GREEK | - | - |
HEBREW | - | - |
HKSCS | - | - |
HUNGARIAN | XHUNGARIAN | cs, gy, ny, sz, ty, zs, ß, CS, Cs, GY, Gy, NY, Ny, SZ, Sz, TY, Ty, ZS, Zs |
ICELANDIC | - | - |
INDONESIAN | - | - |
ITALIAN | - | - |
LATIN | - | - |
LATVIAN | - | - |
LITHUANIAN | - | - |
MALAY | - | - |
NORWEGIAN | - | - |
POLISH | - | - |
PUNCTUATION | XPUNCTUATION | - |
ROMANIAN | - | - |
RUSSIAN | - | - |
SLOVAK | XSLOVAK | dz, DZ, Dz, ß (caron) |
SLOVENIAN | XSLOVENIAN | ß |
SPANISH | XSPANISH | ch, ll, CH, Ch, LL, Ll |
SWEDISH | - | - |
SWISS | XSWISS | ß |
TURKISH | XTURKISH | æ, AE, ß |
UKRAINIAN | - | - |
UNICODE_BINARY | - | - |
VIETNAMESE | - | - |
WEST_EUROPEAN | XWEST_EUROPEAN | ß |
Table A-15 lists the multilingual linguistic sorts available in Oracle. All of them include GENERIC_M (an ISO standard for sorting Latin-based characters) as a base. Multilingual linguistic sorts are used for a specific primary language together with Latin-based characters. For example, KOREAN_M sorts Korean and Latin-based characters, but it does not collate Chinese, Thai, or Japanese characters.
Table A-15 Multilingual LInguistic Sorts
Sort Name | Description |
---|---|
CANADIAN_M | Canadian French sort supports reverse secondary, special expanding characters |
DANISH_M | Danish sort supports sorting uppercase characters before lowercase characters |
FRENCH_M | French sort supports reverse sort for secondary |
GENERIC_M | Generic sorting order which is based on ISO14651 and Unicode canonical equivalence rules but excluding compatible equivalence rules |
JAPANESE_M | Japanese sort supports SJIS character set order and EUC characters which are not included in SJIS |
KOREAN_M | Korean sort: Hangul characters are based on Unicode binary order. Hanja characters based on pronunciation order. All Hangul characters are before Hanja characters |
SPANISH_M | Traditional Spanish sort supports special contracting characters |
THAI_M | Thai sort supports swap characters for some vowels and consonants |
SCHINESE_RADICAL_M | Simplified Chinese sort based on radical as primary order and number of strokes order as secondary order |
SCHINESE_STROKE_M | Simplified Chinese sort uses number of strokes as primary order and radical as secondary order |
SCHINESE_PINYIN_M | Simplified Chinese PinYin sorting order |
TCHINESE_RADICAL_M | Traditional Chinese sort based on radical as primary order and number of strokes order as secondary order |
TCHINESE_STROKE_M | Traditional Chinese sort uses number of strokes as primary order and radical as secondary order. It supports supplementary characters. |
See Also:
Chapter 5, 'Linguistic Sorting and String Searching'By default, most territory definitions use the Gregorian calendar system. Table A-14 lists the other calendar systems supported by the Oracle server.
Table A-16 Supported Calendar Systems
Name | Default Date Format | Character Set Used For Default Date Format |
---|---|---|
Japanese Imperial | EEYYMMDD | JA16EUC |
ROC Official | EEyymmdd | ZHT32EUC |
Thai Buddha | dd month EE yyyy | TH8TISASCII |
Persian | DD Month YYYY | AR8ASMO8X |
Arabic Hijrah | DD Month YYYY | AR8ISO8859P6 |
English Hijrah | DD Month YYYY | AR8ISO8859P6 |
Figure A-1 shows how March 27, 1998 appears in Japanese Imperial.
Figure A-1 Japanese Imperial Example
Table A-17 shows the time zone names in the default time zone file that is supplied with the Oracle Database. The default time zone file is $ORACLE_HOME/oracore/zoneinfo/timezlrg.dat
. Oracle also supplies a smaller time zone file, $ORACLE_HOME/oracore/zoneinfo/timezone/dat
. See Chapter 4, 'Datetime Datatypes and Time Zone Support' for more information regarding time zone files.
Table A-17 Time Zone Names
Time Zone Name | Is It in the Smaller Time Zone File? | Time Zone Name | Is It in the Smaller Time Zone File? |
---|---|---|---|
Africa/Algiers | No | Australia/Perth | Yes |
Africa/Cairo | Yes | Australia/Queensland | Yes |
Africa/Casablanca | No | Australia/South | Yes |
Africa/Ceuta | No | Australia/Sydney | Yes |
Africa/Djibouti | No | Australia/Tasmania | Yes |
Africa/Freetown | No | Australia/Victoria | Yes |
Africa/Johannesburg | No | Australia/West | Yes |
Africa/Khartoum | No | Australia/Yancowinna | Yes |
Africa/Mogadishu | No | Brazil/Acre | Yes |
Africa/Nairobi | No | Brazil/DeNoronha | Yes |
Africa/Nouakchott | No | Brazil/East | Yes |
Africa/Tripoli | Yes | Brazil/West | Yes |
Africa/Tunis | No | CET | Yes |
Africa/Windhoek | No | CST | Yes |
America/Adak | Yes | CST6CDT | Yes |
America/Anchorage | Yes | Canada/Atlantic | Yes |
America/Anguilla | No | Canada/Central | Yes |
America/Araguaina | No | Canada/East-Saskatchewan | Yes |
America/Aruba | No | Canada/Eastern | Yes |
America/Asuncion | No | Canada/Mountain | Yes |
America/Atka | Yes | Canada/Newfoundland | Yes |
America/Belem | No | Canada/Pacific | Yes |
America/Boa_Vista | No | Canada/Saskatchewan | Yes |
America/Bogota | No | Canada/Yukon | Yes |
America/Boise | No | Chile/Continental | Yes |
America/Buenos_Aires | No | Chile/EasterIsland | Yes |
America/Cambridge_Bay | No | Cuba | Yes |
America/Cancun | No | EET | Yes |
America/Caracas | No | EST | Yes |
America/Cayenne | No | EST5EDT | Yes |
America/Cayman | No | Egypt | Yes |
America/Chicago | Yes | Eire | Yes |
America/Chihuahua | No | Etc/GMT | Yes |
America/Costa_Rica | No | Etc/GMT+0 | Yes |
America/Cuiaba | No | Etc/GMT+1 | Yes |
America/Curacao | No | Etc/GMT+10 | Yes |
America/Dawson | No | Etc/GMT+11 | Yes |
America/Dawson_Creek | No | Etc/GMT+12 | Yes |
America/Denver | Yes | Etc/GMT+2 | Yes |
America/Detroit | Yes | Etc/GMT+3 | Yes |
America/Edmonton | Yes | Etc/GMT+4 | Yes |
America/El_Salvador | No | Etc/GMT+5 | Yes |
America/Ensenada | Yes | Etc/GMT+6 | Yes |
America/Fort_Wayne | Yes | Etc/GMT+7 | Yes |
America/Fortaleza | No | Etc/GMT+8 | Yes |
America/Godthab | No | Etc/GMT+9 | Yes |
America/Goose_Bay | No | Etc/GMT-0 | Yes |
America/Grand_Turk | No | Etc/GMT-1 | Yes |
America/Guadeloupe | No | Etc/GMT-10 | Yes |
America/Guatemala | No | Etc/GMT-11 | Yes |
America/Guayaquil | No | - | - |
America/Halifax | Yes | Etc/GMT-12 | Yes |
America/Havana | Yes | Etc/GMT-13 | Yes |
America/Indiana/Indianapolis | Yes | Etc/GMT-2 | Yes |
America/Indiana/Knox | No | Etc/GMT-3 | Yes |
America/Indiana/Marengo | No | Etc/GMT-4 | Yes |
America/Indiana/Vevay | No | Etc/GMT-5 | Yes |
America/Indianapolis | Yes | Etc/GMT-6 | Yes |
America/Inuvik | No | Etc/GMT-7 | Yes |
America/Iqaluit | No | Etc/GMT-8 | Yes |
America/Jamaica | Yes | Etc/GMT-9 | Yes |
America/Juneau | No | Etc/GMT0 | Yes |
America/Knox_IN | No | Etc/Greenwich | Yes |
America/La_Paz | No | Europe/Amsterdam | No |
America/Lima | No | Europe/Athens | No |
America/Los_Angeles | Yes | Europe/Belfast | No |
America/Louisville | No | Europe/Belgrade | No |
America/Maceio | No | Europe/Berlin | No |
America/Managua | No | Europe/Bratislava | No |
America/Manaus | Yes | Europe/Brussels | No |
America/Martinique | No | Europe/Bucharest | No |
America/Mazatlan | Yes | Europe/Budapest | No |
America/Mexico_City | Yes | Europe/Copenhagen | No |
America/Miquelon | No | Europe/Dublin | Yes |
America/Montevideo | No | Europe/Gibraltar | No |
America/Montreal | Yes | Europe/Helsinki | No |
America/Montserrat | No | Europe/Istanbul | Yes |
America/New_York | Yes | Europe/Kaliningrad | No |
America/Nome | No | Europe/Kiev | No |
America/Noronha | Yes | Europe/Lisbon | Yes |
America/Panama | No | Europe/Ljubljana | No |
America/Phoenix | Yes | Europe/London | Yes |
America/Porto_Acre | No | Europe/Luxembourg | No |
America/Porto_Velho | No | Europe/Madrid | No |
America/Puerto_Rico | No | Europe/Minsk | No |
America/Rankin_Inlet | No | Europe/Monaco | No |
America/Regina | Yes | Europe/Moscow | Yes |
America/Rio_Branco | Yes | - | - |
America/Santiago | Yes | Europe/Oslo | No |
America/Sao_Paulo | Yes | Europe/Paris | No |
America/Scoresbysund | No | Europe/Prague | No |
America/Shiprock | Yes | Europe/Riga | No |
America/St_Johns | Yes | Europe/Rome | No |
America/St_Thomas | No | Europe/Samara | No |
America/Swift_Current | No | Europe/San_Marino | No |
America/Tegucigalpa | No | Europe/Sarajevo | No |
America/Thule | No | Europe/Simferopol | No |
America/Thunder_Bay | No | Europe/Skopje | No |
America/Tijuana | Yes | Europe/Sofia | No |
America/Tortola | No | Europe/Stockholm | No |
America/Vancouver | Yes | Europe/Tallinn | No |
America/Virgin | No | Europe/Tirane | No |
America/Whitehorse | Yes | Europe/Vatican | No |
America/Winnipeg | Yes | Europe/Vienna | No |
America/Yellowknife | No | Europe/Vilnius | No |
Arctic/Longyearbyen | No | Europe/Warsaw | Yes |
Asia/Aden | No | Europe/Zagreb | No |
Asia/Almaty | No | Europe/Zurich | No |
Asia/Amman | No | GB | Yes |
Asia/Anadyr | No | GB-Eire | Yes |
Asia/Aqtau | No | GMT | Yes |
Asia/Aqtobe | No | GMT+0 | Yes |
Asia/Baghdad | No | GMT-0 | Yes |
Asia/Bahrain | No | GMT0 | Yes |
Asia/Baku | No | Greenwich | Yes |
Asia/Bangkok | No | HST | Yes |
Asia/Beirut | No | Hongkong | Yes |
Asia/Bishkek | No | Iceland | Yes |
Asia/Calcutta | Yes | Indian/Chagos | No |
Asia/Chongqing | No | - | - |
Asia/Chungking | No | Indian/Christmas | No |
Asia/Dacca | No | Indian/Cocos | No |
Asia/Damascus | No | Indian/Mayotte | No |
Asia/Dhaka | No | - | - |
Asia/Dubai | No | Indian/Reunion | No |
Asia/Gaza | No | Iran | Yes |
Asia/Harbin | No | Israel | Yes |
Asia/Hong_Kong | Yes | Jamaica | Yes |
Asia/Irkutsk | No | Japan | Yes |
Asia/Istanbul | Yes | Kwajalein | Yes |
Asia/Jakarta | No | Libya | Yes |
Asia/Jayapura | No | MET | Yes |
Asia/Jerusalem | Yes | MST | Yes |
Asia/Kabul | No | MST7MDT | Yes |
Asia/Kamchatka | No | Mexico/BajaNorte | Yes |
Asia/Karachi | No | Mexico/BajaSur | Yes |
Asia/Kashgar | No | Mexico/General | Yes |
Asia/Krasnoyarsk | No | NZ | Yes |
Asia/Kuala_Lumpur | No | NZ-CHAT | Yes |
Asia/Kuching | No | Navajo | Yes |
Asia/Kuwait | No | PRC | Yes |
Asia/Macao | No | PST | Yes |
Asia/Macau | No | - | - |
Asia/Magadan | No | PST8PDT | Yes |
Asia/Manila | No | Pacific/Auckland | Yes |
Asia/Muscat | No | Pacific/Chatham | Yes |
Asia/Nicosia | No | Pacific/Easter | Yes |
Asia/Novosibirsk | No | Pacific/Fakaofo | No |
Asia/Omsk | No | Pacific/Fiji | No |
Asia/Qatar | No | Pacific/Gambier | No |
Asia/Rangoon | No | Pacific/Guam | No |
Asia/Riyadh | Yes | Pacific/Honolulu | Yes |
Asia/Saigon | No | Pacific/Johnston | No |
Asia/Seoul | Yes | Pacific/Kiritimati | No |
Asia/Shanghai | Yes | Pacific/Kwajalein | Yes |
Asia/Singapore | Yes | Pacific/Marquesas | No |
Asia/Taipei | Yes | Pacific/Midway | No |
Asia/Tashkent | No | Pacific/Niue | No |
Asia/Tbilisi | No | Pacific/Norfolk | No |
Asia/Tehran | Yes | Pacific/Noumea | No |
Asia/Tel_Aviv | Yes | Pacific/Pago_Pago | Yes |
Asia/Tokyo | Yes | Pacific/Pitcairn | No |
Asia/Ujung_Pandang | No | Pacific/Rarotonga | No |
Asia/Urumqi | No | Pacific/Saipan | No |
Asia/Vladivostok | No | Pacific/Samoa | Yes |
Asia/Yakutsk | No | Pacific/Tahiti | No |
Asia/Yekaterinburg | No | Pacific/Tongatapu | No |
Asia/Yerevan | No | Pacific/Wake | No |
Atlantic/Azores | No | Pacific/Wallis | No |
Atlantic/Bermuda | No | Poland | Yes |
Atlantic/Canary | No | Portugal | Yes |
Atlantic/Faeroe | No | ROC | Yes |
Atlantic/Madeira | No | ROK | Yes |
Atlantic/Reykjavik | Yes | Singapore | Yes |
Atlantic/St_Helena | No | Turkey | Yes |
Atlantic/Stanley | No | US/Alaska | Yes |
Australia/ACT | Yes | US/Aleutian | Yes |
Australia/Adelaide | Yes | US/Arizona | Yes |
Australia/Brisbane | Yes | US/Central | Yes |
Australia/Broken_Hill | Yes | US/East-Indiana | Yes |
Australia/Canberra | Yes | US/Eastern | Yes |
Australia/Darwin | Yes | US/Hawaii | Yes |
Australia/Hobart | Yes | US/Indiana-Starke | No |
Australia/LHI | Yes | US/Michigan | Yes |
Australia/Lindeman | Yes | US/Mountain | Yes |
Australia/Lord_Howe | Yes | US/Pacific | Yes |
Australia/Melbourne | Yes | US/Pacific-New | Yes |
Australia/NSW | Yes | US/Samoa | Yes |
Australia/North | Yes | UTC | No |
- | - | W-SU | Yes |
- | - | WET | Yes |
This section contains information about obsolete linguistic sorts, character sets, languages, and territories. The obsolete linguistic sort, language, and territory definitions are still available. However, they are supported for backward compatibility only; they may be desupported in a future release. You can obtain a listing of the obsolete character sets, languages, territories, and linguistic sorts for the current database release by querying the V$NLS_VALID_VALUES
view.
Table A-18 contains linguistic sorts that have been desupported in Oracle Database 10g.
Table A-18 Obsolete Linguistic Sorts in Oracle Database 10g
Obsolete Sort Name | Replacement Sort |
---|---|
THAI_TELEPHONE | THAI_M |
THAI_DICTIONARY | THAI_M |
CANADIAN FRENCH | CANADIAN_M |
JAPANESE | JAPANESE_M |
Table A-19 contains territories that have been desupported in Oracle Database 10g.
Table A-19 Obsolete Territories
Obsolete Territory Name | Replacement Territory |
---|---|
CIS | RUSSIA |
MACEDONIA | FYR MACEDONIA |
YUGOSLAVIA | SERBIA AND MONTENEGRO |
CZECHOSLOVAKIA | CZECH REPUBLIC or SLOVAKIA |
Table A-20 contains languages that have been desupported in Oracle Database 10g.
Table A-20 Obsolete Languages
Obsolete Language Name | Replacement Language |
---|---|
BENGALI | BANGLA |
Table A-21 lists the obsolete character sets. If you reference any of these character sets in your code, then replace them with their new name.
Table A-21 New Names for Obsolete Character Sets
Old Name | New Name |
---|---|
AL24UTFSS | UTF8, AL32UTF8 |
AR8MSAWIN | AR8MSWIN1256 |
CL8EBCDIC875S | CL8EBCDIC875R |
CL8MSWINDOW31 | CL8MSWIN1251 |
EL8EBCDIC875S | EL8EBCDIC875R |
JVMS | JA16VMS |
JEUC | JA16EUC |
SJIS | JA16SJIS |
JDBCS | JA16DBCS |
KSC5601 | KO16KSC5601 |
KDBCS | KO16DBCS |
CGB2312-80 | ZHS16CGB231280 |
CNS 11643-86 | ZHT32EUC |
JA16EUCFIXED | None. Replaced by new national character set. UTF8 and AL16UTF16. |
ZHS32EUCFIXED | None. Replaced by new national character set. UTF8 and AL16UTF16. |
ZHS16GBKFIXED | None. Replaced by new national character set. UTF8 and AL16UTF16. |
JA16DBCSFIXED | None. Replaced by new national character set. UTF8 and AL16UTF16. |
KO16DBCSFIXED | None. Replaced by new national character set. UTF8 and AL16UTF16. |
ZHS16DBCSFIXED | None. Replaced by new national character set. UTF8 and AL16UTF16. |
ZHS16CGB231280FIXED | None. Replaced by new national character set. UTF8 and AL16UTF16. |
ZHT16DBCSFIXED | None. Replaced by new national character set. UTF8 and AL16UTF16. |
KO16KSC5601FIXED | None. Replaced by new national character set. UTF8 and AL16UTF16. |
JA16SJISFIXED | None. Replaced by new national character set. UTF8 and AL16UTF16. |
ZHT16BIG5FIXED | None. Replaced by new national character set. UTF8 and AL16UTF16. |
ZHT32TRISFIXED | None. Replaced by new national character set. UTF8 and AL16UTF16. |
The Unicode Character Set AL24UTFFSS was desupported in Oracle9i. AL24UTFFSS was introduced in version 7 as the Unicode character set supporting UTF-8 encoding scheme based on the Unicode standard 1.1, which is now obsolete. In Oracle Database 10g, Oracle offers the Unicode database character set AL32UTF8, which is based on Unicode 4.0, and UTF8, which is based on Unicode 3.0.
The migration path for an existing AL24UTFFSS database is to upgrade to UTF8 prior to upgrading to Oracle9i. As with all migrations to a new database character set, Oracle Corporation recommends that you use the Character Set Scanner for data analysis before attempting to migrate your existing database character set to UTF8.
See Also:
Chapter 12, 'Character Set Scanner Utilities'Changes have been made to the content in some of the language and territory definition files in Oracle Database 10g. These updates are necessary to correct the legacy definitions which no longer meet the local conventions in some of the Oracle supported languages and territories. These changes include modifications to the currency symbols, month names, and group separators. One example is the local currency symbol for Brazil. This has been updated from Cr$
to R$
in Oracle Database 10g.
Please refer to the 'Oracle Language and Territory definition changes' table documented in the $ORACLE_HOME/nls/data/old/data_changes.html
file for a detailed list of the changes.
Oracle Database 10g customers should review their existing application code to make sure that the correct cultural conventions that are defined in Oracle Database 10g are being used. For customers who may not be able to make the necessary code changes to support their applications, Oracle offers Oracle9i locale definition files with Oracle Database 10g.
To revert back to the Oracle9i language and territory behavior, perform the following:
Shutdown the database.
Run the script cr9idata.pl
from the $ORACLE_HOME/nls/data/old
directory.
Set the ORA_NLS10 environment variable to the newly created $ORACLE_HOME/nls/data/9idata
directory.
Restart the database.
Steps 2 and 3 will need to be repeated for all 10g database clients that need to revert back to the Oracle9i definition files.
Oracle Corporation strongly recommends that customers use the Oracle Database 10g locale definition files; Oracle9i locale definition files will be desupported in a future release.
From Jamesonline:DOS-on-USB lets you install MS-DOS 7.1 on your USB memory key. After formatting your flash drive, you can install a full working version of MS-DOS to let you run games or system utilities. The best thing about having a DOS-bootable memory key is you can boot into it on any computer, just like a CD.
You'll find this utility great for times you need to do system maintenance, because you wont constantly have to burn a new CD, just copy the program to the USB Drive. With the DOS prompt, you can even install Windows 95 or 98. Recommended USB drive size: 16MB - 2GB. Version 2 features an auto launch for ease of the DOS operating system.