| ||||||||||||
|
1 2 3 4 5 6 7 8 9 0 |
|
|
|
|
|
|
|
|
| |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Building Worldwide Websites notes that
“By localizing a site into six languages other than English (Japanese, German, French, Spanish, Portuguese, and Swedish), a site can reach 90% of the online population”
(in 2001).
Values in the "Lang=" column in the table below are from the "Code for the representation of names of languages" used in
IETF 1766:1988 and
ISO-639 maintained by the
US Library of Congress names in English and French over 460 languages.
Version 1 first proposed in 1988 used 2 characters for 136 languages. The XML Cover Pages identified the language family to each code.
The US Library of Congress changed U.S. national standard Z39.52 (Codes for the Representation of Languages for Information Interchange) and the USMARC Code List for Languages to Version 2 of the international standard.
Version 2 uses Alpha-3 (characters) which reached Draft International Standard in 1996. Where there are two codes given, the first is the code for Universal Bibliographic Control (UNIMARC), and the terminology code is given second.
SGML-encoded correspondance between the two versions
The paper version from ISO costs 162 CHF
Version 3 includes more languages using terminology-code (not bibliographic) identifiers from ISO 639-2.
Microsoft's International Word List define the words and phrases that
either appear in the Microsoft Windows user interface or are used in
describing key concepts of the operating system.
The languages in this table are arranged by the in the 3 collections of 17 default language groups recognized by Microsoft Windows XP
| Collection | Group | Lang= | LCID | English Name | Code Page | ISO-8859- | Bit | |
|---|---|---|---|---|---|---|---|---|
| Basic (installed on all languages of the OS) | 1. Western Europe and United States | en | eng | 1033 | English Latin I | SBCS 1252 | 1 & 9 | 0 |
| da | dan |
| ||||||
| nl | dut nld | 1043 |
| |||||
| fi | fin | 1053 |
| |||||
| fr | fre fra | 1036 |
| |||||
| de | ger deu | 1031 66567 |
| |||||
| it | ita | 1040 |
| |||||
| la | lat | Latin | ||||||
| pt | por | 1046 |
| |||||
| es | spa | 1027 3082 1034 |
| |||||
| sv | swe | 1053 |
| |||||
| 2. Central (and Eastern) Europe (Latin 2) | hu po cs sr et | 1038 1045 1029 ? 1061 1058 1062 |
Serbian Estonian Ukrainian Latvian | SBCS 1250 | 2 | 1 | ||
| 3. "Windows" Baltic | ka lt sk bg | 66615 1063 1051 ? 1050 ? 1048 1060 | Georgian Lithuanian Bulgarian Croatian? Belarusian? Romanian? | SBCS 1257 | - | 7 | ||
| 4. Greek | el | grc ell |
| SBCS 1253 | 7 | 3 | ||
| 5. Cyrillic | ru | rus |
| SBCS 1251 | 5 | 2 | ||
| 6. Turkic | tr | tur |
| SBCS 1254 | 9 | 1 | ||
| East Asian (nicknamed "CJK") | 7. Japanese 日本語 | ja | jpn | 1041 66577 |
| DBCS 932 | 18 | |
| 8. Korean 한국어 | ko | kor | 1042 66578 |
| DBCS 949 | 20 | ||
| 9. Simplified Chinese 中文 (简体) | zh_CN | chi zho | 2052 133124 |
| DBCS 936 | 19 | ||
| 10. Traditional Chinese 中文 (繁體) | zh_TW | chi zho | 197636 1028 |
| DBCS 950 | 21 | ||
| Complex script (installed on Arabic and Hebrew localized OSes) | 11. Thai ไทย | th | tha | 1054 | Thai | SBCS 874 | 17 | |
| 12. Hebrew | he | heb |
| SBCS 1255 | 8 | 5 | ||
| 13. Arabic | ar | ara | Arabic | SBCS 1256 | 6 | 6 | ||
| 14. Vietnamese | vi | vie | 1066 | Vietnamese | SBCS 1258 | |||
| 15. Indic | hi ne si | hin nep sin | Hindi Nepali Sinhalese | |||||
| 16. Georgian | ka | geo kat | 66615 | |||||
| 17. Armenian | hy | arm hye | ||||||
| Others? | Persian | fa | ||||||
| Norwegian | nn nb | nno nob | 1044 | Nynorsk | ||||
| African? | zu sw so su | Zulu Swahili Somali Sundanese | ||||||
| Iclandic | 1039 | |||||||
| FYRO Macedonian | 1071 | |||||||
ISO-8859-15 Latin 9 updates 8 characters in ISO-8859-1 Latin 1 published 1999-03-15.
These are noted in my list of characters with Entity Codes
ulCodePageRange bit settings in a font's OS/2 table.
LCID = Locale ID = 2057 for UK English, 33280 for Binary Order, 66574 for Hungarian Technical
|
The Art and Science of Learning Languages (Oxford, England Intellect Books, 1996)
by Amorey Gethin, & Erik V. Gunnemark
|
| |||||
| |||||||||||||
| Aspect | American English | British English |
|---|---|---|
| Words | color organization minimize program | colour organisation minimise programme |
Resources on the English language
The first novel in Middle English is Beouwulf, written around 1000 A.D. It's about Scandanavians (Danes, Swedes, Finns) with Christian allusions.
Beouwulf, of the Wægmunding family within the Geats clan (also referred to as Geatas) sword named Nægling Heorot in Roskilde Sjaelland Scyld, a mythical Danish king Hrothgar, King of the Danes during Beowulf's visit to Heorot Eadgils, the Swedish King Hengest, leader of the of the Healfdenes Giant Grendel
Here is a translation into Modern English
THE PHAOMNNEAL PWEOR OF THE HMUAN MNID
Aoccdrnig to rscheearch at Cmabrigde Uinervtisy, it deosn't mttaer in waht oredr the ltteers in a wrod are, the olny iprmoatnt tihng is taht the frist and lsat ltteer be in the rghit pclae. The rset can be a taotl mses and you can sitll raed it wouthit porbelm. Tihs is bcuseae the huamn mnid deos not raed ervey lteter by istlef, but the wrod as a wlohe.
Amzanig huh?
Hrad to blveiee taht I cluod aulaclty uesdnatnrd waht I was rdgnieg!
|
| |||
|
| |||
Unlike the Hindu place-notation number system commonly used today
(with 1,2,3 etc. and zero which allow numbers to easily grow without limit),
each Classical Greek alphabet has a corresponding value in Greek mathematics.
For example (from Revelation 13:16-18):
| ||||||||||||||||||||||
| Greek | English | |
|---|---|---|
| IhsouV | Jesus | |
| CristoV | Christos | Christ |
| Qeou | Theos | God's |
| UioV | Son | |
| Soter | Soter | Savior |
| |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| Numeric value |
Symbol Upper |
Symbol Lower |
Name | English | Symbol Keystroke |
Unicode |
|---|---|---|---|---|---|---|
| 1 | A | a | alpha | A, a | ||
| 2 | B | b | beta | B, b | ||
| 3 | G | g | gamma | G, g | ||
| 4 | D | d | delta | D, d | ||
| 5 | E | e | epsilon | E, e | ||
| 6 | Ϛ | ϛ | stigma | - | 03DA, 03DB | |
| 6 | Ϝ | ϝ | digamma | - | 03DC, 03DD | |
| 7 | Z | z | zeta | Z, z | ||
| 8 | H | h | eta | H, h | ||
| 9 | Q | q | theta | Q, q | ||
| 10 | I | i | iota | I, i | ||
| 20 | K | k | kappa | K, k | ||
| 30 | L | l | lamda | L, l | ||
| 40 | M | m | mu | M, m | ||
| 50 | N | n | nu | N, n | ||
| 60 | X | x | xi | X, x | ||
| 70 | O | o | omicron | O, o | ||
| 80 | P | p | pi | P, p | ||
| 90 | Ϙ | ϙ | koppa | - | 03D8, 03D9 | |
| 100 | R | r | rho/pho | R, r | ||
| 200 | S | s, V | sigma | S, s, V | ||
| 300 | T | t | tao | T, t | ||
| 400 | U | u | upsilon | U, u | ||
| 500 | F | f | phi | F, f | ||
| 600 | C | c | chi | C, c | ||
| 700 | Y | y | psi | Y, y | ||
| 800 | W | w | omega | W, w | ||
| 900 | Ϡ | ϡ | sampi | - | 03E0, 03E1 |
|
|
| ||||
|
| |||
|
| |||
Related:
| Your first name: Your family name: Your location (city, country): Your Email address: |
Top of Page Thank you! | |||