请输入您要查询的百科知识:

 

词条 Latin script in Unicode
释义

  1. Table of characters

  2. See also

  3. References

Many Unicode characters belonging to the Latin script are encoded in the Unicode Standard. As of version 12.0 of the Unicode Standard, 1,366 characters in the following blocks are classified as belonging to the Latin script:

  • Basic Latin, 0000–007F. This block corresponds to ASCII.
  • Latin-1 Supplement, 0080–00FF
  • Latin Extended-A, 0100–017F
  • Latin Extended-B, 0180–024F
  • IPA Extensions, 0250–02AF
  • Spacing Modifier Letters, 02B0–02FF
  • Phonetic Extensions, 1D00–1D7F
  • Phonetic Extensions Supplement, 1D80–1DBF
  • Latin Extended Additional, 1E00–1EFF
  • Superscripts and Subscripts, 2070–209F
  • Letterlike Symbols, 2100–214F
  • Number Forms, 2150–218F
  • Latin Extended-C, 2C60–2C7F
  • Latin Extended-D, A720–A7FF
  • Latin Extended-E, AB30–AB6F
  • Alphabetic Presentation Forms (Latin ligatures) FB00–FB4F
  • Halfwidth and Fullwidth Forms, FF00–FFEF

In addition, a number of Latin-like characters are encoded in the Currency Symbols, Control Pictures, CJK Compatibility, Enclosed Alphanumerics, Enclosed CJK Letters and Months, Mathematical Alphanumeric Symbols, and Enclosed Alphanumeric Supplement blocks, but although they look like Latin letters they have the script property of common, and so do not belong to the Latin script in Unicode terms. Lisu also consists almost entirely of Latin forms but uses its own script property.

The extended ranges contain mainly precomposed diacritics that may be equivalently encoded with combining diacritics, as well as some ligatures, used in the orthography of various African languages (including click symbols in Latin Extended-B) and the Vietnamese alphabet (Latin Extended Additional). Latin Extended-C contains additions for Uighur and the Claudian letters. Latin Extended-D comprises characters that are mostly of interest to medievalists. Latin Extended-E mostly comprises characters used for German dialectology (Teuthonista).[1]

Table of characters

In this table those characters with the Unicode script property of Latin are highlighted in colour, indicating the version of Unicode they were introduced in. Reserved code points (which may be assigned as characters at a future date) have a grey background. All characters that do not belong to the Latin script have a white background (and the version of Unicode they were introduced in is therefore not indicated).

Legend: Unicode version
Unicode 1.0Unicode 5.0
Unicode 1.1Unicode 5.1
Unicode 2.0Unicode 5.2
Unicode 2.1Unicode 6.0
Unicode 3.0Unicode 6.1
Unicode 3.1Unicode 7.0
Unicode 3.2Unicode 8.0
Unicode 4.0Unicode 9.0
Unicode 4.1Unicode 11.0
Unicode 12.0
Not Latin scriptReserved
U+0123456789ABCDEFBlock#
0040@ABCDEFGHIJKLMNOC0 Controls and Basic Latin
0000–007F
(identical to ASCII)
52
0050PQRSTUVWXYZ[\\]^_
0060`abcdefghijklmno
0070pqrstuvwxyz{|}~DEL
00A0 ¡¢£¤¥¦§¨©ª«¬®¯C1 Controls and Latin-1 Supplement
0080–00FF
(identical to ISO/IEC 8859-1)
64
00B0°±²³´µ·¸¹º»¼½¾¿
00C0ÀÁÂÃÄÅÆÇÈÉÊËÌÍÎÏ
00D0ÐÑÒÓÔÕÖ×ØÙÚÛÜÝÞß
00E0àáâãäåæçèéêëìíîï
00F0ðñòóôõö÷øùúûüýþÿ
0100ĀāĂ㥹ĆćĈĉĊċČčĎďLatin Extended-A
0100–017F
128
0110ĐđĒēĔĕĖėĘęĚěĜĝĞğ
0120ĠġĢģĤĥĦħĨĩĪīĬĭĮį
0130İıIJijĴĵĶķĸĹĺĻļĽľĿ
0140ŀŁłŃńŅņŇňʼnŊŋŌōŎŏ
0150ŐőŒœŔŕŖŗŘřŚśŜŝŞş
0160ŠšŢţŤťŦŧŨũŪūŬŭŮů
0170ŰűŲųŴŵŶŷŸŹźŻżŽžſ
0180ƀƁƂƃƄƅƆƇƈƉƊƋƌƍƎƏLatin Extended-B
0180–024F
208
0190ƐƑƒƓƔƕƖƗƘƙƚƛƜƝƞƟ
01A0ƠơƢƣƤƥƦƧƨƩƪƫƬƭƮƯ
01B0ưƱƲƳƴƵƶƷƸƹƺƻƼƽƾƿ
01C0ǀǁǂǃDŽDždžLJLjljNJNjnjǍǎǏ
01D0ǐǑǒǓǔǕǖǗǘǙǚǛǜǝǞǟ
01E0ǠǡǢǣǤǥǦǧǨǩǪǫǬǭǮǯ
01F0ǰDZDzdzǴǵǶǷǸǹǺǻǼǽǾǿ
0200ȀȁȂȃȄȅȆȇȈȉȊȋȌȍȎȏ
0210ȐȑȒȓȔȕȖȗȘșȚțȜȝȞȟ
0220ȠȡȢȣȤȥȦȧȨȩȪȫȬȭȮȯ
0230ȰȱȲȳȴȵȶȷȸȹȺȻȼȽȾȿ
0240ɀɁɂɃɄɅɆɇɈɉɊɋɌɍɎɏ
0250ɐɑɒɓɔɕɖɗɘəɚɛɜɝɞɟIPA Extensions
0250–02AF
96
0260ɠɡɢɣɤɥɦɧɨɩɪɫɬɭɮɯ
0270ɰɱɲɳɴɵɶɷɸɹɺɻɼɽɾɿ
0280ʀʁʂʃʄʅʆʇʈʉʊʋʌʍʎʏ
0290ʐʑʒʓʔʕʖʗʘʙʚʛʜʝʞʟ
02A0ʠʡʢʣʤʥʦʧʨʩʪʫʬʭʮʯ
02B0ʰʱʲʳʴʵʶʷʸʹʺʻʼʽʾʿSpacing Modifier Letters
02B0–02FF
14
02E0ˠˡˢˣˤ˥˦˧˨˩˪˫ˬ˭ˮ˯
1D00Phonetic Extensions
1D00–1D7F
111
1D10
1D20
1D30ᴿ
1D40
1D50
1D60
1D70ᵿ
1D80Phonetic Extensions Supplement
1D80–1DBF
63
1D90
1DA0
1DB0ᶿ
1E00Latin Extended Additional
1E00–1EFF
256
1E10
1E20
1E30ḿ
1E40
1E50
1E60
1E70ṿ
1E80
1E90
1EA0
1EB0ế
1EC0
1ED0
1EE0
1EF0ỿ
2070  Superscripts and Subscripts
2070–209F
15
2090  
2120ΩLetterlike symbols
2100–214F
4
2130
2140
2160Number Forms
2150–218F
41
2170
2180      
2C60Latin Extended-C
2C60–2C7F
32
2C70Ɀ
A720Latin Extended-D
A720–A7FF
169
A730
A740
A750
A760
A770
A780
A790
A7A0
A7B0
A7C0           
A7F0       
AB30ꬿLatin Extended-E
AB30–AB6F
54
AB40
AB50
AB60        
FB00         Alphabetic Presentation Forms FB00–FB4F7
FF20Halfwidth and Fullwidth Forms
(fullwidth Latin letters)
FF00–FFEF
52
FF30_
FF40
FF50
Total characters1,366

See also

  • Universal Character Set characters
  • Letterlike Symbols (Unicode block)
  • List of Latin-script letters

References

1. ^{{cite web | url=https://www.unicode.org/L2/L2011/11202-n4081-teuthonista.pdf | title=Revised proposal to encode "Teuthonista" phonetic characters in the UCS | first1=Michael | last1=Everson | authorlink1=Michael Everson | first2=Alois | last2=Dicklberger | first3=Karl | last3=Pentzlin | first4=Eveline | last4=Wandl-Vogt | date=2011-06-02 }}
{{Latin script}}{{Unicode navigation}}{{DEFAULTSORT:Latin Characters In Unicode}}

2 : Latin script|Unicode blocks

随便看

 

开放百科全书收录14589846条英语、德语、日语等多语种百科知识,基本涵盖了大多数领域的百科知识,是一部内容自由、开放的电子版国际百科全书。

 

Copyright © 2023 OENC.NET All Rights Reserved
京ICP备2021023879号 更新时间:2024/11/15 19:05:53