请输入您要查询的百科知识:

 

词条 Phonetic symbols in Unicode
释义

  1. Phonetic scripts

  2. From IPA to Unicode

     Consonants   Vowels    Diacritics  

  3. Unicode blocks

  4. From Unicode blocks to scripts

     IPA Extensions (U+0250–02AF)  Spacing Modifier Letters (U+02B0–02FF)  Phonetic Extensions (U+1D00–1D7F)  Phonetic Extensions Supplement (U+1D80–1DBF)  Modifier Tone Letters (U+A700–A71F)  Superscripts and Subscripts (U+2070–209F) 

  5. Fonts support for IPA

  6. Input by selection from a screen

  7. See also

  8. References

  9. External links

{{Refimprove|date=January 2016}}{{ SpecialChars
| special = phonetic symbols
| fix = Help:Special_characters
| characters = phonetic symbols
}}

Unicode supports several phonetic scripts and notations through the existing writing systems and the addition of extra blocks with phonetic characters. These phonetic extras are derived of an existing script, usually Latin, Greek or Cyrillic. In Unicode there is no "IPA script". Apart from IPA, extensions to the IPA and obsolete and nonstandard IPA symbols, these blocks also contain characters from the Uralic Phonetic Alphabet and the Americanist Phonetic Alphabet.

Phonetic scripts

The International Phonetic Alphabet (IPA) makes use of letters from other writing systems as most phonetic scripts do. IPA notably uses Latin, Greek and Cyrillic characters. Combining diacritics also adds meaning to the phonetic text. Finally, these phonetic alphabets make use of modifier letters, that are specially constructed for the phonetic meaning. A "modifier letter" is strictly intended not as an independent grapheme but as a modification of the preceding character[1] resulting in a distinct grapheme, notably in the context of the International Phonetic Alphabet. For example, ʰ should not occur on its own but modifies the preceding or following symbol. Thus, {{IPA|tʰ}} is a single IPA symbol, distinct from {{IPA|t}}. In practice, however, several of these "modifier letters" are also used as full graphemes, e.g. ʿ as transliterating Semitic ayin or Hawaiian okina, or ˚ transliterating Abkhaz ә.

From IPA to Unicode

{{Main|International Phonetic Alphabet}}

Consonants

The following tables indicates the Unicode code point sequences for phonemes as used in the International Phonetic Alphabet. A bold code point indicates that the Unicode chart provides an application note such as "voiced retroflex lateral" for {{unichar|026D|LATIN SMALL LETTER L WITH RETROFLEX HOOK|html=|size=100%}}. An entry in bold italics indicates the character name itself refers to a phoneme such as {{unichar|0298|LATIN LETTER BILABIAL CLICK|html=|size=100%}}

BilabialLabiodentalDentalAlveolarPostalveolarRetroflexLabialized palatalPostalveolar-velar
Plosivep}} 0070b}} 0062p̪}} 0070 032Ab̪}} 0062 032At̪}} 0074 032Ad̪}} 0064 032At}}
0074
d}} 0064ʈ}} 0288ɖ}} 0256
Implosiveɓ̥}} 0253 0325ɓ}} 0253ɗ̪}} 0257 032Aɗ}} 0257 *
Ejectivepʼ}} 0070 02BCt̪ʼ}} 0074 032A 02BCtʼ}} 0074 02BCʈʼ}} 0288 02BC
Nasalm̥}} 006D 0325m}} 006Dɱ̊}} 0271 030Aɱ}} 0271n̪̊}} 006E 032A 030An̪}} 006E 032An̥}} 006E 0325n}} 006Eɳ̊}} 0273 030Aɳ}} 0273
Trillʙ}} 0299r̥}} 0072 0325r}} 0072 *
Tap or Flapⱱ̟}} 2C71 031Fⱱ}} 2C71ɾ}} 027Eɽ}} 027D
Lateral flapɺ}} 027A *
Fricativeɸ}} 0278β}} 03B2f}}
0066
v}} 0076θ}} 03B8ð}} 00F0s}} 0073z}} 007Aʃ}} 0283ʒ}} 0292ʂ}} 0282ʐ}} 0290ɧ}} 0267
Lateral fricativeɬ}} 026Cɮ}} 026Eꞎ}} A78E
Ejective fricativesʼ}} 0073 02BCʃʼ}} 0283 02BC
Ejective lateral fricativeɬʼ}} 026C 02BC
Percussive{{IPA|ʬ}}
02AC
{{IPA|ʭ}}
02AD
Approximantβ̞̊ }} 03B2 031E 030Aβ̞}} 03B2 031Eʋ̥}} 028B 0325ʋ}} 028Bð̞}} 00F0 031Eɹ̥}} 0279 0325ɹ}} 0279ɻ̊}} 027B 030Aɻ}} 027Bɥ̊}} 0265 030Aɥ}} 0265
Lateral approximantl̥}} 006C 0325l}} 006Cɭ}} 026D
Click consonant{{IPA link|ʘ}}
0298
{{IPA link|ǀ}}
01C0
{{IPA link|ǃ}}
01C3
{{IPA link|ǃ}} / {{IPA link|ǂ}}
01C3 / 01C2
Lateral click*{{IPA link|ǁ}}
01C1
Alveolo-palatalPalatalLabial-velarVelarUvularPharyngealEpiglottalGlottal
Plosiveȶ}} 0236ȡ}} 0221c}} 0063ɟ}} 025Fk͡p}} 006B 0361 0070ɡ͡b}} 0261 0361 0062k}} 006Bɡ}} 0261q}} 0071ɢ}} 0262ʡ}} 02A1ʔ}} 0294
Implosiveʄ}} 0284ɠ}} 0260ʛ}} 029B
Ejectivecʼ}} 0063 02BCkʼ}} 006B 02BCqʼ}} 0071 02BC
Nasalȵ}} 0235ɲ}} 0272ŋ͡m}} 014B 0361 006Dŋ}} 014Bɴ}} 0274
Trillʀ}} 0280 *
Tap or Flap *
Lateral flap * *
Fricativeɕ}} 0255ʑ}} 0291ç}} 0063 0327ʝ}} 029Dx}} 0078ɣ}} 0263χ}} 03C7ʁ}} 0281ħ}} 0127ʕ}} 0295ʜ}} 029Cʢ}} 02A2h}} 0068ɦ}} 0266
Approximantj}} 006Aʍ}} 028Dw}} 0077ɰ}} 0270
Lateral approximantȴ}} 0234ʎ}} 028Eʟ}} 029F

Vowels

The following figures depict the phonetic vowels and their Unicode / UCS code points. Vowels appearing in pairs in the figure to the right indicate rounded and unrounded variations respectively. Again, characters with Unicode names referring to phonemes are indicated by bold text. Those with explicit application notes are indicated by bold italic text. Those from borrowed unchanged from another script (Latin, Greek or Cyrillic) are indicated by italics.

Unicode code points for phonetic vowels
This table represents the phonetic vowel trapezium
Before and after a bullet are the unrounded{{·}}rounded vowels
Close{{IPA link|i}}{{·}}{{IPA link|y}}
0069 0079
{{IPA link|ɨ}}{{·}}{{IPA link|ʉ}}
0268 0289
{{IPA link|ɯ}}{{·}}{{IPA link|u}}
026F 0075
Near-close{{IPA link|ɪ}}{{·}}{{IPA link|ʏ}}
026A 028F
{{IPA link|ɪ̈ }}{{·}}{{IPA link|ʊ̈ }}
026A 0308{{·}}028A 0308
{{IPA|{{·}}ʊ}}
028A
Close-mid{{IPA link|e}}{{·}}{{IPA link|ø}}
0065 00F8
{{IPA link|ɘ}}{{·}}{{IPA link|ɵ}}
0258 0275
{{IPA link|ɤ}}{{·}}{{IPA link|o}}
0264 006F
Mid{{IPA link|ə}}
0259
Open-mid{{IPA link|ɛ}}{{·}}{{IPA link|œ}}
025B 0153
{{IPA link|ɜ}}{{·}}{{IPA link|ɞ}}
025C 025E
{{IPA link|ʌ}}{{·}}{{IPA link|ɔ}}
028C 0254
Near-open{{IPA link|æ}}{{·}}
00E6
{{IPA link|ɐ}}
0250
Open{{IPA link|a}}{{·}}{{IPA link|ɶ}}
0061 0276
{{IPA link|ɑ}}{{·}}{{IPA link|ɒ}}
0251 0252
Vowel length marker{{IPA link|ː}}
02D0

Diacritics

DiacriticFunctionHexDiacriticFunctionHexDiacriticFunctionHex
ModifierCombiningModifierCombiningModifierCombining
˳Voiceless0x02F30x0325̤Breathy Voiced0x0324͏̪Dental0x032A
ˬVoiced0x02EC0x032C˷Creaky Voiced0x02F70x0330˽Apical0x02FD0x033A
ʰAspirated0x02B0͏̼Linguolabial0x033C͏̻Laminal0x033B
̹More Rounded0x0339ʷLabialized0x02B7̃Nasalized0x0303
͏̜Less Rounded0x031CʲPalatalized0x02B2Nasal release0x207F
˖Advanced0x02D60x031FˠVelarized0x02E0ˡLateral release0x02E1
ˍRetracted0x02CD0x320ˤPharyngealized0x02E4˺No audible release0x02FA0x031A
̈Centralized0x0308̴Velarized or Pharyngealized0x0334
˟Mid-Centralized0x02DF0x033D˔Raised0x02D40x031D
ˌSyllabic0x02CC0x0329˕Lowered0x02D50x031E
͏̯Non-syllabic0x032F͏̘Advanced Tongue Root0x0318
˞Rhoticity0x02DE͏̙Retracted Tongue Root0x0319

Unicode blocks

  • Basic Latin (0020–007E), IPA example: Open front unrounded vowel (0061)
  • Latin-1 Supplement (00A0–00FF), IPA example: Near-open front unrounded vowel (00E6)
  • Latin Extended-A (0100–017F), IPA example: Voiceless pharyngeal fricative (0127)
  • Latin Extended-B (0180–024F), IPA example: Tenuis dental click (01C0 0287)
  • IPA Extensions (0250–02AF), IPA example: Near-open central vowel (0250)
  • Spacing Modifier Letters (02B0–02FF), IPA example: Palatal ejective (0063 02BC)
  • Combining Diacritical Marks (0300–036F), IPA example: Near-close central unrounded vowel (026A 0308)
  • Greek and Coptic (0370–03FF), IPA example: Voiced bilabial fricative (03B2)
  • Combining Diacritical Marks Supplement (1DC0–1DFF), IPA example: Rising-falling contour tone (1DC8)
  • General Punctuation (2000–206F), IPA example: Linking (absence of a break) (203F)
  • Superscripts and Subscripts (2070–209F), IPA example: Nasal release (207F)
  • Arrows (2190–21FF), IPA example: Global rise (2197)
  • Latin Extended-C (2C60–2C7F), IPA example: Labiodental flap (2C71)
  • Modifier Tone Letters (A700–A71F), IPA example: Upstep (A71B)
  • Phonetic Extensions (1D00–1D7F)
  • Phonetic Extensions Supplement (1D80–1DBF)

From Unicode blocks to scripts

Phonetical scripts are encoded in six Unicode blocks.

IPA Extensions (U+0250–02AF)

{{Distinguish|Extensions to the IPA}}{{Main|IPA Extensions (Unicode block)}}{{Unicode chart IPA Extensions}}

Spacing Modifier Letters (U+02B0–02FF)

The characters in the "Spacing Modifier Letters" block are intended as forming a unity with the preceding letter (which they "modify"). E.g. the character {{unichar|02B0|MODIFIER LETTER SMALL H}} isn't intended simply as a superscript h (h), but as the mark of aspiration placed after the letter being aspirated, as in {{IPA|pʰ}} "aspirated voiceless bilabial plosive". The block contains:

  • Latin superscript modifier letters: (U+02B0–U+02B8): ʰ aspiration; ʱ breathy voice, murmured; ʲ palatalization; ʳ, ʴ, ʵ, ʶ r-coloring or r-offglides; ʷ labialization; ʸ palatalization, Americanist usage for U+02B2
  • Miscellaneous phonetic modifiers: (U+02B9–U+02D7): ʹ ʺ ʻ ʼ ʽ ʾ ʿ ˀ ˁ ˂ ˃ ˄ ˅ ˆ ˇ ˈ ˉ ˊ ˋ ˌ ˍ ˎ ˏ ː ˑ ˒ ˓ ˔ ˕ ˖ ˗
  • Spacing clones of diacritics: (U+02D8–U+02DD): ˘ breve; ˙ dot above; ˚ ring above; ˛ ogonek; ˜ small tilde; ˝ double acute accent
  • Additions based on 1989 IPA: (U+02DE–U+02E4): ˞ ˟ ˠ ˡ ˢ ˣ ˤ
  • Tone letters: (U+02E5–U+02E9): ˥ ˦ ˧ ˨ ˩
  • Extended Bopomofo tone marks: {{unichar|02EA|modifier letter yin departing tone mark}}; {{unichar|02EB|modifier letter yang departing tone mark}}
  • IPA modifiers: {{unichar|02EC|MODIFIER LETTER VOICING}}, unaspirated
  • Other modifier letters: {{unichar|02EE|modifier letter double apostrophe|nlink=Modifier letter double apostrophe}} for Nenets
  • Uralic Phonetic Alphabet (UPA) modifiers: (U+02EF–U+02FF): ˯ ˰ ˱ ˲ ˳ ˴ ˵ ˶ ˷ ˸ ˹ ˺ ˻ ˼ ˽ ˾ ˿
{{Unicode chart Spacing Modifier Letters}}

Phonetic Extensions (U+1D00–1D7F)

This block, together with Phonetic Extensions Supplement below, contains:

  • Small capitals "ɢ ɪ ɴ ɶ ʀ ʏ ʙ ʜ ʟ"
  • Turned small letters "ɐ ɥ ɯ ɹ ɺ ɻ ʇ ʌ ʍ ʎ ʞ ʮ ʯ"
  • Extra small capitals "ʁ ʛ ᴀ ᴁ ᴃ ᴄ ᴅ ᴆ ᴇ ᴊ ᴋ ᴌ ᴍ ᴎ ᴏ ᴐ ᴘ ᴙ ᴚ ᴛ ᴜ ᴠ ᴡ ᴢ ᴣ ᴦ ᴧ ᴨ ᴩ ᴪ"
  • Letters with palatal hooks "ƫ ᶀ ᶁ ᶂ ᶃ ᶄ ᶅ ᶆ ᶇ ᶈ ᶉ ᶊ ᶋ ᶌ ᶍ ᶎ ᶪ ᶵ"
  • Letters with retroflex hooks "ᶏ ᶐ ᶒ ᶓ ᶔ ᶕ ᶖ ᶗ ᶘ ᶙ ᶚ ᶩ ᶯ ᶼ"
{{Unicode chart Phonetic Extensions}}

Phonetic Extensions Supplement (U+1D80–1DBF)

{{Unicode chart Phonetic Extensions Supplement}}

Modifier Tone Letters (U+A700–A71F)

{{Unicode chart Modifier Tone Letters}}

Superscripts and Subscripts (U+2070–209F)

{{Unicode chart Superscripts and Subscripts}}

Fonts support for IPA

IPA font support is increasing, and is now included in several fonts such as the Times New Roman versions that come with various recent computer operating systems. Diacritics are not always properly rendered, however. IPA fonts that are freely available online include Gentium, several from the SIL (such as Charis SIL, and Doulos SIL), DejaVu Sans, and TITUS Cyberbit, which are all freely available; as well as commercial typefaces such as Brill, available from Brill Publishers, and Lucida Sans Unicode and Arial Unicode MS, shipping with various Microsoft products. These all include several ranges of characters in addition to the IPA. Modern Web browsers generally do not need any configuration to display these symbols, provided that a font capable of doing so is available to the operating system.

Input by selection from a screen

Further Information: Unicode input#Selection from a screen

Many systems provide a way to select Unicode characters visually. ISO/IEC 14755 refers to this as a screen-selection entry method.

Microsoft Windows has provided a Unicode version of the Character Map program (find it by hitting {{keypress|Win|R}} then type charmap then hit {{keypress|Enter}}) since version NT 4.0 – appearing in the consumer edition since XP. This is limited to characters in the Basic Multilingual Plane (BMP). Characters are searchable by Unicode character name, and the table can be limited to a particular code block. More advanced third-party tools of the same type are also available (a notable freeware example is BabelMap).

macOS provides a "character palette" with much the same functionality, along with searching by related characters, glyph tables in a font, etc. It can be [https://web.archive.org/web/20110109234035/http://www.apple.com/pro/techniques/glyphspalette/ enabled] in the input menu in the menu bar under System Preferences → International → Input Menu (or System Preferences → Language and Text → Input Sources) or can be viewed under Edit → Emoji & Symbols in many programs.

Equivalent tools – such as gucharmap (GNOME) or kcharselect (KDE) – exist on most Linux desktop environments.

See also

  • Unicode symbols
  • Universal Character Set characters
  • Latin script in Unicode
  • IPA

References

1. ^{{cite web|url=http://everything2.com/index.pl?node=Spacing%20Modifier%20Letters |title=Spacing modifier letters |website=Everything2.com |date=2002-08-29 |accessdate=2016-01-23}}

External links

  • Links to PDFs of Unicode codes for several phonetic symbol sets
{{Unicode navigation}}{{IPA navigation}}

2 : Phonetics|Unicode blocks

随便看

 

开放百科全书收录14589846条英语、德语、日语等多语种百科知识,基本涵盖了大多数领域的百科知识,是一部内容自由、开放的电子版国际百科全书。

 

Copyright © 2023 OENC.NET All Rights Reserved
京ICP备2021023879号 更新时间:2024/9/22 4:14:01