请输入您要查询的百科知识:

 

词条 Cedilla
释义

  1. Origin

  2. C

  3. S

  4. Latvian

  5. Marshallese

  6. Other diacritics

  7. French

  8. Romanian

  9. Gagauz

  10. Encodings

  11. References

  12. External links

{{Redirect|Cedille|the record label|Cedille Records}}{{distinguish|Sedilia}}{{Diacritical marks|̧}}{{Letters with cedilla}}

A cedilla ({{IPAc-en|s|ᵻ|ˈ|d|ɪ|l|ə}} {{respell|si|DIL|ə}}; from Spanish), also known as cedilha (from Portuguese) or cédille (from French), is a hook or tail ( ¸ ) added under certain letters as a diacritical mark to modify their pronunciation. In Catalan, French, and Portuguese, it is used only under the c, and the entire letter is called respectively c trencada (i.e. "broken C"), c cédille, and c cedilhado (or c cedilha, colloquially).

Origin

The tail originated in Spain as the bottom half of a miniature cursive z. The word "cedilla" is the diminutive of the Old Spanish name for this letter, ceda (zeta).[1] Modern Spanish and Galician no longer use this diacritic, although it is used in Portuguese,[2] Catalan, Occitan, and French, which gives English the alternative spellings of cedille, from French "cédille", and the Portuguese form cedilha. An obsolete spelling of cedilla is cerilla.[2] The earliest use in English cited by the Oxford English Dictionary[2] is a 1599 Spanish-English dictionary and grammar.[3] Chambers’ Cyclopædia[4] is cited for the printer-trade variant ceceril in use in 1738.[2] The main use in English is not universal and applies to loan words from French and Portuguese such as "façade", "limaçon" and "cachaça" (often typed "facade", "limacon" and "cachaca" because of lack of ç keys on Anglophone keyboards).

With the advent of modernism,{{vague|date=May 2017}} the calligraphic nature of the cedilla was thought somewhat jarring on sans-serif typefaces, and so some designers instead substituted a comma design, which could be made bolder and more compatible with the style of the text.{{efn|Fonts with this design include Akzidenz-Grotesk and Helvetica, especially the Neue Haas Grotesk digitisation.[5][6][7][8][9]}} This can add to confusion as the use of commas as opposed to cedillas varies by language.

C

{{Main|Ç}}

The most frequent character with cedilla is "ç" ("c" with cedilla, as in façade). It was first used for the sound of the voiceless alveolar affricate {{IPA|/ts/}} in old Spanish and stems from the Visigothic form of the letter "z" (ꝣ), whose upper loop was lengthened and reinterpreted as a "c", whereas its lower loop became the diminished appendage, the cedilla.

It represents the "soft" sound {{IPA|/s/}}, the voiceless alveolar sibilant, where a "c" would normally represent the "hard" sound {{IPA|/k/}} (before "a", "o", "u", or at the end of a word) in English and in certain Romance languages such as Catalan, Galician, French (where ç appears in the name of the language itself, français), Ligurian, Occitan, and Portuguese. In Occitan, Friulian and Catalan ç can also be found at the beginning of a word (Çubran, ço) or at the end (braç).

It represents the voiceless postalveolar affricate {{IPA|/tʃ/}} (as in English "church") in Albanian, Azerbaijani, Crimean Tatar, Friulian, Kurdish, Tatar, Turkish (as in çiçek, çam, çekirdek, Çorum), and Turkmen. It is also sometimes used this way in Manx, to distinguish it from the velar fricative.

In the International Phonetic Alphabet, ⟨ç⟩ represents the voiceless palatal fricative.

S

{{Main|Ş}}

The character "ş" represents the voiceless postalveolar fricative {{IPA|/ʃ/}} (as in "show") in several languages, including many belonging to the Turkic languages, and included as a separate letter in their alphabets:

  • Turkish
    • For example, it is used in Turkish words and names like Eskişehir, Şımarık, Hasan Şaş, Rüştü Reçber etc.
  • Azerbaijani
  • Crimean Tatar
  • Gagauz
  • Tatar
  • Turkmen
  • Romanian (substitution use when S-comma [Ș] was missing from pre-3.0 Unicode standards, and older standards, still frequent, but an error)
  • Kurdish
  • In HTML character entity references Ş and ş can be used.

Latvian

Comparatively, some consider the diacritics on the Latvian consonants "ģ", "ķ", "ļ", "ņ", and formerly "ŗ" to be cedillas. Although their Adobe glyph names are commas, their names in the Unicode Standard are "g", "k", "l", "n", and "r" with a cedilla. The letters were introduced to the Unicode standard before 1992, and their names cannot be altered. The uppercase equivalent "Ģ" sometimes has a regular cedilla.

Marshallese

Four letters in Marshallese have cedillas: <{{lang|mh|ļ}} {{lang|mh|m̧}} {{lang|mh|ņ}} {{lang|mh|o̧}}>. In standard printed text they are always cedillas, and their omission or the substitution of comma below and dot below diacritics are nonstandard.{{citation needed|reason=Several documents use a detached cedilla, the Marshallese language commission recommended a (non specific) diacritic below. The Marshallese Language Orthography (Standard Spelling) Act of 2010 applies the rules of the MED (1979), but it is not clear what was meant by cedilla, if there is only one correct shape for the cedilla or if the cedilla can have different shapes as it the case in many languages where it can be both detached or attached depending on the font style.|date=June 2013}}

{{As of|2011}}, many font rendering engines do not display any of these properly, for two reasons:
  • "{{lang|mh|ļ}}" and "{{lang|mh|ņ}}" usually do not display properly at all, because of the use of the cedilla in Latvian. Unicode has precombined glyphs for these letters, but most quality fonts display them with comma below diacritics to accommodate the expectations of Latvian orthography. This is considered nonstandard in Marshallese. The use of a zero-width non-joiner between the letter and the diacritic can alleviate this problem: "{{lang|mh|l‌̧}}" and "{{lang|mh|n‌̧}}" may display properly, but may not; see below.
  • "{{lang|mh|m̧}}" and "{{lang|mh|o̧}}" do not currently exist in Unicode as precombined glyphs, and must be encoded as the plain Latin letters "{{lang|mh|m}}" and "{{lang|mh|o}}" with the combining cedilla diacritic. Most Unicode fonts issued with Windows do not display combining diacritics properly, showing them too far to the right of the letter, as with Tahoma ("" and "") and Times New Roman ("" and ""). This mostly affects "{{lang|mh|m̧}}", and may or may not affect "{{lang|mh|o̧}}". But some common Unicode fonts like Arial Unicode MS ("" and ""), Cambria ("" and "") and Lucida Sans Unicode ("" and "") do not have this problem. When "{{lang|mh|m̧}}" is properly displayed, the cedilla is either underneath the center of the letter, or is underneath the right-most leg of the letter, but is always directly underneath the letter wherever it is positioned.

Because of these font display issues, it is not uncommon to find nonstandard ad hoc substitutes for these letters. The online version of the Marshallese-English Dictionary (the only complete Marshallese dictionary in existence) displays the letters with dot below diacritics, all of which do exist as precombined glyphs in Unicode: "{{lang|mh|ḷ}}", "{{lang|mh|ṃ}}", "{{lang|mh|ṇ}}" and "{{lang|mh|ọ}}". The first three exist in the International Alphabet of Sanskrit Transliteration, and "{{lang|mh|ọ}}" exists in the Vietnamese alphabet, and both of these systems are supported by the most recent versions of common fonts like Arial, Courier New, Tahoma and Times New Roman. This sidesteps most of the Marshallese text display issues associated with the cedilla, but is still inappropriate for polished standard text.

Other diacritics

Languages such as Romanian add a comma (virgula) to some letters, such as ș, which looks like a cedilla, but is more precisely a diacritical comma. This is particularly confusing with letters which can take either diacritic: for example, the consonant {{IPA|/ʃ/}} is written as "ş" in Turkish but "ș" in Romanian, and Romanian writers will sometimes use the former instead of the latter because of insufficient font or character-set support.

The Polish letters "ą" and "ę" and Lithuanian letters "ą", "ę", "į", and "ų" are not made with the cedilla either, but with the unrelated ogonek diacritic.

French

In 1868, Ambroise Firmin-Didot suggested in his book Observations sur l'orthographe, ou ortografie, française (Observations on French Spelling) that French phonetics could be better regularized by adding a cedilla beneath the letter "t" in some words. For example, the suffix -tion this letter is usually not pronounced as (or close to) {{IPA|/t/}} in either French or English, but respectively as {{IPA|/sjɔ̃/}} and {{IPA|/ʃən/}}. It has to be distinctly learned that in words such as French diplomatie (but not diplomatique) and English action it is pronounced {{IPA|/s/}} and {{IPA|/ʃ/}}, respectively (but not in active in either language). A similar effect occurs with other prefixes or within words also in French and English, such as partial where t is pronounced {{IPA|/s/}} and {{IPA|/ʃ/}} respectively. Firmin-Didot surmised that a new character could be added to French orthography. A similar letter, the t-comma, does exist in Romanian, but it has a comma accent, not a cedilla one.

Romanian

The Unicode characters for Ţ (T with cedilla) and Ş (S with cedilla) were wrongly implemented in Windows Romanian. In Windows 7, Microsoft corrected the error by replacing T-cedilla with T-comma (Ț) and S-cedilla with S-comma (Ș).

Gagauz

Gagauz uses Ţ (T with cedilla), one of the few languages to do so, and Ş (S with cedilla). Besides being present in some Gagauz orthographies, T with Cedilla exists as part of the General Alphabet of Cameroon Languages, in the Kabyle dialect of the Berber language, and possibly elsewhere.

Encodings

Unicode provides precomposed characters for some Latin letters with cedillas. Others can be formed using the cedilla combining character.

Unicode and HTML Codes for Cedillas
Description Letter Unicode HTML
Cedilla (spacing) ¸ U+00B8 &cedil; or &#184;
Combining cedilla ◌̧ U+0327 &#807;
C with cedilla Ç
ç
U+00C7
U+00E7
&Ccedil; or &#199;
&ccedil; or &#231;
C with cedilla and acute accent
U+1E08
U+1E09
&#7688;
&#7689;
Combining small c with cedilla
(medieval superscript diacritic)[10]
◌ᷗ U+1DD7 &#7639;
D with cedilla
U+1E10
U+1E11
&#7696;
&#7697;
E with cedilla Ȩ
ȩ
U+0228
U+0229
&#552;
&#553;
E with cedilla and breve
U+1E1C
U+1E1D
&#7708;
&#7709;
G with cedilla Ģ
ģ
U+0122
U+0123
&#290;
&#291;
H with cedilla
U+1E28
U+1E29
&#7720;
&#7721;
K with cedilla Ķ
ķ
U+0136
U+0137
&#310;
&#311;
L with cedilla Ļ
ļ
U+013B
U+013C
&#315;
&#316;
N with cedilla Ņ
ņ
U+0145
U+0146
&#325;
&#326;
R with cedilla Ŗ
ŗ
U+0156
U+0157
&#342;
&#343;
S with cedilla Ş
ş
U+015E
U+015F
&#350;
&#351;
T with cedilla Ţ
ţ
U+0162
U+0163
&#354;
&#355;

References

1. ^For cedilla being the diminutive of ceda, see definition of cedilla, Diccionario de la lengua española, 22nd edition, Real Academia Española {{es icon}}, which can be seen in context by accessing the site of the Real Academia and searching for cedilla. (This was accessed 27 July 2006.)
2. ^{{OED|cedilla}}
3. ^Minsheu, John (1599) Percyvall's (R.) Dictionarie in Spanish and English (as enlarged by J. Minsheu) Edm. Bollifant, London, {{OCLC|3497853}}
4. ^Chambers, Ephraim (1738) Cyclopædia; or, an universal dictionary of arts and sciences (2nd ed.) {{OCLC|221356381}}
5. ^{{cite web|last1=Jacquerye|first1=Denis Moyogo|title=Cedilla Comma|url=https://www.unicode.org/L2/L2013/13155r-cedilla-comma.pdf|publisher=Unicode Consortium|accessdate=3 July 2015}}
6. ^{{cite web|title=Neue Haas Grotesk|url=http://www.fontbureau.com/nhg/|publisher=The Font Bureau, Inc.|page=Introduction}}
7. ^{{cite web|url=http://www.linotype.com/6598/neuehaasgrotesk.html |title=Neue Haas Grotesk - Font News |publisher=Linotype.com |date= |accessdate=2013-09-21}}
8. ^{{cite web|url=http://www.christianschwartz.com/haasgrotesk.shtml |title=Schwartzco Inc |publisher=Christianschwartz.com |date= |accessdate=2013-09-21}}
9. ^{{cite web|title=Akzidenz Grotesk Buch|url=http://www.fonts.com/font/berthold/akzidenz-grotesk-bq/collection-volume|publisher=Berthold/Monotype|accessdate=3 July 2015|deadurl=yes|archiveurl=https://web.archive.org/web/20150704103208/http://www.fonts.com/font/berthold/akzidenz-grotesk-bq/collection-volume|archivedate=4 July 2015|df=}}
10. ^{{cite web | url=http://std.dkuug.dk/JTC1/SC2/WG2/docs/n3027.pdf | title=N3027: Proposal to add medievalist characters to the UCS | publisher=ISO/IEC JTC1/SC2/WG2 | date=2006-01-30 }}
{{notelist}}

External links

  • ScriptSource — Positioning the traditional cedilla
  • Diacritics Project—All you need to design a font with correct accents
  • Keyboard Help—Learn how to make world language accent marks and other diacriticals on a computer
{{Latin script||cedilla}}

2 : Latin-script diacritics|Turkish language

随便看

 

开放百科全书收录14589846条英语、德语、日语等多语种百科知识,基本涵盖了大多数领域的百科知识,是一部内容自由、开放的电子版国际百科全书。

 

Copyright © 2023 OENC.NET All Rights Reserved
京ICP备2021023879号 更新时间:2024/11/11 21:57:57