词条 | Chinese character description language | |||||||||||||||||||||||||||||||||||||||
释义 |
CDLChinese Character Description Language is a font technology, based on XML, co-created by Tom Bishop and Richard Cook for the Wenlin Institute , designed for describing any CJK character, but suitable for describing any glyph. This XML-based declarative language actually defines the stroke order of each component (a subunit of the glyph similar to a radical, but not necessarily bearing the semantic significance of a true radical), as well as assembly of previously defined components to build up ever more complex characters. Many of these components are characters in their own right, in addition to serving as building-block components. The background looks like a square of 128 pixels on each side. In this background:
Accordingly, a set of less than 50 strokes[1] allow one to construct a set of about 1,000 components[2] which may in turn be embedded within tens of thousands of characters' descriptions[2]. A change in the shape of one of the 50 basic strokes is implicitly applied within each character that embeds that stroke. Likewise, a change to a component is implicitly applied within each and all characters whose assemblage uses that component[2]. T. Bishop and R. Cook explain this as follows: {{quote|The stroke count of one character is generally related to the stroke counts of other characters. Most characters are built from components, and as long as the stroke counts of those components are defined, there is rarely any difficulty in adding them together to obtain the combined stroke count. Therefore, if a standard defines the strokes of a few thousand characters, it implicitly defines the strokes of many thousands of additional characters.[3]}}As of spring 2003, over 50,000 Chinese characters had been described via CDL. As of 26 February 2013, 86,416 Chinese characters had been described via CDL.[4] HanGlyphA character description language intended for supplying missing rare characters in documents (addressing the Chinese equivalent of the gaiji problem).[5] Documents can contain markup for missing characters, which will automatically trigger the generation of small fonts to provide the characters. The language itself is a simple postfix notation describing strokes and ways to combine them. The prototype software uses Metapost to render the characters and embed them in LaTeX documents. The language was presented by Wai Wong in 1997,[6] and papers about its implementation in Metapost and LaTeX appeared at TeX user group conferences in 2003.[7][8] Ideographic Description Sequences{{main|Ideographic Description Characters (Unicode block)}}Chapter 12 of the Unicode specification[9] defines a syntax for "Ideographic Description Sequences" (IDSes) intended for use in describing characters not included in the standard in terms of combinations of components that do have code points. Twelve special characters in the range U+2FF0 to U+2FFB act as prefix operators to combine other characters or sequences to form larger characters.
For example, the Sawndip character "" (encoded in CJK Unified Ideographs Extension F as U+2DA21 𭨡) can be described as "⿰書史". These sequences differ from some other character description languages in that they do not include detailed information about the locations and shapes of strokes. They do not, by themselves, provide enough information for an actual rendering of a character being described. However, these sequences are useful in describing to the reader a character that is not directly printable, either because it is absent in a given font, or is absent from the Unicode standard altogether. These sequences may incidentally be useful for dictionary lookup purposes, as a sort of rough input method for queries. Unicode's specification for these sequences is based on the characters and syntax of the earlier GBK standard. The IDSgrep free software package by Matthew Skala[10][11] extends Unicode's IDS syntax to include additional features for dictionary lookup; it is capable of converting KanjiVG's database to its own extended IDS format, or of searching EIDS files generated by the related Tsukurimashou font family. KanjiVG{{Expand section|date=December 2009}}KanjiVG is a free (CC-by-sa-3.0) Japanese character description language (intended to eventually expand to Chinese as well) based on SVG and a wiki system of edition. SCMLIn 2007, Structural Character Modeling Language was proposed as a different kind of XML-based Chinese-character description language whose positioning is not based on a numerical grid, as CDL and HanGlyph are. The known database of characters whose strokes and components are encoded in SCML is for demonstration-of-principle only; no known effort exists to attempt to encode, say, all of Unicode's CJK characters in SCML. See also
Notes1. ^Bishop & Cook 2013-12-31:p2 2. ^1 2 Bishop & Cook 2013-12-31:p9 3. ^{{Harvnb|Bishop, Tom|Cook, Richard|2003-10-31|pp=8–9, point n⁰12}} 4. ^Wenlin Institute webpage for CDL 5. ^{{cite web|url=http://www.hanglyph.com/en/index.shtml|title=HanGlyph|accessdate=17 February 2012}} 6. ^{{cite journal|last=Wong|first=Wai|title=HanGlyph – a Chinese Character Description Language|journal=Proceedings of the Seventeenth International Conference on Computer Processing of Oriental Languages, Hong Kong|date=April 1997}} 7. ^{{cite journal|last=Yiu|first=Candy L. K.|author2=Wai Wong|title=Chinese Character Synthesis using METAPOST|journal=Proceedings of the 24th Annual Meeting and Conference of the TeX User Group, Hawaii, U.S.A.|date=July 2003}} 8. ^{{cite journal|last=Wong|first=Wai |author2=Candy L. K. Yiu |author3=Kelvin, C. F. Ng |title=Typesetting Rare Chinese Characters in LaTeX|journal=Proceedings of the 14th European TeX Conference, Brest, France|date=June 2003}} 9. ^[https://www.unicode.org/versions/Unicode6.0.0/ch12.pdf] 10. ^ 11. ^{{cite journal|last1=Skala|first1=Matthew|title=A Structural Query System for Han Characters|journal=International Journal of Asian Language Processing|date=2015|volume=23|issue=2|pages=127-159|url=http://colips.org/journals/volume23/23.2.4_idsgrep-article-final.pdf}} External links
|last1=Bishop |first1=Tom |last2=Cook |first2=Richard|year=2003|url=https://www.unicode.org/L2/L2003/03420-cdl-strokes.pdf}}
|last1=Bishop |first1=Tom |last2=Cook |first2=Richard |date=2003-10-31|url=https://www.unicode.org/L2/L2003/03404-cdl-spec.pdf}}
2 : Chinese characters|XML |
|||||||||||||||||||||||||||||||||||||||
随便看 |
|
开放百科全书收录14589846条英语、德语、日语等多语种百科知识,基本涵盖了大多数领域的百科知识,是一部内容自由、开放的电子版国际百科全书。