请输入您要查询的百科知识:

 

词条 Zero-width space
释义

  1. Usage

     Blacklisting in URLs 

  2. Encoding

  3. See also

  4. References

      Citations    Sources  

The zero-width space (ZWSP) is a non-printing character used in computerized typesetting to indicate word boundaries to text processing systems when using scripts that do not use explicit spacing, or after characters (such as the slash) that are not followed by a visible space but after which there may nevertheless be a line break. Normally, it is not a visible separation, but it may expand in passages that are fully justified.[1]

Usage

In HTML pages, the zero-width space can be used as a potential line-break in long words as an alternative to the <wbr> element. However, the zero-width space is not supported in some web browsers such as old versions of Internet Explorer (versions 6 and earlier).[2]

To show the effect of the zero-width space, the following words have been separated with zero-width spaces:

Lorem​Ipsum​Dolor​Sit​Amet​Consectetur​Adipiscing​Elit​Sed​Do​Eiusmod​Tempor​Incididunt​Ut​Labore​Et​Dolore​Magna​Aliqua​Ut​Enim​Ad​Minim​Veniam​Quis​Nostrud​Exercitation​Ullamco​Laboris​Nisi​Ut​Aliquip​Ex​Ea​Commodo​Consequat​Duis​Aute​Irure​Dolor​In​Reprehenderit​In​Voluptate​Velit​Esse​Cillum​Dolore​Eu​Fugiat​Nulla​Pariatur​Excepteur​Sint​Occaecat​Cupidatat​Non​Proident​Sunt​In​Culpa​Qui​Officia​Deserunt​Mollit​Anim​Id​Est​Laborum

And the following words are not separated with these spaces:

LoremIpsumDolorSitAmetConsecteturAdipiscingElitSedDoEiusmodTemporIncididuntUtLaboreEtDoloreMagnaAliquaUtEnimAdMinimVeniamQuisNostrudExercitationUllamcoLaborisNisiUtAliquipExEaCommodoConsequatDuisAuteIrureDolorInReprehenderitInVoluptateVelitEsseCillumDoloreEuFugiatNullaPariaturExcepteurSintOccaecatCupidatatNonProidentSuntInCulpaQuiOfficiaDeseruntMollitAnimIdEstLaborum

On browsers supporting zero-width spaces, resizing the window will re-break the first text only at word boundaries, while the second text will not be broken at all.

Blacklisting in URLs

ICANN rules prohibit domain names from including non-displayed characters such as zero-width space, and most browsers blacklist their use within domain names, because they can be used to create a homograph attack, where a malicious URL is visually indistinguishable from a legitimate one.[3][4]

Encoding

The zero-width space character is encoded in Unicode as {{Unichar|200B|ZERO WIDTH SPACE|html=8203}}.[5]

The TeX representation is \\hskip0pt; the LaTeX representation is \\hspace{0pt};[6] and the groff representation is \\:.[7]

Its semantics and HTML implementation are similar to the soft hyphen.

See also

  • Soft hyphen
  • Word divider
  • Word joiner
  • Word wrapping
  • Zero-width joiner
  • Zero-width non-joiner
  • Space (punctuation)
  • Whitespace character - includes table comparing various space-like characters

References

Citations

1. ^[https://www.unicode.org/versions/Unicode6.1.0/ The Unicode Standard 6.1], p. 366
2. ^{{cite web|url=http://dunae.ca/2009/better-web-typography-with-spaces-and-hyphens/ |title=Alex Dunae » Better Web Typography with Spaces and Hyphens |accessdate=December 3, 2009 |deadurl=unfit |archiveurl=https://web.archive.org/web/20101214223741/http://dunae.ca/2009/better-web-typography-with-spaces-and-hyphens/ |archivedate=December 14, 2010 }}
3. ^{{cite web| title=Network.IDN.blacklist_chars - mozillaZine| url=http://kb.mozillazine.org/Network.IDN.blacklist_chars| accessdate=2018-02-07}}
4. ^{{cite web| title=Unicode Character 'ZERO WIDTH SPACE' - FileFormat.Info| url=https://www.fileformat.info/info/unicode/char/200b/index.htm| accessdate=2018-02-07}}
5. ^{{cite web| title=General Punctuation – Unicode| url=https://www.unicode.org/charts/PDF/U2000.pdf| accessdate=2013-07-20}}
6. ^{{cite web| title=The LaTeX Companion. Chapter 3: Basic Formatting Tools| url=http://latex-project.org/guides/tlc2-ch3.pdf| accessdate=2014-02-08}}
7. ^{{cite web| title=groff(7) – Linux manual page| url=http://man7.org/linux/man-pages/man7/groff.7.html| accessdate=2014-02-08}}

Sources

{{refbegin}}
  • Unicode Consortium, "Special Areas and Format Characters" (Chapter 16), The Unicode Standard, Version 5.2 .full text
  • Victor H. Mair, Yongquan Liu, Characters and computers, IOS Press, 1991.
{{refend}}{{-}}{{Unicode navigation}}{{DEFAULTSORT:Zero-Width Space}}

4 : Control characters|Typography|Unicode formatting code points|Whitespace

随便看

 

开放百科全书收录14589846条英语、德语、日语等多语种百科知识,基本涵盖了大多数领域的百科知识,是一部内容自由、开放的电子版国际百科全书。

 

Copyright © 2023 OENC.NET All Rights Reserved
京ICP备2021023879号 更新时间:2024/9/25 2:24:21