“Czech National Corpus”的意思、由来-开放百科全书

The Czech National Corpus (CNC) (Czech : Český národní korpus) is a large electronic corpus of the Czech language. A corpus (plural corpora) or text corpus is a large and structured set of texts in electronic form used for linguistic research.

Description

The Czech National Corpus is a large electronic corpus of written and spoken Czech. The Institute of the Czech National Corpus (ICNC) in Faculty of Arts at Charles University in Prague oversees the development of the CNC, including its use in teaching, and advancing the field of the corpus linguistics.^[1] The ICNC collaborates with over 200 researchers and students (mainly for spoken and parallel data acquisition), 270 publishers (as text providers), and other similar research projects.

About the Corpora

References

1. ^{{cite web |title=Institute of the Czech National Corpus |url=https://www.ff.cuni.cz/home/about/organisation/institute-of-the-czech-national-corpus/ |website=Institute of the Czech National Corpus |accessdate=8 January 2019}}
2. ^{{cite web |last1=Křen |first1=Michal |title=Recent Developments in the Czech National Corpus |url=https://ids-pub.bsz-bw.de/frontdoor/deliver/index/docId/3826/file/K%C5%99en_Recent_developments_in_the_czech_national_corpus_2015.pdf |website=Publication Server of the Institute for German Language |accessdate=8 January 2019}}
3. ^{{cite journal |last1=M. Hnátková, M. Křen, P. Procházka, and H. Skoumalová. |title=The SYN-series corpora of written Czech |journal=Proceedings of LREC2014 |date=2014 |page=160–164 |url=https://pdfs.semanticscholar.org/5f6b/cbe42e55694dd414cab6aa0f82c5885e9175.pdf |accessdate=9 January 2019}}
4. ^{{cite journal |last1=L. Válková, M. Waclawičová, and M. Křen. |title=Balanced data repository of spontaneous spoken Czech |journal=Proceedings of LREC2012 |date=2012 |page=3345–3349 |url=https://ids-pub.bsz-bw.de/frontdoor/deliver/index/docId/3826/file/K%C5%99en_Recent_developments_in_the_czech_national_corpus_2015.pdf |accessdate=9 January 2019}}
5. ^{{cite journal |last1=F. Čermák and A. Rosen |title=The case of InterCorp, a multilingual parallel corpus |journal=International Journal of Corpus Linguistics |date=2012 |volume=13 |issue=3 |pages=411– 427 |url=http://ucnk.korpus.cz/doc/the_case_of_intercorp.pdf |accessdate=9 January 2019}}
6. ^{{cite journal |last1=K. Kučera and M. Stluka. |title=Corpus of 19th century Czech texts: Problems and solutions |journal=Proceedings of LREC2014 |date=2014 |pages=165–168 |url=http://www.lrec-conf.org/proceedings/lrec2014/pdf/300_Paper.pdf |accessdate=9 January 2019}}

Description

About the Corpora

References

External links