词条 | Draft:Spark NLP | ||||||||||||||||||||||||||||||||||||||||
释义 |
| title = Spark NLP | name = Spark NLP | author = John Snow Labs | released = October 2017[1] | latest release version = 2.0 | latest release date = {{Start date and age|2019|03|df=yes}} | repo = {{URL|https://github.com/JohnSnowLabs/spark-nlp}} | status = active | programming language = Python, Scala | operating system = Linux, Windows, macOS, OS X | platform = cross-platform | genre = Natural language processing | license = Apache licence | website = {{URL|https://www.johnsnowlabs.com/spark-nlp/}} }} Spark NLP[2][3][4][5][6] is an open-source text processing library built on top of Apache Spark and its Spark ML library. It's goal is to provide an API for NLP annotations allowing a scalable approach within a distributed large scale environment. Main FeaturesSeveral annotators are provided out of the box for both Python and Scala:
Pipelines
Pre-trained modelsEnglish
Italian
French
See also
Licence InformationThe library is under Apache 2.0 license, written in Scala with no dependencies on other NLP or ML libraries, and designed to natively extend the Spark ML Pipeline API. Spark NLP is available for free download on GitHub https://github.com/johnsnowlabs/spark-nlp. References1. ^{{cite web |last1=Talby |first1=David |title=Introducing the Natural Language Processing Library for Apache Spark |url=https://databricks.com/blog/2017/10/19/introducing-natural-language-processing-library-apache-spark.html |website=databricks.com |publisher=databricks |accessdate=29 March 2019}} Category:SoftwareCategory:Open-source artificial intelligence{{AFC submission|||ts=20190329085310|u=Dia.trambitas|ns=118}}2. ^{{Cite web|url=https://insidebigdata.com/2018/09/03/use-nlp-extract-unstructured-medical-data-text/|title=The Use of NLP to Extract Unstructured Medical Data From Text|last=Team|first=Editorial|date=2018-09-04|website=insideBIGDATA|language=en-US|access-date=2019-03-29}} 3. ^{{Cite web|url=https://startupbeat.com/john-snow-labs-natural-language-understanding-software-gets-state-of-the-art-recognition-in-three-industry-events/30699/|title=John Snow Labs' Natural Language Understanding Software Gets "State of the Art" Recognition in Three Industry Events|date=2018-07-19|website=StartUp Beat|language=en-US|access-date=2019-03-29}} 4. ^{{Cite web|url=https://www.oreilly.com/ideas/comparing-production-grade-nlp-libraries-running-spark-nlp-and-spacy-pipelines|title=Comparing production-grade NLP libraries: Running Spark-NLP and spaCy pipelines|last=Ellafi|first=Saif Addin|date=2018-02-28|website=O'Reilly Media|language=en|access-date=2019-03-29}} 5. ^{{Cite web|url=https://www.oreilly.com/ideas/comparing-production-grade-nlp-libraries-accuracy-performance-and-scalability|title=Comparing production-grade NLP libraries: Accuracy, performance, and scalability|last=Ellafi|first=Saif Addin|date=2018-02-28|website=O'Reilly Media|language=en|access-date=2019-03-29}} 6. ^{{Cite web|url=https://www.i-programmer.info/news/80-java/11251-spark-gets-nlp-library.html|title=Spark Gets NLP Library|last=Ewbank|first=Kay|date=|website=www.i-programmer.info|archive-url=|archive-date=|dead-url=|access-date=}} |
||||||||||||||||||||||||||||||||||||||||
随便看 |
|
开放百科全书收录14589846条英语、德语、日语等多语种百科知识,基本涵盖了大多数领域的百科知识,是一部内容自由、开放的电子版国际百科全书。