“Draft:GetData.IO”的意思、由来-开放百科全书

词条

Draft:GetData.IO

释义

References
External links

{{Infobox organization
| name = GetData.IO
| logo =
| type = organization
| traded_as =
| predecessor =
| successor =
| founder =
| defunct =
| fate =
| area_served = Worldwide
| key_people =
| genre =
| products =
| production =
| purpose = Semantic Web, Open Web, Data Web, Data Mining, Web APIs, Web Scraping, Web Crawling
| revenue =
| operating_income =
| net_income =
| aum =
| assets =
| equity =
| owner =
| num_employees =
| parent =
| divisions =
| subsid =
| footnotes =
| intl =
| caption =
| foundation =
| location_city = San Mateo, California
| location_country = United States
| locations =
| homepage = [https://GetData.IO/ GetData.IO]
}}

GetDataIO is a project to turn the Web into a fully functional Giant Graph Database. The project was started in 2013 when early members of the community realized, while there were tons of data out there on the web, they were not structured in the format that could be easily re-utilized for machine learning purposes. This symptom has been attributed largely to the failure in execution on the vision of the data web^[1] envisioned by Tim Berners-Lee. Supporters of the project believes this problem will only become increasingly chronic as need for machine learning grows more prevalent while worldwide engineering shortage continues to persist^[2].

The project aims to bring about the evolution of the existing Web to a Semantic Web^[3] as was described by Berners-Lee, Hendler, and Lassila in The 2001 Scientific American article. It aims to achieve this mission by defining a Turing complete Semantic query language^[4] and building an underlying decentralized ODBC engine.

Supporters have been creating and maintaining recipes, written in JSON, to describe data scattered across clusters of web pages. These recipes are then interpreted by the ODBC engine which then turns these clusters of webpages into database tables accessible via public APIs. Coverage for the Web has been steadily growing as numbers of community members and recipes maintained increase. The project heavily relies upon web scraping techniques to extract data from web pages as well as crowd sourcing to bring about the next phase of the Web's evolution.

References

1. ^{{cite news |url=https://www.bloomberg.com/news/articles/2007-04-09/q-and-a-with-tim-berners-leebusinessweek-business-news-stock-market-and-financial-advice|title=Q&A with Tim Berners-Lee, Special Report |author=|date=|publisher=Businessweek |accessdate=14 April 2018}}
2. ^{{cite web|url=https://www.theverge.com/2017/12/5/16737224/global-ai-talent-shortfall-tencent-report|title=Tencent says there are only 300,000 AI engineers worldwide, but millions are needed|author=James Vincent|date=5 December 2017|publisher=The Verge}}
3. ^{{cite web |accessdate=March 13, 2008 |url=https://pdfs.semanticscholar.org/566c/1c6bd366b4c9e07fc37eb372771690d5ba31.pdf |title=The Semantic Web |publisher=Scientific American |date=May 17, 2001 |author=Berners-Lee, Tim }}
4. ^{{cite web |accessdate=July 20, 2018 |url=https://getdata.io/docs/semantic-query-language/api |title=The Semantic Query Language for the Web |date=July 20, 2018 |author=Gary Teh}}

External links

{{Official website|https://getdata.io/}}

{{Semantic Web|http://www.w3.org/standards/semanticweb/}}Category:Applied machine learningCategory:Data miningCategory:Web scrapingCategory:Web crawlersCategory:Web archiving

随便看

开放百科全书收录14589846条英语、德语、日语等多语种百科知识，基本涵盖了大多数领域的百科知识，是一部内容自由、开放的电子版国际百科全书。