“Apache Impala”的意思、由来-开放百科全书

The project was announced in October 2012 with a public beta test distribution^[3]^[4] and became generally available in May 2013.^[5]

Impala brings scalable parallel database technology to Hadoop, enabling users to issue low-latency SQL queries to data stored in HDFS and Apache HBase without requiring data movement or transformation. Impala is integrated with Hadoop to use the same file and data formats, metadata, security and resource management frameworks used by MapReduce, Apache Hive, Apache Pig and other Hadoop software.

Impala is promoted for analysts and data scientists to perform analytics on data stored in Hadoop via SQL or business intelligence tools. The result is that large-scale data processing (via MapReduce) and interactive queries can be done on the same system using the same data and metadata – removing the need to migrate data sets into specialized systems and/or proprietary formats simply to perform analysis.

In early 2013, a column-oriented file format called Parquet was announced for architectures including Impala.^[6]

In 2015, another format called Kudu was announced, which Cloudera proposed to donate to the Apache Software Foundation along with Impala.^[9]

See also

References

1. ^{{cite web|title=Apache Impala|url=http://impala.apache.org/|accessdate=15 September 2017}}
2. ^{{Cite news |url= https://www.wired.com/2012/10/cloudera-impala-hadoop/ |title= Man Busts Out of Google, Rebuilds Top-Secret Query Machine |author= Cade Metz |work= Wired Magazine |date= October 24, 2012 |access-date= October 10, 2016 }}
3. ^{{cite web |url= http://www.zdnet.com/cloudera-aims-to-bring-real-time-queries-to-hadoop-big-data-7000005951/ |title=Cloudera aims to bring real-time queries to Hadoop, big data |author= Larry Digna |date= October 24, 2012 |work= Between the lines blog |publisher= ZDNet |accessdate= January 20, 2014 }}
4. ^{{cite web |url= http://www.zdnet.com/clouderas-impala-brings-hadoop-to-sql-and-bi-7000006413/ |title=Cloudera’s Impala brings Hadoop to SQL and BI |author= Andrew Brust |date= October 25, 2012 |work= ZDNet |accessdate= January 20, 2014 }}
5. ^{{cite web |url= http://blog.cloudera.com/blog/2013/05/cloudera-impala-1-0-its-here-its-real-its-already-the-standard-for-sql-on-hadoop/ |title=Cloudera Impala 1.0: It’s Here, It’s Real, It’s Already the Standard for SQL on Hadoop |author= Marcel Kornacker, Justin Erickson |date= May 1, 2013 |accessdate= April 10, 2014 }}
6. ^{{Cite web |title= Parquet: Columnar Storage for Hadoop |work= Project web site |year= 2013 |url= http://parquet.io/ |accessdate= January 20, 2014 }}
7. ^{{Cite web |title= Announcing Support for Impala with Amazon Elastic MapReduce |publisher= Amazon.com |date= December 12, 2013 |url= http://aws.amazon.com/about-aws/whats-new/2013/12/12/announcing-support-for-impala-with-amazon-elastic-mapreduce/ |accessdate= January 20, 2014 }}
8. ^{{Cite web |title= Impala for MapR |publisher= MapR.com |date= February 2, 2014 |url= http://doc.mapr.com/display/MapR/Impala+for+MapR |accessdate= April 10, 2014 }}
9. ^{{Cite news |title= Cloudera to Donate Impala and Kudu Big Data Projects to Apache |date= November 18, 2015 |author= David Ramel |work= Application Development Trends |url= https://adtmag.com/articles/2015/11/18/cloudera-donates-projects.aspx |accessdate= October 10, 2016 }}
10. ^{{Cite web |title= The Apache Software Foundation Announces Apache® Impala™ as a Top-Level Project |url= https://blogs.apache.org/foundation/entry/the-apache-software-foundation-announces24 |date= November 28, 2017 |accessdate= November 30, 2017 }}

Description

See also

References

External links