Apache CarbonData
Apache CarbonData is a free and open-source column-oriented data storage format of the Apache Hadoop ecosystem. It is similar to the other columnar-storage file formats available in Hadoop namely RCFile and ORC. It is compatible with most of the data processing frameworks in the Hadoop environment. It provides efficient data compression and encoding schemes with enhanced performance to handle complex data in bulk.
Initial release | 2013[1] |
---|---|
Stable release | 1.6.0
/ 28 August 2019[2] |
Operating system | Cross-platform |
Type | Database management system |
License | Apache License 2.0 |
Website | carbondata |
History
CarbonData was first developed at Huawei in 2013. The project was submitted to the Apache Incubator in June 2016, and had its first official release two months later. The project won top honors in the BlackDuck 2016 Open Source Rookies of the Year's Big Data category. Since May 1, 2017, Apache CarbonData is a top-level Apache Software Foundation (ASF)-sponsored project.[1]
See also
- Pig (programming tool)
- Apache Hive
- Apache Impala
- Apache Drill
- Apache Kudu
- Apache Spark
- Apache Thrift
- Apache Parquet
- Presto (SQL query engine)
References
- Foundation, The Apache Software (May 1, 2017). "The Apache Software Foundation Announces Apache® CarbonData™ as a Top-Level Project". GlobeNewswire News Room.
- "Releases - CarbonData - Apache Software Foundation". cwiki.apache.org.