Apache ORC (Optimized Row Columnar) is an open-source, column-oriented data storage format within the Apache Software Foundation ecosystem. It provides a highly efficient way to store and query large-scale data by using techniques like type-aware compression, bit-packing, and predicate pushdown. Originally developed for Apache Hive, it has become a standard for high-performance big data processing across various engines.
Brand Authority Index (BAI): 82/100
Archetype: Incumbent
Category: Data Infrastructure