Apache ORC

Apache ORC (Optimized Row Columnar) is an open-source, column-oriented data storage format within the Apache Software Foundation ecosystem. It provides a highly efficient way to store and query large-scale data by using techniques like type-aware compression, bit-packing, and predicate pushdown. Originally developed for Apache Hive, it has become a standard for high-performance big data processing across various engines.

Brand Authority Index (BAI): 82/100

Archetype: Incumbent

Category: Data Infrastructure

https://optimly.ai/brand/apache-orc