ORC is a self-describing type-aware columnar file format designed for Hadoop workloads. It is optimized for large streaming reads, but wi...
Последний релиз: nullBenchmarks for comparing ORC, Parquet, and Avro performance.
Последний релиз: мая 08, 2017The core reader and writer for ORC files. Uses the vectorized column batch for the in memory representation.
Последний релиз: nullORC is a self-describing type-aware columnar file format designed for Hadoop workloads. It is optimized for large streaming reads, ...
Последний релиз: nullAn implementation of Hadoop's mapred and mapreduce input and output formats for ORC files. They use the core reader and writer, but present the...
Последний релиз: nullA shim layer for supporting various versions of Hadoop dynamically. This module uses a higher version of Hadoop so that we can create shims ...
Последний релиз: null