Public Attributes | List of all members
cudf::io::parquet::ColumnChunk Struct Reference

Thrift-derived struct describing a chunk of data for a particular column. More...

#include <parquet_schema.hpp>

Public Attributes

std::string file_path = ""
 
int64_t file_offset = 0
 Deprecated: Byte offset in file_path to the ColumnMetaData.
 
ColumnChunkMetaData meta_data
 
int64_t offset_index_offset = 0
 File offset of ColumnChunk's OffsetIndex.
 
int32_t offset_index_length = 0
 Size of ColumnChunk's OffsetIndex, in bytes.
 
int64_t column_index_offset = 0
 File offset of ColumnChunk's ColumnIndex.
 
int32_t column_index_length = 0
 Size of ColumnChunk's ColumnIndex, in bytes.
 
int schema_idx = -1
 Index in flattened schema (derived from path_in_schema)
 
std::optional< OffsetIndexoffset_index
 OffsetIndex for this column chunk
 
std::optional< ColumnIndexcolumn_index
 ColumnIndex for this column chunk
 

Detailed Description

Thrift-derived struct describing a chunk of data for a particular column.

Each column chunk lives in a particular row group and are guaranteed to be contiguous in the file. Any missing or corrupted chunks can be skipped during reading.

Definition at line 763 of file parquet_schema.hpp.

Member Data Documentation

◆ file_path

std::string cudf::io::parquet::ColumnChunk::file_path = ""

File where column data is stored. If not set, assumed to be same file as metadata. This path is relative to the current file.

Definition at line 766 of file parquet_schema.hpp.

◆ meta_data

ColumnChunkMetaData cudf::io::parquet::ColumnChunk::meta_data

Column metadata for this chunk. Some writers may also replicate this at the location pointed to by file_path/file_offset.

Definition at line 771 of file parquet_schema.hpp.


The documentation for this struct was generated from the following file: