Add ORC and Avro file format support to Druid's Iceberg input source

### Description

Component: extensions-contrib/druid-iceberg-extensions

Druid's Iceberg input source (druid-iceberg-extensions) currently only supports reading Iceberg tables stored in Parquet format. 

IcebergNativeRecordReader hardcodes Parquet.read() + GenericParquetReaders for all reads:

### Motivation

This was flaged while working on v2 spec support https://github.com/apache/druid/pull/19266#discussion_r3259155168 

dependecies : iceberg-orc, orc-core, iceberg-avro, and avro are absent 


### References:

•  IcebergNativeRecordReader.java — current Parquet-only implementation
•  IcebergFileTaskInputSource.java — serialisation boundary between coordinator and worker
•  Iceberg API: org.apache.iceberg.data.GenericDeleteFilter (public, already on classpath)
•  Iceberg API: org.apache.iceberg.data.orc.GenericOrcReader, org.apache.iceberg.data.avro.GenericAvroReader
•  PR #19266 — added Iceberg V2 delete support (Parquet only); this is the follow-up

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add ORC and Avro file format support to Druid's Iceberg input source #19472

Description

Motivation

References:

Metadata

Assignees

Labels

Type

Fields

Projects

Milestone

Relationships

Development

Add ORC and Avro file format support to Druid's Iceberg input source #19472

Description

Description

Motivation

References:

Metadata

Metadata

Assignees

Labels

Type

Fields

Projects

Milestone

Relationships

Development

Issue actions