Skip to content

Compress Parquet #1189

@joelostlund

Description

@joelostlund

I am trying to compress using the parquet implementation in Scio but nothing seems to happen. Tried adding both to core-site.xml and in code but none of them help.

<property>
    <name>parquet.compression</name>
    <value>org.apache.hadoop.io.compress.GzipCodec</value>
    <description>
    </description>
  </property>

ParquetOutputFormat.setCompression(Job.getInstance(), CompressionCodecName.GZIP)

Metadata

Metadata

Assignees

Labels

Type

No type

Projects

No projects

Milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions