Skip to content

Files

Latest commit

Jul 22, 2025
a5ee599 · Jul 22, 2025

History

History

cloud_export_to_parquet

Cloud Export to parquet sample

This is an example workflow to convert exported file from proto to parquet file. The workflow is an hourly schedule.

Please make sure your python version is 3.10 or above. For this sample, run:

uv sync --group=cloud-export-to-parquet

Before you start, please modify workflow input in create_schedule.py with your s3 bucket and namespace. Also make sure you've the right AWS permission set up in your environment to allow this workflow read and write to your s3 bucket.

To run, first see README.md for prerequisites. Then, run the following from the root directory to start the worker:

uv run cloud_export_to_parquet/run_worker.py

This will start the worker. Then, in another terminal, run the following to execute the schedule:

uv run cloud_export_to_parquet/create_schedule.py

The workflow should convert exported file in your input s3 bucket to parquet in your specified location.