Skip to main content

3 docs tagged with "pyarrow"

View All Tags

Load Parquet Data from S3 to Arrow Table

I have a Parquet dataset stored in AWS S3 and want to access it in a Metaflow flow. How can I read one or several Parquet files at once from a flow and use them in an Arrow table?