Skip to content

write_parquet() / write_csv_arrow() cannot stream a dataset object back to S3 #30615

Description

@asfimport

It seems we cannot send a dataset (table) object opened with open_dataset() into write_parquet() or write_csv_arrow().  I realize that the former is a 'multi-file' representation and the latter two are not, but as both represent a single table (single schema) it seems like this should still work.  (It also looks like write_csv_arrow() gives us more control over formatting details than write_dataset() )

Reporter: Carl Boettiger / @cboettig

Related issues:

Note: This issue was originally created as ARROW-15104. Please see the migration documentation for further details.

Metadata

Metadata

Assignees

No one assigned

    Type

    No type
    No fields configured for issues without a type.

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions