[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

[jira] [Created] (ARROW-2677) Missing pyarrow ZSTD support for Parquet

William Lee created ARROW-2677:

             Summary: Missing pyarrow ZSTD support for Parquet
                 Key: ARROW-2677
                 URL: https://issues.apache.org/jira/browse/ARROW-2677
             Project: Apache Arrow
          Issue Type: Improvement
          Components: Python
    Affects Versions: 0.9.0
            Reporter: William Lee

It seems like in the python/pyarrow/_parquet.pyx file there's a check for the supported compression scheme:
cdef int check_compression_name(name) except -1:
    if name.upper() not in ['NONE', 'SNAPPY', 'GZIP', 'LZO', 'BROTLI', 'LZ4']:
        raise ArrowException("Unsupported compression: " + name)
    return 0{code}
which does not include ZSTD (Zstandard).  From my understanding it should be supported in the underlying c++ library already.  Is it possible to add that support?


This message was sent by Atlassian JIRA