geoparquet-io¶
Fast I/O and transformation tools for GeoParquet files, powered by DuckDB and PyArrow.
- One interface for conversion, sorting, partitioning, and spatial indexing.
- CLI and Python API with full type hints.
- Unix pipes with Arrow IPC streaming—no intermediate files.
- Read/write to S3, GCS, Azure, HTTPS via DuckDB and obstore.
- Automatic Hilbert sorting, ZSTD compression, bbox columns.
- Add H3, S2, A5, quadkey, KD-tree spatial indices.
- GeoParquet 1.1 and 2.0 support, including Parquet geometry types.
Installation¶
uv tool install geoparquet-io # CLI (recommended)
uv add geoparquet-io # Python library
Or with pip:
pip install geoparquet-io
Documentation¶
Full documentation is available on the website.
Head to Getting Started to dig in.
Development¶
git clone https://github.com/geoparquet/geoparquet-io.git
cd geoparquet-io
uv sync --all-extras
uv run pytest
See Contributing Guide for details.