Skip to content

geoparquet-io

PyPI version Tests codecov Python 3.10+ License

Fast I/O and transformation tools for GeoParquet files, powered by DuckDB and PyArrow.

  • One interface for conversion, sorting, partitioning, and spatial indexing.
  • CLI and Python API with full type hints.
  • Unix pipes with Arrow IPC streaming—no intermediate files.
  • Read/write to S3, GCS, Azure, HTTPS via DuckDB and obstore.
  • Automatic Hilbert sorting, ZSTD compression, bbox columns.
  • Add H3, S2, A5, quadkey, KD-tree spatial indices.
  • GeoParquet 1.1 and 2.0 support, including Parquet geometry types.

Installation

uv tool install geoparquet-io   # CLI (recommended)
uv add geoparquet-io            # Python library

Or with pip:

pip install geoparquet-io

Documentation

Full documentation is available on the website.

Head to Getting Started to dig in.

Development

git clone https://github.com/geoparquet/geoparquet-io.git
cd geoparquet-io
uv sync --all-extras
uv run pytest

See Contributing Guide for details.

License

Apache 2.0