Best practice for creating GeoParquet files: https://github.com/opengeospatial/geoparquet/blob/main/format-specs/distributing-geoparquet.md#best-practices-for-distributing-geoparquet see the TL;DR:
Use zstd for compression, and set the compression level to 15.
Be sure to include the bbox covering, and use GeoParquet version 1.1.
Spatially order the data within the file.
Set the maximum row group size between 50,000 and 150,000 per row.
If the data is larger than ~2 gigabytes consider spatially partitioning the files.
Use STAC Metadata metadata to describe the data.
https://cloudnativegeo.org/blog/2025/02/geoparquet-2.0-going-native/
GeoParquet 2.0 vs Parquet: https://rednegra.net/blog/20250925-parquet-with-geometry-type-is-not-geoparquet/
https://bsky.app/profile/developmentseed.org/post/3lygffozqv22y
nested columns are possible!
Arrow vs Parquet
“spatial indexing”
https://cloudnativegeo.org/blog/2025/02/geoparquet-2.0-going-native/
https://github.com/jorisvandenbossche/2025-cloudscaping-geoparquet-workshop?tab=readme-ov-file
Best practice: https://github.com/opengeospatial/geoparquet/blob/main/format-specs/distributing-geoparquet.md
A podcast: https://mapscaping.com/podcast/geoparquet-for-beginners/