the open source
core for
scientific data
Adopt the open source standard for organizing and collaborating on array data in the cloud.
Trusted by leading institutions and ground-breaking innovators

The Python stack for scientific data
Earthmover was founded by Dr. Ryan Abernathey and Dr. Joe Hamman, who bring deep expertise in climate science, remote sensing, and cloud-native analytics. As core maintainers and contributors to Xarray, Zarr, and the Pangeo Project, they have helped define how the scientific community works with multidimensional data in the cloud.
Maintain the rich context of tensor data with Xarray
Earthmover helps maintain Xarray, the leading Python package for working with labeled multidimensional array data. The Earthmover platform integrates with Xarray and adds a user-friendly data catalog to unify data organization and facilitate data discovery.
xarray.devFlexible, performant storage and access with Zarr
The Zarr data format and Python library enables efficient storage and fast data access with chunked, compressed storage for multidimensional arrays. The Earthmover team helps maintain Zarr and recently announced Zarr 3.
The Earthmover platform extends Zarr's efficient data access by empowering applications and users to query data fast at any scale via WMS, OGC EDR, or OPeNDAP.
zarr.devEvolve data safely with Icechunk
The Earthmover team created and open-sourced Icechunk, a transaction storage engine for Zarr data to handle the version control and seamless updating needs of geospatial and weather data workflows. Icechunk delivers 10x performance improvements over other cloud storage libraries.
The Earthmover platform harnesses the performance of Icechunk and adds data governance and usage monitoring to help teams reduce costs, maintenance, and risk.
icechunk.ioThe Pangeo project has fundamentally changed how the geoscience community works with big data. Earthmover is taking that vision further by making these tools accessible to everyone.
Dr. Ryan Abernathey
Co-founder & CTO, Earthmover
Pangeo Project
The Pangeo Project is a community for open, reproducible, scalable geoscience supported by NASA and the National Science Foundation. As key contributors within this community, we are defining the reference architecture for cloud-native multidimensional data to accelerate collaboration and reproducible open science.
pangeo.ioBuild smarter with expert guidance
Accelerate your roadmap with expert guidance from climate scientists and data engineers building the leading open source solutions and defining the modern workflow for multidimensional data.