Announcing the Earthmover Data Marketplace: Subscribe to ARCO datasets from ECMWF, NOAA, and more. Explore the marketplace .

the open source
core
for
scientific data

Adopt the open source standard for organizing and collaborating on array data in the cloud.

Trusted by leading institutions and ground-breaking innovators

Overview

The Python stack for scientific data

The Python stack for scientific data

Earthmover was founded by Dr. Ryan Abernathey and Dr. Joe Hamman, who bring deep expertise in climate science, remote sensing, and cloud-native analytics. As core maintainers and contributors to Xarray, Zarr, and the Pangeo Project, they have helped define how the scientific community works with multidimensional data in the cloud.

Xarray

Maintain the rich context of tensor data with Xarray

Earthmover helps maintain Xarray, the leading Python package for working with labeled multidimensional array data. The Earthmover platform integrates with Xarray and adds a user-friendly data catalog to unify data organization and facilitate data discovery.

xarray.dev
Maintain the rich context of tensor data with Xarray
Zarr

Flexible, performant storage and access with Zarr

The Zarr data format and Python library enables efficient storage and fast data access with chunked, compressed storage for multidimensional arrays. The Earthmover team helps maintain Zarr and recently announced Zarr 3.

The Earthmover platform extends Zarr's efficient data access by empowering applications and users to query data fast at any scale via WMS, OGC EDR, or OPeNDAP.

zarr.dev
Flexible, performant storage and access with Zarr
Icechunk

Evolve data safely with Icechunk

The Earthmover team created and open-sourced Icechunk, a transaction storage engine for Zarr data to handle the version control and seamless updating needs of geospatial and weather data workflows. Icechunk delivers 10x performance improvements over other cloud storage libraries.

The Earthmover platform harnesses the performance of Icechunk and adds data governance and usage monitoring to help teams reduce costs, maintenance, and risk.

icechunk.io
Evolve data safely with Icechunk
The Pangeo project has fundamentally changed how the geoscience community works with big data. Earthmover is taking that vision further by making these tools accessible to everyone.

Dr. Ryan Abernathey

Co-founder & CTO, Earthmover

Pangeo Project

Pangeo Project

The Pangeo Project is a community for open, reproducible, scalable geoscience supported by NASA and the National Science Foundation. As key contributors within this community, we are defining the reference architecture for cloud-native multidimensional data to accelerate collaboration and reproducible open science.

pangeo.io
Pangeo Project
Learn

Build smarter with expert guidance

Accelerate your roadmap with expert guidance from climate scientists and data engineers building the leading open source solutions and defining the modern workflow for multidimensional data.

Build smarter with expert guidance

Looking to accelerate your roadmap? Deploy our managed platform instead of building a bespoke solution.