Announcing the Earthmover Data Marketplace: Subscribe to ARCO datasets from ECMWF, NOAA, and more. Explore the marketplace .

the earthmover platform

Streamline multidimensional data workflows, reduce DevOps complexity, and avoid bottlenecks as datasets scale.

Talk to an expert
Overview

Modern architecture for tensor data

The Earthmover team does more than provide a data platform. As leaders in the open source ecosystem, we're defining a new cloud-native standard for working with tensor data to accelerate scientific research, earth system AI modeling, and data-driven application development.

Earthmover platform architecture
arraylake

Unify data management with Arraylake

Transform scattered datasets into a single source of truth, while retaining the native multidimensional structure of your data.

Turn-key cloud platform for data management
01

Turn-key cloud platform for data management

Harness the power of cloud computing and collaboration for data cubes without building and maintaining custom cloud infrastructure. Bring your own object storage or let us manage it for you.

High performance data loading
02

High performance data loading

Combine performance and flexibility with chunked, compressed array-based storage for performant training of machine learning models. Zero-copy ingestion of existing Zarr, NetCDF, HDF, GRIB, and TIFF data.

User-friendly data catalog
03

User-friendly data catalog

Streamline data organization through a central, unified catalog of all your array-based data assets.

Robust data governance
04

Robust data governance

Easily manage access with rich permission structures and audit your data with immutable data references.

flux

Accelerate data delivery with Flux

Enable seamless data exploration and accelerate data product development and delivery with a turn-key, high-performance gateway to your data.

01

Explore data at the speed of thought

Empower analysts to query and visualize multidimensional data in the cloud using their preferred tools, without waiting to download locally.

02

Prototype and iterate faster

Quickly explore and iterate on a dataset to develop your product, service, or AI/ML model and then build applications that query your data via API using WMS, EDR, or OPeNDAP.

03

Remove the complexity of Data Delivery

Remove the operational complexity of data product delivery. Flux enables a wide array of scalable delivery options offering massive savings in engineering, infrastructure, and maintenance.

Supported standards-compliant endpoints

Web mapping service (WMS)

Web mapping service (WMS)

Explore, query, and integrate maps layers, data, and metadata in MapboxGL, QGIS, and Leaflet via web mapping service (WMS) integration.

Read more →
Environmental data retrieval (EDR)

Environmental data retrieval (EDR)

Retrieve consistent, well-formatted JSON and CSV data from Arraylake using the EDR API developed by the Open Geospatial Consortium (OGC).

Read more →
Network data access protocol (OPeNDAP)

Network data access protocol (OPeNDAP)

Query subsets and aggregates of array data in a range of data formats (NetCDF, HDF, GRIB) without downloading entire files.

Read more →
open source leadership

Open-source core

Built on the Icechunk/Zarr open-source data storage engine, with Arraylake, enterprises avoid vendor lock-in, maintain data sovereignty and control, and ensure seamless data portability through adherence to open standards.

Earthmover founders Dr. Ryan Abernathey and Dr. Joe Hamman, have spent their careers doing cutting-edge research in climate science, remote sensing, and cloud-native data analytics. They maintain numerous critical open-source scientific Python packages, including Xarray, Zarr, and the Pangeo Project. Xarray and Zarr together have over 4,000 stars on Github and are used by teams at NVIDIA, NOAA, Google, Microsoft, and more.

Learn more about our open source leadership
Learn

Build smarter with expert guidance

Accelerate your roadmap with expert guidance from climate scientists and data engineers building the leading open source solutions and defining the modern workflow for multidimensional data.

Want to learn more? Book a demo or join our mailing list to stay up to date with new releases.