Overview

ArcGIS GeoAnalytics Engine delivers spatial analysis to your big data by extending Apache Spark with ready-to-use SQL functions and analysis tools. ArcGIS GeoAnalytics Engine includes a Spark plugin and a Python library, geoanalytics, that is used to drive analysis.

Key features

  • Fully integrated with Apache Spark—Process spatial data at scale with functions developed and tested by Esri, the global leader in GIS, location intelligence, and mapping.
  • Easy to use—Build spatially-enabled big data pipelines with an intuitive Python API that extends PySpark.
  • 100+ spatial SQL functions—Create geometries, test spatial relationships, and more using Python or SQL syntax.
  • Powerful analysis tools—Run common spatiotemporal and statistical analysis workflows with only a few lines of code.
  • Automatic spatial indexing—Perform optimized spatial joins and other operations immediately.
  • Read from and write to common data sources—Load and save data from shapefiles, feature services, and vector tiles.
  • Cloud-native—Tested and ready to install on Databricks, Amazon EMR, Azure Synapse, and Google Cloud Dataproc.

Where to start

For more information see the ArcGIS GeoAnalytics Engine product page.

To learn more about bringing ArcGIS GeoAnalytics Engine into your Spark environment, see Install and set up and Licensing and Authorization.

Once you have GeoAnalytics Engine installed, see Get started for a quick tutorial that introduces some basic features and capabilities of the geoanalytics library.

Your browser is no longer supported. Please upgrade your browser for the best experience. See our browser deprecation post for more details.