Install and set up
GeoAnalytics Engine can be installed on a personal computer, a standalone Spark cluster, or a managed Spark service in the cloud.
Installing on a personal computer
Apache Spark supports a local deployment mode that is useful for testing in a shell or notebook prior to using resources on a larger Spark cluster. This deployment mode lets you run PySpark code using your personal computer's resources as a single node cluster.
See this guide for instructions on using GeoAnalytics Engine in Spark local mode.
Installing on a Spark standalone cluster
For working with large datasets, a cluster or managed Spark service offers the ability to scale out compute resources and utilize the true potential of Spark. Spark cluster mode allows you to configure Apache Spark on any number of nodes in a cluster of machines that you deploy. See this guide for instructions on using GeoAnalytics Engine in Spark Cluster mode.
Installing on a managed Spark service in the cloud
GeoAnalytics Engine supports use with the following managed Spark services:
Within each service you can deploy customized Spark clusters and PySpark notebooks. The advantages of deploying a Spark cluster in the cloud include a small startup cost, the ability to deploy and shut down resources quickly, and the option to scale up or scale down resources as needed.
Dependencies
GeoAnalytics Engine extends Spark and thus requires Spark and its dependencies to be installed prior to using the API. The table below summarizes which versions of Spark and its dependencies are supported by each version of GeoAnalytics Engine.
GeoAnalytics Engine | Spark | Python | Java | Scala |
---|---|---|---|---|
1.0.x | 3.0.1-3.2.x | 3.7+ | Java 8/11 | 2.12 |
Support for new versions of Spark or its dependencies may be added with any minor release while support for older versions may be dropped with any major release. For more information see Versioning policy. Managed Spark services hosted in the cloud are often pre-configured with Spark dependencies and ready to use. See the install guide for each cloud provider for the list of runtimes supported by GeoAnalytics Engine.