Skip To Content ArcGIS for Developers Sign In Dashboard

Display scenes in augmented reality

ArcGIS Runtime provides support augmented reality through the AR Toolkit.

AR Toolkit is open source, so you can use it as is or modify its components to meet your needs. You can keep your changes proprietary or share them with the open source community.

AR experiences can be thought of in terms of three common patterns: tabletop, flyover, and world-scale. Runtime supports all three, and the rest of this document describes how to implement each.

  • Flyover – Flyover AR is a kind of AR scenario that allows you to explore a scene using your device as a window into the virtual world. A typical flyover AR scenario will start with the scene’s virtual camera positioned over an area of interest. You can walk around and reorient the device to focus on specific content in the scene.
  • Tabletop – A kind of AR scenario where scene content is anchored to a physical surface, as if it were a 3D-printed model. You can walk around the tabletop and view the scene from different angles.
  • World-scale – A kind of AR scenario where scene content is rendered exactly where it would be in the physical world. This is used in scenarios ranging from viewing hidden infrastructure to displaying waypoints for navigation. In AR, the real world, rather than a basemap, provides the context for your GIS data.

Enable your app for AR using AR Toolkit

  1. See the toolkit repo on GitHub for the latest instructions for installing: Toolkit repo on GitHub
  2. Add an ArcGISArView to your app.
  3. Configure privacy and permissions.
  4. Now you're ready to add tabletop AR , add flyover AR , or add world-scale AR to your app.

Add an AR view to your app

ArcGISArView uses an underlying ARKit or ARCore view and an ArcGIS Runtime SceneView . Use the following on ArcGISArView to configure AR:

  • translationFactor - controls the relationship between physical device position changes and changes in the position of the scene view's camera. This is useful for tabletop and flyover AR.
  • originCamera - controls the initial position of the scene view's camera. When position tracking is started, ArcGISArView transforms the scene view camera's position using a transformation matrix provided by ARKit or ARCore. Once the origin camera is set, the manipulation of the scene view's camera is handled automatically.
  • setInitialTransformation – takes a point on the screen, finds the surface represented by that point, and applies a transformation such that the origin camera is pinned to the location represented by that point. This is useful for pinning content to a surface, which is needed for tabletop AR.
In addition to AR Toolkit features, you'll need to use the following ArcGIS Runtime features provided by the underlying scene view when creating AR experiences:
  • Scene view space effect control — Disable rendering the 'starry sky' effect to display scene content on top of a camera feed.
  • Scene view atmosphere effect control — Disable rendering the atmosphere effect to avoid obscuring rendered content.
  • Surface transparency — Hide the ground when rendering world-scale AR because the camera feed, not the basemap, is providing context for your GIS content. You can use a semitransparent surface to calibrate your position in world-scale AR.
  • Scene view navigation constraint — By default, scene views constrain the camera to being above the ground. You should disable this feature to enable users to use world-scale AR underground (for example, while in a basement). The navigation constraint will interfere with tabletop AR if the user attempts to look at the scene from below.
To use ArcGISArView , first add it to the view, then configure the lifecycle methods to start and stop tracking as needed.

import Esri.ArcGISArToolkit 1.0

Item {

    ArcGISArView {
        id: arcGISArView
        anchors.fill: parent
        sceneView: view
        tracking: true

    SceneView {
        id: view
        anchors.fill: parent

        Rectangle {
            anchors {
                bottom: view.attributionTop
                horizontalCenter: parent.horizontalCenter
                margins: 5
            width: childrenRect.width
            height: childrenRect.height
            color: "#88ffffff" // transparent white
            radius: 5
            visible: model.dialogVisible
            Text {
                anchors.centerIn: parent
                padding: 2
                font.bold: true
                text: qsTr("Touch screen to place the tabletop scene...")

    // Declare the C++ instance which creates the scene etc. and supply the view
    DisplayScenesInTabletopARSample {
        id: model
        arcGISArView: arcGISArView
        sceneView: view

Configure privacy and permissions

Before you can use augmented reality, you'll need to request location and camera permissions.

On iOS, ensure the following properties are set in info.plist :

  • Privacy - Camera Usage Description
  • Privacy - Location When In Use Usage Description

The deployment target should be set to a supported version of iOS (see System requirements for details).

On Android, you'll need to request location permissions before using ARCore. Ensure that the following permissions are specified in AndoidManifest.xml :


When starting the AR experience, ensure that the user has granted permissions. See Qt documentation for details.

Note that the device must support ARCore for ArcGISArView to work on Android. Google maintains a list of supported devices. ARCore is a separate installable component delivered via Google Play.

<!-- Indicates that app requires ARCore ("AR Required"). Causes Google
         Play Store to download and install ARCore along with the app.
         For an "AR Optional" app, specify "optional" instead of "required".
<application ...>
    <meta-data android:name="" android:value="required" />

The following declaration (outside of the application element) will ensure that the app only displays in the Play Store if the device supports ARCore:

<!-- Indicates that app requires ARCore ("AR Required"). Ensures app is only
         visible in the Google Play Store on devices that support ARCore.
         For "AR Optional" apps remove this line.
<application ...>
    <uses-feature android:name="" android:required="true" />

If you’d like to restrict your app to installing only on devices that support ARKit, add arkit to the required device capabilities section of info.plist :


Once you have installed the toolkit, configured your app to meet privacy requirements, requested location permissions, and added an ArcGISArView to your app, you can begin implementing your AR experience.

Understand Common AR Patterns

There are many AR scenarios you can achieve with Runtime. This SDK recognizes the following common patterns for AR:

  • Flyover – Flyover AR is a kind of AR scenario that allows you to explore a scene using your device as a window into the virtual world. A typical flyover AR scenario will start with the scene’s virtual camera positioned over an area of interest. You can walk around and reorient the device to focus on specific content in the scene.
  • Tabletop – A kind of AR scenario where scene content is anchored to a physical surface, as if it were a 3D-printed model. You can walk around the tabletop and view the scene from different angles.
  • World-scale – A kind of AR scenario where scene content is rendered exactly where it would be in the physical world. This is used in scenarios ranging from viewing hidden infrastructure to displaying waypoints for navigation. In AR, the real world, rather than a basemap, provides the context for your GIS data.

Each experience is built using some combination of the features in Runtime and the toolkit and some basic behavioral assumptions.

AR pattern Origin camera Translation factor Scene view Base surface

Flyover AR

Above the tallest content in the scene

A large value to enable rapid traversal; 0 to restrict movement

Space effect:





Tabletop AR

On the ground at the center or lowest point on the scene

Based on the size of the target content and the physical table

Space effect:





World-scale AR

At the same location as the physical device camera

1, to keep virtual content in sync with real-world environment

Space effect:




Optional for calibration


While the table describes the settings for the common AR scenarios in general, you have freedom to define your own configurations and build scenarios not explicitly mentioned in this guide.

Add tabletop AR to your app

Tabletop AR allows you to use your device to interact with scenes as if they were 3D-printed models sitting on your desk. You could, for example, use tabletop AR to virtually explore a proposed development without needing to create a physical model.


See Learn more below for links to additional resources.

Configure content for tabletop AR

The goal of a tabletop AR experience is to create the illusion of scene content sitting on your desk. Consider the following guidelines when creating content for your tabletop experience :

  • Keep scene content focused; ideally a single geographic feature or a small area of a city.
  • Ensure that scene content looks OK from below; your users should be able to look at the scene from an y angle.
  • If you’re not using a pre-defined scene (you allow users to load arbitrary scenes from ArcGIS Online, in other words), define a clipping distance to create a realistic tabletop experience.

Implement tabletop AR

Tabletop AR often allows users to place scene content on a physical surface (e.g. top of a desk) of their choice. Once the content is placed, it stays anchored to the surface as the user moves around it.

  1. Create an ArcGISArView and add it to the view
  2. Once the user has tapped a point, call setInitialTransformation . The AR toolkit will use the native platform’s plane detection to position the virtual camera relative to the plane. If the result is true, the transformation has been set successfully and you can place the scene.

    // Set the initial transformation using the point clicked on the screen
    const QPoint screenPoint = event.localPos().toPoint();

  3. Create and display the scene. For demonstration purposes, this code uses the Philadelphia mobile scene package because it is particularly well-suited for tabletop display. You can download that .mspk and add it to your project to make the code below work. Alternatively, you can use any scene for tabletop mapping, but be sure to define a clipping distance for a proper tabletop experience.

    // Create scene package and connect to signals
    void DisplayScenesInTabletopAR::createScenePackage(const QString& path)
      m_scenePackage = new MobileScenePackage(path, this);
      connect(m_scenePackage, &MobileScenePackage::doneLoading, this, &DisplayScenesInTabletopAR::packageLoaded);
      connect(m_permissionsHelper, &PermissionsHelper::requestFilesystemAccessCompleted, this, [this]()
      if (!m_permissionsHelper->fileSystemAccessGranted())

    // Slot for handling when the package loads
    void DisplayScenesInTabletopAR::packageLoaded(const Error& e)
      if (!e.isEmpty())
        qDebug() << QString("Package load error: %1 %2").arg(e.message(), e.additionalMessage());
      if (m_scenePackage->scenes().isEmpty())
      // Get the first scene
      m_scene = m_scenePackage->scenes().at(0);
      // Create a camera at the bottom and center of the scene.
      // This camera is the point at which the scene is pinned to the real-world surface.
      m_originCamera = Camera(39.95787000283599, -75.16996728256345, 8.813445091247559, 0, 90, 0);

  4. Find an anchor point in the scene. You can use a known value, a user-selected value, or a computed value. For simplicity, this example uses a known value. Place the origin camera at that point. Set the navigation constraint on the scene’s base surface to None .

    // Set the origin camera.
    // Enable subsurface navigation. This allows you to look at the scene from below.

  5. Set the translation factor on the ArcGIS AR view so that the whole scene can be viewed by moving around it. A useful formula for determining this value is translation factor = virtual content width / desired physical content width. The desired physical content width is the size of the physical table while virtual content width is the real-world size of the scene content; both measurements should be in meters. You can set the virtual content width by setting a clipping distance.

    // Set the translation factor based on the scene content width and desired physical size.

Guidelines for tabletop AR user experience

To create an optimal tabletop mapping experience:

  • Set expectations about lighting before starting the experience. AR doesn't work well in low light.
  • Don't allow users to view arbitrary content in AR. Use curated content that has been designed.
  • Keep scene content focused; this should ideally be a single geographic feature or a small area of a city.
  • Ensure that the scene looks good from below; your users may attempt to look at the scene from any angle.
  • When using scenes that aren’t confined to a small area, set a clipping distance to ensure that content is limited to a small area on the physical surface.

Learn more

Add flyover AR to your app

Flyover AR displays a scene while using the movement of the physical device to move the scene view camera. For example, you can walk around while holding up your device as a window into the scene. Unlike other AR experiences, the camera feed is not displayed to the user , making flyover more similar to a traditional virtual reality (VR) experience.

Flyover is the simplest AR scenario to implement, as there is only a loose relationship between the physical world and the rendered virtual world. With flyover, you can imagine your device as a window into the virtual scene.


See Learn more below for links to additional resources.

Implement flyover AR

  1. Create the AR view and add it to the UI.
  2. Create the scene, add any content, then display it. This example uses an integrated mesh layer of an area on the US-Mexico border.

    ExploreScenesInFlyoverAR::ExploreScenesInFlyoverAR(QObject* parent /* = nullptr */):
      m_scene(new Scene(Basemap::imagery(this), this))
      // create a new elevation source from Terrain3D REST service
      ArcGISTiledElevationSource* elevationSource = new ArcGISTiledElevationSource(
            QUrl(""), this);
      // add the elevation source to the scene to display elevation
      // create the integrated mesh layer
      const QUrl meshLyrUrl("");
      m_integratedMeshLayer = new IntegratedMeshLayer(meshLyrUrl, this);
      // add the layer to the scene

  3. Place the origin camera above the content you want the user to explore, ideally in the center. Typically, you’ll want to place the origin camera above the highest point in your scene. Constrain navigation to stay above the scene's base surface.

    Continued from above ...

    connect(m_integratedMeshLayer, &IntegratedMeshLayer::doneLoading, this, [this](Error e)
      if (!e.isEmpty())
        qDebug() << e.code() << e.message() << " - " << e.additionalMessage();
      // Enable subsurface navigation. This allows you to look at the scene from below.
      const Point centerPoint = m_integratedMeshLayer->fullExtent().center();
      // Start with the camera at the center of the mesh layer.
      m_originCamera = Camera(centerPoint.y(), centerPoint.x(), 250, 0, 90, 0);

  4. Set the translation factor to allow rapid traversal of the scene. The translation factor defines the relationship between physical device movement and virtual camera movement. To create a more immersive experience, set the space effect on the scene view to Stars and the atmosphere effect to Realistic .

    Continued from above ...

    // Set the translation factor to enable rapid movement through the scene.
    // Enable atmosphere and space effects for a more immersive experience.
    emit arcGISArViewChanged();
    emit sceneViewChanged();

Guidelines for flyover AR

Consider the following guidelines for creating high-quality flyover AR experiences:

  • Set expectations about the environment before starting the AR experience. Flyover AR doesn't work well in low light or in tight, constrained spaces.
  • Start the AR experience in the center of the area of interest for your scene. Users are likely to move around freely, including rotating and looking behind where they originally start.

Learn more

Add world-scale AR to your app

A world-scale AR experience is defined by the following characteristics:

      • The scene camera is positioned to precisely match the position and orientation of the device’s physical camera
      • Scene content is placed in the context of the real world by matching the scene view’s virtual camera position and orientation to that of the physical device camera.
      • Context aids, like the basemap, are hidden; the camera feed provides real-world context.

      Some example use cases of world-scale AR include:

      • Visualizing hidden infrastructure, like sewers, water mains, and telecom conduits.
      • Maintaining context while performing rapid data collection for a survey.
      • Visualizing a route line while navigating.

      See Learn more below for links to additional resources.

      Configure content for world-scale AR

      The goal of a world-scale AR experience is to create the illusion that your GIS content is physically present in the world around you. There are several requirements for content that will be used for world-scale AR that go beyond what is typically required for 2D mapping.

      • Ensure that all data has an accurate elevation (or Z) value. For dynamically generated graphics (for example, route results) use an elevation surface to add elevation.
      • Use an elevation source in your scene to ensure all content is placed accurately relative to the user.
      • Don't use 3D symbology that closely matches the exact shape of the feature it represents. For example, do not use a generic tree model to represent tree features or a fire hydrant to represent fire hydrant features. Generic symbology won’t capture the unique geometry of actual real-world objects and will highlight minor inaccuracies in position.
      • Consider how you present content that would otherwise be obscured in the real world, as the parallax effect can make that content appear to move unnaturally. For example, underground pipes will ‘float’ relative to the surface, even though they are at a fixed point underground. Have a plan to educate users, or consider adding visual guides, like lines drawn to link the hidden feature to the obscuring surface (for example, the ground).
      • By default, ArcGIS Runtime renders content over a large distance, which can be problematic when you are trying to view a limited subset of nearby features (just the pipes in your building, not for the entire campus, for example). You can use the clipping distance to limit the area over which scene content renders.

            Understand location tracking options for world-scale AR

            There are a few strategies for determining the device’s position in the world and maintaining that position over time:

            • Use the device’s location data source (for example, GPS) to acquire an initial position and make further position updates using ARKit and ARCore only.
            • Use the location data source continuously.

            With continuous updates, the origina camera is set every time the location data source provides a new update. With a one-time update, the origin camera is set only once.

            There are benefits and drawbacks to each approach that you should consider when designing your AR experience:

            • One-time update
              • Advantage: ARKit/ARCore tracking is more precise than most location data sources.
              • Advantage: Content stays convincingly pinned to its real-world position, with minimal drifting or jumping.
              • Disadvantage: Error accumulates the further you venture from where you start the experience.
            • Continuous update
              • Advantage: Works over a larger area than ARKit or ARCore.
              • Disadvantage: Visualized content will jump as you move through the world and the device’s location is updated (as infrequently as once per second rather than ARKit’s 60 times per second).
              • Disadvantage: Because the origin camera is constantly being reset, you can’t use panning to manually correct position errors.

            You don’t need to make a binary choice between approaches for your app. Your app can use continuous updates while the user moves through larger areas, then switch to a primarily ARKit or ARCore-driven experience when you need greater precision.

            With ArcGISArView , the choice of location strategy is specified in the call to startTracking. To change the location update mode, stop tracking and then resume tracking with the desired mode.


            In the samples, continuous update mode is referred to as Roaming and one-time update mode is referred to as Local.

            Implement world-scale AR

            1. Create an ArcGISArView and add it to the view.
            2. Configure the ArcGISArView  with a location data source. The location data source provides location information for the device. The AR scene view uses the location data source to place the virtual scene camera close to the location of the physical device’s camera.

              m_arcGISArView->setLocationDataSource(new LocationDataSource(m_arcGISArView));

            3. Configure the scene for AR by setting the space and atmosphere effects and adding an elevation source, then display it.

              // add elevation source
              const QUrl elevationSourceUrl("");
              const QList<ElevationSource*> sources = { new ArcGISTiledElevationSource(elevationSourceUrl, m_scene) };
              auto* baseSurface = new Surface(sources, m_scene);
              BackgroundGrid grid;
              // set the space and atmosphere effects

            4. Start tracking using one of two location update modes, continuous or once only.

              // Continuous update mode
              // One-time mode

            5. Provide a calibration UI to allow your users to correct heading, elevation, and location errors.

            Learn more

            Guidelines for world-scale AR

            Consider the following guidelines to help you overcome common obstacles to creating a useful world-scale AR experience:

            • Create a calibration experience that works for your users’ needs in the environment they will be working in. See Enable calibration for world-scale AR for details.
            • Educate your users on what to expect from world-scale experiences, including calibration and parallax issues.
            • Consider how and where your users will use your AR experience. Due to limitations in GNSS and compass accuracy, your users will need to calibrate their initial starting position. Ensure that your users have the tools they need to calibrate their position, whether that is a known landmark, a fixed calibration point, or comparison to known reference features. See Enable calibration for world-scale AR for details.
            • Consider encouraging users to use high-accuracy positioning devices if practical.

            Enable calibration for world-scale AR

            World-scale AR depends on a close match between the positions and orientations of the device’s physical camera and the scene view’s virtual camera. Any error in the device’s position or orientation will degrade the experience. Consider each of the following key properties as common sources of error:

            • Heading – Usually determined using a magnetometer (compass) on the device
            • Elevation/Altitude (Z) – Usually determined using GPS/GNSS or a barometer
            • Position (X,Y) – usually determined using GPS/GNSS, cell triangulation, or beacons

            The following examples illustrate these errors by showing a semitransparent basemap for comparison with the ground truth provided by the camera:

              Design a calibration workflow

              There are many ways to calibrate position and heading. In most scenarios, you’ll need to provide one or more points of comparison between scene content and the real-world ground truth. Consider the following options for allowing the user to visually calibrate the position:

              • Align the imagery on the basemap with the camera feed.
              • Align a known calibration feature with its real-world equivalent (for example, a previously recorded tree feature).
              • Define a start point and heading and direct the user.

              Consider the following UI options for allowing the user to adjust the calibration:

              • Display sliders for orientation and elevation adjustment.
              • Use 'joystick' sliders, where the further from the center the slider moves, the faster the adjustment goes.
              • Use an image placed in a known position in conjunction with ARKit / ARCore image detection to automatically determine the device's position.

              Explicitly plan for calibration when designing your AR experiences. Consider how and where your users will use your app. Not all calibration workflows are appropriate for all locations or use cases.


              As described in Understand two location tracking options for world-scale AR , you can configure the AR view to reset the device’s position using the location data source continuously or just once. If you would like to manually adjust position or altitude while using continuous location updates, you will need to implement a custom location data source.

              Identify real-world and in-scene objects

              ArcGIS Runtime scene views have two methods for determining the location in a scene that corresponds to a point on the device's screen:

                • screenToBaseSurface – ignores non-surface content, like 3D buildings
                • screenToLocation – includes non-surface content

                ArcGISArView has arScreenToLocation, which:

                1. Performs a hit test using ARKit / ARCore to find a real- world plane.
                2. Applies a transformation to determine the physical position of that plane relative to the known position of the device's camera.
                3. Returns the real- world position of the tapped plane.

                You can use  to enable field data collection workflows where users tap to identify real-world objects in the camera feed as detected by ARKit / ARCore . The position of the tapped object will be more accurate than using the device's location, as you might with a typical field data collection process.

                Manage vertical space in world-scale AR

                Accurate positioning is particularly important to world-scale AR; even small errors can break the perception that the virtual content is anchored in the real world. Unlike 2D mapping, Z values are important. And unlike traditional 3D experiences, you need to know the position of the user’s device.

                Be aware of the following common Z-value challenges that you’re likely to encounter while building AR experiences:

                • Many kinds of Z values – Android and iOS devices differ in how they represent altitude/elevation/Z values.
                • Imprecise altitude – Altitude/Elevation is the least precise measurement offered by GPS/GNSS. In testing, we found devices reported elevations that were anywhere between 10 and 100 above or below the true value, even under ideal conditions.

                Many kinds of Z values

                Just as there are many ways to represent position using X and Y values, there are many ways to represent Z values. GPS devices tend to use two primary reference systems for altitude/elevation:

                • WGS84 – Height Above Ellipsoid (HAE)
                • Orthometric – Height Above Mean Sea Level (MSL)

                The full depth of the differences between these two references is beyond the scope of this topic, but do keep in mind the following facts:

                  • Android devices return elevations in HAE, while iOS devices return altitude in MSL.
                  • It is not trivial to convert between HAE and MSL; MSL is based on a measurement of the Earth’s gravitational field. There are many models, and you may not know which model was used to when generating data.
                  • Esri’s world elevation service uses orthometric altitudes.
                  • The difference between MSL and HAE varies by location and can be on the order of tens of meters. For example, at Esri’s HQ in Redlands, California, the MSL altitude is about 30 meters higher than the HAE elevation.

                  It is important that you understand how your Z values are defined to ensure that data is placed correctly in the scene. For example, the Esri world elevation service uses MSL for its Z values. If you set the origin camera using an HAE Z value, you could be tens of meters off from the desired location.

                  To gain a deeper understanding of these issues, see ArcUser: Mean Sea Level, GPS, and the Geoid.

                        Learn more

                        Visualize planes and features detected by ARKit and ARCore

                        Some workflows, like tapping to place a tabletop scene or collecting a feature, rely on ARKit / ARCore features that detect planes. Plane visualization is particularly useful for two common scenarios:

                        • Visualization provides feedback to users, so they know which surfaces the app has detected and can interact with
                        • Visualization is useful while developing and testing your app

                        ARKit / ARCore can be configured to render planes.

                        // Set whether planes should be visible.
                        void CppArExample::showPlanes(bool visible)
                          if (visible)
                            m_arcGISArView->setPlaneColor(QColor(255, 0, 0, 10 ));