Mapping the 2019 Novel Coronavirus Pandemic

According to WHO, 2019 Novel Corona Virus (COVID-19) is a virus (more specifically, a coronavirus) identified as the cause of an outbreak of respiratory illness, which was unknown before the outbreak began in Wuhan, China, in December 2019 [1]. Early on, the disease demonstrated an animal-to-person spread, then a person-to-person spread. Infections with COVID-19, were reported in a growing number of international locations, including the United States". The United States reported the first confirmed instance of person-to-person spread with this virus on January 30, 2020 [2].

This notebook shows how to use the ArcGIS API for Python to monitor the spread of COVID-19 as it became a pandemic (updated as of June 28th, 2021).

1. Import Data

Esri provides an open-to-public and free-to-share feature layer that contains the most up-to-date COVID-19 cases covering China, the United States, Canada, Australia (at province/state level), and the rest of the world (at country level, represented by either the country centroids or their capitals). Data sources are WHO, US CDC, China NHC, ECDC, and DXY. The China data is updated automatically at least once per hour, and non-China data is updating manually. The data source repo that this layer referenced from, is created and maintained by the Center for Systems Science and Engineering (CSSE) at the Johns Hopkins University, and can be viewed here. In this notebook, we will use the feature layer supported by Esri Living Atlas team and JHU Data Services, and provide a different perspective in viewing the global maps of COVID-19 via the use of ArcGIS API for Python.

NOTE: "Since COVID-19 is continuously evolving, the sample reflects data as of June 28th, 2021 (or if you are looking at a previously published version of this notebook, the data and maps were fetched/made as of July 4th, 2020). Running this notebook at a later date might reflect a different result, but the overall steps should hold good."

DISCLAIMER: "This notebook is for the purpose of illustrating an analytical process using Jupyter Notebooks and ArcGIS API for Python, and should not be used as medical or epidemiological advice."

Necessary Imports

from io import BytesIO
import requests
import pandas as pd
from arcgis.features import FeatureLayer
from arcgis.gis import GIS
from arcgis.mapping import WebMap
"""
# if you are using arcgis api for python with version 1.8.0 or above,
# make sure that the pandas version>=1.0,
# if not, use `pip install --upgrade pandas>=1` to upgrade.
"""
pd.__version__
'1.2.3'

Get data for the analysis

Query the source feature layer

The dashboard item contributed to the public by Esri and JHU CSSE, is accessible through here:

gis = GIS(profile="your_online_profile")
item = gis.content.search("Coronavirus_2019_nCoV_Cases owner:CSSE_covid19", outside_org=True)[0]
item
Coronavirus COVID-19 (2019-nCoV)
This ArcGIS Dashboard created by the CSSE Team at Johns Hopkins contains the most up-to-date coronavirus COVID-19 (2019-nCoV) cases and latest trend plot.Dashboard by CSSE_covid19
Last Modified: June 03, 2021
0 comments, 301,342,595 views

Through the API Explorer provided along with the dashboard product, we can easily fetch the source URL for the Feature Service containing daily updated COVID-19 statistics, which can then be used to create a FeatureLayer object good for querying and visualizing.

src_url = "https://services1.arcgis.com/0MSEUqKaxRlEPj5g/arcgis/rest/services/Coronavirus_2019_nCoV_Cases/FeatureServer/1"
fl = FeatureLayer(url=src_url)
df_global = fl.query(where="1=1",
                     return_geometry=True,
                     as_df=True)

As stated in the dashboard, the source data can be grouped into:

  • A. Countries or regions of which data are collected at province/state level, e.g. China, the United States, Canada, Australia;
  • B. Countries or regions for the rest of the world of which data collected at country level, and shape represented by either the country centroids or their capitals;
  • C. Cruise Ships with confirmed COVID-19 cases.
Group A

Let us first take a look at how many countries are within group A, that Country_Region and Province_State are not null or NAN.

df_global[~pd.isnull(df_global['Province_State'])].groupby('Country_Region').sum(numeric_only=True)[['Confirmed', 'Recovered', 'Deaths']]
ConfirmedRecoveredDeaths
Country_Region
Australia3052823781910
Belgium1082476025160
Brazil1842059816160826513474
Canada1421009138562326188
Chile1547103148441032298
China103727983314847
Colombia41587163854457104678
Denmark8227981
France128701641021069
Germany3734597361901090777
India3027933129309607396730
Italy42580694073435127472
Japan79707976320814659
Mexico25057921990610232564
Netherlands17115302729018024
New Zealand100
Pakistan95565790120122231
Peru20481152007201191899
Russia54087444911752131671
Spain379264215037680789
Sweden1088896014619
US336315320604030
Ukraine2298478222138054517
United Kingdom477128915724128367

Each country/region in Group A, has more than 1 feature, as what we have seen below from the query() results.

fset_usa = fl.query(where="Country_Region='US'")
fset_usa
<FeatureSet> 60 features
fset_china = fl.query(where="Country_Region='China'")
fset_china
<FeatureSet> 34 features
fl.query(where="Country_Region='Denmark'")
<FeatureSet> 3 features
Group C

Group C contains cruise ships across the globe with reported cases:

df_cruise_ships = fl.query(where="Province_State='Diamond Princess' or \
                                  Province_State='Grand Princess' or \
                                  Country_Region='MS Zaandam' or \
                                  Country_Region='Diamond Princess'",
                           as_df=True)
df_cruise_ships[["Province_State", "Country_Region", "Last_Update", "Confirmed", "Recovered", "Deaths"]]
Province_StateCountry_RegionLast_UpdateConfirmedRecoveredDeaths
0Diamond PrincessCanada2020-07-05 00:34:01001
1Grand PrincessCanada2020-07-05 00:34:0113130
2NoneDiamond Princess2020-07-05 00:34:0171265113
3NoneMS Zaandam2020-07-05 00:34:01902
4Grand PrincessUS2020-07-05 00:34:0110303
5Diamond PrincessUS2020-07-05 00:34:014900
Group B

In the df_global, other than the 22 countries (Australia, Canada, China, etc.) in Group A, and those cruise ships in Group C, all other countries/regions fall into Group B, e.g. Thailand. The major difference between Group A and Group B is that the latter contains one and only feature per country.

fl.query(where="Country_Region='Thailand'")
<FeatureSet> 1 features

Query the reference feature layers

Because the geo-information provided by the dashboard contains only the coordinates representing the centroid of each country/region, the feature layer can only be rendered as points on Map. If this is what you want, you can now skip the rest of section 1, and jump right onto section 2.

On the other hand, if you want to visualize the confirmed/death/recovered cases per country/region as polygons, in other words as a choropleth map, please read along:

First, we need to access the feature service that contains geometry/shape info for all provinces in Mainland China, and merge with the COVID-19 DataFrame.

Access the reference feature layer of China
provinces_item = gis.content.get("0f57da7f853c4a1aa5b2e048ff8655d2")
provinces_item
Merged_China_Province_Boundaries_2018
Feature layer generated from Merge LayersFeature Layer Collection by api_data_owner
Last Modified: July 04, 2020
0 comments, 15 views
provinces_flayer = provinces_item.layers[0]
provinces_df = provinces_flayer.query(as_df=True)
provinces_df.columns
Index(['OBJECTID', 'ID', 'NAME', 'AREA', 'TOTPOP_CY', 'ISO_CODE', 'ISO_SUB',
       'ISO2_CC', 'ISO3_CC', 'Shape__Area', 'Shape__Length', 'ID_1',
       'sourceCountry', 'ENRICH_FID', 'aggregationMethod',
       'populationToPolygonSizeRating', 'apportionmentConfidence', 'HasData',
       'TOTPOP_CY_1', 'YEAR2018', 'SHAPE'],
      dtype='object')
tmp = provinces_df.sort_values('NAME', ascending=True)
provinces_df = tmp.drop_duplicates(subset='NAME', keep='last')
provinces_df.shape
(31, 21)
DataFrame Merging for China Dataset

The subsets of dataframe being created in the previous section now needs to be merged with feature services which have geographic information (e.g. geometries, shape, or longitude/latitude) in order to provide location and geometries required for geographic mapping. First, let's acquire the geometries from feature services existing on living atlas or arcgis online organization to represent the geographic information needed of overlap_rows_china.

df_china = fset_china.sdf[['Province_State', 'Confirmed', 'Recovered', 'Deaths']]
df_china = df_china.assign(NAME = df_china["Province_State"])
df_china.head()
Province_StateConfirmedRecoveredDeathsNAME
0Anhui10049986Anhui
1Beijing107810549Beijing
2Chongqing5985916Chongqing
3Fujian6726201Fujian
4Gansu1951922Gansu

Because the names are inconsistent between the two data sources (e.g. provinces represented differently in df_china['Province_State'] and provinces_df['NAME"]), the replace_value_in_column method is declared below to edit the records and unify the column names.

def replace_value_in_column(data_frame, l_value, r_value, column_name = 'NAME'):
    data_frame.loc[data_frame[column_name] == l_value, column_name] = r_value

replace_value_in_column(df_china, 'Guangxi', 'Guangxi Zhuang Autonomous Region')
replace_value_in_column(df_china, 'Inner Mongolia', 'Inner Mongolia Autonomous Region')
replace_value_in_column(df_china, 'Ningxia', 'Ningxia Hui Autonomous Region')
replace_value_in_column(df_china, 'Tibet', 'Tibet Autonomous Region')

Now the two DataFrame objects have got unified column names, we can go ahead to use a single function in Pandas called merge as an entry point to perform in-memory standard database join operations (similar to that of relational databases such as SQL), and its syntax is shown here -

pd.merge(left, right, how='inner', on=None, left_on=None, right_on=None,
         left_index=False, right_index=False, sort=True)

Note that, the how argument specifies how to merge (a.k.a. how to determine which keys are to be included in the resulting table). If a key combination does not appear in either the left or the right tables, the values in the joined table will be NA. The table below then shows a summary of the how options and their SQL equivalent names −

Merge MethodSQL EquivalentDescription
leftLEFT OUTER JOINUse keys from left object
rightRIGHT OUTER JOINUse keys from right object
outerFULL OUTER JOINUse union of keys
innerINNER JOINUse intersection of keys

In this case, we will be calling merge() with how='inner' to perform the inner join of the two DataFrame objects on the index field "NAME" and only to keep the intersection of keys.

cols_2 = ['NAME', 'AREA', 'TOTPOP_CY','SHAPE']
overlap_rows_china = pd.merge(left = provinces_df[cols_2], right = df_china, 
                              how='inner', on = 'NAME')
overlap_rows_china.head()
NAMEAREATOTPOP_CYSHAPEProvince_StateConfirmedRecoveredDeaths
0Anhui140139.79598761795934{'rings': [[[116.36710167, 34.643320083], [116...Anhui10049986
1Beijing16535.63027622237467{'rings': [[[116.647890001, 41.0513740000001],...Beijing107810549
2Chongqing82390.60069130365055{'rings': [[[108.50186348, 32.20025444], [108....Chongqing5985916
3Fujian121789.66683338659398{'rings': [[[118.426801682, 28.2953205110001],...Fujian6726201
4Gansu401138.39458126268236{'rings': [[[97.1721363070001, 42.793840408000...Gansu1951922
cols_2 = ['NAME', 'AREA', 'TOTPOP_CY','SHAPE','Shape__Area', 'Shape__Length']
overlap_rows_china = pd.merge(left = provinces_df[cols_2], right = df_china, how='inner',
                        on = 'NAME')
overlap_rows_china.head()
NAMEAREATOTPOP_CYSHAPEShape__AreaShape__LengthProvince_StateConfirmedRecoveredDeaths
0Anhui140139.79598761795934{'rings': [[[116.36710167, 34.643320083], [116...13.35290432.965496Anhui10049986
1Beijing16535.63027622237467{'rings': [[[116.647890001, 41.0513740000001],...NaNNaNBeijing107810549
2Chongqing82390.60069130365055{'rings': [[[108.50186348, 32.20025444], [108....7.70849230.308633Chongqing5985916
3Fujian121789.66683338659398{'rings': [[[118.426801682, 28.2953205110001],...10.98910554.634113Fujian6726201
4Gansu401138.39458126268236{'rings': [[[97.1721363070001, 42.793840408000...41.08570380.438096Gansu1951922

As shown in overlap_rows_china, each province/state in China is now merged while retaining the SHAPE column, and is ready to be rendered as polygons.

Access the reference feature layer of the United States

Next, we need to access the feature service that contains geometry/shape info for all states in the U. S., and merge with the DataFrame depicting COVID-19 statistics.

us_states_item = gis.content.get('99fd67933e754a1181cc755146be21ca')
us_states_item
USA States (Generalized)
USA States (Generalized) provides 2017 boundaries for the States of the United States in the 50 states and the District of Columbia.Feature Layer Collection by esri_dm
Last Modified: June 21, 2021
0 comments, 466,257,725 views
us_states_flayer = us_states_item.layers[0]
us_states_df = us_states_flayer.query(as_df=True)
us_states_df.columns
Index(['FID', 'STATE_NAME', 'STATE_FIPS', 'SUB_REGION', 'STATE_ABBR',
       'POPULATION', 'POP_SQMI', 'POP2010', 'POP10_SQMI', 'WHITE', 'BLACK',
       'AMERI_ES', 'ASIAN', 'HAWN_PI', 'HISPANIC', 'OTHER', 'MULT_RACE',
       'MALES', 'FEMALES', 'AGE_UNDER5', 'AGE_5_9', 'AGE_10_14', 'AGE_15_19',
       'AGE_20_24', 'AGE_25_34', 'AGE_35_44', 'AGE_45_54', 'AGE_55_64',
       'AGE_65_74', 'AGE_75_84', 'AGE_85_UP', 'MED_AGE', 'MED_AGE_M',
       'MED_AGE_F', 'HOUSEHOLDS', 'AVE_HH_SZ', 'HSEHLD_1_M', 'HSEHLD_1_F',
       'MARHH_CHD', 'MARHH_NO_C', 'MHH_CHILD', 'FHH_CHILD', 'FAMILIES',
       'AVE_FAM_SZ', 'HSE_UNITS', 'VACANT', 'OWNER_OCC', 'RENTER_OCC',
       'NO_FARMS12', 'AVE_SIZE12', 'CROP_ACR12', 'AVE_SALE12', 'SQMI',
       'Shape__Area', 'Shape__Length', 'GlobalID', 'SHAPE'],
      dtype='object')
DataFrame Merging for U.S. Dataset
df_usa = fset_usa.sdf[['Province_State', 'Confirmed', 'Recovered', 'Deaths']]
df_usa = df_usa.assign(STATE_NAME = df_usa["Province_State"])
df_usa.head()
Province_StateConfirmedRecoveredDeathsSTATE_NAME
0Mississippi32113807401Mississippi
1Grand Princess10303Grand Princess
2Oklahoma45757107384Oklahoma
3Delaware10968201694Delaware
4Minnesota60521807680Minnesota
cols_4 = ['STATE_NAME','SHAPE']
overlap_rows_usa = pd.merge(left = us_states_df[cols_4], right = df_usa, 
                            how='inner', on = 'STATE_NAME')
overlap_rows_usa.head()
STATE_NAMESHAPEProvince_StateConfirmedRecoveredDeaths
0Alaska{'rings': [[[-17959594.8053098, 8122953.575198...Alaska710350374
1California{'rings': [[[-13543710.3257494, 4603367.827345...California3816296063593
2Hawaii{'rings': [[[-17819334.303422, 2512026.7784964...Hawaii376470516
3Idaho{'rings': [[[-13027307.5891034, 5415905.134774...Idaho19471902145
4Nevada{'rings': [[[-13263990.1054907, 4637763.931898...Nevada33161405667
Access the reference feature layer of world countries
countries_item = gis.content.get('2b93b06dc0dc4e809d3c8db5cb96ba69')
countries_item
World Countries (Generalized)
World Countries (Generalized) provides optimized country boundaries with useful name and country code attributes. Current to January 2020.Feature Layer Collection by esri_dm
Last Modified: April 14, 2021
1 comments, 34,158,950 views
countries_flayer = countries_item.layers[0]
countries_df = countries_flayer.query(as_df=True)
countries_df.columns
Index(['FID', 'COUNTRY', 'ISO', 'COUNTRYAFF', 'AFF_ISO', 'Shape__Area',
       'Shape__Length', 'SHAPE'],
      dtype='object')
DataFrame Merging for global Dataset

The df_global has listed its Country_Region column with their current best-known names in English, while the countries_df uses their currently best-known equivalents, and this difference in naming countries has created a problem for the merge() operation to understand if the two countries listed in two DataFrame objects are the same. We need to hence run the following cell in order to make country names consistent between the two DataFrames to be merged.

df_global.loc[df_global['Country_Region']=='US', 'Country_Region'] = 'United States'
df_global.loc[df_global['Country_Region']=='Korea, South', 'Country_Region'] = 'South Korea'
df_global.loc[df_global['Country_Region']=='Korea, North', 'Country_Region'] = 'North Korea'
df_global.loc[df_global['Country_Region']=='Russia', 'Country_Region'] = 'Russian Federation'
df_global.loc[df_global['Country_Region']=='Czechia', 'Country_Region'] = 'Czech Republic'
List the top 10 countries with largest numbers

With df_global ready, we can now sort countries or regions by their numbers of confirmed/recovered/death cases, through usage of groupby(), and sort_values().

# sorted by # of confirmed cases
df_global_sum = df_global.groupby('Country_Region').sum(numeric_only=True)[['Confirmed', 'Recovered', 'Deaths']]
df_global_sum_c = df_global_sum.sort_values(by = ['Confirmed'], ascending = False)
df_global_sum_c.head(10)
ConfirmedRecoveredDeaths
Country_Region
United States336315320604030
India3027933129309607396730
Brazil1842059816160826513474
France5832225404395111130
Turkey5414310528055849634
Russian Federation54087444911752131671
United Kingdom477128915724128367
Argentina4405247402751092568
Italy42580694073435127472
Colombia41587163854457104678
# sorted by death tolls
df_global_sum_d = df_global_sum.sort_values(by = ['Deaths'], ascending = False)
df_global_sum_d.head(10)
ConfirmedRecoveredDeaths
Country_Region
United States336315320604030
Brazil1842059816160826513474
India3027933129309607396730
Mexico25057921990610232564
Peru20481152007201191899
Russian Federation54087444911752131671
United Kingdom477128915724128367
Italy42580694073435127472
France5832225404395111130
Colombia41587163854457104678
Joining the COVID-19 stats and world countries DataFrames
world_merged1 = pd.merge(df_global_sum_c, countries_df[['COUNTRY', 'SHAPE']], 
                         left_index=True, right_on='COUNTRY',
                         how="left")
world_merged1[['COUNTRY', 'Confirmed','Deaths', 'Recovered']].head(10)
COUNTRYConfirmedDeathsRecovered
154.0United States336315326040300
193.0India3027933139673029309607
20.0Brazil1842059851347416160826
155.0France5832225111130404395
151.0Turkey5414310496345280558
246.0Russian Federation54087441316714911752
89.0United Kingdom477128912836715724
14.0Argentina4405247925684027510
170.0Italy42580691274724073435
37.0Colombia41587161046783854457

Now, each country/region in world_merged1 is now merged with the SHAPE column, and is ready to be rendered as polygons.

2. Map the COVID-19 cases in China

Next, let us start visualizing the following scenarios targeting at China:

  • Confirmed cases rendered as points, and polygons
  • Death cases rendered as points, and polygons
  • Recovered cases rendered as points, and polygons

Map the confirmed COVID-19 cases in China

We can either call the add_layer() function to add the specified layer or item (i.e. the FeatureLayer object created as fl) to the map widget, and set the visualization options to be using ClassedSizeRenderer, or plot the derived SeDF on the map view directly with further descriptions such as how to renderer spatial data using symbol and color palette (Here, the SeDF is the derivative of the merged DataFrame, which now contains a SHAPE column that we can use to plot in the Map widget as polygons).

Display confirmed cases in China as points

map1 = gis.map('China', zoomlevel=4)
map1
map1.add_layer(fl,   { "type": "FeatureLayer",
                       "renderer":"ClassedSizeRenderer",
                       "field_name":"Confirmed"})
map1.zoom = 3
map1.legend = True

Display confirmed cases in China as polygons

map1b = gis.map('China')
map1b
map1b.clear_graphics()
overlap_rows_china.spatial.plot(  kind='map', map_widget=map1b,
                                  renderer_type='c',  # for class breaks renderer
                                  method='esriClassifyNaturalBreaks',  # classification algorithm
                                  class_count=4,  # choose the number of classes
                                  col='Confirmed',  # numeric column to classify
                                  cmap='inferno',  # color map to pick colors from for each class
                                  alpha=0.7  # specify opacity
                                 )
True
map1b.zoom = 4
map1b.legend=True

The Map view above (map1b) displays the number of confirmed cases per province in Mainland China. Orange polygons refer to provinces with number of confirmed cases in the range of [45423, 68134], and black polygons represent those in the range of [1, 22712].

Also, we can save the MapView object into a Web Map item for the purpose of future references and modifications.

map1b.save({'title':'Confirmed COVID-19 Cases in China',
            'snippet':'Map created using Python API showing confirmed COVID-19 cases in China',
            'tags':['automation', 'COVID19', 'world health', 'python']})
Confirmed COVID-19 Cases in China
Map created using Python API showing confirmed COVID-19 cases in ChinaWeb Map by arcgis_python
Last Modified: July 04, 2020
0 comments, 0 views

For example, we can browse the web map in the browser, change its symbology to different color maps in the configuration pane, then visualize it again with different looks here.

map1b_item = gis.content.search('Confirmed COVID-19 Cases in China')[0]
WebMap(map1b_item)

Map the deaths caused by COVID-19 in China

Display death cases in China as points

map2 = gis.map('China', zoomlevel=4)
map2
map2.add_layer(fl,   { "type": "FeatureLayer",
                       "renderer":"ClassedSizeRenderer",
                       "field_name":"Deaths"})
map2.legend = True

Display death cases in China as polygons

map2b = gis.map('China')
map2b
map2b = gis.map('China')
map2b
map2b.clear_graphics()
overlap_rows_china.spatial.plot(  kind='map', map_widget=map2b,
                                  renderer_type='c',  # for class breaks renderer
                                  method='esriClassifyNaturalBreaks',  # classification algorithm
                                  class_count=4,  # choose the number of classes
                                  col='Deaths',  # numeric column to classify
                                  cmap='inferno',  # color map to pick colors from for each class
                                  alpha=0.7  # specify opacity
                                 )
True
map2b.zoom = 4
map2b.legend = True

Using the same approach, we can then map the number of death cases per province in Mainland China. With legend displayed, map2b shows us orange polygons refer to provinces with number of death cases in the range of [3008, 4512], and black polygons represent those in the range of [0, 1504].

Similarly, we can create an additional deliverable - the Web Map Item created on the active GIS - and then browse the web map in the browser, change its symbology to different color maps in the configuration pane, and/or visualize it again with different looks here.

map2b_item = map2b.save({'title':'COVID-19 Death Cases in China',
                         'snippet':'Map created using Python API showing COVID-19 death cases in China',
                         'tags':['automation', 'COVID19', 'world health', 'python']})
WebMap(map2b_item)

Map the recovered COVID-19 cases in China

Display the recovered cases in China as points

map3 = gis.map('China', zoomlevel=4)
map3
map3.add_layer(fl,   { "type": "FeatureLayer",
                       "renderer":"ClassedSizeRenderer",
                       "field_name":"Recovered"})
map3.legend = True

Display the recovered cases in China as polygons

map3b = gis.map('China')
map3b
map3b.clear_graphics()
overlap_rows_china.spatial.plot(  kind='map', map_widget=map3b,
                                  renderer_type='c',  # for class breaks renderer
                                  method='esriClassifyNaturalBreaks',  # classification algorithm
                                  class_count=4,  # choose the number of classes
                                  col='Recovered',  # numeric column to classify
                                  cmap='inferno',  # color map to pick colors from for each class
                                  alpha=0.7  # specify opacity
                                 )
True
map3b.zoom = 4
map3b.legend = True

Same here in map3b, with legend displayed, we can tell orange polygons refer to provinces with number of recovered cases in the range of [42411, 63616], and black polygons represent those in the range of [1, 21206].

Based on the sets of maps plotted above, the Hubei province has topped all provinces in China no matter when we are looking at the confirmed cases, or the recovered/death tolls.

Again as how we create an additional deliverable - the Web Map Item - on the active GIS in previous two sections, we can then browse the web map in the browser, change its symbology to different color maps in the configuration pane, and/or visualize it again with different looks.

map3b_item = map3b.save({'title':'COVID-19 Recovered Cases in China',
                         'snippet':'Map created using Python API showing COVID-19 recovered cases in China',
                         'tags':['automation', 'COVID19', 'world health', 'python']})
WebMap(map3b_item)

3. Map the COVID-19 cases in the U.S.

Similar to how we map the scenarios in China, visualizing these scenarios targeting at the United States can be divided into these sub-sections:

  • Confirmed cases rendered as points, and polygons
  • Death cases rendered as points, and polygons

Map the confirmed COVID-19 cases in the U. S.

There are two ways to map the COVID-19 cases:

  • call the add_layer() function to add the specified layer or item (i.e. the FeatureLayer object created as fl) to the map widget, and set the visualization options to be using ClassedSizeRenderer,
  • plot the derived SeDF on the map view directly with further descriptions such as how to renderer spatial data using symbol and color palette (Here, the SeDF is the derivative of the merged DataFrame, which now contains a SHAPE column that we can use to plot in the Map widget as polygons).

We are going to show how the confirmed cases in the U.S. look like in both ways:

Map the confirmed cases in the U.S. as points

map4 = gis.map("US", zoomlevel=4)
map4
map4.add_layer(fl, { "type": "FeatureLayer",
                     "renderer":"ClassedSizeRenderer",
                     "field_name":"Confirmed"})
map4.legend = True

Map the confirmed cases in the U.S. as polygons

map4a = gis.map("US")
map4a
map4a.clear_graphics()
overlap_rows_usa.spatial.plot( kind='map', map_widget=map4a,
                               renderer_type='c',  # for class breaks renderer
                               class_count=4,  # choose the number of classes
                               method='esriClassifyEqualInterval',  # classification algorithm
                               col='Confirmed',  # numeric column to classify
                               cmap='inferno',  # color map to pick colors from for each class
                               alpha=0.7  # specify opacity
                              )
True
map4a.zoom = 4
map4a.legend = True

Map the COVID-19 deaths in the U. S.

Map the death cases in the U.S. as points

map4b = gis.map("US")
map4b
map4b.add_layer(fl, { "type": "FeatureLayer",
                     "renderer":"ClassedSizeRenderer",
                     "field_name":"Deaths"})
map4b.zoom=4
map4b.legend=True

Map the death cases in the U.S. as polygons

map4c = gis.map("US")
map4c
map4c.clear_graphics()
overlap_rows_usa.spatial.plot( kind='map', map_widget=map4c,
                               renderer_type='c',  # for class breaks renderer
                               class_count=4,  # choose the number of classes
                               method='esriClassifyEqualInterval',  # classification algorithm
                               col='Deaths',  # numeric column to classify
                               cmap='inferno',  # color map to pick colors from for each class
                               alpha=0.7  # specify opacity
                              )
True
map4c.zoom = 4
map4c.legend = True

4. Map the COVID-19 cases globally

Now we have learned how to map COVID-19 cases in China, and the United States, let's take a look at how to map across the globe. For illustration purpose, the rest of this section will only talk about mapping the confirmed cases globally, yet the workflow is almost the same to map deaths and recovered cases globally.

Map the confirmed COVID-19 cases globally

We are going to walk through how the confirmed cases across the globe can be mapped in the Map widget:

  • call the add_layer() function to add the specified layer or item (i.e. the FeatureLayer object created as fl) to the map widget, and set the visualization options to be using ClassedSizeRenderer,
  • plot the derived SeDF on the map view directly with further descriptions such as how to renderer spatial data using symbol and color palette (Here, the SeDF is the derivative of the merged DataFrame, which now contains a SHAPE column that we can use to plot in the Map widget as polygons).

Map the global confirmed cases as points

map5a = gis.map("Italy")
map5a
map5a.add_layer(fl, { "type": "FeatureLayer",
                     "renderer":"ClassedSizeRenderer",
                     "field_name":"Confirmed"})
map5a.legend=True
map5a.zoom = 4

Map the global confirmed cases as polygons

map5b = gis.map("Italy")
map5b
world_merged1.spatial.plot(  kind='map', map_widget=map5b,
                             renderer_type='c',  # for class breaks renderer
                             method='esriClassifyStandardDeviation',  # classification algorithm
                             class_count=7,  # choose the number of classes
                             col='Confirmed',  # numeric column to classify
                             cmap='inferno',  # color map to pick colors from for each class
                             alpha=0.7  # specify opacity
                            )
True
map5b.zoom = 4
map5b.legend = True

When map5b is created centering Brazil, the legend added along shows that yellow polygons (the 2nd class in the colormap) indicate that the countries are of number of confirmed cases ranging within (28026277, 33631532], and purple polygons represent those ranging between (5605255, 11210511].

5. What's next?

This notebook demonstrates accessing the corona viruses statistics, tabularizing them in DataFrames, and finally mapping the outbreak in different regions or counties. Next step, you can refer to Part 2 notebook for analyzing and charting time-series of the 2019 Novel Coronavirus.

References

[1] https://www.cdc.gov/coronavirus/2019-ncov/about/index.html

[2] https://www.cdc.gov/coronavirus/2019-nCoV/

Your browser is no longer supported. Please upgrade your browser for the best experience. See our browser deprecation post for more details.