Flood inundation mapping and monitoring using SAR data and deep learning | ArcGIS API for Python

🔬 Data Science

🥠 Deep Learning and pixel classification

Introduction

Flooding is one of the most frequent and costly forms of natural disasters. They often strike without warning and can occur when large volumes of water fall in a short time, causing flash floods. Flood mapping is typically performed using the following methods:

Aerial observations
Ground surveys

However, when flooding is widespread, these methods become prohibitively expensive and time consuming. Furthermore, aerial observation and optical imagery can often prove difficult, if not impossible, due to obstructive weather conditions. During flooding conditions, clouds can prevent the use of optical satellite imagery for visualization and analysis. In these instances, synthetic-aperture radar (SAR) allows us to penetrate through clouds and hazy atmospheric conditions to continuously observe and map flooding.

In 2019, severe flooding occurred in the Midwest of the United States. Also known as the Great Flood of 2019, 14 million people were affected across multiple states. In this analysis, we will perform flood mapping and infrastructural inundation mapping of the St. Peters region of Missouri, which was one of the affected areas during the flood.

Necessary imports

import os
from datetime import datetime
from pathlib import Path

from arcgis.gis import GIS
from arcgis.learn import prepare_data, UnetClassifier
from arcgis.raster import Raster, convert_raster_to_feature
from arcgis.features.manage_data import overlay_layers
from arcgis.features.analysis import dissolve_boundaries

Connect to your GIS

from arcgis import GIS
gis = GIS('home')

Export training data

Here, we convert the Sentinel-1 GRD VH polarization band to a 3 band raster using Export Raster. Under the Render Settings section, once Use Renderer is checked, Force RGB will be enabled.

The resulting raster is generated from the Sentinel-1 GRD VH imagery using traditional histogram thresholding technique. The raster contains two classes, permanent waterbodies and flood water. This raster will be used as a Classified Raster in the Export Training Data Using Deep Learning tool.

input_raster = gis.content.get("6bb57dc7e31c4acaaf14eef61cd62d92")
input_raster

raster_for_training_data_flood_inundation

Tiled Imagery Layer by api_data_owner
Last Modified: October 22, 2024
0 comments, 0 views

The feature layer contains two classes: 1 = Permanent Waterbodies and 2 = Flood Water. The feature layer will be used as the Input Feature Class in the Export Training Data For Deep Learning tool.

label_raster = gis.content.get("a71ae4b6846b49979575e8cea1526f0c")
label_raster

flood_label_2classes

Feature Layer Collection by api_data_owner
Last Modified: November 10, 2024
0 comments, 2 views

The polygon feature class will be used as Input Mask Polygons in the Export Training Data For Deep Learning tool to delineate the area where image chips will be created.

aoi = gis.content.get("ac25dcee70a740ffbdd1d7a107604c23")
aoi

flood_aoi_mask

Feature Layer Collection by api_data_owner
Last Modified: November 10, 2024
0 comments, 2 views

The Export Training Data For Deep Learning tool is used to prepare training data for training a deep learning model. The tool is available in both ArcGIS Pro and ArcGIS Enterprise.

Next, we will utilize Jupyter Notebooks. Documentation on how to install and setup the necessary environment is available here.

Model training

Get training data

We have already exported the data, and it can be directly downloaded using the following steps:

training_data = gis.content.get('c4f58fd8e21743d69c82a93b30c8b873')
training_data

flood_inundation_mapping_using_sar_data_and_deep_learning
Flood inundation mapping using sar data and deep learning

Image Collection by api_data_owner
Last Modified: February 09, 2022
0 comments, 194 views

filepath = training_data.download(file_name=training_data.name)

import zipfile
with zipfile.ZipFile(filepath, 'r') as zip_ref:
    zip_ref.extractall(Path(filepath).parent)

data_path = Path(os.path.join(os.path.splitext(filepath)[0]))

Prepare data

The prepare_data function takes a training data path as input and creates a fast.ai databunch with specified parameters, like transformation, batch size, split percentage, etc.

data = prepare_data(data_path, batch_size=4, chip_size=400)

Visualize training data

To get a sense of what the training data looks like, the arcgis.learn.show_batch() method will randomly select a few training chips and visualize them.

data.show_batch(rows=3)

Load model architecture

arcgis.learn provides the UnetClassifier model for per pixel classification that is based on a pretrained convnet, like ResNet, that acts as the backbone. More details about UnetClassifier can be found here.

# Create U-Net Model
unet = UnetClassifier(data, backbone='resnet34')

unet.unfreeze()

Train the model

lr = unet.lr_find()
lr

7.585775750291836e-05

We are using the suggested learning rate above to train the model for 400 epochs.

unet.fit(100, lr=lr)

epoch	train_loss	valid_loss	accuracy	dice	time
0	0.315196	0.238061	0.916650	0.140753	03:18
1	0.257032	0.206839	0.927303	0.201196	03:21
2	0.215311	0.177408	0.935796	0.196791	03:19
3	0.200612	0.173998	0.936045	0.223597	03:22
4	0.163658	0.147535	0.944854	0.245653	03:18
5	0.172517	0.157920	0.939964	0.317370	03:19
6	0.161908	0.150001	0.943904	0.318771	03:18
7	0.144725	0.143262	0.948632	0.318215	03:18
8	0.127185	0.132418	0.950279	0.308088	03:20
9	0.160415	0.128531	0.953066	0.297973	03:20
10	0.130305	0.140115	0.953595	0.308917	03:19
11	0.140588	0.179590	0.930879	0.307942	03:20
12	0.100787	0.113572	0.957843	0.349602	03:21
13	0.138424	0.121252	0.955557	0.351324	03:19
14	0.143793	0.130152	0.947390	0.388204	03:22
15	0.136836	0.103604	0.961964	0.474613	03:20
16	0.117982	0.154733	0.936899	0.407125	03:18
17	0.125900	0.113143	0.955233	0.446174	03:20
18	0.160453	0.177447	0.928937	0.322310	03:22
19	0.100883	0.111882	0.957361	0.369451	03:21
20	0.124519	0.107814	0.962253	0.494565	03:18
21	0.142026	0.115047	0.959814	0.485063	03:20
22	0.132374	0.112865	0.958643	0.323485	03:20
23	0.105031	0.104781	0.963123	0.424728	03:19
24	0.130210	0.099988	0.961954	0.485959	03:18
25	0.111906	0.137084	0.948183	0.536301	03:18
26	0.114745	0.090877	0.965811	0.468203	03:19
27	0.112998	0.098488	0.964209	0.461653	03:19
28	0.107765	0.135637	0.942996	0.464132	03:19
29	0.122544	0.113617	0.958053	0.447598	03:18
30	0.126803	0.154085	0.947610	0.483636	03:18
31	0.116606	0.099966	0.964711	0.363412	03:17
32	0.101885	0.113951	0.957733	0.551356	03:16
33	0.106304	0.160295	0.949341	0.509041	03:16
34	0.108699	0.091748	0.966640	0.557177	03:17
35	0.086422	0.099116	0.963997	0.571465	03:17
36	0.088052	0.102253	0.961857	0.556534	03:16
37	0.086453	0.114716	0.958480	0.554985	03:17
38	0.097134	0.108769	0.960515	0.513498	03:16
39	0.087108	0.091782	0.965696	0.568486	03:16
40	0.093878	0.091828	0.965848	0.475024	03:17
41	0.091784	0.093702	0.966600	0.572884	03:17
42	0.107427	0.093918	0.965950	0.496693	03:16
43	0.100025	0.090305	0.966608	0.560310	03:17
44	0.098903	0.147281	0.948750	0.381766	03:16
45	0.085758	0.093694	0.963931	0.453228	03:17
46	0.091247	0.084099	0.968554	0.584905	03:17
47	0.098983	0.091277	0.966115	0.563792	03:17
48	0.085295	0.086128	0.968817	0.588143	03:17
49	0.092595	0.090168	0.966302	0.571030	03:17
50	0.078332	0.097215	0.967454	0.560382	03:18
51	0.081438	0.118282	0.956715	0.533550	03:18
52	0.083343	0.090773	0.968466	0.565028	03:17
53	0.092242	0.101065	0.965892	0.597515	03:16
54	0.082083	0.078399	0.971373	0.588886	03:17
55	0.087049	0.085211	0.968528	0.566625	03:17
56	0.070234	0.090124	0.968084	0.586053	03:17
57	0.080417	0.080767	0.970334	0.519328	03:17
58	0.096029	0.076735	0.970688	0.540244	03:17
59	0.064478	0.079238	0.970172	0.590601	03:17
60	0.085882	0.073671	0.972408	0.596750	03:18
61	0.067137	0.078994	0.970387	0.580007	03:17
62	0.097056	0.081760	0.968576	0.594298	03:17
63	0.081772	0.076046	0.971833	0.601046	03:17
64	0.080576	0.073024	0.972750	0.539293	03:17
65	0.061883	0.072185	0.973761	0.577910	03:17
66	0.059233	0.083739	0.968916	0.612876	03:17
67	0.074872	0.088879	0.966927	0.590858	03:17
68	0.067955	0.083563	0.967476	0.600336	03:18
69	0.072453	0.075802	0.972161	0.536534	03:17
70	0.060471	0.076625	0.971866	0.565876	03:18
71	0.065586	0.077575	0.969901	0.584906	03:16
72	0.068726	0.072682	0.973562	0.582242	03:17
73	0.067402	0.071703	0.973451	0.609991	03:18
74	0.071588	0.073427	0.972869	0.557851	03:16
75	0.062566	0.073257	0.972422	0.605447	03:18
76	0.069558	0.073604	0.971288	0.605160	03:16
77	0.066573	0.068060	0.973893	0.598024	03:16
78	0.063839	0.068609	0.974868	0.604081	03:17
79	0.051357	0.077050	0.972586	0.597544	03:16
80	0.060609	0.065068	0.975528	0.621720	03:17
81	0.052754	0.069642	0.974368	0.616671	03:17
82	0.061674	0.068864	0.973790	0.618299	03:17
83	0.066033	0.067208	0.974869	0.611925	03:17
84	0.061642	0.066778	0.975008	0.611326	03:18
85	0.056743	0.067824	0.974236	0.619976	03:16
86	0.058290	0.067946	0.974201	0.623902	03:17
87	0.065600	0.065537	0.975292	0.617071	03:18
88	0.069902	0.066557	0.974751	0.617559	03:18
89	0.060525	0.064784	0.975515	0.612438	03:17
90	0.059213	0.069837	0.973789	0.617353	03:17
91	0.062706	0.064319	0.975835	0.622603	03:17
92	0.061580	0.068217	0.974225	0.611768	03:18
93	0.061281	0.065149	0.975796	0.617887	03:17
94	0.064799	0.065163	0.975561	0.624698	03:17
95	0.052594	0.066613	0.975016	0.613010	03:16
96	0.052637	0.064985	0.975628	0.614651	03:17
97	0.059724	0.065538	0.975418	0.603437	03:17
98	0.052752	0.068193	0.974102	0.598650	03:17
99	0.059329	0.068712	0.973762	0.616763	03:17

We have trained the model for a further 300 epochs to improve model performance. For the sake of time, the cell below is commented out.

# model.fit(300)

Visualize results in validation set

It's a good practice to see results of the model vis-a-vis ground truth. The code below picks random samples and shows us ground truths and model predictions, side by side. This enables us to preview the results of the model within the notebook.

unet.show_results(rows=2, alpha=0.9)

Evaluate model performance

unet.accuracy()

0.9758347868919373

As we have 2 classes (1=permanent waterbodies and 2=flood water) for this segmentation task, we need to perform an accuracy assessment for each class. To achieve this, ArcGIS API for Python provides the per_class_metrics function that calculates a precision, recall, and f1 score for each class.

unet.per_class_metrics()

	NoData	1	2
precision	0.988426	0.864601	0.889424
recall	0.988093	0.839055	0.896257
f1	0.988259	0.851636	0.892827

Save the model

We will save the model that we trained as a 'Deep Learning Package' ('.dlpk' format). A Deep Learning Package is the standard format used to deploy deep learning models on the ArcGIS platform.

We will use the save() method to save the trained model. By default, it will be saved to the 'models' sub-folder within the training data folder.

unet.save('flood_model_2024', publish=True)

Computing model metrics...
Published DLPK Item Id: 442a9884b7d847f58c59d5ce5d4f6f88

WindowsPath('~/AppData/Local/Temp/flood_inundation_mapping_using_sar_data_and_deep_learning/models/flood_model_2024')

Model inferencing

Using ArcGIS Pro, we can use the trained model on a test image/area to classify permanent waterbodies and flood inundated areas in the SAR satellite image.

After training the UnetClassifier model and saving the weights for classifying images, we can use the Classify Pixels Using Deep Learning tool tool available in ArcGIS pro and ArcGIS Enterprise for inferencing.

flood_model = gis.content.get('bc6740215622464a9a707dc12b705858')
flood_model

flood_model_2024

Deep Learning Package by api_data_owner
Last Modified: November 11, 2024
0 comments, 0 views

raster_for_inferencing = gis.content.get('05a5c9ffd2044d2583ec1ae2a5712d54')
raster_for_inferencing

with arcpy.EnvManager(processorType="GPU"): out_classified_raster = arcpy.ia.ClassifyPixelsUsingDeepLearning("sentinel1_3band_inference_raster", "https://deldev.maps.arcgis.com/sharing/rest/content/items/86d5806943024257a8a15fe17296b19b", "padding 100;batch_size 8;predict_background True;tile_size 400", "PROCESS_AS_MOSAICKED_IMAGE", None); out_classified_raster.save(r"C:\Users\shi10484\Documents\ArcGIS\Projects\flood2\flood2.gdb\inferenced_results")

Results visualization

The classified output raster is generated using ArcGIS Pro. The output raster is published on the portal for visualization.

sar_ras2 = gis.content.get('427cd9a47eb544c59c1e965a56e72550')
inf_ras2 = gis.content.get('a7f2c8f23aa448d28fc14ec99b325ca8')

sar_ras2, inf_ras2

(<Item title:"st_peters_sar_composite" type:Tiled Imagery Layer owner:demos_deldev>,
 <Item title:"st_peters_inferenced_raster" type:Tiled Imagery Layer owner:demos_deldev>)

from arcgis.raster import colormap
inf_cmap2 = colormap(inf_ras2.layers[0], colormap=[[1, 7, 42, 108],[2, 0, 206, 209]])

Create map widgets

Three map widgets are created showing flood inundation in different regions.

m1 = gis.map('St Peters, USA', 11)
m1.basemap.basemap = 'satellite'
m2 = gis.map('St Peters, USA', 11)
m2.content.add(sar_ras2)
m2.zoom_to_layer(sar_ras2)
m3 = gis.map()
m3.content.add(sar_ras2)
m3.content.add(inf_cmap2)

Set the map layout

from ipywidgets import HBox, VBox, Label, Layout

m1.sync_navigation(m2)
m2.sync_navigation(m3)

Hbox and Vbox were used to set the layout of map widgets.

from ipywidgets import *

m1.layout = Layout(flex = '1 1', padding = '10px')
m2.layout = Layout(flex = '1 1', padding = '10px')
m3.layout = Layout(flex = '1 1', padding = '10px')

# Create VBoxes for each map and label
box1 = VBox([Label("True Colour Imagery"), m1], layout=Layout(width='33%'))
box2 = VBox([Label("Sentinel-1 Imagery"), m2], layout=Layout(width='33%'))
box3 = VBox([Label("Predictions"), m3], layout=Layout(width='33%'))

Flood inundation mapping

The resulting predictions are provided as a map for better visualization. The results show the spatial distribution of flood water in the Midwestern US during the 2019 floods. Sentinel-1 VV imagery of May 2019 are used for the analysis. In the map widgets, it can be seen that the trained UNetClassifier model is able to identify permanent waterbodies and flood water, as well as differentiate between the two. The deep blue color represents permanent waterbodies and the cyan color represents flood water.

# Place the VBoxes side by side using an HBox
hbox = HBox([box1, box2, box3])

# Display the HBox
hbox

<PIL.PngImagePlugin.PngImageFile image mode=RGBA size=1447x642>

m1.basemap.basemap = 'arcgis-imagery'

Three map widgets were created. The left widget displays natural color high resolution satellite imagery prior to flooding, the middle widget displays the sentinel-1 imagery during the flood event, and the right map widget displays the predictions of the trained UnetClassifier model. In the maps, St Louis city can be seen where the Illinois river and the Mississippi river converge. The model is able to identify river channels and differentiate from the flood water. The True Color Imagery can be used for visual interpretation for model accuracy.

Estimation of flood inundated area (sq. km)

The pixel size of the raster is required to calculate the area of flood inundated areas. We will use the <raster>.properties.pixelSizeX and <raster>.properties.pixelSizeY functions to find the Pixel size of the raster.

## Cellsize
ras2_cellsize_x = inf_ras2.layers[0].properties.pixelSizeX
ras2_cellsize_y = inf_ras2.layers[0].properties.pixelSizeY
print(ras2_cellsize_x, ras2_cellsize_y)

14.3472971497467 14.3472971497467

To calculate the area of land under flood water, we will use the <raster>.attribute_table() function to find the count of pixels per flood water class.

inf_ras2.layers[0].attribute_table()

{'fields': [{'name': 'Value',
   'type': 'esriFieldTypeInteger',
   'alias': 'Value',
   'sqlType': 'sqlTypeOther',
   'domain': None,
   'defaultValue': None},
  {'name': 'Count',
   'type': 'esriFieldTypeDouble',
   'alias': 'Count',
   'sqlType': 'sqlTypeOther',
   'domain': None,
   'defaultValue': None},
  {'name': 'Class',
   'type': 'esriFieldTypeString',
   'alias': 'Class',
   'sqlType': 'sqlTypeOther',
   'length': 1,
   'domain': None,
   'defaultValue': None}],
 'features': [{'attributes': {'Value': 1, 'Count': 1822630, 'Class': '1'}},
  {'attributes': {'Value': 2, 'Count': 5219135, 'Class': '2'}}]}

This study requires the calculation of the area of land under flood water in terms of square km. The raster uses the projected coordinate system (3857), which has pixels in meters.

## area in square kilometers
area_ras2_flood_water = (5219135*(ras2_cellsize_x*ras2_cellsize_y)/1000000)
area_ras2_flood_water

1074.3325074571271

Infrastructural inundation assessment

The inferenced raster will be used to assess the infrasruture inundated in flood water.

flood_raster = Raster("https://tiledimageservices6.arcgis.com/SMX5BErCXLM7eDtY/arcgis/rest/services/st_louis_flood_water/ImageServer",
                      gis=gis,
                      engine="image_server")
flood_raster

<arcgis.raster._layer.Raster at 0x29303b37a90>

The LULC raster for St. Louis is generated using the Land Cover Classification (Sentinel-2) pretrained model to assess the inundated areas per the LULC class.

lulc_raster = Raster("https://tiledimageservices6.arcgis.com/SMX5BErCXLM7eDtY/arcgis/rest/services/lulc_st_louis/ImageServer",
                        gis=gis,
                        engine="image_server")
lulc_raster

<arcgis.raster._layer.Raster at 0x29303806010>

The flood raster will be converted to polygons using convert_raster_to_feature. We then use the import_data function to create a new feature layer containing the flood water polygons.

flood_poly = convert_raster_to_feature(flood_raster, 
                                    field='Value', 
                                    output_type='Polygon', 
                                    simplify=False, 
                                    output_name='flood_st_louis_poly'+str(datetime.datetime.now().microsecond), 
                                    gis=gis)

## Create dataframe from feature layer and get water polygons
dfm1 = flood_poly.layers[0].query('gridcode=2').sdf 

## Convert dataframe to feature layer
flood_poly = gis.content.import_data(dfm1, title='flood_water_poly'+str(datetime.datetime.now().microsecond))

Next, the LULC raster will be converted to polygons using convert_raster_to_feature. After getting the LULC polygons, we remove NODATA and Water polygons and use the import_data function to create a new feature layer containing the correct LULC polygons.

lulc_poly = convert_raster_to_feature(lulc_raster, 
                                    field='Value', 
                                    output_type='Polygon', 
                                    simplify=False, 
                                    output_name='lulc_st_louis_poly'+str(datetime.datetime.now().microsecond), 
                                    gis=gis)

## Create dataframe from feature layer and get water polygons
dfm2 = lulc_poly.layers[0].query('gridcode > 0 And gridcode < 5').sdf 

## Convert dataframe to feature layer
lulc_polygon = gis.content.import_data(dfm2, title='lulc_poly'+str(datetime.datetime.now().microsecond))

To get the LULC classes for the flood inundated areas, we will use the overlay_layers function.

inundated_lulc = overlay_layers(lulc_polygon.layers[0], 
                                flood_poly.layers[0], 
                                output_name='inundated_lulc'+str(datetime.datetime.now().microsecond),
                                gis=gis)

{"cost": 72.43}

After getting the LULC classes for the flood inundated areas, we will dissolve the polygons on the basis of gridcode. The output feature layer will have the combined area of each class in square miles units.

lulc_dissolve = dissolve_boundaries(inundated_lulc, 
                               dissolve_fields=['gridcode'], 
                               output_name='dissolved_lulc'+str(datetime.datetime.now().microsecond),
                               gis=gis,
                               multi_part_features=True)

{"cost": 9.685}

The resulting feature layer has a column for the area per class, but the corresponding LULC class name is missing. We will add the class names to the dataframe using the code below.

dfm4 = lulc_dissolve.layers[0].query().sdf
lulc_classes = ['Artificial surfaces', 'Agricultural areas', 'Forest and semi natural areas', 'Wetlands']
dfm4['LULC_Classes'] = lulc_classes
dfm5 = dfm4[['gridcode','AnalysisArea', 'LULC_Classes']].copy()
dfm5.rename(columns={'AnalysisArea': 'Area_in_square_miles'}, inplace=True)
dfm5

	gridcode	Area_in_square_miles	LULC_Classes
0	1	7.469743	Artificial surfaces
1	2	189.190674	Agricultural areas
2	3	5.561292	Forest and semi natural areas
3	4	14.224226	Wetlands

Conclusion

In this notebook, we have demonstrated how to use a UnetClassifier model with ArcGIS API for Python to extract flood water and permanent waterbodies. In Part 1, we covered how Sentinel-1 SAR data can be used for flood inundation mapping and monitoring. This process involved steps to prepare the input data, train a pixel-based classification model, visualize the results, generate accuracy metrics, and inferencing results on a test raster/area. Finally, in Part 2, we demonstrated the flood water inundated area in square kilometers and an infrastructural inundation assessment.