GEDI_L4A Subset and Visualize

Authors: Chuck Daniels (Development Seed), Jamison French (Development Seed), Anish Bhusal (UAH), Sumant Jha (MSFC/USRA), Alex Mandel (Development Seed)

Date: November 14, 2022

Description: In this tutorial, we will use a GeoJSON to create an area of interest (AOI) and use it in MAAP’s GEDI Subsetter. We will then visualize the output file created by the subsetter by plotting elevation contours.

Run This Notebook

To access and run this tutorial within MAAP’s Algorithm Development Environment (ADE), please refer to the “Getting started with the MAAP” section of our documentation.

Disclaimer: it is highly recommended to run a tutorial within MAAP’s ADE, which already includes packages specific to MAAP, such as maap-py. Running the tutorial outside of the MAAP ADE may lead to errors.

About the Data

GEDI L4A Footprint Level Aboveground Biomass Density, Version 2

This dataset provides Global Ecosystem Dynamics Investigation (GEDI) Level 4 (L4) data, which has the purpose of providing mean aboveground biomass density (AGBD) and consists of the GEDI_L4A and GEDI_L4B collections. GEDI L4A contains predictions of AGBD and estimates of the prediction standard error. GEDI is attached to the International Space Station (ISS) and collects data globally between 51.6° N and 51.6° S latitudes at the highest resolution and densest sampling of any light detection and ranging (lidar) instrument in orbit to date; specifically, GEDI L4A data has a spatial resolution of 25m. Source: GEDI_L4A Version 2 User Guide

Additional Resources

GEDI_L4A Version 2 Data Set Landing Page
The GEDI Website
Earthdata Search
GEDI Subsetter DOI: 10.5281/zenodo.10019412

[Optional] Install Python Packages

This notebook contains some cells marked as optional, meaning that you can use this notebook without necessarily running such cells.

However, if you do wish to run the optional cells, you must install the following Python packages, which might not already be installed in your environment:

geopandas: for reading your AOI (GeoJson file), as well as for reading the job output (GeoPackage file containing the subset)
contextily: for visually verifying your AOI
backoff: for repeatedly polling the job status (after submission) until the job has been completed (either successfully or not)
folium: for visualizing your data on a Leaflet map
geojsoncontour: for converting your matplotlib contour plots to geojson

[ ]:

# Uncomment the following lines to install these packages if you haven't already.
# !pip install geopandas
# !pip install contextily
# !pip install backoff
# !pip install folium
# !pip install geojsoncontour

A job can be submitted without these packages, but installing them in order to run the optional cells may make it more convenient for you to visually verify both your AOI and the subset output produced by your job.

Obtain Username

[6]:

from maap.maap import MAAP

maap = MAAP(maap_host="api.maap-project.org")
username = maap.profile.account_info()["username"]
username

[6]:

'smk0033'

Define the Area of Interest

You may use either a publicly available GeoJSON file for your AOI, such as those available at geoBoundaries, or you may create a custom GeoJSON file for your AOI. The following 2 subsections cover both cases.

Using a geoBoundary GeoJSON File

If your AOI is a publicly available geoBoundary, you can obtain the URL for the GeoJSON file using the function below. You simply need to supply an ISO3 value and a level. To find the appropriate ISO3 and level values, see the table on the geoBoundaries site.

[7]:

import requests


def get_geo_boundary_url(iso3: str, level: int) -> str:
    response = requests.get(
        f"https://www.geoboundaries.org/api/current/gbOpen/{iso3}/ADM{level}"
    )
    response.raise_for_status()
    return response.json()["gjDownloadURL"]


# If using a geoBoundary, uncomment the following assignment, supply
# appropriate values for `<iso3>` and `<level>`, then run this cell.

# Example (Gabon level 0): get_geo_boundary("GAB", 0)

# aoi = get_geo_boundary_url("<iso3>", <level>)

Using a Custom GeoJSON File

Alternatively, you can make your own GeoJSON file for your AOI and place it within your my-public-bucket folder within the ADE.

Based upon where you place your GeoJSON file under my-public-bucket, you can construct the URL for a job’s aoi input value.

For example, if the relative path of your AOI GeoJSON file under my-public-bucket is path/to/my-aoi.geojson (avoid using whitespace in the path and filename), the URL you would supply as the value of a job’s aoi input would be the following (where {username} is replaced with your username as output from the previous section):

f"https://maap-ops-workspace.s3.amazonaws.com/shared/{username}/path/to/my-aoi.geojson"`

If this is the case, use the cell below.

[8]:

#aoi = f"https://maap-ops-workspace.s3.amazonaws.com/shared/{username}/langtang_np.geojson"

#for your convenience you can use this geoJSON file but if you have your own geojson, use the commented link as example format
aoi = f"https://maap-ops-workspace.s3.amazonaws.com/shared/anisbhsl/langtang_np.geojson"

This example uses the AOI of Gosaikunda Lake region inside Langtang National Park. You can also create your own GeoJSON file for your AOI using sites like geojson.io

Gosaikunda

[Optional] Visually Verify your AOI

If you want to visually verify your AOI before proceeding, you may run the following cell, if you have the geopandas and contextily Python packages installed.

[9]:

try:
    import geopandas as gpd
    import contextily as ctx
except:
    print(
        "If you wish to visually verify your AOI, "
        "you must install the `geopandas` and `contextily` packages."
    )
else:
    aoi_gdf = gpd.read_file(aoi)
    aoi_epsg4326 = aoi_gdf.to_crs(epsg=4326)
    ax = aoi_epsg4326.plot(figsize=(10, 5), alpha=0.3, edgecolor="red")
    ctx.add_basemap(ax, crs=4326)

../../_images/science_GEDI_GEDI_L4A_15_0.png

Submit a Job

When supplying input values for a GEDI subsetting job, to use the default value for a field (where indicated), use a dash ("-") as the input value.

aoi (required): URL to a GeoJSON file representing your area of interest, as explained above.
doi: Digital Object Identifier (DOI) of the GEDI collection to subset, or a logical name representing such a DOI. Valid logical names: L1B, L2A, L2B, L4A
columns: Comma-separated list of column names to include in the output file.
query: Query expression for subsetting the rows in the output file.
limit: Maximum number of GEDI granule data files to download (among those that intersect the specified AOI). (Default: 10000)

It is recommended to use maap-dps-worker-32gb queues when submitting a job with a large aoi.

In addition, the version argument should be updated to match the most recent version of the GEDI Subsetter.

[10]:

inputs = dict(
    aoi=aoi,
    doi="L4A",
    lat="lat_lowestmode",
    lon="lon_lowestmode",
    beams="coverage",
    columns="agbd, agbd_se, sensitivity, geolocation/sensitivity_a2, elev_lowestmode",
    query="l2_quality_flag == 1 and l4_quality_flag == 1 and sensitivity > 0.95 and `geolocation/sensitivity_a2` > 0.95",
    limit=10,
    output="gedi_subset.gpkg"
)

result = maap.submitJob(
    identifier="gedi-subset",
    algo_id="gedi-subset",
    version="0.12.0",
    queue="maap-dps-worker-32gb",
    **inputs,
)

job_id = result.id
job_id or result

[10]:

'5090fbf1-1418-433b-817b-b6f4d982171b'

Get the Job’s Output File

Now that the job has been submitted, we can use the job_id to check the job status until the job has been completed.

[ ]:

from urllib.parse import urlparse

def job_status_for(job_id: str) -> str:
    return maap.getJobStatus(job_id)

def job_result_for(job_id: str) -> str:
    return maap.getJobResult(job_id)[0]

def to_job_output_dir(job_result_url: str) -> str:
    return f"/home/jovyan/my-private-bucket/{job_result_url.split(f'/{username}/')[1]}"

If you have installed the backoff Python package, running the following cell will automatically repeatedly check your job’s status until the job has been completed. Otherwise, you will have to manually repeatedly rerun the following cell until the output is either 'Succeeded' or 'Failed'.

[14]:

try:
    import backoff
except:
    job_status = job_status_for(job_id)
else:
    # Check job status every 2 minutes
    @backoff.on_predicate(
        backoff.constant,
        lambda status: status not in ["Deleted", "Succeeded", "Failed"],
        interval=120,
    )
    def wait_for_job(job_id: str) -> str:
        return job_status_for(job_id)

    job_status = wait_for_job(job_id)

job_status

INFO:backoff:Backing off wait_for_job(...) for 1.0s (Accepted)
INFO:backoff:Backing off wait_for_job(...) for 50.7s (Accepted)
INFO:backoff:Backing off wait_for_job(...) for 11.9s (Accepted)
INFO:backoff:Backing off wait_for_job(...) for 111.0s (Accepted)
INFO:backoff:Backing off wait_for_job(...) for 78.4s (Running)
INFO:backoff:Backing off wait_for_job(...) for 37.8s (Running)

[14]:

'Succeeded'

[3]:

assert job_status == "Succeeded", (
    job_result_for(job_id)
    if job_status == "Failed"
    else f"Job {job_id} has not yet completed ({job_status}). Rerun the prior cell."
)

output_url = job_result_for(job_id)
output_dir = to_job_output_dir(output_url)
output_file = f"{output_dir}/gedi_subset.gpkg"
print(f"Your subset results are in the file {output_file}")

Your subset results are in the file /home/jovyan/my-private-bucket/dps_output/gedi-subset/0.12.0/gedi-subset/2025/07/31/21/41/26/947077/gedi_subset.gpkg/gedi_subset.gpkg

[Optional] Visually Verify the Results

If you installed the geopandas Python package, you can visually verify the output file by running the following cell.

[16]:

try:
    import geopandas as gpd
    import matplotlib.pyplot as plt
except:
    print(
        "If you wish to visually verify your output file, "
        "you must install the `geopandas` package."
    )
else:
    gedi_gdf = gpd.read_file(output_file)
    print(gedi_gdf.head())
    sensitivity_colors = plt.colormaps.get_cmap("viridis_r")
    gedi_gdf.plot(markersize = 0.1)

                                            filename        agbd   agbd_se  \
0  GEDI04_A_2020076133303_O07134_02_T00054_02_002...  425.904724  3.443850
1  GEDI04_A_2020076133303_O07134_02_T00054_02_002...   70.760147  3.006401
2  GEDI04_A_2020076133303_O07134_02_T00054_02_002...   10.279377  2.986123
3  GEDI04_A_2020076133303_O07134_02_T00054_02_002...  108.709862  3.040855
4  GEDI04_A_2020076133303_O07134_02_T00054_02_002...  107.700874  3.039851

   sensitivity  geolocation/sensitivity_a2  elev_lowestmode  \
0     0.952703                    0.983446      4061.177002
1     0.958067                    0.983591      4282.283203
2     0.952969                    0.979552      4271.901855
3     0.952656                    0.978301      4245.281738
4     0.957419                    0.984516      4397.378418

                    geometry
0  POINT (85.38437 28.03722)
1  POINT (85.38864 28.04138)
2  POINT (85.38903 28.04176)
3  POINT (85.38943 28.04215)
4  POINT (85.39292 28.04556)

../../_images/science_GEDI_GEDI_L4A_24_1.png

Generate Contour Lines

Create a lat, lon mesh grid with elevation as a depth parameter. As shown in the plot above, the lines don’t seem smooth. So we can apply linear or cubic interpolation to smoothen those missing points.

[17]:

geometry = gedi_gdf["geometry"]
elevation=gedi_gdf["elev_lowestmode"]

[18]:

lon = geometry.x
lat = geometry.y

[19]:

import numpy as np

x=np.linspace(min(lon), max(lon), 1000)
y=np.linspace(min(lat), max(lat), 1000)

[20]:

from scipy.interpolate import griddata

x_mesh, y_mesh = np.meshgrid(x,y)

You may experiment with nearest, linear, and cubic interpolation methods to see which gives more smooth results.

[21]:

#grid the elevation
z_mesh = griddata((lon, lat), elevation, (x_mesh, y_mesh), method='linear')

[22]:

colors=['blue','royalblue', 'navy','pink',  'mediumpurple',  'darkorchid',  'plum',  'm', 'mediumvioletred', 'palevioletred', 'crimson',
         'magenta','pink','red','yellow','orange', 'brown','green', 'darkgreen']
levels=len(colors)
contourf = plt.contourf(x_mesh, y_mesh, z_mesh, levels, alpha=0.5,  colors=colors, linestyles='None', vmin=elevation.min(), vmax=elevation.max())

../../_images/science_GEDI_GEDI_L4A_32_0.png

ContourPlot

Now we need to plot this contour into an interactive map for better visualization.

Plot the Contour Lines in Folium

You may need to install geojsoncontour, mapclassify, and folium, if you don’t already have them installed. We need to convert this contourf into geoJSON format.

[23]:

import folium
from folium import plugins
import branca
import geojsoncontour

[24]:

geojson = geojsoncontour.contourf_to_geojson(
    contourf=contourf,
    min_angle_deg=3.0,
    ndigits=5,
    stroke_width=1,
    unit='ft',
    fill_opacity=0.1,
)

[25]:

#create map view
m = folium.Map([lat.mean(), lon.mean()], zoom_start=12, tiles="OpenStreetMap")

folium.GeoJson(
    geojson,
    style_function=lambda x:{
        'color': x['properties']['stroke'],
        'weight':    x['properties']['stroke-width'],
        'fillColor': x['properties']['fill'],
        'opacity':   0.5,
    }
).add_to(m)

cm = branca.colormap.LinearColormap(colors, vmin=elevation.min(), vmax=elevation.max()).to_step(levels)
cm.caption='Elevation (in m)'
m.add_child(cm)

#legend
plugins.Fullscreen(position='topright', force_separate_button=True).add_to(m)

[25]:

<folium.plugins.fullscreen.Fullscreen at 0x7fd538d53d70>

[ ]:

Now you have an interactive visualization of a contour plot.

ContourMAP