Cloud Native Geospatial Outreach Day
7:00 UTC
00:00 PDT
Intro Session: Introduction to Geospatial Raster Data
Sara Safavi, Planet
A gentle introduction to foundational geospatial data concepts, with a focus on raster data specifically. No prior knowledge is necessary, this talk will assume you are starting from scratch. Facilitated by Alex Leith, and will transition immediately to the next session (same zoom connection)
7:30 UTC
00:30 PDT
Intro Session: Getting Started with Digital Earth Africa
Alex Leith, Digital Earth Africa
This talk will cover an introduction to Digital Earth Africa, including our software and data ecosystems, and will dive into a brief training course on doing data science in our Jupyter environment.
You'll learn how to view our Cloud Optimised GeoTIFFs on a web map, how to analyse Earth observation data in Jupyter, using the Open Data Cube, and about how you can use our Sentinel-2 analysis ready data directly in your applications.
8:00 UTC
1:00 PDT
Intro Session: STAC Intro and Q&A
Matthias Mohr, OpenEO
Overview of the STAC Specification. Will give background on the effort, the motivations behind it, and dig into each of the 'sub-specs': Item, Catalog, Collection and STAC API. The session will then be an open forum to ask any question about STAC, to help get everyone up to speed.
11:30 UTC
4:30 PDT
Intro Session: Sentinel-Hub Intro and COG
Grega Milcinski & Max Kampen, Sinergise
-Sentinel Hub - overall intro (Grega Milcinski, Sentinel Hub) - 20 minutes
-COG@Sentinel Hub (Max Kampen, Sentinel Hub) - 20 minutes, pre-recorded
-Q&A (Grega Milcinski, Sentinel Hub)
13:30 UTC
6:30 PDT
Cloud Native Geospatial Overview & Sponsor Remarks
Welcome & Opening Remarks - Chris (Planet) + Bruno (Microsoft)
Data Labeling Contest Details - Hamed Alemohammad (Radiant Earth Foundation)
Cloud Native Geospatial Overview - Chris Holmes
Planetary Computer + AI for Earth - Bruno Sánchez-Andrade Nuño (Microsoft)
14:15 UTC
7:15 PDT
Lightning Talks Round 1
Digital Earth Africa: using Earth observation data to address Africa's development challenges
Fang Yuan, GeoScience Australia & Digital Earth Africa
I will present a brief overview of Digital Earth Africa, including what we aim to do, why free and easy to use Earth observation data and information products are important, and how this continental scale effort is powered by STAC/COG.
STAC & COG at Arturo
Jeff Albrecht, Arturo
Brief overview of why/how we use STAC at Arturo and the STAC related tools and catalogs we have built.
The AWS Open Data Program
Zac Flamig, Amazon Web Services
We will describe the AWS Open Data Program including current datasets available in cloud optimized formats.
Overview of STAC at Maxar
Caitlin Mahanna, Maxar
Why Maxar chose to use STAC and how we have implemented it.
NASA's Common Metadata Repository (CMR) STAC Interface
Drew Daniel, Element 84
A recent STAC implementation that exposes and proxies NASA's CMR geospatial data.
Zarr: chunked, compressed, multidimensional arrays
Anderson Banihirwe, UCAR/Pangeo
This talk will give a brief introduction to zarr's functionality including compression, data storage and parallel computing. This talk will also demonstrate the integration of zarr with data discovery/cataloging tools such as intake.
Enabling STAC and pixel-based access services to Sentinel-5P
Simone Mantovani, MEEO
In this talk we will present the current status and future plans of Sentinel-5P Level-2 and Level-3 datasets available in the AWS Open Data Registry: native data format are collected in Frankfurt region close to other Copernicus datasets to support the development of cross-satellite applications, while Cloud Optimised Geotiff products, including daily mosaics (Level-3) products, are generated to boost the exploitation of Ozone (O3), Nitrogen Dioxide (NO2), Sulphur Dioxide (SO2), Carbon Monoxide (CO), Methane (CH4), Formaldehyde (HCHO) and Aerosol products.
User interfaces to support data exploitation will be presented, including STAC and ADAM Explorer / adamapi.
Introduction to DotNetStac
Emmanuel Mathot, Terradue
I shall present DotNetStac library with a practical how to using small quick examples. I shall complete it's usage in a more complex project of a command line tool to read and navigate in STAC catalogs
COG with GDAL
Even Rouault, Spatialys
Quick overview of COG file organization and how GDAL uses it, presentation of the new COG driver of GDAL 3.1
ESA FedEO STAC Implementation
Yves Coene, Spacebel/ESA
How the STAC specifications are being exploited in addition to OGC API Features interfaces in the ESA ESE-ERGO Project to allow access to ESA FedEO and EOCAT assets simultaneously accessible using OGC OpenSearch interfaces with Atom and GeoJSON responses.
Exploring COGs in the browser with geotiff.js and COG-Explorer
Fabian Schindler, EOX
This lightning talk will detail the capabilities of remote data exploration of potentially huge COG files using the geotiff.js library and the COG-Explorer Browser Application
Sentinel Hub STAC API
Rok Mocnik, Sinergise
How we have implemented STAC API to provide access to Sentinel Hub's catalogue of global archives of Sentinel-1,2,3,5p, Landsat-8, MODIS and other datasets, technical information on data and its storage. We will also discuss deviations from stac-spec due to implementation details.
Applying STAC to Planetary Images
Robin Fergason, USGS
Differences (and similarities) between terrestrial spatial data and planetary data, unique challenges to planetary data, how STAC is helping us meet or goals for cloud-based data release of derived data products.
OGC + Cloud Native Geospatial
Scott Simmons, Open Geospatial Consortium
OGC standards and practices must work in the Cloud. Extensive work in OGC's Innovation Program has been performed to assess suitability of existing standards or identify requirements for new standards for Cloud-Native operations. OGC recognizes that the geospatial community is evolving tailored specifications to work most effectively in the cloud and proposes means to endorse those specifications as part of the community standards baseline.
Intro Sessions Round 2 (Parallel Sessions)
16:00 UTC
9:00 PDT
Intro Session Track 1: Data Labeling with Groundwork (training for the contest)
Niki LaGrone & Joe Morrison, Azavea
Along with the STAC sprint, Radiant Earth is hosting labeling competition--this talk will walk the audience through how to sign up, how to get started labeling, and how entrants will be judged.
16:00 UTC
9:00 PDT
Intro Session Track 2: Intro to STAC API
Phil Varner, Astraea
Principles and patterns for querying imagery from a STAC API-compliant web service
16:00 UTC
9:00 PDT
Intro Session Track 3: Creating STACs with PySTAC
Rob Emanuele, Azavea
In this beginner session we'll walk through creating a STAC of open data using PySTAC in a Jupyter notebook.
Intro Sessions Round 3 (Parallel Sessions)
16:40 UTC
9:40 PDT
Intro Session Track 1: Machine Learning and Satellite Imagery overview
Dave Luo, Anthropocene Labs
A beginner-friendly session (no geospatial or ML background needed) that walks through a common ML-for-mapping workflow (mapping building footprints from overhead imagery) with emphasis on what/why/how labeled data is important, defining key geospatial ML terms and concepts, and how training datasets are enhanced by using COG and STAC. As an example, we'll also introduce the Open Cities AI Challenge dataset of drone imagery (as COGs) and building footprint labels across 10+ African cities organized in a STAC.
16:40 UTC
9:40 PDT
Intro Session Track 2: COG's and STAC with Titiler & arturo-stac-api
Jeff Albrecht, Arturo & Vincent Sarago, Development Seed
Titiler is a modern cloud optimized geotiff tile server with lots of cool features. Arturo-stac-api is a STAC-compliant API with an OGC Tiles/Titiler integration to render map tiles from the catalog. Both projects are backed by FastAPI and part of a larger goal to make it easier to configure and deploy cloud native geospatial architecture.
16:40 UTC
9:40 PDT
Intro Session Track 3: Cirrus - Open-Source Geospatial Pipeline for STAC
Matthew Hanson, Element84
A recently open-sourced project, Cirrus, is an AWS pipeline for scaling up processing and publishing of STAC metadata and assets. Use Cirrus to just publish STAC Items, or convert some of the assets to COGs while mirroring the others as is.
Intro Sessions Round 4 (Parallel Sessions)
17:20 UTC
10:20 PDT
Intro Track 1 Session: Browsing and Downloading Assets from STAC APIs with the QGIS STAC Plugin
Kevin Booth, Radiant Earth Foundation
In this talk users will learn how to install the QGIS STAC Plugin, configure the plugin for a STAC API, browse collections within the APIs, and download assets from those APIs.
17:20 UTC
10:20 PDT
Intro Session Track 2: Getting Started with Franklin
James Santucci, Azavea
This intro session will walk people through getting started with Franklin in a more hands on and more thorough way than is currently available in the documentation.
Topics to cover will be:
- starting the Franklin service
- getting data into Franklin
- STAC transactions
- serving raster tiles
- validation and fixing items with transactions
- talking to Franklin with PySTAC
17:20 UTC
10:20 PDT
Intro Session Track 3: STAC Validation
James Banting, SparkGeo & Matthias Mohr, OpenEO
In this talk, we will go over the different tools you can use to validate STAC items
18:00 UTC
11:00 PDT
Lightning Talks Round 2
SparkGeo and STAC
James Banting, SparkGeo
In this Lightning Talk, we will describe how and why we embraced STAC early on in the development. We will discuss some of the projects we were able to integrate STAC into and why we choose to go that route.
Introduction to TileDB
Norman Barker, TileDB
The Google Earth Engine Data Catalog
Kurt Schwehr, Google
Earth Engine is the combination of a large data catalog with a planetary scale computation platform to analyze and visualize Earth data on Google's infrastructure. The system currently has 35PB of open Earth data in over 550 collections with many of those collections continually updating. Earth Engine’s mission is to expand our knowledge of the planet and use the system to power solutions to global challenges. Combining data and computation, scientists, researchers, and developers can detect changes, maps trends, and quantify differences across the planet. Users of the system must be able to find and use that massive data collection for Earth Engine to be effective. We have developed an internal catalog format based on ProtoBufs expressed as YAML. Each collection in the catalog contains basic metadata, a thumbnail view, and sample code to get users started working with the collection. While this format has worked well so far, we would like to switch to the SpatioTemporal Asset Catalog (STAC) format to enable several goals: 1) User contributions to the public catalog need catalog entries. 2) User contributed improvements to existing catalog entries. 3) Take advantage of existing STAC tools to help enable exploration of the Earth Engine catalog. 4) User owned and maintained (public or private) catalogs in cloud storage. 5) Enable more interaction between Earth Engine and systems that are outside of the core Google infrastructure.
STAC Index
Matthias Mohr, OpenEO
A quick look into STAC Index, the list of STAC catalogs, APIs and tools that offers a free STAC Browser for all datasets
How cloud-based data warehouses will change the industry at $5 per TeraByte
Javier de la Torre, Carto
Cloud Native Geospatial has started on the raster world, but vector is as important. The cloud-first approach to vector these days come from the next generation of data warehouses. Google Bigquery, Snowflake, Azure Synapse and AWS Athena are some of the players on the space competing. The proposal for storage-computing separation, multitenancy and scalability will redefine how spatial data infrastructures get build on the future. CARTO is been applying this into its Data Observatory and in this talk we will talk about it.
How Astraea Uses COGs and STAC API to Support Geospatial Data Science
Phil Varner, Astraea
This talk will be about Astraea's use of standards-based formats and interfaces, such as COG and STAC API, to allow fast and interoperable access to satellite imagery, allowing data scientists to more easily query, explore, and analyze large imagery collections.
CBERS on AWS
Frederico Liporace, Kepleres
I'll briefly describe the two stacks developed to ingest CBERS data into AWS: one responsible for data transfer and COG generation and the other responsible for indexing the content using STAC.Most of the developed software is open, I'll provide links and discuss the architecture.
Attendees will learn about this experience and the online resources available.
Adoption of Remote Sensing in Sub-saharan Africa
Stephen Korir, Data Driven Agriculture
Adoption of Remote Sensing in subsaharan Africa, and the bottlenecks that is still slowing down it's full adoption
The Needle in the STAC
Marten Hogeweg, ESRI
Building on our experience in data discovery in the context of spatial data infrastructures, we have learned that common standards-based dataset metadata is not suitable for all types of content, including specifically large scale collections of earth observation data. Using our open source Geoportal Server we have built an index that addresses these challenges with large collections of content. In this talk you will hear about these developments and see them in action.
Radiant MLHub - A Cloud-Native ML Commons for Geospatial Data
Hamed Alemohammad, Radiant Earth Foundation
A high-level overview of Radiant MLHub mission, and STAC catalog to develop a ML commons for Earth observation.
Leveraging Cloud-Optimized GeoTIFFs to Enable New Methods in Web Mapping
Kyle Barron, Unfolded
The future of geospatial data visualization is in the browser. WebGL enables fast client-side satellite imagery processing. But users are far from the source data, and direct access to large collections of Cloud Optimized GeoTIFFs (e.g. Landsat 8 on AWS) would incur unacceptable latency. With COGs, it's possible to quickly combine a mosaic of imagery on demand into tiles ready for consumption in the browser. I'll provide an overview of my recent open-source work to enable fast dynamic image tiling, and show examples of its potential in the browser.
Radio-Occultation is not an occult science
Leo Thomas, Development Seed
Creating a STAC catalog out of radio-occultation data is an interesting and unique challenge. I want to discuss some of the successes and failures we've encountered in doing so, and I hope to provide an interesting and off-the-beaten path example of STAC in action.
Edge Compute: Cool Stuff You Can Do With COGs in the Browser
Daniel Dufour, GeoSurge
This talk will go over cool stuff you can do with GeoRasterLayer, a plugin for easily visualizing Cloud Optimized GeoTIFFs on a Leaflet web map.
Intake-STAC: Interactive catalog analysis with Python
Scott Henderson, University of Washington / Pangeo Project
A common pattern for geospatial analysis goes like this: 1) search and discover data. 2) download… and wait. 3) open and analyze with your favorite programming language. What if you could skip the downloading and go straight to interactive analysis in Python? You can with “Intake-STAC”, a Python package designed to simplify geospatial data management! Tune in for quick example using Cloud-hosted public data.
Raster Data Tiles from COG’s using Numpy and OpenLayers
Dan "Ducky" Little, Planet
Planet has developed an approach for working with raster data in the browser with OpenLayers. A brief demo of how it's used, how it works, and implementation details will be covered.
Pointclouds and STAC
Chris Helm, Pixel8
Introduction to the STAC Pointcloud Extension and the STAC-ification of the USGS 3DEP Lidar Dataset.
STAC + protobufs at Near Space Labs
David Raleigh, Near Space Labs
Why we chose to use STAC for all our infrastructure from day one at Near Space Labs. How we use a protobuf version of STAC and gRPC for our internal services and for our public API
2020 NASA SpaceApps Challenge and OceanHackweek: Cloud Optimized GeoTiffs and Zarr on AWS PDS
Aimee Barciauskas, Development Seed
NASA's Earth Observation Division supported the Covid-19 SpaceApps Challenge (https://covid19.spaceappschallenge.org/) through sharing data to AWS Public Datasets Program. Working with NASA stakeholders and science advisors, we produced Cloud-Optimized Geotiffs for MODIS Aerosol Optical Depth, Vegetation Indices and OMI NO2 products. To support ocean scientists and UW OceanHackweek, we worked in a similar fashion to produce MUR SST in Zarr.
These products enable users to explore trends in these environmental variables. During this talk, we share important factors that went into our process of delivering COGs and Zarr stores and what should be replicated or improved on in the future.
resto - a stac search engine for geospatialized data
Jérôme Gasperi
Presentation of resto server and rocket client ecosystem
pygeoapi STAC updates
Tom Kralidis, Environment and Climate Change Canada
This talk will provide an overview of pygeoapi and discuss recent STAC updates and enhancements.
Intro Sessions Round 5 (Parallel Sessions)
20:00 UTC
13:00 PDT
Intro Session Track 1: Introduction to TileDB
Norman Barker, TileDB
The talk will give an intro to TileDB and its use as a cloud native universal data engine for geospatial data from point clouds to hyperspectral!
20:00 UTC
13:00 PDT
Intro Session Track 2: Introduction to Multi-Scale Ultra High Resolution (MUR) Sea Surface Temperature (SST) Zarr on AWS PDS
Aimee Barciauskas, Development Seed
This session will introduce the MUR SST Dataset the Zarr data format. The MUR SST dataset is stored in Zarr on AWS PDS. Sea Surface Temperature is a key climate change variable and is being used to study how ocean temperature can impact other environmental factors such as biodiversity. This session will demo the dataset itself, as it was used during OceanHackweek to study marine heat wave variability. We will discuss the benefits of the Zarr storage format when compared with traditional file stores and cover how different Zarr chunk shapes can impact query performance.
Intro Sessions Round 6 (Parallel Sessions)
20:40 UTC
13:40 PDT
Intro Session Track 1: Packing Your Geospatial Data Science Toolkit
Ash Hoover, Planet
Intro to open source tools for geospatial data analysis and visualization
20:40 UTC
13:40 PDT
Intro Session Track 2: Intake-STAC: Interactive catalog analysis with Python
Scott Henderson, University of Washington / Pangeo Project
Inake-STAC is an open source Python package for discovering, exploring, and loading spatio-temporal datasets. It facilitates working with geospatial data by providing default visualizations and functions for directly loading data into Python objects like Xarray Datasets or Pandas Dataframes. Using intake-STAC allows you to stop worrying about file paths and urls and instead focus on analysis. You'll also learn how Cloud-optimized Geotiffs (COGs), Xarray, and Dask work in a complimentary way to parallelize computations on raster data.
Intro Sessions Round 7 (Parallel Sessions)
21:20 UTC
14:20 PDT
Intro Session Track 1: STAC & COG Intro + Q&A
Chris Holmes, Planet / Radiant Earth Foundation
This session will dig deeper into how COG & STAC work. It will go into each of the 'sub-specs': Item, Catalog, Collection and STAC API. And it will provide a deeper dive on how COG's are set up. The session will then be an open forum to ask any question about STAC or COG, so if you still have questions at the end of the day come here and get them answered! It will be similar to Matthias's session at the beginning of the day.
21:20 UTC
14:20 PDT
Intro Session Track 2: Radiant MLHub - A Cloud-Native ML Commons for Geospatial Data
Hamed Alemohammad, Radiant Earth Foundation
I will provide a brief overview of Radiant MLHub catalog and datasets. Then present a live demo of how users can use our STAC API to search and access training datasets. I will use the LandCoverNet as an example, share the notebook in advance so participants can follow the demo live.