Controlled Vocabulary#
This page defines the controlled vocabulary for eoio output datasets. All readers must produce datasets that conform to this specification to ensure consistency across missions and products, and to maximise compliance with community standards.
Standards Alignment#
eoio datasets are aligned with two established conventions:
- CF Conventions (CF-1.8)
Used for coordinate systems, variable naming attributes, units (UDUNITS-2), grid mapping variables, and core metadata attributes.
- ACDD (Attribute Convention for Data Discovery)
Used for dataset discovery metadata: platform, instrument, processing level, and related discovery fields.
All eoio output datasets must declare:
ds.attrs["Conventions"] = "CF-1.8"
Dataset (Global) Attributes#
CF / ACDD Required Attributes#
The following attributes are required by CF or ACDD and must be present in every eoio output dataset.
Attribute |
Description |
Example |
|---|---|---|
|
CF version string |
|
|
Short human-readable dataset title |
|
|
Organisation producing the dataset |
|
|
Upstream product filename or identifier |
|
|
Audit trail of modifications; append, never overwrite |
|
|
References to product documentation |
|
|
Platform canonical token (see Controlled Values — Platform) |
|
|
Instrument canonical token (see Controlled Values — Instrument) |
|
|
Processing level token (see Controlled Values — Processing Level) |
|
eoio-Specific Required Attributes#
In addition to the CF / ACDD attributes, the following eoio-specific attributes are required.
Attribute |
Description |
Example |
|---|---|---|
|
Full upstream product identifier |
|
|
Stable collection identifier |
|
|
Product or algorithm version string |
|
|
Normalised processing level token (same controlled values as |
|
Controlled Values — Platform#
The platform attribute must use one of the following canonical tokens.
Token |
Notes |
|---|---|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
If the platform you are adding is not listed here, add it to this table as part of your pull request.
Controlled Values — Instrument#
The instrument attribute must use one of the following canonical tokens.
Token |
Notes |
|---|---|
|
Multispectral Instrument (Sentinel-2) |
|
Ocean and Land Colour Instrument (Sentinel-3) |
|
Sea and Land Surface Temperature Radiometer (Sentinel-3) |
|
Operational Land Imager + Thermal Infrared Sensor (Landsat 8/9) |
|
Meteosat Visible and Infrared Imager |
|
Spinning Enhanced Visible and Infrared Imager |
|
PlanetScope 8-band instrument |
Controlled Values — Processing Level#
The processing_level (and product_level) attribute must use one of these tokens.
Token |
Meaning |
|---|---|
|
Raw instrument data |
|
Calibrated, geolocated radiances |
|
Geometrically corrected, top-of-atmosphere reflectance |
|
Atmospherically corrected surface reflectance (sensor-specific algorithm) |
|
Surface-level product (generic) |
|
Gridded / composited product |
Variable Attributes#
CF-Aligned Variable Attributes#
Each measurement, auxiliary, and mask variable should carry the following CF-compliant attributes where applicable.
Attribute |
Description |
|---|---|
|
CF Standard Name Table entry where one exists. Case-sensitive, no whitespace. |
|
Human-readable description, used for plot labels and documentation. |
|
UDUNITS-2 compliant string. A variable with a |
|
Space-separated list of coordinate variables associated with the variable. |
|
Name of the grid-mapping variable (e.g. |
|
Space-separated list of associated ancillary variables (e.g. uncertainty). |
|
Optional free-text note. |
See the CF Conventions documentation for complete descriptions of acceptable values.
eoio-Specific Variable Attributes#
The following additional attributes are used by eoio for classification and interoperability.
Attribute |
Description |
Allowed values |
|---|---|---|
|
Normalised measurand classification |
|
|
Nominal spatial resolution; pattern |
e.g. |
|
Variable geometry classification |
|
Dimension Naming#
Canonical Dimensions#
Dimension name |
When to use |
|---|---|
|
Temporal dimension |
|
Only when coordinates are true 1-D geographic arrays |
|
Projected grid dimensions; coordinate variables must carry CF
|
|
Spectral band index dimension |
Multi-Resolution Grids#
Where a single dataset contains variables at multiple spatial resolutions,
use the pattern x_<resolution> / y_<resolution>:
Examples |
Notes |
|---|---|
|
10 m resolution (e.g. Sentinel-2 high-res bands) |
|
20 m resolution |
|
60 m resolution |
|
300 m resolution (e.g. Sentinel-3 OLCI) |
|
5 km resolution (e.g. MSG / SEVIRI) |
Each dimension coordinate variable must carry CF-compliant standard_name
and units attributes.