Skip to content

Commit d72a4f1

Browse files
authored
Merge branch 'main' into isic-archive
2 parents 4fe145f + b2d4982 commit d72a4f1

20 files changed

+348
-30
lines changed

datasets/1000-genomes.yaml

+3
Original file line numberDiff line numberDiff line change
@@ -20,6 +20,9 @@ Resources:
2020
DataAtWork:
2121
Tutorials:
2222
Tools & Applications:
23+
- Title: Examine genomic variation across populations with AWS
24+
URL: https://aws.amazon.com/blogs/industries/examine-genomic-variation-across-populations-with-aws/
25+
AuthorName: Konstantinos Tzouvanas
2326
Publications:
2427
- Title: Exploratory data analysis of genomic datasets using ADAM and Mango with Apache Spark on Amazon EMR
2528
URL: https://aws.amazon.com/blogs/big-data/exploratory-data-analysis-of-genomic-datasets-using-adam-and-mango-with-apache-spark-on-amazon-emr/

datasets/africa-field-boundary-labels.yaml

+23-7
Original file line numberDiff line numberDiff line change
@@ -1,12 +1,15 @@
11
Name: A region-wide, multi-year set of crop field boundary labels for Africa
22
Description: >
33
Crop field boundaries digitized in Planet imagery collected across Africa
4-
between 2017 and 2023, developed by [Farmerline](https://farmerline.co/),
5-
[Spatial Collective](https://spatialcollective.com/), and the
6-
[Agricultural Impacts Research Group](https://agroimpacts.info/) at
7-
[Clark University](https://www.clarku.edu/), with support from the
8-
[Lacuna Fund](https://lacunafund.org/)
9-
Documentation: "https://github.com/agroimpacts/lacunalabels/"
4+
between 2017 and 2023, developed by [Farmerline](https://farmerline.co/), [Spatial Collective](https://spatialcollective.com/),
5+
and the [Agricultural Impacts Research Group](https://agroimpacts.info/) at [Clark University](https://www.clarku.edu/), with support from the
6+
[Lacuna Fund](https://lacunafund.org/) ([Estes et al, 2024](https://arxiv.org/abs/2412.18483); [Wussah et al. (2023)](https://zenodo.org/records/11060871)). This dataset has been
7+
further supplemented by additional labels collected primarily for
8+
for 2018 over a subset of countries, which provide an example of their
9+
application in training and validating a CNN-based cropland mapping model
10+
[(Khallaghi et al. 2025)](https://www.mdpi.com/2072-4292/17/3/474).
11+
Documentation: Information on the primary dataset can be found [here](https://github.com/agroimpacts/lacunalabels/).
12+
Documentation for added labels is available [here](https://github.com/agroimpacts/cnn-generalization-enhancement).
1013
1114
ManagedBy: "[The Agricultural Impacts Research Group](https://agroimpacts.info/)"
1215
UpdateFrequency: "Updated versions of the dataset are added as they are developed"
@@ -19,10 +22,14 @@ Tags:
1922
- labeled
2023
License: "[Planet NICFI participant license agreement](https://assets.planet.com/docs/Planet_ParticipantLicenseAgreement_NICFI.pdf)"
2124
Resources:
22-
- Description: Field boundary labels and corresponding Planet images
25+
- Description: Field boundaries and corresponding Planet images
2326
ARN: arn:aws:s3:::africa-field-boundary-labels
2427
Region: us-west-2
2528
Type: S3 Bucket
29+
- Description: '[Additional rasterized field labels and corresponding Planet images](https://www.mdpi.com/2072-4292/17/3/474)'
30+
ARN: arn:aws:s3:::africa-field-boundary-labels/extra
31+
Region: us-west-2
32+
Type: S3 Bucket
2633
DataAtWork:
2734
Tutorials:
2835
- Title: Instructions on data access and label-making demonstration notebook
@@ -31,6 +38,12 @@ DataAtWork:
3138
AuthorName: Lyndon Estes
3239
AuthorURL: https://github.com/ldemaz
3340
Publications:
41+
- Title: Generalization enhancement strategies to enable cross-year cropland mapping with convolutional neural networks trained using historical samples
42+
URL: https://www.mdpi.com/2072-4292/17/3/474
43+
AuthorName: Khallaghi et al. (2025)
44+
- Title: A region-wide, multi-year set of crop field boundary labels for Africa
45+
URL: https://arxiv.org/abs/2412.18483
46+
AuthorName: Estes et al. (2024)
3447
- Title: A region-wide, multi-year set of crop field boundary labels for Africa
3548
URL: https://zenodo.org/records/11060871
3649
AuthorName: Wussah et al. (2023)
@@ -43,3 +56,6 @@ DataAtWork:
4356
- Title: A platform for crowdsourcing the creation of representative, accurate landcover maps
4457
URL: http://www.sciencedirect.com/science/article/pii/S136481521630010X
4558
AuthorName: Estes et al. (2016)
59+
Citation: >
60+
Primary dataset: Estes et al. (2024). A region-wide, multi-year set of crop field boundary labels for Africa. arXiv:2412.18483.
61+
Additional labels: Khallaghi et al. (2025). Generalization enhancement strategies to enable cross-year cropland mapping with convolutional neural networks trained using historical samples. Remote Sensing, 17(3), 474.

datasets/ag-loam.yaml

+36
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,36 @@
1+
Name: AG-LOAM Dataset
2+
Description: |
3+
AG-LOAM dataset has been released to facilitate the evaluation of LiDAR-based odometry algorithms in agricultural environments.
4+
1) It was collected by a wheeled mobile robot at the Agricultural Experimental Station of the University of California, Riverside, during Winter 2022 and Winter 2023.
5+
2) It provides LiDAR point cloud data captured using a Velodyne VLP-16 sensor, along with ground-truth trajectories obtained from an RTK-GPS system.
6+
3) It consists of 18 sequences collected over three phases, covering diverse planting environments, terrain conditions, path patterns, and robot motion profiles.
7+
4) It spans a total operation time of 3 hours, covers a total distance of 7.5 km, and constitutes 150 GB of data.
8+
Documentation: https://github.com/UCR-Robotics/AG-LOAM
9+
Contact: Hanzhe Teng ([email protected]), Konstantinos Karydis ([email protected])
10+
ManagedBy: "[Autonomous Robots and Control Systems Lab](https://sites.google.com/view/arcs-lab)"
11+
UpdateFrequency: NA
12+
Tags:
13+
- aws-pds
14+
- robotics
15+
- agriculture
16+
- lidar
17+
- localization
18+
- mapping
19+
License: Creative Commons Attribution-ShareAlike 4.0 International (CC BY-SA 4.0).
20+
Resources:
21+
- Description: AG-LOAM Dataset sequences
22+
ARN: arn:aws:s3:::ucr-robotics/ag-loam-dataset
23+
Region: us-west-2
24+
Type: S3 Bucket
25+
DataAtWork:
26+
Tools & Applications:
27+
- Title: Source code of the LiDAR-only odometry and mapping system
28+
URL: https://github.com/UCR-Robotics/AG-LOAM
29+
AuthorName: Hanzhe Teng et al.
30+
Publications:
31+
- Title: Adaptive LiDAR Odometry and Mapping for Autonomous Agricultural Mobile Robots in Unmanned Farms
32+
URL: https://www.sciencedirect.com/science/article/pii/S0168169925001292
33+
AuthorName: Hanzhe Teng, Yipeng Wang, Dimitrios Chatziparaschis, Konstantinos Karydis
34+
- Title: Adaptive LiDAR Odometry and Mapping for Autonomous Agricultural Mobile Robots in Unmanned Farms
35+
URL: https://arxiv.org/abs/2412.02899
36+
AuthorName: Hanzhe Teng, Yipeng Wang, Dimitrios Chatziparaschis, Konstantinos Karydis

datasets/amazonia.yaml

+4-4
Original file line numberDiff line numberDiff line change
@@ -11,7 +11,7 @@ Description: |
1111
Level 4 archive will be ingested by the end of October 2022.
1212
Documentation: http://www.inpe.br/amazonia1
1313
Contact: https://lists.osgeo.org/mailman/listinfo/cbers-pds
14-
ManagedBy: "[AMS Kepler](https://amskepler.com/)"
14+
ManagedBy: "[Frederico Liporace](https://github.com/fredliporace)"
1515
UpdateFrequency: Daily
1616
Collabs:
1717
ASDI:
@@ -36,7 +36,7 @@ Resources:
3636
Type: S3 Bucket
3737
RequesterPays: False
3838
Explore:
39-
- '[STAC V1.0.0 endpoint](https://stac.amskepler.com/v100)'
39+
- '[STAC V1.0.0 endpoint](https://stac.scitekno.com.br/v100)'
4040
- '[stacindex](https://stacindex.org/catalogs/cbers)'
4141
- Description: STAC static catalog
4242
ARN: arn:aws:s3:::br-eo-stac-1-0-0
@@ -71,8 +71,8 @@ DataAtWork:
7171
- Amazon Athena
7272
Tools & Applications:
7373
- Title: STAC V1.0.0 endpoint
74-
URL: https://stac.amskepler.com/v100
75-
AuthorName: AMS Kepler
74+
URL: https://stac.scitekno.com.br/v100
75+
AuthorName: Frederico Liporace
7676
AuthorURL: https://github.com/fredliporace
7777
- Title: Amazonia 1 stactools package
7878
URL: https://github.com/stactools-packages/amazonia-1

datasets/cbers.yaml

+3-3
Original file line numberDiff line numberDiff line change
@@ -15,7 +15,7 @@ Description: |
1515
scenes are being ingested starting from 10-12-2022.
1616
Documentation: https://github.com/fredliporace/cbers-on-aws
1717
Contact: https://lists.osgeo.org/mailman/listinfo/cbers-pds
18-
ManagedBy: "[AMS Kepler](https://amskepler.com/)"
18+
ManagedBy: "[Frederico Liporace](https://github.com/fredliporace)"
1919
UpdateFrequency: Daily
2020
Collabs:
2121
ASDI:
@@ -39,7 +39,7 @@ Resources:
3939
Type: S3 Bucket
4040
RequesterPays: False
4141
Explore:
42-
- '[STAC V1.0.0 endpoint](https://stac.amskepler.com/v100)'
42+
- '[STAC V1.0.0 endpoint](https://stac.scitekno.com.br/v100)'
4343
- '[stacindex](https://stacindex.org/catalogs/cbers)'
4444
- Description: STAC static catalog
4545
ARN: arn:aws:s3:::br-eo-stac-1-0-0
@@ -69,7 +69,7 @@ DataAtWork:
6969
- Amazon DynamoDB
7070
Tools & Applications:
7171
- Title: STAC V1.0.0 endpoint
72-
URL: https://stac.amskepler.com/v100
72+
URL: https://stac.scitekno.com.br/v100
7373
AuthorName: AMS Kepler
7474
AuthorURL: https://github.com/fredliporace/cbers-2-stac
7575
- Title: EOS Land Viewer

datasets/cmas-data-warehouse.yaml

+6
Original file line numberDiff line numberDiff line change
@@ -60,6 +60,12 @@ Resources:
6060
Type: S3 Bucket
6161
Explore:
6262
- '[Browse Bucket](https://cmas-equates.s3.amazonaws.com/index.html)'
63+
- Description: CMAQ 2021 Modeling Platform
64+
ARN: arn:aws:s3:::2021platform
65+
Region: us-east-1
66+
Type: S3 Bucket
67+
Explore:
68+
- '[Browse Bucket](https://2021platform.s3.amazonaws.com/readme.html)'
6369
- Description: CMAQ 2019 Modeling Platform
6470
ARN: arn:aws:s3:::cmaq-2019-modeling-platform
6571
Region: us-east-1

datasets/geoschem-input-data.yaml

+10-6
Original file line numberDiff line numberDiff line change
@@ -1,6 +1,6 @@
11
Name: GEOS-Chem Input Data
22
Description: Input data for the GEOS-Chem Chemical Transport Model, includes NASA/GMAO MERRA-2 and GEOS-FP [meteorological products](https://geos-chem.readthedocs.io/en/latest/gcclassic-user-guide/input-overview.html#met), [chemistry input data](https://geos-chem.readthedocs.io/en/latest/gcclassic-user-guide/input-overview.html#chemistry-input-data), [emissions input data](https://geos-chem.readthedocs.io/en/latest/gcclassic-user-guide/input-overview.html#emis-inputs), and other smaller datasets such as model [initial conditions](https://geos-chem.readthedocs.io/en/latest/gcclassic-user-guide/input-overview.html#initial-conditions-input-data>).
3-
Documentation: https://geos-chem.readthedocs.io/en/latest/geos-chem-shared-docs/supplemental-guides/geos-chem-input-data-on-aws.html
3+
Documentation: https://geos-chem.readthedocs.io/en/stable/geos-chem-shared-docs/doc/gcid-portal-overview.html
44
Contact: https://geoschem.github.io/support-team.html
55
UpdateFrequency: New meteorological and emission data will be added when available.
66
ManagedBy: "[GEOS-Chem Support Team](https://geoschem.github.io/support-team.html)"
@@ -28,18 +28,22 @@ Resources:
2828
- '[Browse Bucket](https://geos-chem.s3.amazonaws.com/index.html)'
2929
DataAtWork:
3030
Tutorials:
31-
- Title: Getting started with GEOS-Chem Input Data
32-
URL: https://geos-chem.readthedocs.io/en/latest/geos-chem-shared-docs/supplemental-guides/getting-started-aws.html
31+
- Title: GEOS-Chem Input Data on AWS cloud
32+
URL: https://geos-chem.readthedocs.io/en/stable/geos-chem-shared-docs/doc/gcid-portal-overview.html
3333
AuthorName: GEOS-Chem Support Team
3434
AuthorURL: http://geos-chem.org/support-team
3535
- Title: GEOS-Chem Classic Quickstart Guide
36-
URL: https://geos-chem.readthedocs.io/en/latest/getting-started/quick-start.html
36+
URL: https://geos-chem.readthedocs.io/en/stable/getting-started/quick-start.html
3737
AuthorName: GEOS-Chem Support Team
3838
AuthorURL: http://geos-chem.org/support-team
3939
- Title: GCHP Quickstart Guide
40-
URL: https://gchp.readthedocs.io/en/latest/getting-started/quick-start.html
40+
URL: https://gchp.readthedocs.io/en/stable/getting-started/quick-start.html
4141
AuthorName: GEOS-Chem Support Team
4242
AuthorURL: http://geos-chem.org/support-team
43+
- Title: Integrated Methane Inversion (IMI) Quickstart Guide
44+
URL: https://imi.readthedocs.io/en/latest/getting-started/quick-start.html
45+
AuthorName: Daniel Varon et al.
46+
AuthorURL: http://carboninversion.com/people
4347
Tools & Applications:
4448
Publications:
4549
- Title: "Enabling Immediate Access to Earth Science Models through Cloud Computing: Application to the GEOS-Chem Model"
@@ -50,4 +54,4 @@ DataAtWork:
5054
AuthorName: Jiawei Zhuang et al.
5155
- Title: "Improved advection, resolution, performance, and community access in the new generation (version 13) of the high-performance GEOS-Chem global atmospheric chemistry model (GCHP)"
5256
URL: https://doi.org/10.5194/gmd-15-8731-2022
53-
AuthorName: Randall V. Martin et al.
57+
AuthorName: Randall V. Martin et al.

datasets/geoschem-nested-input-data.yaml

+7-3
Original file line numberDiff line numberDiff line change
@@ -1,6 +1,6 @@
11
Name: GEOS-Chem Nested Input Data
22
Description: Input data for nested-grid simulations using the GEOS-Chem Chemical Transport Model. This includes the NASA/GMAO MERRA-2 and GEOS-FP [meteorological products](https://geos-chem.readthedocs.io/en/latest/gcclassic-user-guide/input-overview.html#met), the [HEMCO emission inventories](https://geos-chem.readthedocs.io/en/latest/gcclassic-user-guide/input-overview.html#emis-inputs), and other small data such as [model initial conditions](https://geos-chem.readthedocs.io/en/latest/gcclassic-user-guide/restart-files.html).
3-
Documentation: https://geos-chem.readthedocs.io
3+
Documentation: https://geos-chem.readthedocs.io/en/stable/geos-chem-shared-docs/doc/gcid-special-portals.html#geos-chem-nested-input-data
44
Contact: http://geos-chem.org/support-team
55
UpdateFrequency: New meteorological and emission data will be added when available.
66
ManagedBy: "[GEOS-Chem Support Team](https://geoschem.github.io/support-team.html)"
@@ -28,6 +28,10 @@ Resources:
2828
- '[Browse Bucket](https://s3.amazonaws.com/gcgrid/index.html)'
2929
DataAtWork:
3030
Tutorials:
31+
- Title: GEOS-Chem nested input data
32+
URL: https://geos-chem.readthedocs.io/en/stable/geos-chem-shared-docs/doc/gcid-special-portals.html#geos-chem-nested-input-data
33+
AuthorName: GEOS-Chem Support Team
34+
AuthorURL: http://geos-chem.org/support-team
3135
- Title: GEOS-Chem input data on AWS cloud
3236
URL: https://geos-chem.readthedocs.io/en/latest/geos-chem-shared-docs/supplemental-guides/geos-chem-input-data-on-aws.html
3337
AuthorName: GEOS-Chem Support Team
@@ -42,9 +46,9 @@ DataAtWork:
4246
AuthorURL: http://geos-chem.org/support-team
4347
Tools & Applications:
4448
- Title: Integrated Methane Inversion
45-
URL: https://imi.seas.harvard.edu/
49+
URL: https://carboninversion.com
4650
AuthorName: Daniel Varon et al.
47-
AuthorURL: https://imi.seas.harvard.edu/people
51+
AuthorURL: https://carboninversion.com/people/
4852
Publications:
4953
- Title: "Enabling Immediate Access to Earth Science Models through Cloud Computing: Application to the GEOS-Chem Model"
5054
URL: https://doi.org/10.1175/BAMS-D-18-0243.1

datasets/noaa-gk2a-pds.yaml

+1-1
Original file line numberDiff line numberDiff line change
@@ -24,7 +24,7 @@ Tags:
2424
- meteorological
2525
- disaster response
2626
- satellite imagery
27-
License: NOAA data disseminated through NODD is made available under the [Creative Commons 1.0 Universal Public Domain Dedication (CC0-1.0) license](https://creativecommons.org/publicdomain/zero/1.0/?ref=chooser-v1\), which is well-known and internationally recognized. There are no restrictions on the use of the data. The data are open to the public and can be used as desired. <br/><br/> NOAA has adopted the Creative Commons license to ensure maximum use of our data, to spur and encourage exploration and innovation throughout the industry. This license is applicable to each of the NOAA datasets made available by NODD. NOAA requests attribution for the use or dissemination of unaltered NOAA data. However, it is not permissible to state or imply endorsement by or affiliation with NOAA. If you modify NOAA data, you may not state or imply that it is original, unaltered NOAA data.
27+
License: Open Data ; Refer to [KMA Site](https://nmsc.kma.go.kr/enhome/html/base/cmm/selectPage.do?page=satellite.gk2a.fact) for more information.
2828
Resources:
2929
- Description: GK2A Imagery
3030
ARN: arn:aws:s3:::noaa-gk2a-pds

datasets/noaa-goes.yaml

+1-1
Original file line numberDiff line numberDiff line change
@@ -5,7 +5,7 @@ Description: |
55
NEW GOES-19 Data!!! GOES-19 products at the Provisional maturity level have begun streaming. Data files will be available between Provisional and the Operational Declaration of the satellite, however, these data are subject to the caveat that they are "GOES-19 Preliminary, Non-Operational Data".
66
<br/>
77
<br/>
8-
NEW GOES 16 Reprocess Data!! The reprocessed GOES-16 ABI L1b data mitigates systematic data issues (including data gaps and image artifacts) seen in the Operational products, and improves the stability of both the radiometric and geometric calibration over the course of the entire mission life. These data were produced by recomputing the L1b radiance products from input raw L0 data using improved calibration algorithms and look-up tables, derived from data analysis of the NIST-traceable, on-board sources. In addition, the reprocessed data products contain enhancements to the L1b file format, including limb pixels and pixel timestamps, while maintaining compatibility with the operational products. The datasets currently available span the operational life of GOES-16 ABI, from early 2018 through the end of 2024. The Reprocessed L1b dataset shows improvement over the Operational L1b products but may still contain data gaps or discrepancies. Please provide feedback to Dan Lindsey ([email protected]) and Gary Lin ([email protected]).
8+
NEW GOES 16 Reprocess Data!! The reprocessed GOES-16 ABI L1b data mitigates systematic data issues (including data gaps and image artifacts) seen in the Operational products, and improves the stability of both the radiometric and geometric calibration over the course of the entire mission life. These data were produced by recomputing the L1b radiance products from input raw L0 data using improved calibration algorithms and look-up tables, derived from data analysis of the NIST-traceable, on-board sources. In addition, the reprocessed data products contain enhancements to the L1b file format, including limb pixels and pixel timestamps, while maintaining compatibility with the operational products. The datasets currently available span the operational life of GOES-16 ABI, from early 2018 through the end of 2024. The Reprocessed L1b dataset shows improvement over the Operational L1b products but may still contain data gaps or discrepancies. Please provide feedback to Dan Lindsey ([email protected]) and Gary Lin ([email protected]). More information can be found in the [GOES-R ABI Reprocess User Guide](https://github.com/NOAA-Big-Data-Program/nodd-data-docs/blob/main/GOES/GOES-R_ABI_Reprocessed_L1b_User_Guide-v1.1.pdf).
99
<br/>
1010
<br/>
1111
NOTICE: As of January 10th 2023, GOES-18 assumed the GOES-West position and all data files are deemed both operational and provisional, so no ‘preliminary, non-operational’ caveat is needed. GOES-17 is now offline, shifted approximately 105 degree West, where it will be in on-orbit storage. GOES-17 data will no longer flow into the GOES-17 bucket. Operational GOES-West products can be found in the GOES-18 bucket.

0 commit comments

Comments
 (0)