The Weddell Sea Benthic Dataset: A computer vision-ready object detection dataset for in situ benthic biodiversity monitoring model development
We present the Weddell Sea Benthic Dataset (WSBD), a computer vision-ready collection of high-resolution seafloor imagery and corresponding annotations designed to support automated analysis of Antarctic benthic communities. The dataset comprises 100 top-down images captured during RV Polarstern Expedition PS118 (cruises 69-1 and 69-6) in 2019, using the Ocean Floor Observation and Bathymetry System (OFOBS) in the Weddell Sea, Antarctica. A subset of this imagery was manually annotated by ecologists at the British Antarctic Survey (BAS) to support ecological analyses, including benthic community composition and species interaction studies. These annotations were subsequently standardised into 25 morphotypes to serve as class labels for object detection tasks. Bounding box annotations are provided in COCO format, alongside the training, validation, and test splits used during model development at BAS. This dataset provides a benchmark for developing and evaluating machine learning models aimed at enhancing biodiversity monitoring in Antarctic benthic environments.
This work was funded by the UKRI Future Leaders Fellowship MR/W01002X/1 ''The past, present and future of unique cold-water benthic (sea floor) ecosystems in the Southern Ocean'' awarded to Rowan Whittle.
Simple
- Alternate title
- Polar Data Centre (PDC) record GB/NERC/BAS/PDC/02069
- Date (Publication)
- 2025-06-09
- Identifier
- http://www.antarctica.ac.uk/dms/metadata.php?id= / GB/NERC/BAS/PDC/02069
- Maintenance and update frequency
- unknown Unknown
- Keywords
-
- NDGO0001
- NERC OAI Harvesting
-
- NERC_DDC
- GCMD Parameter Valids
-
- EARTH SCIENCE > Biosphere > Ecological Dynamics > Biodiversity
- EARTH SCIENCE > Oceans > Marine Environment Monitoring
- BAS Free-text keywords
-
- Benthos
- biodiversity monitoring
- computer vision
- deep learning
- marine ecology
- Use limitation
- Data are supplied under Open Government Licence v3.0 http://www.nationalarchives.gov.uk/doc/open-government-licence/version/3/.
- Access constraints
- otherRestrictions Other restrictions
- Other constraints
- Data are supplied under Open Government Licence v3.0 http://www.nationalarchives.gov.uk/doc/open-government-licence/version/3/.
- Metadata language
- EnglishEnglish
- Topic category
-
- Biota
- Environment
- Oceans
- Begin date
- 2019-02-01
- End date
- 2019-04-30
- Reference system identifier
- OGP / urn:ogc:def:crs:EPSG::4326
- Distribution format
-
- Protocol
- http
- Name
- GET DATA
- Function
- download Download
- Hierarchy level
- dataset Dataset
Domain consistency
- Measure identification
- INSPIRE / Conformity_001
Conformance result
- Date
- Explanation
- See the referenced specification
- Pass
- No
- Statement
- All bounding boxes were manually checked after conversion from SVG format. Class labels have been reviewed by ecologists at BAS for accuracy. Given the high densities of organisms in the dataset, the prevalence of small-bodied taxa, and the well documented issues of fatigue and subjectivity in manual annotation processes for benthic imagery, it is likely some valid organisms were omitted from the ground truth.
- File identifier
- GB_NERC_BAS_PDC_02069 XML
- Metadata language
- EnglishEnglish
- Hierarchy level
- dataset Dataset
- Date stamp
- 2025-06-09
- Metadata standard name
- NERC profile of ISO19115:2003
- Metadata standard version
- 1.0
Overviews
Spatial extent
Provided by
NERC Data Catalogue Service