Abstract:
We present the Weddell Sea Benthic Dataset (WSBD), a computer vision-ready collection of high-resolution seafloor imagery and corresponding annotations designed to support automated analysis of Antarctic benthic communities. The dataset comprises 100 top-down images captured during RV Polarstern Expedition PS118 (cruises 69-1 and 69-6) in 2019, using the Ocean Floor Observation and Bathymetry System (OFOBS) in the Weddell Sea, Antarctica. A subset of this imagery was manually annotated by ecologists at the British Antarctic Survey (BAS) to support ecological analyses, including benthic community composition and species interaction studies. These annotations were subsequently standardised into 25 morphotypes to serve as class labels for object detection tasks. Bounding box annotations are provided in COCO format, alongside the training, validation, and test splits used during model development at BAS. This dataset provides a benchmark for developing and evaluating machine learning models aimed at enhancing biodiversity monitoring in Antarctic benthic environments.
This work was funded by the UKRI Future Leaders Fellowship MR/W01002X/1 'The past, present and future of unique cold-water benthic (sea floor) ecosystems in the Southern Ocean' awarded to Rowan Whittle.
Keywords:
Benthos, biodiversity monitoring, computer vision, deep learning, marine ecology
Trotter, C., Griffiths, H.J., Khan, T.M., Purser, A., & Whittle, R.J. (2025). The Weddell Sea Benthic Dataset: A computer vision-ready object detection dataset for in situ benthic biodiversity monitoring model development (Version 1.0) [Data set]. NERC EDS UK Polar Data Centre. https://doi.org/10.5285/1ba97e4b-efb7-460b-9f2d-90437e33ce09
Access Constraints: | Data are under embargo until publication of the related manuscript. |
---|---|
Use Constraints: | Data are supplied under Open Government Licence v3.0 http://www.nationalarchives.gov.uk/doc/open-government-licence/version/3/. |
Creation Date: | 2025-06-06 |
---|---|
Dataset Progress: | Complete |
Dataset Language: | English |
ISO Topic Categories: |
|
Parameters: |
|
Personnel: | |
Name | PDC BAS |
Role(s) | Metadata Author |
Organisation | British Antarctic Survey |
Name | Cameron Trotter |
Role(s) | Technical Contact, Investigator |
Organisation | British Antarctic Survey |
Name | Huw J Griffiths |
Role(s) | Investigator |
Organisation | British Antarctic Survey |
Name | Tasnuva M Khan |
Role(s) | Investigator |
Organisation | British Antarctic Survey |
Name | Autun Purser |
Role(s) | Investigator |
Organisation | Alfred Wegener Institute for Polar and Marine Research |
Name | Rowan J Whittle |
Role(s) | Investigator |
Organisation | British Antarctic Survey |
Parent Dataset: | N/A |
Reference: | [1] Purser, A., Dreutter, S., Griffiths, H., Hehemann, L., Jerosch, K., Nordhausen, A., Piepenburg, D., Richter, C., Schroeder, H., Dorschel, B., 2021. Seabed video and still images from the northern Weddell Sea and the western flanks of the Powell Basin. Earth Syst. Sci. Data 13, 609-615. https://doi.org/10.5194/essd-13-609-2021 [2] Purser, A., Hehemann, L., Dreutter, S., Dorschel, B., Nordhausen, A., 2020. OFOBS seafloor images from the antarctic peninsula and powell basin, collected during RV POLARSTERN cruise PS118. https://doi.org/10.1594/PANGAEA.911904 [3] Khan, T.M., Griffiths, H.J., Whittle, R.J., Stephenson, N.P., Delahooke, K.M., Purser, A., Manica, A., Mitchell, E.G., 2024. Network analyses on photographic surveys reveal that invertebrate predators do not structure epibenthos in the deep (~2000m) rocky Powell Basin, Weddell Sea, Antarctica. Front. Mar. Sci. 11, 1408828. https://doi.org/10.3389/fmars.2024.1408828 [4] Khan, T.M., Griffiths, H.J., Whittle, R.J., Stephenson, N., Delahooke, K., Purser, A., Manica, A., & Mitchell, E.G. (2025). Organisms identified from OFOBS images from PS118 Profiles 6_9 (Weddell Sea) and 69 (Powell Basin), April - May 2019 (Version 1.0) [Data set]. NERC EDS UK Polar Data Centre. https://doi.org/10.5285/7fb2f0c1-413c-4cd6-84ab-a504bf431290 |
|
---|---|---|
Quality: | All bounding boxes were manually checked after conversion from SVG format. Class labels have been reviewed by ecologists at BAS for accuracy. Given the high densities of organisms in the dataset, the prevalence of small-bodied taxa, and the well documented issues of fatigue and subjectivity in manual annotation processes for benthic imagery, it is likely some valid organisms were omitted from the ground truth. | |
Lineage: | Data was collected as part of expedition PS118, cruises 69-1 and 6-9, of the RV Polarstern in 2019 [1] using the OFOBS [2] and manually labelled for use in benthic community analysis [3]. These labels were then condensed into 25 morphotypes, with annotations converted to COCO-formatted bounding boxes, for use in object detection model development. Data was split into training, validation, and test sets based on substrate, depth, seafloor inclination. Imagery in this dataset is a subset of imagery collected during the expedition PS118 [1], available on PANGEA [2]. Dataset annotations are a subset of those present in [4] for use in benthic community analysis [3]. This dataset was used for the development of an object detection model capable of automated benthic organism detection [in-prep]. For model weights, see Related URLs. Some original source images [4] were not comprehensively annotated, e.g. due to distortion. For use in object detection model training, the unlabelled regions were cropped, resulting in images of varying dimensions (average size = 3,364×4,545px). |
Temporal Coverage: | |
---|---|
Start Date | 2024-09-01 |
End Date | 2025-05-31 |
Start Date | 2019-02-01 |
End Date | 2019-04-30 |
Spatial Coverage: | |
Latitude | |
Southernmost | -64.93935 |
Northernmost | -61.19232 |
Longitude | |
Westernmost | -57.81626 |
Easternmost | -50.99663 |
Altitude | |
Min Altitude | N/A |
Max Altitude | N/A |
Depth | |
Min Depth | 421 m |
Max Depth | 2202 m |
Location: | |
Location | Southern Ocean |
Detailed Location | Weddell Sea, Powell Basin |
Data Collection: | Data was collected using the OFOBS, a top-down towed camera system. Original labelling for benthic community analysis was performed in Inkscape v1.1. Resulting SVG files were then converted using a custom script file to JPGs with corresponding COCO bounding box JSON using Python v3.12.8. Converted bounding boxes were then manually edited (resized etc) using LabelMe v5.8.1, then converted back to COCO format. |
---|
Data Storage: | Images: 100 files (jpg format), 115.5MB Annotations: 4 files (json format), 22.8MB |
---|