Modelled spatial predictions of the distribution and density of Antarctic krill in the South Scotia Sea between 2011-2020
This dataset contains gridded spatial predictions of the distribution and density of Antarctic krill (Euphausia superba) in the South Scotia Sea, specifically within Subarea 48.2 of the Convention for the Conservation of Antarctic Marine Living Resources (CCAMLR). Both year-specific and decadal mean predictions are provided across years 2011-2020. All predictions were generated from a two-part hurdle model which used input data from (i) a spatially and temporally consistent acoustic krill survey around the South Orkney Islands and (ii) year-specific environmental covariates. The first hurdle model component was a binomial Generalized Additive Model (GAM) fitted to binary presence-absence krill data which predicts the probability of krill presence. The second component was a Gaussian GAM fitted to non-zero krill data which predicts krill density. Finally, these components were combined to identify where krill were both likely to be present and occur at high densities. Full model details are given in the associated publication. This dataset provides the spatial predictions generated from the binomial GAM, Gaussian GAM, and their combined product.
Funding:
PNT, SF and JJF were supported by the British Antarctic Survey's National Capability Antarctic Logistics and Infrastructure programme CONSEC, supported by the Natural Environment Research Council, a part of UK Research and Innovation.; VW-E and JJF were supported by the Pew Charitable Trusts under grant PA00034295. The South Orkney Islands acoustic trawl survey is part of the ongoing Norwegian Institute of Marine Research (IMR) project KRILL (p.no. 14246), which is supported by the Norwegian Research Council (NFR grant 222798), the Norwegian Ministry of Foreign Affairs, and IMR.
Simple
- Date (Creation)
- 2025-01-10
- Date (Revision)
- 2025-01-10
- Date (Publication)
- 2025-01-10
- Date (released)
- 2025-01-10
- Edition
- 1.0
- Unique resource identifier
- https://doi.org/10.5285/4fd0a1bf-da1a-4021-82eb-2fc513910e32
- Codespace
- doi
- Unique resource identifier
- GB/NERC/BAS/PDC/01974
- Codespace
- https://data.bas.ac.uk/
- Other citation details
- Please cite this item as: Freer, J.J., Warwick-Evans, V., Skaret, G., Krafft, B.A., Fielding, S., & Trathan, P.N. (2025). Modelled spatial predictions of the distribution and density of Antarctic krill in the South Scotia Sea between 2011-2020 (Version 1.0) [Data set]. NERC EDS UK Polar Data Centre. https://doi.org/10.5285/4fd0a1bf-da1a-4021-82eb-2fc513910e32
- Credit
- No credit.
- Status
- completed Completed
https://www.bas.ac.uk/team/business-teams/information-services/uk-polar-data-centre/
- Maintenance and update frequency
- asNeeded As needed
- Maintenance note
- completed Completed
- Theme
-
- Antarctic krill
- South Orkney Islands
- South Scotia Sea
- hurdle model
- interannual variability
- Place
-
- South Scotia Sea, South Orkney Plateau, CCAMLR subarea 48.2 Antarctica
- GEMET - INSPIRE themes, version 1.0
- Access constraints
- otherRestrictions Other restrictions
- Other constraints
- no limitations to public access
- Access constraints
- otherRestrictions Other restrictions
- Other constraints
- no limitations
- Use constraints
- license License
- Other constraints
- Open Government Licence v3.0
- Use constraints
- otherRestrictions Other restrictions
- Other constraints
- Data supplied under Open Government Licence v3.0
- Use constraints
- otherRestrictions Other restrictions
- Other constraints
- No restrictions apply.
- Unique resource identifier
- url
- Codespace
- url
- Association Type
- crossReference Cross reference
- Unique resource identifier
- url
- Codespace
- url
- Association Type
- dependency dependency
- Unique resource identifier
- url
- Codespace
- url
- Association Type
- dependency dependency
- Spatial representation type
- textTable Text, table
- Metadata language
- engEnglish
- Character set
- utf8 UTF8
- Topic category
-
- Oceans
- Begin date
- 2011-01-01
- End date
- 2011-02-28
- Supplemental Information
- It is recommended that careful attention be paid to the contents of any data, and that the author be contacted with any questions regarding appropriate use. If you find any errors or omissions, please report them to polardatacentre@bas.ac.uk.
Distributor
https://www.bas.ac.uk/team/business-teams/information-services/uk-polar-data-centre/
- Name
- image/tiff
- Units of distribution
- bytes
- Transfer size
- 5452595
- OnLine resource
-
Get Data
(
WWW:LINK-1.0-http--link
)
Download data
- Hierarchy level
- dataset Dataset
- Statement
-
Methodology:
Hurdle model approach:
To model the distribution of Antarctic krill (hereafter krill) we applied a two-part hurdle model or zero-altered model (Zuur et al. 2009), which allows for heavily skewed data distributions. The first model predicts the probability of presence of krill. To do this, the krill density data were transformed into a binary zero/non-zero form (n = 2709) and modelled against the environmental covariates using a binomial generalised additive model (GAM; Wood (2017)) with a logit link function. The second model investigates the relationship between non-zero data (i.e. presence-only data, n = 1833) and environmental covariates. This was carried out using a GAM with a Gaussian distribution and the default identity link function. Based on exploratory density plots, the presence-only data were log transformed to follow a normal distribution. Finally, outputs from both models were multiplied together. This allowed us to identify where krill were both likely to be present and occur at high densities.
Model fitting and selection:
All GAMs were fitted using the R package 'mgcv' (Wood 2019), with a Restricted Maximum Likelihood (REML) optimisation method to estimate splines, and penalised thin plate regression splines on all smooth terms (Eilers and Marx 1996). To reduce model overfitting the basis dimension (k) was limited to between 3 and 6 with the optimum number guided by edf values and associated p values reported in mgcv's gam.check, and by visualising the partial effects plots for each covariate with the raw data.
For both the binomial and Gaussian GAMs, model selection followed a forward stepwise selection approach with five-fold cross validation. Specifically, each environmental covariate was modelled against the response variable independently and repeated five times, each time withholding a different random subset of data (fold) for evaluation. The model coefficients from each run were used to predict the outcome for the withheld fold and performance metrics of the prediction - Root Mean Squared Error (RMSE) and R2 - were extracted. The best performing covariate (i.e. lowest RMSE and highest R2 averaged over five folds) was retained within the model. This selection process was repeated allowing for all possible combinations of environmental covariates at their different temporal scales (sample and decadal). At each iteration, the retained set of covariates were assessed for collinearity using Pearson correlation coefficients and Variance Inflation Factors (VIF), and for concurvity using the worst-case measure of overall concurvity for each smooth. If issues were identified (Pearson's r > 0.7, VIF >3, concurvity >0.8) the next highest-ranking covariate was selected. Forward selection continued until model performance metrics plateaued and/or issues of collinearity and concurvity could not be overcome. Predictions from the final Gaussian GAM were back transformed to obtain outputs on the original density scale. Once the final set of covariates was selected, predictions for the probability of occurrence and estimated krill density were projected onto year-specific grids at the scale of Subarea 48.2. The mean and +/-1 standard deviation of predictions and their product (interpreted as the krill density weighted by the probability of occurrence) across all years were generated.
Data collection:
All analyses were carried out in R version 4.3.3.
Data quality:
The quality of model outputs are, in part, dependent on the data used for model input, i.e. the krill density data and environmental covariates.
Krill density data:
As part of the ongoing Norwegian scientific contribution to monitoring distribution, abundance and population characteristics of Antarctic krill, an acoustic survey takes place annually in waters surrounding the South Orkney Islands (Longitudinal stratum boundaries at 43.5 degrees W and 48 degrees W, and latitudinal boundaries at 59.67 degrees S and 62 degrees S). In the present study we use data from the first ten-year time period of this survey (2011 to 2020). All survey transects occurred between January and February each year, but in two of the years - 2013 and 2015 - sea ice prevented full completion of the survey (Krafft et al. 2018). The surveys were conducted by using the Norwegian commercial fishing vessels 'Saga Sea' and 'Juvel' (Aker Biomarine ASA, Oslo, Norway and Rimfrost AS, Fosnavag, Norway) as research platforms except in 2019 when RV 'Kronprins Haakon' was used.
The acoustic data used in this study were collected using calibrated hull mounted Simrad echo sounders. We used the swarm-based approach for acoustic target identification of krill which is recommended by CCAMLR when the 38, 120 and 200 kHz frequency combination is not available. Details on the processing procedures used to determine krill density are reported and evaluated by Skaret et al. (2023). The retained nautical area scattering coefficient (NASC) allocated to krill per nautical mile was converted to biomass density (g m-2, hereafter referred to as density) using full Stochastic Distorted Wave Born Approximation (SDWBA) model runs to estimate backscattering cross-sectional areas (delta) for each krill length group of 1 mm increment present in the sample.
Each acoustic sample value was matched to the covariate raster data according to the latitude, longitude and year of collection. Finally, the combined krill density-covariate dataset was aggregated to the same spatial resolution as the environmental covariates (0.04×0.04 decimal degrees) by calculating the mean values within each grid cell for each year. This was done to avoid pseudo-replication given multiple acoustic samples within the same grid cell of environmental data, and to reduce any effect of spatial autocorrelation in model residuals.
Environmental covariate data:
Twelve environmental covariates were identified as candidate explanatory variables for the model. These included three static variables, i.e. unchanging with survey year: water depth (bathymetry), bathymetric slope and distance from shelf break defined as the 500m isobath (where values on-shelf were positive, and those off-shelf were negative). The nine remaining variables were dynamic across survey years: distance from sea ice edge (defined as 15 percentage ice concentration), seven sea surface variables (temperature, mixed layer thickness, sea surface height above geoid, salinity, chlorophyll a, primary productivity, and geostrophic current velocity) and bottom temperature. Raster grids of all covariates were obtained from a combination of empirical observations and model re-analyses (see Section 9. Related datasets). For each of the dynamic covariates, two different temporal scales were extracted. These were: i) a sample scale which averaged conditions during the sampling months (January to February) independently for each year; and ii) a decadal scale climatology which was the average of summer conditions (January to March) between 2011 and 2020.
- File identifier
- 4fd0a1bf-da1a-4021-82eb-2fc513910e32 XML
- Metadata language
- engEnglish
- Character set
- utf8 UTF8
- Hierarchy level
- dataset Dataset
- Hierarchy level name
- dataset
- Date stamp
- 2025-01-10
- Metadata standard name
- ISO 19115 Geographic Information - Metadata
- Metadata standard version
- ISO 19115:2003(E)
https://www.bas.ac.uk/team/business-teams/information-services/uk-polar-data-centre/
Overviews
Spatial extent
Provided by
NERC Data Catalogue Service