This repository is a testbed for shoreline modelling algorithms. It contains all benchmark datasets, input files, codes, and results.
Shoreline change prediction has witnessed the evolution of both hybrid and data-driven models, offering valuable insights into how coastlines respond to varying wave conditions across different timescales. Building on the success of the ShoreShop 1.0 (Blind testing of shoreline evolution models) in 2018, which tested 19 shoreline models in predicting shoreline positions over a four-year period, ShoreShop 2.0 aims to further explore and showcase the evolution of predictive methods.
- Diverse Methodology Showcase: With the rapid advancements in data-driven models and the increasing availability of shoreline measurements, ShoreShop 2.0 intends to highlight the diversity of methods available for predicting shoreline changes.
- Blind Testing Element: Drawing inspiration from ShoreShop 1.0, ShoreShop 2.0 retains its core feature of blind testing to provide unbiased evaluations of the predictive models.
- Key Focus Areas:
- Assessing the performance of shoreline models using freely available datasets such as satellite-derived shorelines and modelled wave conditions.
- Evaluating the capabilities of shoreline models for short-term (5 years) and medium-term (50 years) predictions while discussing their potential for long-term predictions (80 years).
- Investigating the variability in shoreline model accuracy along the coastline of an embayed beach.
Join us at ShoreShop 2.0 to learn about the latest advancements, share insights, and foster collaborative efforts in shoreline change prediction.
The following notebooks are available in this repo:
- 1.data_explore.ipynb: explore the input transect, shoreline and wave data.
- 2.model_compare.ipynb: compare and evaluate all the submitted model predictions against the target shoreline positions.
A real-world Beach_X serves as the target site for this workshop. To ensure a blind test, all geographical information regarding this beach has been intentionally removed.
What we do know is that Beach_X is an east-facing embayed sandy beach with a mean grain size D50 ≈ 0.3 mm.
9 shore-normal transects are established from North to South with a 100-meter longshore distance to measure shoreline position. The coordinates for the landward and seaward ends of transects are provided in transects_coords.csv
.
These coordinates are in a local coordinate system, deliberately shifted (not distorted or rotated), and are expressed in easting (x) and northing (y) with a unit of meters. They do not reveal the actual geographical location of Beach_X.
Given 20 years shoreline position data in the 1999-2018 period, along with the shoreline position in 1951-05-01 as well as wave and sea level data spanning from 1940 to 2100, participants are tasked with:
- Task1.Short-term prediction: Predict the shoreline position between 2019-01-01 and 2023-12-31 with daily timestep.
- Task2.Medium-term prediction: Predict the shoreline position between 1951-05-01 and 1998-12-31 with daily timestep.
- Task3.Long-term prediction: Predict the shoreline position between 2019-01-01 and 2100-12-31 with daily timestep (no evaluation).
- Target transects: The target transects used for evaluation include Transects 2, 5 and 8 in the North end, the middle and the South end of the beach respectively.
-
Target shorelines: For each target transect, the model prediction will be evaluated against the observed shoreline data at target datetimes. The target datetimes for short-term and medium-term tasks are in
shorelines_target_short.csv
andshorelines_target_medium.csv
. For blind testing, the observed shoreline data in the target window will be withheld for all participants. The long-term prediction will not be evaluated, and the results are only for discussion. -
Evaluation metrics: Taylor diagram (consisting of centered root-mean-square error (CRMSE, bias excluded), Correlation and standard deviation (STD)) will be used to visualize and compare the model performance for each of the target transect.
For quantitative comparison, root-mean-square error (RMSE, bias included), Correlation and STD are integrated into a single metrics. RMSE and STD of prediction are normalized by dividing them by the STD of the observation (target). This ensures that all three statistical parameters in the diagram are normalized. The loss function$\mathcal{L_i}$ is then defined by intergrating the three parameters for each target transect. The average loss across all target transects$\mathcal{L_avg}$ serves as the final metric for ranking the model performance. Please note that due to the difference between CRMSE and RMSE, the$\mathcal{L_i}$ is indicative of, but not identical to, the distance between the model points and the target point (the black triangle) in the Taylor diagram.
- Participants should not attempt to locate and retrieve extra shoreline information beyond that provided for Beach_X.
- Participants may use any type of model, including but not limited to hybrid and data-driven models.
- Participants must complete at least one task, although attempting both is encouraged.
- Participants must provide a brief description of the methodology used.
- Code submission is optional.
- Each participant can have multiple submissions.
The following files are provided for shoreline predictions.
transects_coords.csv
: Coordinates for transects.slope.csv
: Beach-face slope for transects.shorelines_obs.csv
: Shoreline position between 1999 and 2018 for model calibration/training for each transect.shorelines_target_short.csv
: Target dates where short-term shoreline prediction will be evaluated.shorelines_target_medium.csv
: Target dates where medium-term shoreline prediction will be evaluated. The shoreline position for 1951-05-01 is also provided as context for prediction.Hindcast wave data (Hs.csv, Tp.csv, Dir.csv)
: Hindcast significant wave height, peak wave period and mean wave direction between 1940 and 2024 for each transect. Hindcast wave data can be used for short-term and medium-term shoreline modelling.Forecast wave data (Hs.csv, Tp.csv, Dir.csv)
: Forecast (2006-2100) significant wave height, peak wave period and mean wave direction under RCP4.5 and 8.5 scenarios for each transect. Forecast wave data can be used for long-term shoreline modelling.Tide data (tide.csv)
: Tidal height between 1950 and 2100 with daily interval.Sea level data (sealevel_obs.csv, sealevel_proj.csv)
: Annual mean of sea level between 1950 and 2100, including the gauge measurement (1950-2018) and the projections (2019-2100) under RCP4.5 and 8.5 scenarios.
The following constants are also provided.
Depth of wave data
: 10 (m)Mean grain size D50
: 0.3 (mm)
The shoreline data (shorelines_obs.csv
) utilized in the ShoreShop 2.0 workshop is derived from public satellite images captured by Landsat 5, 7, 8, and 9 satellites. This data has been processed using CoastSat, a public toolbox specifically designed for coastline analysis.
The satellite-derived shoreline (SDS) data for model calibration/training starts from 1999 and ends at 2018 with 347 time steps.
The shoreline position (m) is defined as the distance from the landward end of a transect to the intersection of the shoreline and the transect.
All the shorelines have been corrected to reflect the instantaneous position at Mean Sea Level.
The satellite-derived shoreline position data exhibits a high level of accuracy and closely aligns with the ground truth data for Beach_X with RMSE (shown in the figure below) less than 7 (m) for transects 1~8 and 15 (m) for transect 9 which is close to the headland, ensuring reliable model calibration and training.
For short-term shoreline prediction, The shorelines_target_short.csv
contains the target dates from 2019 to 2023. All shoreline position values are intentionally withheld for all participants. During the evaluation phase, participants' predictions will be compared against the withheld shoreline position data from the SDS dataset.
For medium-term shoreline prediction, shorelines_target_medium.csv
provides shoreline data for the date 1st May 1951, serving as the context data for the medium-term prediction task. Additional target dates are included for medium-term prediction. Participants' predictions will be evaluated against photogrammetry (pre-1989) and SDS (1989-1998) data available for the respective target dates.
The hindcast wave data used in this workshop was obtained by downscaling offshore directional wave spectra to nearshore areas. The offshore wave data is from the ERA5 Wave Hindcast. The BinWaves approach was applied to transform the offshore wave data to nearshore. The significant wave height (Hs), peak wave period (Tp) and mean wave direction (Dir) were extracted along each shore-normal transect at 10 (m) depth contour with a daily interval from 1940 to 2023. Wave roses for the three target transects are shown below to demonstrate the minimal longshore variation of wave conditions.
The forecast wave data were made using a nested WAVEWATCH III wave model with surface wind data from the Australian Community Climate and Earth System Simulator (ACCESS) as inputs. Wave projections have been performed for medium range carbon (RCP4.5) and high range (RCP8.5) carbon emission scenarios throughout 2006-2100. The offshore wave forecast was transformed to nearshore following the same approach applied to hindcast wave data. For each transect, to ensure the consistency between the hindcast and forecast data, the forecast wave climates were calibrated to have the same seasonality and joint distribution of Hs, Tp and Dir over the period of 2006~2023 with the multivariate bias correction algorithm (MBCn). Calibration results with hindcast data as reference for Transect1 under RCP4.5 scenario are shown below.
tide.csv
contains the daily average of tidal height from FES2014 model. Tidal components M2, S2, K1 and O1 were used for reconstruction. Tidal height is the same for all transects.
sealevel_obs.csv
contains the annual sea level data from gauge measurement (1950-2018). sealevel_proj.csv
contains the sea level projection under the RCP 4.5 and 8.5 scenarios (2019-2100).
The projection data is from NASA's SEA LEVEL PROJECTION TOOL, which is based on the regional sea level projections provided in the IPCC 6th Assessment Report (AR6). Sea level is the same for all transects.
Similar to the vast majority of beaches worldwide, Beach_X does not have nearshore bathymetry data available. Instead, slope.csv
contains the beach-face slope for each transect derived from SDS with the CoastSat.slope tool. Therefore, participants are challenged to adapt their models accordingly to address the model applicability under data poor scenarios with only freely accessible datasets.
Participants should provide short-term, medium-term and long-term model predictions for all transects as shorelines_prediction_short.csv
, shorelines_prediction_medium.csv
, and shorelines_prediction_long.csv
.
Both the short-term and medium-term model predictions should have daily interval. For models that do not generate daily outputs, please interpolate the results to daily.
All prediction submissions should be placed in the designated submission folder.
The submission folder will have shoreline predictions from various participants.
An illustrative example of submission is provided in the submissions/ShoreFor
folder for reference.
Participants should copy and check the boxes below in the model description to assist in classifying the submitted model. An example of a model description can be found here.
- Process-Based Models (PBM): couple hydrodynamics, waves, and morphodynamics through mass and momentum conservation laws.
- Hybrid Models (HM): use observational data to calibrate free parameters in the equilibrium configuration of a system.
- Data-Driven Models (DDM): use observational data to train regression models (e.g. machine learning and statistical models).
- Cross-shore: model the shoreline position for each transect independently.
- Long-shore: incorporate the interaction of shoreline position across different transects.
- Sea level: consider the impact of sea level rise on shoreline position.
To submit your results, please:
- fork this repository;
- Begin by forking the original workshop repository to your GitHub account.
- Create a Submission Folder:
- Inside the submission folder, create a subfolder named "ModelName_AuthorInitials". Replace ModelName with the name of your model and AuthorInitials with your initials.
- Place Your Prediction Files:
- Copy your completed prediction files (
shorelines_prediction_short.csv
,shorelines_prediction_medium.csv
and/orshorelines_prediction_long.csv
) into the subfolder created in Step 2.
- Copy your completed prediction files (
- Provide Your Model Description (an example is here):
- Include a README.md file providing a clear description of the model you used.
- Copy and complete the model classification checkboxes to the README.md.
- Include Your Code (Optional):
- Create another subfolder within the algorithm folder named ModelName_AuthorInitials.
- Copy your commented code into this subfolder.
- Include a README.md file providing clear instructions on how to run your code to reproduce the prediction outputs.
- Submit Your Results:
- Create a Pull request to the original workshop repository to submit your prediction results and optional code.
If you need any help with this submission, please post on the GitHub Issues page.
The deadline for the submission is the end of the year (01/10/2024).
Please put any questions on the GitHub Issues page so that everybody can read/comment.
We extend our gratitude to Dr. Killian Vos, Dr. Laura Cagigal and Dr. Raimundo Ibaceta for generously sharing the CoastSat, BinWaves and ShoreFor codes, as well as to Aditya Deshmukh for supplying the forecast wave data.