Training Details

Model Training Code

Model training code for most of the models is available in our GitHub repository . Please note that this is an active project and documentation is provided on a best-effort basis.

Dataset Preparation

The datasets used to train the kelp segmentation model were a number of scenes collected using DJI Phantom remotely-piloted aircraft systems (RPAS). A total of 28 image mosaic scenes were used. The resolution of each image varied between 0.023m and 0.428m, with an average of 0.069m and standard deviation of 0.087m. These images were collected over a period from 2018 to 2021, all during summer.

For model training, each dataset was divided into 512 pixels square cropped sections, with 50% overlap between adjacent tiles. To balance the dataset, tiles containing no kelp where discarded. These sets of tiles where then divided into training, validation, and test splits.

Pre-processing overview

flowchart LR
    subgraph Labelling["Labelling"]
        A[/"UAV Orthomosaic (RGB)"/]
        B["Manual kelp classification"]
        C[/"Presence/Absence Dataset\n(not kelp = 0, kelp = 1)"/]
        D[/"Species Dataset\n(not kelp = 0, Macrocystis = 2,\n Nereocystis = 3)"/]
    end
    subgraph Filtering["Prepare image tiles"]
        E{{"Tile image and label\n(512x512 pixel tiles, 50% overlap)"}}
        F{{"Discard image and labels tiles with no kelp\nOR where >50% is no data"}}
    end
    subgraph PADataset["P/A Dataset"]
        G[("Training ~80%")]
        H[("Validation ~10%")]
        I[("Testing ~10%")]
    end
    subgraph SpeciesDataset["Species Dataset"]
        G1[("Training ~80%")]
        H1[("Validation ~10%")]
        I1[("Testing ~10%")]
    end
    subgraph DataPrep["Data Preparation"]
        Labelling
        Filtering
        PADataset
        SpeciesDataset
    end
    A --> B
    B --> C & D
    E --> F
    C --> Filtering
    D --> Filtering
    Filtering --> PADataset & SpeciesDataset

Dataset summaries

Kelp (presence/absence)

Split	Scenes	Tiles	Pixels_kelp	Pixels_total	Area (km²)	Res_max (m)	Res_min (m)	Res_\(\mu\) (m)	Res_\(\sigma\) (m)
Train	20	829404	216770596954	3738528864	11680.54	0.1040	0.0230	0.0487	0.0253
Validation	4	31610	8220459464	295310084	253.79	0.0420	0.0230	0.0296	0.0085
Test	6	92608	24093604354	639694012	819.45	0.0680	0.0230	0.0385	0.0161
Sum	30	953622	249084660772	4673532960	12,753.78

Kelp (species)

Split	Scenes	Tiles	Pixels_macro	Pixels_nereo	Pixels_total	Area (km²)	Res_max (m)	Res_min (m)	Res_\(\mu\) (m)	Res_\(\sigma\) (m)
Train	17	336740	605462674	1158650042	88008624978	4034.11	0.1040	0.0230	0.0488	0.0266
Validation	4	15805	127410722	20244320	4110229732	123.91	0.0420	0.0230	0.0296	0.0085
Test	6	46304	143277498	176569508	12046802177	409.72	0.0680	0.0230	0.0385	0.0161
Sum	27	398849	876150894	1355463870	104165656887	4567.75

Mussels (presence/absence)

Split	Scenes	Tiles	Pixels_mussels	Pixels_total	Area (km²)	Res_max (m)	Res_min (m)	Res_\(\mu\) (m)	Res_\(\sigma\) (m)
Train	39	4834	933287123	5068816384	23.8147	0.027	0.00231	0.0082244	0.00492054
Validation	8	1277	223598444	1339031552	5.74255	0.00518591	0.00330667	0.00421985	0.000545617
Test	8	1110	175226412	1163919360	4.64052	0.00578278	0.003671	0.00435929	0.000665025
Sum	55	7221	1332111979	7571767296	34.197799	0.037969	0.009288	0.016804	0.006131

Model Training

Source code for model training is available on GitHub at hakai-ml-train.

Training overview

flowchart LR
    subgraph Augmentation["Data Augmentation"]
        J["Randomly rotation in [0, 45] deg"]
        K["Random horizontal and/or vertical flip"]
        L["Randomly jitter brightness, contrast, and saturation"]
    end
    subgraph Validation["Validation"]
        V["Calculate average model error on all batches"]
        H2[("Validation")]
    end
    subgraph Testing["Testing"]
        Y["Calculate model performance on all batches"]
        I3[("Testing")]
        Z("Save model performance metrics")
    end
    subgraph Training["Model Training Loop"]
        P(["Start"])
        B2["Get next batch of training tiles"]
        G2[("Training")]
        Augmentation
        Q["Forward pass through model"]
        R["Calculate error of model relative to hand-labelled images"]
        S["Update model parameters with backpropagation"]
        B3{"More training tiles to process?"}
        Validation
        W{"Validation error is stabilized?\n(model converged)"}
        reset["Next training epoch"]
        X["Finished training, save model"]
        Testing
        End(["End"])
    end
    P --> B2
    G2 -.-> B2
    J --> K
    K --> L
    B2 --> Augmentation
    L --> Q
    Q --> R
    R --> S
    S --> B3
    B3 -- Yes --> B2
    H2 -.-> V
    B3 -- No --> Validation
    Validation --> W
    W -- No --> reset
    reset --> B2
    W -- Yes --> X
    I3 -.-> Y
    Y --> Z
    X --> Testing
    Testing --> End