<p dir="ltr">These datasets represent predictions and associated probabilities using four machine learning methods, associated with collection: Mapping Wetlands with High Resolution Planet SuperDove Satellite Imagery: An Assessment of Machine Learning Models Across the Diverse Waterscapes of New Zealand (<a href="https://doi.org/10.26021/canterburynz.c.7848596" target="_blank">10.26021/canterburynz.c.7848596</a>).</p><p dir="ltr">The following datasets are available:</p><ol><li>HGB prediction </li><li>HGB probability [this dataset]</li><li>MLPC prediction</li><li>MLPC probability</li><li>Random forest prediction</li><li>Random forest probability</li><li>XGBoost prediction</li><li>XGBoost probability</li></ol><p dir="ltr">For details of the models developed, please see the collection and associated paper. The following files are available in each dataset, each representing an area within New Zealand:</p><ul><li><b>xxxxx_mmm_prediction.tif</b>: model prediction, encoded as 8-bit integers where 1 is predicted as wetland (>50% probability), and NA (no data) is non-wetland.</li><li><b>xxxxx_mmm_probability.tif</b>: model wetland probability, encoded as 16-bit integers, with probability values from 0 to 1 rescaled from 0 to 10,000. Divide the values by 10,000 to obtain probabilities to four decimal places.</li></ul><p dir="ltr">In the tile filenames, <b>xxxxx</b> refers to the UUID of the grid area, which can be found in the file nzgrid_uuid.gpkg, and <b>mmm</b> is a code which refers to the model used:</p><ul><li><b>hgb</b>: histogram gradient boost</li><li><b>mlpc</b>: multi-layer perceptron classification</li><li><b>rf</b>: random forest</li><li><b>xgb</b>: extreme gradient boosting</li></ul><p dir="ltr">In addition to the tif images, two virtual raster tile files are included to enable mapping at the national scale:</p><ul><li><b>_mmm_prediction.vrt</b></li><li><b>_mmm_probability.vrt</b></li></ul><p dir="ltr">All tif images are saved using cloud optimised geotiff (COG), which makes them fast to display even at a national level, although increases the data size. Total size is around 700 MB for the prediction datasets, and ~75 GB for the probability datasets.</p><p dir="ltr">Metadata for the Planet SuperDove imagery used for each pixel of the predictions is available here: <a href="https://doi.org/10.26021/canterburynz.29231837.v1" target="_blank">https://doi.org/10.26021/canterburynz.29231837.v</a></p>