Article

Machine Learning Methods Applied to the Prediction of Pseudo-nitzschia spp. Blooms in the Galician Rias Baixas (NW Spain)

Citation

Bellas Aláez FM, Torres Palenzuela JM, Spyrakos E & Gonzalez Vilas L (2021) Machine Learning Methods Applied to the Prediction of Pseudo-nitzschia spp. Blooms in the Galician Rias Baixas (NW Spain). ISPRS International Journal of Geo-Information, 10 (4), Art. No.: 199. https://doi.org/10.3390/ijgi10040199

Abstract
This work presents new prediction models based on recent developments in machine learning methods, such as Random Forest (RF) and AdaBoost, and compares them with more classical approaches, i.e., support vector machines (SVMs) and neural networks (NNs). The models predict Pseudo-nitzschia spp. blooms in the Galician Rias Baixas. This work builds on a previous study by the authors (doi.org/10.1016/j.pocean.2014.03.003) but uses an extended database (from 2002 to 2012) and new algorithms. Our results show that RF and AdaBoost provide better prediction results compared to SVMs and NNs, as they show improved performance metrics and a better balance between sensitivity and specificity. Classical machine learning approaches show higher sensitivities, but at a cost of lower specificity and higher percentages of false alarms (lower precision). These results seem to indicate a greater adaptation of new algorithms (RF and AdaBoost) to unbalanced datasets. Our models could be operationally implemented to establish a short-term prediction system.

Keywords
harmful algal blooms (HABs); Pseudo-nitzschia spp.; Galician Rias Baixas; coastal embayment; support vector machines (SVMs); neural networks (NNs); Random Forest (RF); AdaBoost

Journal
ISPRS International Journal of Geo-Information: Volume 10, Issue 4

StatusPublished
FundersHorizon 2020 (Outputs)
Publication date30/04/2021
Publication date online31/03/2021
Date accepted by journal23/03/2021
URLhttp://hdl.handle.net/1893/32522
eISSN2220-9964