TY - JOUR
T1 - Improving semantic segmentation accuracy in thin cloud interference scenarios by mixing simulated cloud-covered samples
AU - Wang, Haoyu
AU - Li, Junli
AU - Shen, Zhanfeng
AU - Zhang, Zihan
AU - Bai, Linze
AU - Li, Ruifeng
AU - Zhou, Chenghu
AU - De Maeyer, Philippe
AU - Van de Voorde, Tim
N1 - Publisher Copyright:
© 2024 The Author(s)
PY - 2024/8/13
Y1 - 2024/8/13
N2 - Thin cloud interference presents a significant challenge for the semantic segmentation of optical satellite imagery, which directly degrades the model accuracy and causes difficulties in sample selection. This paper generated a dataset named Populus euphratica and Tamarix chinensis discrimination (PTD), containing both cloudless and thin cloud scenarios. Based on this PTD dataset, an enhanced Atmospheric Scattering Model with Nonlinear Optimization (ASM_NL) was proposed to simulate high-fidelity thin clouds by incorporating two vital nonlinear terms: the point spread function and the Perlin noise. Additionally, we adopt a strategy of mixing simulated thin cloud-covered images (STCI) into the training set at a certain proportion to improve the semantic segmentation accuracy in thin cloud-covered scenarios. The conclusions are as follows: 1) ASM_NL can simulate high-fidelity clouds at an average Jensen-Shannon distance of 0.0699. 2) When dealing with medium- and high-cloud density datasets, mixing STCI proved to be more effective than cloud removal in mitigating thin cloud interference, resulting in average macro F1 score improvements of 0.164 and 0.094, respectively. 3) The semantic segmentation accuracy improved significantly by mixing STCI with a minimal proportion of 1/60, demonstrating the activation of model transfer capabilities. This study provides a concise and efficient methodology for effectively mitigating thin cloud interference in deep learning-based optical satellite imagery analysis.
AB - Thin cloud interference presents a significant challenge for the semantic segmentation of optical satellite imagery, which directly degrades the model accuracy and causes difficulties in sample selection. This paper generated a dataset named Populus euphratica and Tamarix chinensis discrimination (PTD), containing both cloudless and thin cloud scenarios. Based on this PTD dataset, an enhanced Atmospheric Scattering Model with Nonlinear Optimization (ASM_NL) was proposed to simulate high-fidelity thin clouds by incorporating two vital nonlinear terms: the point spread function and the Perlin noise. Additionally, we adopt a strategy of mixing simulated thin cloud-covered images (STCI) into the training set at a certain proportion to improve the semantic segmentation accuracy in thin cloud-covered scenarios. The conclusions are as follows: 1) ASM_NL can simulate high-fidelity clouds at an average Jensen-Shannon distance of 0.0699. 2) When dealing with medium- and high-cloud density datasets, mixing STCI proved to be more effective than cloud removal in mitigating thin cloud interference, resulting in average macro F1 score improvements of 0.164 and 0.094, respectively. 3) The semantic segmentation accuracy improved significantly by mixing STCI with a minimal proportion of 1/60, demonstrating the activation of model transfer capabilities. This study provides a concise and efficient methodology for effectively mitigating thin cloud interference in deep learning-based optical satellite imagery analysis.
KW - Atmospheric scattering model
KW - Cloud simulation
KW - Data augmentation
KW - Semantic segmentation
KW - Thin cloud interference
UR - http://www.scopus.com/inward/record.url?scp=85201118022&partnerID=8YFLogxK
U2 - 10.1016/j.jag.2024.104087
DO - 10.1016/j.jag.2024.104087
M3 - Article
AN - SCOPUS:85201118022
SN - 1569-8432
VL - 133
JO - International Journal of Applied Earth Observation and Geoinformation
JF - International Journal of Applied Earth Observation and Geoinformation
M1 - 104087
ER -