Clustering SuperDARN Backscatter Echoes

Example output of clustering algorithm

This study aims to classify backscatter measured by the Super Dual Auroral Radar Network (SuperDARN) using unsupervised machine learning methods. The SuperDARN radars are used to study the ionosphere, and data from multiple radars can be combined to create ionospheric convection maps. SuperDARN backscatter can come from the ionosphere, the ground, or other sources. It is important to separate these categories cleanly so that data used for research (usually ionospheric scatter) is not contaminated by the wrong types of scatter. However, it is challenging to separate these categories because their characteristics in doppler velocity, spectral width, and other features can overlap and ground truth is rarely available. The traditional method of separating ionospheric scatter (IS) from ground scatter (GS) classifies data point by point, and misclassifies a large amount of low-velocity IS as GS to obtain clean IS data. A recent method that clusters data using a depth-first search improved on this using criteria developed specifically for conditions with low-velocity IS. However, this method uses boxcar filtering, which requires a trade-off between having the ability to find small-scale structures and contaminating the data with noise. In this study we apply unsupervised machine learning methods to cluster backscatter data, comparing Gaussian Mixture Model (GMM) and several variations on Density-Based Spatial Clustering of Applications with Noise (DBSCAN). We then classify the clusters as IS or GS using criteria from the literature. These clustering algorithms can create coherent structures in space and time separate from noise, preserve the original data, and identify more low-velocity IS, offering improvements over existing methods.

Shibaji Chakraborty
Shibaji Chakraborty
Postdoc at ‘Space@VT’ SuperDARN Lab

My research interests include solar flares, sudden ionospheric disturbances, geomagnetic storms, coronal holes.

Related