Submitted:
04 March 2024
Posted:
05 March 2024
You are already at the latest version
Abstract
Keywords:
1. Introduction
- This study aims to assess the effectiveness of the MINIROCKET ([6]) time series classifier, based on the MINImally RandOm Convolutional KErnel Transform, for real-time prediction of solar flares with minimal data manipulation. MINIROCKET, an efficient variant of the ROCKET classifier ([7]), achieves high precision at reduced computing costs by employing random convolutional kernels to transform input time series. The transformed features are then used to train a linear classifier. MINIROCKET, being a (nearly) deterministic reformulation of ROCKET, exhibits significantly faster performance on larger datasets while maintaining comparable accuracy.
- In this study, we compare MINIROCKET’s performance with Canonical Interval Forest (CIF) ([8]), Multiple Representations SEQuence Learner (Mr-SEQL), support vector machine (SVM) and Long Short-Term Memory (LSTM) models.
2. Related Work
3. Dataset
| Abbreviation | Description | Formula |
|---|---|---|
| ABSNJZH [14] | Absolute value of the net current helicity | |
| EPSX [25] | Sum of x-component of normalized Lorentz force | |
| EPSY [25] | Sum of y-component of normalized Lorentz force | |
| EPSZ [25] | Sum of z-component of normalized Lorentz force | |
| MEANALP [26] | Mean characteristic twist parameter, | |
| MEANGAM [14] | Mean angle of field from radial | |
| MEANGBH [14] | Mean gradient of horizontal field | |
| MEANGBT [14] | Mean gradient of total field | |
| MEANGBZ [14] | Mean gradient of vertical field | |
| MEANJZD [14] | Mean vertical current density | |
| MEANJZH [14] | Mean current helicity ( contribution) | |
| MEANPOT [27] | Mean photospheric magnetic free energy | |
| MEANSHR [27] | Mean shear angle | |
| R_VALUE [28] | Sum of flux near polarity inversion line | |
| SAVNCPP [14] | Sum of the modulus of the net current per polarity | |
| SHRGT45 [14] | Fraction of Area with shear | Area with shear total area |
| TOTBSQ [25] | Total magnitude of Lorentz force | |
| TOTFX [25] | Sum of x-component of Lorentz force | |
| TOTFY [25] | Sum of y-component of Lorentz force | |
| TOTFZ [25] | Sum of z-component of Lorentz force | |
| TOTPOT [14] | Total photospheric magnetic free energy density | |
| TOTUSJH [14] | Total unsigned current helicity | |
| TOTUSJZ [14] | Total unsigned vertical current | |
| USFLUX [14] | Total unsigned flux |
4. Methodology
5. Experiments
5.1. Performance Metrics: TSS Score and HSS2 Score
5.2. Comparing Different Classes of Classifiers
5.2.1. Long Short-Term Memory (LSTM)
5.2.2. Support Vector Machine (SVM)
5.2.3. Canonical Interval Forest (CIF)
5.2.4. Multiple Representations SEQuence Learner (Mr-SEQL)
5.3. Binary Classification
5.4. Multi-class: All Class Classification
5.5. Analysis with the Exclusion of B and C Class Flares
6. Conclusions
Funding
Data Availability Statement
References
- Ahmadzadeh, A.; Aydin, B.; Georgoulis, M.K.; Kempton, D.J.; Mahajan, S.S.; Angryk, R.A. How to train your flare prediction model: Revisiting robust sampling of rare events. The Astrophysical Journal Supplement Series 2021, 254, 23. [Google Scholar] [CrossRef]
- Larsen, E. Predicting Solar Flares with Remote Sensing and Machine Learning. CoRR 2021, abs/2110.07658, [2110.07658].
- Abd Elrahman, S.M.; Abraham, A. A review of class imbalance problem. Journal of Network and Innovative Computing 2013, 1, 332–340. [Google Scholar]
- Bobra, M.G.; Sun, X.; Hoeksema, J.T.; Turmon, M.; Liu, Y.; Hayashi, K.; Barnes, G.; Leka, K. The Helioseismic and Magnetic Imager (HMI) vector magnetic field pipeline: SHARPs–space-weather HMI active region patches. Solar Physics 2014, 289, 3549–3578. [Google Scholar] [CrossRef]
- Kusano, K.; Iju, T.; Bamba, Y.; Inoue, S. A physics-based method that can predict imminent large solar flares. Science 2020, 369, 587–591. [Google Scholar] [CrossRef] [PubMed]
- Dempster, A.; Schmidt, D.F.; Webb, G.I. Minirocket: A very fast (almost) deterministic transform for time series classification. In Proceedings of the Proceedings of the 27th ACM SIGKDD conference on knowledge discovery & data mining, 2021; pp. 248–257. [Google Scholar]
- Dempster, A.; Petitjean, F.; Webb, G.I. ROCKET: exceptionally fast and accurate time series classification using random convolutional kernels. Data Mining and Knowledge Discovery 2020, 34, 1454–1495. [Google Scholar] [CrossRef]
- Middlehurst, M.; Large, J.; Bagnall, A. The canonical interval forest (CIF) classifier for time series classification. In Proceedings of the 2020 IEEE international conference on big data (big data). IEEE, 2020; pp. 188–195. [Google Scholar]
- Bloomfield, D.S.; Higgins, P.A.; McAteer, R.J.; Gallagher, P.T. Toward reliable benchmarking of solar flare forecasting methods. The Astrophysical Journal Letters 2012, 747, L41. [Google Scholar] [CrossRef]
- McIntosh, P.S. The classification of sunspot groups. Solar Physics 1990, 125, 251–267. [Google Scholar] [CrossRef]
- Boubrahimi, S.F.; Aydin, B.; Kempton, D.; Angryk, R. Spatio-temporal interpolation methods for solar events metadata. In Proceedings of the 2016 IEEE International Conference on Big Data (Big Data). IEEE, 2016; pp. 3149–3157. [Google Scholar]
- Cui, Y.; Li, R.; Zhang, L.; He, Y.; Wang, H. Correlation between solar flare productivity and photospheric magnetic field properties: 1. Maximum horizontal gradient, length of neutral line, number of singular points. Solar Physics 2006, 237, 45–59. [Google Scholar] [CrossRef]
- Jing, J.; Song, H.; Abramenko, V.; Tan, C.; Wang, H. The statistical relationship between the photospheric magnetic parameters and the flare productivity of active regions. The Astrophysical Journal 2006, 644, 1273. [Google Scholar] [CrossRef]
- Leka, K.; Barnes, G. Photospheric magnetic field properties of flaring versus flare-quiet active regions. II. Discriminant analysis. The Astrophysical Journal 2003, 595, 1296. [Google Scholar] [CrossRef]
- Yu, D.; Huang, X.; Wang, H.; Cui, Y. Short-term solar flare prediction using a sequential supervised learning method. Solar Physics 2009, 255, 91–105. [Google Scholar] [CrossRef]
- Song, H.; Tan, C.; Jing, J.; Wang, H.; Yurchyshyn, V.; Abramenko, V. Statistical assessment of photospheric magnetic features in imminent solar flare predictions. Solar Physics 2009, 254, 101–125. [Google Scholar] [CrossRef]
- Ahmed, O.W.; Qahwaji, R.; Colak, T.; Higgins, P.A.; Gallagher, P.T.; Bloomfield, D.S. Solar flare prediction using advanced feature extraction, machine learning, and feature selection. Solar Physics 2013, 283, 157–175. [Google Scholar] [CrossRef]
- Al-Ghraibah, A.; Boucheron, L.; McAteer, R. An automated classification approach to ranking photospheric proxies of magnetic energy build-up. Astronomy & Astrophysics 2015, 579, A64. [Google Scholar]
- Nishizuka, N.; Sugiura, K.; Kubo, Y.; Den, M.; Watari, S.; Ishii, M. Solar flare prediction model with three machine-learning algorithms using ultraviolet brightening and vector magnetograms. The Astrophysical Journal 2017, 835, 156. [Google Scholar] [CrossRef]
- Bobra, M.G.; Couvidat, S. Solar flare prediction using SDO/HMI vector magnetic field data with a machine-learning algorithm. The Astrophysical Journal 2015, 798, 135. [Google Scholar] [CrossRef]
- Hamdi, S.M.; Kempton, D.; Ma, R.; Boubrahimi, S.F.; Angryk, R.A. A time series classification-based approach for solar flare prediction. In Proceedings of the 2017 IEEE International Conference on Big Data (Big Data). IEEE, 2017; pp. 2543–2551. [Google Scholar]
- Angryk, R.A.; Martens, P.C.; Aydin, B.; Kempton, D.; Mahajan, S.S.; Basodi, S.; Ahmadzadeh, A.; Cai, X.; Filali Boubrahimi, S.; Hamdi, S.M.; et al. Multivariate time series dataset for space weather data analytics. Scientific data 2020, 7, 227. [Google Scholar] [CrossRef] [PubMed]
- Pesnell, W.D.; Thompson, B.J.; Chamberlin, P.C. The Solar Dynamics Observatory (SDO). In The Solar Dynamics Observatory; Chamberlin, P., Pesnell, W.D., Thompson, B., Eds.; Springer US: New York, NY, 2012; pp. 3–15. [Google Scholar] [CrossRef]
- Hoeksema, J.T.; Liu, Y.; Hayashi, K.; Sun, X.; Schou, J.; Couvidat, S.; Norton, A.; Bobra, M.; Centeno, R.; Leka, K.; et al. The Helioseismic and Magnetic Imager (HMI) vector magnetic field pipeline: overview and performance. Solar Physics 2014, 289, 3483–3530. [Google Scholar] [CrossRef]
- Fisher, G.H.; Bercik, D.J.; Welsch, B.T.; Hudson, H.S. Global forces in eruptive solar flares: the lorentz force acting on the solar atmosphere and the solar interior. Solar Physics 2012, 277, 59–76. [Google Scholar] [CrossRef]
- Leka, K.; Skumanich, A. On the value of’αAR’from vector magnetograph data. Solar Physics 1999, 188, 3–19. [Google Scholar] [CrossRef]
- Wang, J.; Shi, Z.; Wang, H.; Lue, Y. Flares and the magnetic nonpotentiality. The Astrophysical Journal 1996, 456, 861. [Google Scholar] [CrossRef]
- Schrijver, C.J. A characteristic magnetic field pattern associated with all major solar flares and its use in flare forecasting. The Astrophysical Journal 2007, 655, L117. [Google Scholar] [CrossRef]
- Guo, X.; Yin, Y.; Dong, C.; Yang, G.; Zhou, G. On the class imbalance problem. In Proceedings of the 2008 Fourth international conference on natural computation. IEEE, 2008; Vol. 4, pp. 192–201. [Google Scholar]
- Allouche, O.; Tsoar, A.; Kadmon, R. Assessing the accuracy of species distribution models: prevalence, kappa and the true skill statistic (TSS). Journal of Applied Ecology 2006, 43, 1223–1232. [Google Scholar] [CrossRef]
- Mason, J.P.; Hoeksema, J. Testing automated solar flare forecasting with 13 years of Michelson Doppler Imager magnetograms. The Astrophysical Journal 2010, 723, 634. [Google Scholar] [CrossRef]
- Woodcock, F. The evaluation of yes/no forecasts for scientific and administrative purposes. Monthly Weather Review 1976, 104, 1209–1214. [Google Scholar] [CrossRef]
- Hochreiter, S.; Schmidhuber, J. Long Short-Term Memory. Neural Computation 1997, 9, 1735–1780, https://direct.mit.edu/neco/article-pdf/9/8/1735/813796/neco.1997.9.8.1735.pdf. [Google Scholar] [CrossRef] [PubMed]
- Cortes, C.; Vapnik, V. Support-vector networks. Machine learning 1995, 20, 273–297. [Google Scholar] [CrossRef]
- Lubba, C.H.; Sethi, S.S.; Knaute, P.; Schultz, S.R.; Fulcher, B.D.; Jones, N.S. catch22: CAnonical Time-series CHaracteristics. CoRR 2019, abs/1901.10200, [1901.10200].
- Middlehurst, M.; Large, J.; Bagnall, A.J. The Canonical Interval Forest (CIF) Classifier for Time Series Classification. CoRR 2020, abs/2008.09172, [2008.09172].
- Nguyen, T.L.; Gsponer, S.; Ilie, I.; O’Reilly, M.; Ifrim, G. Interpretable Time Series Classification using Linear Models and Multi-resolution Multi-domain Symbolic Representations. CoRR 2020, abs/2006.01667, [2006.01667].
- Nguyen, T.L.; Gsponer, S.; Ifrim, G. Time Series Classification by Sequence Learning in All-Subsequence Space. In Proceedings of the 2017 IEEE 33rd International Conference on Data Engineering (ICDE), 2017; pp. 947–958. [Google Scholar] [CrossRef]
- Angryk, R.; Martens, P.; Aydin, B.; Kempton, D.; Mahajan, S.; Basodi, S.; Ahmadzadeh, A.; Cai, X.; Filali Boubrahimi, S.; Hamdi, S.M.; et al. SWAN-SF 2020. [CrossRef]
- Löning, M.; Király, F.; Bagnall, T.; Middlehurst, M.; Ganesh, S.; Oastler, G.; Lines, J.; Walter, M.; ViktorKaz; Mentel, L.; et al. sktime/sktime: v0.13.4 2022. [CrossRef]
- Pedregosa, F.; Varoquaux, G.; Gramfort, A.; Michel, V.; Thirion, B.; Grisel, O.; Blondel, M.; Prettenhofer, P.; Weiss, R.; Dubourg, V.; et al. Scikit-learn: Machine Learning in Python. Journal of Machine Learning Research 2011, 12, 2825–2830. [Google Scholar]
- Hunter, J.D. Matplotlib: A 2D Graphics Environment. Computing in Science & Engineering 2007, 9, 90–95. [Google Scholar]
- Oliphant, T.E. A Guide to NumPy. Trelgol Publishing 2006. [Google Scholar]








| Flare Type |
Partitions | ||||
|---|---|---|---|---|---|
| P1 | P2 | P3 | P4 | P5 | |
| Q | 60,130 | 73,368 | 34,762 | 43,294 | 62,688 |
| B | 5,692 | 4,978 | 685 | 846 | 5,924 |
| C | 6,416 | 8,810 | 5,639 | 5,956 | 5,763 |
| M | 1,089 | 1,329 | 1,288 | 1,012 | 971 |
| X | 165 | 72 | 136 | 153 | 19 |
| sum | 73,492 | 88,557 | 42,510 | 51,261 | 75,365 |
| Hyper-parameters | ROCKET | MINIROCKET |
|---|---|---|
| Length | {7, 8, 11} | 9 |
| Weight | N(0,1) | (-1,2) |
| Bias | U(-1,1) | From convolution output |
| Dilation | Random | Fixed |
| Padding | Random | Fixed |
| Actual Positive | Actual Negative | |
|---|---|---|
| Predicted Positive | True Positive | False Positive |
| Predicted Negative | False Negative | True Negative |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2024 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).