Preprint Article Version 1 This version is not peer-reviewed

Fractals as Pre-training Datasets for Anomaly Detection and Localization

Version 1 : Received: 18 October 2024 / Approved: 21 October 2024 / Online: 21 October 2024 (12:06:01 CEST)

How to cite: Ugwu, C. I.; Caruso, E.; Lanz, O. Fractals as Pre-training Datasets for Anomaly Detection and Localization. Preprints 2024, 2024101579. https://doi.org/10.20944/preprints202410.1579.v1 Ugwu, C. I.; Caruso, E.; Lanz, O. Fractals as Pre-training Datasets for Anomaly Detection and Localization. Preprints 2024, 2024101579. https://doi.org/10.20944/preprints202410.1579.v1

Abstract

Anomaly detection is crucial in large-scale industrial manufacturing as it helps detect and localise defective parts. Pre-training feature extractors on large-scale datasets is a popular approach for this task. Stringent data security, privacy regulations, high costs and acquisition time hinder the development of large-scale datasets for training and benchmarking. Despite recent work focusing primarily on the development of new anomaly detection methods based on such extractors, not much attention has been paid to the importance of the data used for pre-training. This study compares representative models pre-trained with fractal images against those pre-trained with ImageNet, without subsequent task-specific fine-tuning. We evaluated the performance of eleven state-of-the-art methods on MVTecAD, MVTec LOCO AD, and VisA, well-known benchmark datasets inspired by real-world industrial inspection scenarios. Further, we propose a novel method to create a dataset by combining the dynamically generated fractal images creating a ``Multi-Formula'' dataset. Even though pre-training with ImageNet leads to better results, fractals can achieve close performance to ImageNet under proper parametrisation. This opens up the possibility for a new research direction where feature extractors could be trained on synthetically generated abstract datasets mitigating the ever-increasing demand for data in machine learning while circumventing privacy and security concerns.

Keywords

Fractals; Mandelbulb; Anomaly detection; Industrial inspection; Synthetic data

Subject

Computer Science and Mathematics, Artificial Intelligence and Machine Learning

Comments (0)

We encourage comments and feedback from a broad range of readers. See criteria for comments and our Diversity statement.

Leave a public comment
Send a private comment to the author(s)
* All users must log in before leaving a comment
Views 0
Downloads 0
Comments 0


×
Alerts
Notify me about updates to this article or when a peer-reviewed version is published.
We use cookies on our website to ensure you get the best experience.
Read more about our cookies here.