Preprint Article Version 1 Preserved in Portico This version is not peer-reviewed

Tweets Classification for Digital Epidemiology of Childhood Health Outcomes Using Pre-Trained Language Models

Version 1 : Received: 11 June 2024 / Approved: 12 June 2024 / Online: 13 June 2024 (09:42:13 CEST)

How to cite: Athukoralage, D.; Atapattu, T.; Thilakaratne, M.; Falkner, K. Tweets Classification for Digital Epidemiology of Childhood Health Outcomes Using Pre-Trained Language Models. Preprints 2024, 2024060860. https://doi.org/10.20944/preprints202406.0860.v1 Athukoralage, D.; Atapattu, T.; Thilakaratne, M.; Falkner, K. Tweets Classification for Digital Epidemiology of Childhood Health Outcomes Using Pre-Trained Language Models. Preprints 2024, 2024060860. https://doi.org/10.20944/preprints202406.0860.v1

Abstract

This paper presents our approaches for the SMM4H’24 Shared Task 5 on the binary classification of English tweets reporting children’s medical disorders. Our first approach involves fine-tuning a single RoBERTa-large model, while the second approach entails ensembling the results of three fine-tuned BERTweet-large models. We demonstrate that although both approaches exhibit identical performance on validation data, the BERTweet-large ensemble excels on test data. Our best-performing system achieves an F1-score of 0.938 on test data, outperforming the benchmark classifier by 1.18%.

Keywords

digital epidemiology, childhood health, pre-trained language models, ensemble models, natural language processing, tweets classification

Subject

Computer Science and Mathematics, Artificial Intelligence and Machine Learning

Comments (0)

We encourage comments and feedback from a broad range of readers. See criteria for comments and our Diversity statement.

Leave a public comment
Send a private comment to the author(s)
* All users must log in before leaving a comment
Views 0
Downloads 0
Comments 0
Metrics 0


×
Alerts
Notify me about updates to this article or when a peer-reviewed version is published.
We use cookies on our website to ensure you get the best experience.
Read more about our cookies here.