Preprint Article Version 1 Preserved in Portico This version is not peer-reviewed

Topic Modeling for Faster Literature Screening Using Transformer-Based Embeddings

Version 1 : Received: 25 July 2024 / Approved: 26 July 2024 / Online: 26 July 2024 (14:26:58 CEST)

A peer-reviewed article of this Preprint also exists.

Galli, C.; Cusano, C.; Meleti, M.; Donos, N.; Calciolari, E. Topic Modeling for Faster Literature Screening Using Transformer-Based Embeddings. Metrics 2025, 1, 2. Galli, C.; Cusano, C.; Meleti, M.; Donos, N.; Calciolari, E. Topic Modeling for Faster Literature Screening Using Transformer-Based Embeddings. Metrics 2025, 1, 2.

Abstract

Systematic reviews represent a powerful instrument to summarize the existing evidence in medical literature. However, articles for a systematic review are hard to identify, and mostly require a structured search of the literature through a number of databases, using keyword-based search strategies, followed by the painstaking manual selection of pieces of evidence that are pertinent to the query. A.I. algorithms may offer solutions to reduce the workload on investigators. We applied BERTopic, a newer and much-recognized transformer-based topic-modeling algorithm, to two datasets of 6137 and 5309 articles of newly published systematic reviews in the area of peri-implantitis and bone regeneration in implant dentistry. In the two datasets, BERTopic identified 14 and 22 clusters, respectively, and it automatically created labels describing the nature of the topics for each individual cluster based on semantic interpretation of their titles. Most themes regarded the query theme, but in both conditions, BERTopic also uncovered articles related to off-themes, which composed around 30% of the dataset—with sensitivity up to 0.79 and specificities of at least 0.99. Our results suggest that adding a topic-modeling step to the screening process could potentially save working hours for researchers involved in systematic reviews of the literature.

Keywords

embedding; systematic reviews; topic modeling

Subject

Medicine and Pharmacology, Other

Comments (0)

We encourage comments and feedback from a broad range of readers. See criteria for comments and our Diversity statement.

Leave a public comment
Send a private comment to the author(s)
* All users must log in before leaving a comment
Views 0
Downloads 0
Comments 0


×
Alerts
Notify me about updates to this article or when a peer-reviewed version is published.
We use cookies on our website to ensure you get the best experience.
Read more about our cookies here.