Galli, C.; Cusano, C.; Meleti, M.; Donos, N.; Calciolari, E. Topic Modeling for Faster Literature Screening Using Transformer-Based Embeddings. Metrics2025, 1, 2.
Galli, C.; Cusano, C.; Meleti, M.; Donos, N.; Calciolari, E. Topic Modeling for Faster Literature Screening Using Transformer-Based Embeddings. Metrics 2025, 1, 2.
Galli, C.; Cusano, C.; Meleti, M.; Donos, N.; Calciolari, E. Topic Modeling for Faster Literature Screening Using Transformer-Based Embeddings. Metrics2025, 1, 2.
Galli, C.; Cusano, C.; Meleti, M.; Donos, N.; Calciolari, E. Topic Modeling for Faster Literature Screening Using Transformer-Based Embeddings. Metrics 2025, 1, 2.
Abstract
Systematic reviews represent a powerful instrument to summarize the existing evidence in medical literature. However, articles for a systematic review are hard to identify, and mostly require a structured search of the literature through a number of databases, using keyword-based search strategies, followed by the painstaking manual selection of pieces of evidence that are pertinent to the query. A.I. algorithms may offer solutions to reduce the workload on investigators. We applied BERTopic, a newer and much-recognized transformer-based topic-modeling algorithm, to two datasets of 6137 and 5309 articles of newly published systematic reviews in the area of peri-implantitis and bone regeneration in implant dentistry. In the two datasets, BERTopic identified 14 and 22 clusters, respectively, and it automatically created labels describing the nature of the topics for each individual cluster based on semantic interpretation of their titles. Most themes regarded the query theme, but in both conditions, BERTopic also uncovered articles related to off-themes, which composed around 30% of the dataset—with sensitivity up to 0.79 and specificities of at least 0.99. Our results suggest that adding a topic-modeling step to the screening process could potentially save working hours for researchers involved in systematic reviews of the literature.
Keywords
embedding; systematic reviews; topic modeling
Subject
Medicine and Pharmacology, Other
Copyright:
This is an open access article distributed under the Creative Commons Attribution License which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.