Preprint Article Version 1 Preserved in Portico This version is not peer-reviewed

Consore: A Powerful Federated Data Mining Tool Driving a French Research Network to Accelerate Cancer Research

Version 1 : Received: 22 November 2023 / Approved: 24 November 2023 / Online: 26 November 2023 (05:13:14 CET)

A peer-reviewed article of this Preprint also exists.

Guérin, J.; Nahid, A.; Tassy, L.; Deloger, M.; Bocquet, F.; Thézenas, S.; Desandes, E.; Le Deley, M.-C.; Durando, X.; Jaffré, A.; Es-Saad, I.; Crochet, H.; Le Morvan, M.; Lion, F.; Raimbourg, J.; Khay, O.; Craynest, F.; Giro, A.; Laizet, Y.; Bertaut, A.; Joly, F.; Livartowski, A.; Heudel, P. Consore: A Powerful Federated Data Mining Tool Driving a French Research Network to Accelerate Cancer Research. Int. J. Environ. Res. Public Health 2024, 21, 189. Guérin, J.; Nahid, A.; Tassy, L.; Deloger, M.; Bocquet, F.; Thézenas, S.; Desandes, E.; Le Deley, M.-C.; Durando, X.; Jaffré, A.; Es-Saad, I.; Crochet, H.; Le Morvan, M.; Lion, F.; Raimbourg, J.; Khay, O.; Craynest, F.; Giro, A.; Laizet, Y.; Bertaut, A.; Joly, F.; Livartowski, A.; Heudel, P. Consore: A Powerful Federated Data Mining Tool Driving a French Research Network to Accelerate Cancer Research. Int. J. Environ. Res. Public Health 2024, 21, 189.


Background: Real-world data (RWD) related to the health status and care of cancer patients reflect the ongoing medical practice, and their analysis yields essential real-world evidence. Advanced information technologies are vital for their collection, qualification, and reuse in research projects. Methods: UNICANCER, the French federation of comprehensive cancer centres, has innovated a unique research network : Consore. This potent federated tool enables the analysis of data from millions of cancer patients across eleven French hospitals. Results: Currently operational within eleven French cancer centres, Consore employs natural language processing to structure the therapeutic management data of approximately 1.3 million cancer patients. This data originates from their electronic medical records, encompassing about 65 millions of medical records. Thanks to the structured data, which is harmonized within a common data model, and its federated search tool, Consore can create patient cohorts based on patient or tumor characteristics, and treatment modalities. This ability to derive larger cohorts is particularly attractive when studying rare cancers. Conclusions: Consore serves as a tremendous data mining instrument that propels French cancer centres into the big data era. With its federated technical architecture and unique shared data model, Consore facilitates compliance to regulations and acceleration of cancer research projects.


cancer research; cancer; natural language processing; data mining; data warehouse; big data


Public Health and Healthcare, Public Health and Health Services

Comments (0)

We encourage comments and feedback from a broad range of readers. See criteria for comments and our Diversity statement.

Leave a public comment
Send a private comment to the author(s)
* All users must log in before leaving a comment
Views 0
Downloads 0
Comments 0

Notify me about updates to this article or when a peer-reviewed version is published.
We use cookies on our website to ensure you get the best experience.
Read more about our cookies here.