Legionella pneumophila (
L. pneumophila) is a waterborne pathogen, ubiquitously infecting patients via water reservoirs and mostly known to cause mild to severe pneumonia. It can also be involved in extra-pulmonary infections such as cardiologic or neurological, hematopoietic disorders, hepatitis or cellulitis [
1,
2]. Infection episodes are mostly community-acquired and sporadic, but clusters and epidemics often happen despite a tight control of
L. pneumophila ratio in water systems [
3,
4]. In Belgium, each case leads to an environmental investigation by public health institutions to identify the source of contamination and prevent epidemics.
1.1. Case 1
A patient (P1) was hospitalized for deterioration of general status, in a secondary hospital of Wallonia, Belgium in October 2019, further mentioned as H1. The patient was also treated for a metastatic cancer for two years, recently complicated by an episode of febrile neutropenia. At day 16, the patient developed signs of sepsis and large spectrum antibiotics (glycopeptides and third generation cephalosporin) were initiated. Within 24 hours, they were admitted in intensive care unit as they became hypoxemic, developed a spontaneous and rapidly progressing abdominal wall cellulitis and signs of systemic shock. The X-ray performed at that time showed bilateral condensing infiltrates with right pleurisy. In this context, a urinary L. pneumophila antigen test was performed and turned out to be positive. Interestingly it was also the case of a liquid sample punctured in a small third space close to the cellulitis. Moxifloxacin was immediately added to treatment, but the patient died a few hours later.
Four clinical samples were sent to the Belgian National reference center (NRC) for Legionella pneumophila for culture and PCR detection: an endotracheal aspirate (ETA), a pleural fluid, an ascites fluid and a cutaneous biopsy. All turned out positive for L. pneumophila both by PCR and culture. The isolates were all serogroup 1 (Sg1) and sequence-based typing (SBT) sequence-type (ST) 1.
As the patient had no recent travel history, no activity suggesting a contamination via a recreational water system and was hospitalized for more than two weeks before the beginning of the symptoms, a diagnostic of nosocomial infection was retained.
The water distribution system of the patient’s room was sampled. The culture of the sink’s hot and cold-water systems samples reached 50 000 unit forming colonies (UFC)/L and 20 000 UFC/L of L. pneumophila Sg1, respectively, and confirmed to be ST1 by the NRC. The water system was then decontaminated following up-to-date protocols.
1.2. Case 2
One year later, another immunocompromised patient (P2) was hospitalized in the same ward. At day 10 after hospitalization, he was diagnosed with a clinical pneumonia caused by L. pneumophila, which was ST1 according to SBT (BAL isolate). As 10 days is the usual breakpoint used for nosocomial onset, an environmental investigation was set up. P2 did not stay in the same bedroom than P1 and two isolates of L. pneumophila ST1 were cultured from the water systems of this bedroom. While the link between environmental and clinical isolates was, here, difficult to clarify, the additional question of a link between the two clinical cases was triggered as the persistence of a strain in the hospital environment is a well described possibility.
Nowadays, when an investigation is set up, SBT is the reference technique for comparing
L. pneumophila isolates. But, as illustrated by the two cases described above, when the isolates investigated belong to ST1, the epidemiological link can rarely be confirmed [
9]. In Belgium, 19% of
L. pneumophila infections are related to Sg1 ST1, one of the five major STs found worldwide [
3,
4,
5,
6]. As opposed to the four other major lineages of
L. pneumophila (ST23, ST37, ST47 and ST62), ST1 is ubiquitously found in the environment and very often isolated from environmental investigations following both community-acquired (CA) and nosocomial
L. pneumophila cases [
5,
6]. In addition,
Legionella forms biofilms and lives at slow pace until factors favoring its multiplication occur [
7,
8], then the epidemiological link between the source and the patient P2 was only assumed. We thus decided to investigate the correlation between the isolates from both episodes by core-genome (cg) MultiLocus Sequence Typing (MLST), to extend the number of alleles included in the comparison, using the pattern designed by Moran-Gilad et al. [
10] that serves as a reference for many European studies on
Legionella (
Table 1).
Three out of four clinical samples from P1, the two environmental matching isolates and the isolates found in the environmental investigations of P2 clustered together when applying a maximum 4 allelic differences (AD) between two isolates [
10] (
Figure 1A). The clinical isolate from P2 is 5 allelic differences away from the closest isolates, putting it at the limit of being included the cluster. Interestingly, the P1 isolate LEG1116 is 6 allelic differences away from the closest isolate, 9 to 10 AD from isolates sampled from the same patient the very same day and 8AD away from the clinical isolate of P2. Consequently, along with the practical question of a possible link between these two specific cases trough the persistence of a strain in the ward environment [
7], the question of the adequate technique and the adequate threshold to infer reliable conclusions regarding ST1 isolates of
L. pneumophila relationship was raised.
Since NGS has been developed, its use as high-resolution typing tool to distinguish closely related isolates has contributed to improve infection control and outbreak management for a wide range of bacterial pathogens [
9,
10,
11,
12,
13,
14,
15]. Applied to
L. pneumophila , core-genome, whole-genome (wg) Single Nucleotide Polymorphism (SNP) and wg and cg-Multi-Locus Sequence Typing have been reported as showing good discrimination performances [
9,
10,
15,
16,
17]. Nevertheless, issues remain for highly represented STs like ST1 [
7,
10,
13,
14]. Indeed, ST1 is known to be both genetically highly conserved and ubiquitous, but its population structure remains poorly described [
18]. We carried out a mini review of
L. pneumophila genomic studies conducted since 2015 in order to explore the methods and threshold for clustering used to investigate both outbreaks and long-term surveillances to identify which to apply to our specific question (
Table 1).
Table 1.
Mini review of the literature about L. pneumophila genomic studies from 2015.
Table 1.
Mini review of the literature about L. pneumophila genomic studies from 2015.
As the pattern for cgMLST with the proposed threshold of 4 AD used as reference in most studies, did not give conclusive results using the 9 isolates described above (hospital 1 - H1), a larger analysis was performed (
Figure 1B,C). A panel of 26 ST1 well documented Belgian isolates collected between 1985 and 2020 was added. These isolates were related to two nosocomial ST1 epidemics that took place in the 1980’s in two other Belgian hospitals: Hospital 2 (H2) (6 isolates of which 2 clinical) and Hospital 3 (H3) (13 isolates; 11 clinical) but also isolates from another hospital (H4) (2 clinically related). Five unrelated (1 nosocomial, 4 community-acquired) and an environmental isolate from H3 were also used. To complete the panel, 41 ST1 isolates from 9 countries collected between 1992 and 2018 (
Supplementary Table S1a).
Most isolates grouped together by country (
Figure 1B). Regarding Belgian isolates, three groups appeared, the well-identified epidemics from H2 and H3 were at maximum separated by 5 allelic differences except for the H3 isolate LEG515, (> 50 AD from the H3 closest isolate) which is opposed to the conclusion made by the initial investigation but consistent with the monoclonal subtyping performed at the time (Philadelphia versus Benidorm) [
19]. However, in the same study LEG517, also subtyped Philadelphia belonged to the H3 cluster [
19] (
Supplementary Table S1). To cluster together isolates known to be epidemiologically linked (H3 epidemic isolates and H2 epidemic isolates, and P1 isolates), a threshold of 6 AD should be applied (
Figure 1C). Then, all the 9 clinical and environmental isolates from H1 would also cluster together. However, the H1 cluster would also involve LEG767, an epidemiologically unrelated 2017 community-acquired (CA) sporadic case from the same region, the H3 LEG515 isolate and the two H4 isolates (located in the same city as H3) from 15 years earlier (2003-2004). Similarly, the H3 epidemic cluster would englobe an environmental isolate from the same hospital (LEG723), but from 30 years later (2016 -
Supplementary Table S1) and a sporadic community isolate acquired in the same city than H3 (LEG325).
A wgSNP analysis was then run on the same group of strains. The commonly proposed SNPs threshold in the literature is 4 SNPs. The minimum spanning tree (
Figure 2A,B) shows that this threshold does not cluster together neither the H3 nor the P1 isolates. The fitting threshold would be 8 SNPs, which would allow more discrimination than cgMLST. Indeed, while still showing clusters by country of origin on the large scale this threshold allows for H2 and H3 epidemics to form well-defined clusters, excluding LEG723 from the H3 cluster but not LEG 325. In this analysis, P2 clinical isolate stays outside the cluster formed by all the other H1 isolates with a difference of 11 SNPs, as opposed to the H4 isolates, H3 LEG515 and the unrelated sporadic LEG767 isolate which remain inside the cluster (
Figure 2A,B).