Genomic blueprint of a relapsing fever pathogen in 15th century Scandinavia.
Journal: 2018/November - Proceedings of the National Academy of Sciences of the United States of America
ISSN: 1091-6490
Abstract:
Louse-borne relapsing fever (LBRF) is known to have killed millions of people over the course of European history and remains a major cause of mortality in parts of the world. Its pathogen, Borrelia recurrentis, shares a common vector with global killers such as typhus and plague and is known for its involvement in devastating historical epidemics such as the Irish potato famine. Here, we describe a European and historical genome of B recurrentis, recovered from a 15th century skeleton from Oslo. Our distinct European lineage has a discrete genomic makeup, displaying an ancestral oppA-1 gene and gene loss in antigenic variation sites. Our results illustrate the potential of ancient DNA research to elucidate dynamics of reductive evolution in a specialized human pathogen and to uncover aspects of human health usually invisible to the archaeological record.
Relations:
Content
Citations
(1)
Diseases
(1)
Drugs
(1)
Chemicals
(1)
Organisms
(3)
Processes
(2)
Similar articles
Articles by the same authors
Discussion board
Proceedings of the National Academy of Sciences of the United States of America. Oct/8/2018; 115(41): 10422-10427
Published online Sep/23/2018

Genomic blueprint of a relapsing fever pathogen in 15th century Scandinavia

Abstract

Significance

Louse-borne relapsing fever was one of the major diseases affecting Western human populations, with its last major pandemic killing millions after World War I. Despite the major role fevers have played in epidemic events throughout history, molecular evidence for the presence of their etiological agent has been extremely scarce in historical samples worldwide. By comparing our medieval Borrelia recurrentis genome with modern representatives of the species, we offer an historical snapshot of genomic changes in an immune-evasion system and of reductive evolution in a specialized vector-borne human pathogen. This shotgun sequencing project highlights the potential for ancient DNA research to uncover pathogens which are undetectable to osteological analysis but are known to have played major roles in European health historically.

Louse-borne relapsing fever (LBRF) is known to have killed millions of people over the course of European history and remains a major cause of mortality in parts of the world. Its pathogen, Borrelia recurrentis, shares a common vector with global killers such as typhus and plague and is known for its involvement in devastating historical epidemics such as the Irish potato famine. Here, we describe a European and historical genome of B. recurrentis, recovered from a 15th century skeleton from Oslo. Our distinct European lineage has a discrete genomic makeup, displaying an ancestral oppA-1 gene and gene loss in antigenic variation sites. Our results illustrate the potential of ancient DNA research to elucidate dynamics of reductive evolution in a specialized human pathogen and to uncover aspects of human health usually invisible to the archaeological record.

Louse-borne relapsing fever (LBRF), once one of many fevers ravaging Europe, has disappeared from the Western world and is now endemic to only eastern Africa. The disease is part of a group of well-known deadly louse-borne pathogens, which specialize in vector–human transmission. Its causative pathogen is the spirochete Borrelia recurrentis, whose only known vector is the human body louse, Pediculus humanus. This sets it apart from other tick-borne relapsing fever (TBRF) pathogens, such as its most closely related strains Borrelia duttonii and Borrelia crocidurae (1, 2).

Genomes of the genus Borrelia are unique. They are composed of up to 24 circular and linear plasmids with covalently closed hairpin telomeres showcasing AT-rich genomes of very small sizes. While the chromosomes, generally around 900–930 kbp, are very conserved across species, a high level of DNA rearrangements can be found among the plasmids (35), with some carrying essential genes, such as the telomere resolvase gene resT on plasmid pl23 of B. recurrentis and B. duttonii.

The B. recurrentis genome (1.24 Mbp) is composed of one linear chromosome and seven linear plasmids and is characterized by its low GC content (mean 28.1%). Genomic and ecological data on the pathogen are scarce, since it is challenging to cultivate and studies lack an animal model (6, 7). Research on the pathogen has therefore been limited to clinical samples, resulting thus far in the publication of one deposited reference sequence, strain A1 (5), and six additional datasets from eastern African strains (8).

LBRF is fatal in 10–40% of untreated cases (9) and is transmitted from vector to host when the hemocoel of crushed lice comes into contact with intact mucosa or skin (9). Studies (10) have also suggested a possible transmission of LBRF via lice feces. Like all relapsing fevers (RF), it is characterized by multiple febrile episodes separated by short periods of remission. Defining symptoms of LBRF are epistaxis and jaundice (9).

The disease was first mentioned in medical texts by Hippocrates from the fourth century BCE, in which he describes a series of fevers afflicting the populations of Thasos after a harsh winter (11). Further references to LBRF can be found throughout European history with prominent examples being outbreaks during the Great Irish Famine of 1846–1852 (1113) and the post-World War I pandemic (1919–1923), which is estimated to have killed more than five million people in central Europe and Russia alone (14, 15). It has also been hypothesized that the so-called “pestis flava” or “Buidhe Chonaill” of sixth century CE Ireland was an LBRF epidemic (16). LBRF frequently emerged with typhus, which is also louse-borne and caused by the bacterium Rickettsia prowazekii (12).

Today, outbreaks of the disease can be found sporadically in Ethiopia, Eritrea, Somalia, and Sudan, where it is still endemic and until recently was the fifth most common cause of death (17). In addition to recent European reports of the disease among refugees (18), which clearly highlight how quickly a decline in hygiene and living conditions can lead to the spread of LBRF, a study by Brouqui et al. (19) detected a clear increase of individuals with IgG antibodies for LBRF in 2000 and 2002. The study therefore suggests that small, undetected outbreaks of the disease still exist to this day in European populations exposed to body lice infestation.

While LBRF is the only known RF capable of reaching epidemic proportions, it was never possible to confirm the presence of the pathogen in archaeological samples. Previous attempts at detecting B. recurrentis in archaeological samples have been unsuccessful (20, 21) but have succeeded in detecting other louse-borne pathogens such as the typhus agent R. prowazekii. The only previously reported ancient DNA (aDNA) sequences of the genus Borrelia came from the Tyrolean Iceman Ötzi (22), who carried sequences matching the Lyme disease pathogen Borrelia burgdorferi.

Results

Skeleton OSL9/SZ50522 (Fig. 1A) was found during the excavation of a graveyard south of St. Nicolay’s Church, in Oslo (Fig. 1B, SI Appendix, and SI Appendix, Fig. S1) (23). It was part of a double burial (SA50521) situated close to the southern boundary of the graveyard and was identified as a female individual (age 28–35 y) who had been buried with a child (age 7–9 y) (24). A rib fragment was radiocarbon dated to Cal CE 1430–1465 (SI Appendix, Fig. S2 and Table S1).

Fig. 1.

Sample origin and site location. (A) In situ picture of the double burial SA50521 with individual OSL9/SZ50522 to the right. (B) Location of the archaeological site and C14 date of the burial displayed in A.

The initial metagenomic analysis of shotgun sequencing data from individual OSL9/SZ50522 revealed hits matching the spirochete B. recurrentis. After mapping the data to the B. recurrentis A1 genome assembly, we recovered 16.9% of the chromosome with a mean depth of coverage below 1 and deamination patterns matching aDNA.

To assemble the medieval B. recurrentis strain at an adequate depth and to cover more of the reference sequence, we sequenced 13 additional genomic libraries from two teeth (A+B) (SI Appendix, Fig. S3) over four lanes on a HiSeq 2500 Illumina system and generated ca. 1.2 billion raw DNA sequences (Dataset S8). Overall, each library contained less than 0.04% reads mapping to B. recurrentis (Dataset S4).

We were able to assemble a B. recurrentis genome at a mean depth of 6.4× with 95.2% of the genome being covered at least twice (Dataset S7), with a total of 152,490 reads mapping to B. recurrentis (Dataset S4). Ultimately, we were able to assemble 98.2% of the chromosome at a mean depth of coverage of 8.3× (Fig. 2A).

Fig. 2.

(A) Coverage plots for mapping of OSL9A-B reads to the B. recurrentis A1 reference sequence. Rings (from outer ring to inner ring) show coverage, GC skew, and GC content. GC content is further highlighted as being above (black) or below (gray) 26%. (B) Damage frequency for the mapping displayed in A with 10% of reads showing a clear deamination signature consistent with aDNA. (C) Comparison of OSL9A-B noncompetitive mappings to the chromosomes of B. recurrentis A1, B. duttonii Ly, and B. crocidurae Achema. It can be seen here that strain OSL9 does not have a CobQ/ParA gene and is also missing a 1-kbp-long sequence at the 5′ end of the B. duttonii chromosome, much like B. recurrentis A1. (D) Distribution of edit distances (plotted on the x axis) of OSL9A-B reads mapping to chromosomal assemblies of B. recurrentis A1, B. duttonii Ly, and B. crocidurae Achema (Left), to colinear pl23 plasmids of B. recurrentis A1 and B. duttonii Ly (Center), and to colinear pl6 plasmids of B. recurrentis A1 and B. crocidurae Achema (Right).

The read-length distribution (mean: 69 bp) (SI Appendix, Fig. S9) of all datasets showed that the DNA was in a highly fragmented state. Consistent with aDNA, we could also detect significant DNA damage patterns for the reads mapping to the B. recurrentis A1 assembly (Fig. 2B and SI Appendix, Fig. S5). This was further supported by the identification of a northern European mtDNA haplogroup and the DNA damage profile of the reads mapping to the revised Cambridge reference sequence (rCRS) build of the human mitochondrion (SI Appendix and SI Appendix, Fig. S6).

The unique genomic structure of these highly specialized bacteria allows high mapping specificity across the B. recurrentis genome (4, 5). This, in turn, allows us to infer the presence and absence of genomic regions via the level of coverage observed after mapping the raw datasets to Borrelia references. Additionally, the use of a shotgun dataset, as opposed to a target-enrichment sequencing strategy, did not restrict or shift the available data to known preselected modern sequences.

To ascertain that the organism represented in our metagenomic output was in fact a B. recurrentis strain and not one of its most closely related species, and potentially to detect signs of plasmid rearrangements, we mapped all datasets against available reference sequences for B. duttonii Ly and B. crocidurae Achema independently (Fig. 2C and SI Appendix, Fig. S4). The number of reads mapping to the chromosomal assemblies varies only slightly from one species to another, but we observe distinctly higher numbers of reads mapping with edit distance 0 to the B. recurrentis genome than to any of the other genomes (Fig. 2D and SI Appendix, Fig. S7 and Dataset S5).

Most B. recurrentis plasmids are colinear to B. duttonii plasmids with the exception of pl6, which is colinear to a 5-kbp plasmid in B. crocidurae. While pl6 is covered at a mean depth of coverage of 12.6× in our genome mapping, we clearly observed that noncolinear plasmids of B. duttonii and B. crocidurae are not present in our medieval strain (SI Appendix, Fig. S4).

Six newly published de novo genomes by Marosevic et al. (8) stemming from Eastern African refugees treated for relapsing fever seem to have reinforced previously raised doubts (25) regarding the length of plasmids pl124 and pl6 (NC_011263.1). The new strains all exhibit a ca. 40-kbp extension at the 5′ end of the previously reported pl124 and a ca. 1-kbp reduction at the 3′ end of pl6, much like their respective colinear plasmids pl165 in B. duttonii and a 5-kbp-long plasmid in B. crocidurae. Interestingly, our ancient B. recurrentis genome also displays signs of a longer pl124 and a shorter pl6 (SI Appendix, Figs. S4A and S10) compared with the reference strain A1. Our mapping to pl6 was completely missing 1 kbp at the 3′ end of the plasmid. Similarly, while aligning our data to pl165 of B. duttonii, we observed uniform mapping at a mean depth of coverage of 3× across the entire length of the plasmid, including the missing 40-kbp extension at the 5′ end. It is highly likely that these discrepancies can be linked to the deposition of an incomplete assembly of the original reference genome strain A1, since both our medieval European strain and the modern African strains show the same variations. This was further validated by pulse-field gel electrophoresis and the amplification of a gene ortholog to cihC in B. duttonii at 27,465–28,535 bp (26).

Upon further inspection of the plasmids, we found that three identical 582-bp-long variable short proteins (vsp) genes are missing at the 3′ end of plasmids pl33, pl37, and pl53 (Fig. 3). Strikingly, these genes mark the end of mapping for plasmid pl33 and pl37, while pl53 retains a hypothetical protein at its 3′ end. The first ca. 130 bp at the 5′ end of the vsp gene are covered by nonunique reads restricted to the gene across all three plasmids. This indicates that pl33, pl37, and pl53 are shorter in strain OSL9 and that we could be seeing signs of undetected plasmid rearrangements involving a similar vsp gene. The genes missing after the 582-bp vsp genes are copies of five variable long proteins (vlp) genes and multiple hypothetical proteins. Identical copies of a 785-bp-long vlp pseudogene are missing across all three plasmids, with the copy on pl33 being only 475 bp long. While the deposited assembly of this plasmid is known to be missing parts of its telomeric sequence, it might also be missing the end of the 785-bp vlp pseudogene and one or more of the four additional identical vlp pseudogenes missing across pl37 and pl53 (Fig. 3 and Dataset S2).

Fig. 3.

Alignment of OSL9A-B reads to B. recurrentis plasmids pl33 (A), pl37 (B), and pl53 (C). Across all plasmids a section at the 3′ end of the linear plasmids is missing starting with 80% of a vsp gene. The regions boxed in red share identities of 99–100%. Tiled reads seen within the boxed regions are nonunique reads mapping to sequences conserved across multiple plasmids.

Additionally, plasmids pl37 and pl53 have a decreased coverage in the intervals 23,822–28,471 and 39,440–44,087 bp, respectively. The sequences are both 4,648 bp long and share 99% identity over their entire length. These intervals distinguish themselves by a patchy and low coverage, with most vlp and vsp genes in the region being covered ca. 50–80% (mean coverage 75.2%), while the covered vlp and vsp genes outside these intervals show a mean coverage of 96.9%, hinting at potential sequence degradation or plasmid rearrangements. Overall, 11 vlp pseudogenes and three vsp genes are missing, with one additional vsp pseudogene and three vlp genes showing signs of potential degradation.

We investigated the presence of known frameshift and stop-gain mutations throughout our ancient genome and found that, with the exception of oppA-1, all pseudogenes, which B. recurrentis A1 acquired during its reductive evolution and divergence from B. duttonii Ly, were present in the medieval strain. Our alignments to both B. recurrentis A1 and B. duttonii Ly support (depth of coverage 5×) that the oppA-1 gene could still be active in the OSL9 strain by retaining its ancestral glutamine (residue 59) (SI Appendix, Fig. S8).

After aligning all OSL9A-B reads to the chromosome of the reference strain, B. recurrentis A1, we detected 321 SNPs (Dataset S3). These SNPs were then combined with all other SNPs found in RF Borrelia strains included in this study to build a phylogenetic tree using the maximum likelihood method. Before building the phylogeny, we checked for the presence of recombination using a phi test and ClonalFrameML software but could not detect any sign of recombination, as these approaches yielded a P value of 0.8287 and a likelihood of −612.47, respectively. The midpoint-rooted tree was generated using PhyML. The phylogeny clearly shows two clusters represented by LBRF and TBRF species (Fig. 4), and, as previously reported, the African RF isolates cluster together (8). Interestingly, the ancient genome recovered in Oslo is clustered between B. duttonii and the African B. recurrentis strains in the phylogeny. Compared with all other Borrelia in this study, 164 SNPs are specific to the OSL9 strain. These SNPs are distributed as 77 nonsynonymous SNPs, 77 synonymous SNPs, and 10 intergenic SNPs.

Fig. 4.

RL phylogeny based on chromosomal alignments (Datasets S3 and S9). The European LBRF strain (this study) is shown in red, and African LBRF strains are shown in blue. The legend displays the branch length.

Discussion

LBRF is characterized by multiple relapses of fever, which are believed to be caused by bacterial immune evasion systems. RF Borreliae use plasmid-encoded antigenic phase variation within a clonal population as a mechanism of immune evasion. It utilizes the sequential expression of different variants of an antigenic surface protein to evade the host immunity, prolong the infection, and promote transmission. Hence, each new wave of infection, in this case febrile relapse, is characterized by a new serotype (5, 6, 2729). In B. recurrentis, these proteins are the surface lipoproteins vlp and vsp, which are also known to be its main proinflammatory proteins (30). The genes expressing these proteins are arranged across plasmids in silent and expressed copies, and the interaction between the host immunity and the phase-variation mechanism encoded in the pathogen’s genome are believed to be responsible for the febrile relapses of LBRF (29). The number of phase-variation loci or loci families dictates the number of possible serotypes and, thus, the number of genome variants theoretically available to evade the host immunity, with some estimating up to 210 different states for every 20 phase-variation sites (28, 29, 31). As a result, although LBRF can be treated with antibiotics, no vaccine against the disease exists today. The only effective antibodies against LBRF seem to be directed at vlp and vsp genes and thus only can be manufactured to fight one randomly appearing serotype at a time (32).

We observed that a number of vlp and vsp genes are absent or potentially degraded in our medieval strain (Fig. 3 and Dataset S2), but are present in modern-day African strains and have orthologs in other RF genomes. Some are frameshifted or incomplete in A1 and might have been pseudogenes, but others, like the vsp copies found at the beginning of each missing interval, were fully functional genes. We can hypothesize, based on the data at hand, that the observed loss of antigenic phase-variation loci could have led to a difference in phenotype compared with modern strains and hypothetically could have influenced the number of febrile relapses, i.e., the number of serotypes that the medieval strain of the pathogen might have been capable of generating. While experimental infections by LBRF have yielded up to 10 relapses (9, 32), most sources cite up to five relapses for the pathogen (33). Similarly, most untreated non-European cases recorded during the World War I/World War II pandemics reached only up to five relapses (11, 34). Historically, Creighton (12) recorded varying numbers of observed relapses across known LBRF outbreaks. An outbreak in 18th century Dublin cites that patients were prone to relapses “even sometimes to the third” (35). Overall, most outbreaks, for which the number of observed relapses was supplied in historical texts, saw one or two or “one or more” relapses. However, it should be noted that data on modern untreated cases of LBRF are rare, and most available historical sources only refer to “multiple relapses” or “prone to relapse” (12, 36), considerably reducing the amount of data to be evaluated (36).

Compared with B. duttonii Ly, the modern reference strain B. recurrentis A1 has lost a large number of intact vlps and vsps and causes fewer relapses in human patients (5, 33), but this difference is even more pronounced in the medieval strain. The missing genes make up all archival copies of six phase-variation loci. Overall, this translates to a genome reduction of 1.2% of the pan-genome and 5.1–21% of the affected plasmids, which, in combination with a decrease in GC content (Dataset S4), is in line with observed reductive genome evolution of specialized and highly pathogenic bacteria (37, 38). The presence of additional unknown plasmids seems unlikely, as the number of plasmids tends to follow the same reductive trend as the overall genome (37). Instead, we can detect additional gene loss at the 3′ end of some plasmids.

This reductive evolution characterizes B. recurrentis’ epidemic potential and its increased virulence compared with other RFs. Similar to Bartonella quintana and Rickettsia prowazekii (38, 39), other well-known body lice-to-human transmission specialists, B. recurrentis shows accelerated rates of genome degradation caused by adaptation to host-restricted vectors and functional trade-offs, resulting in a degraded genome, reduced genome size, low coding content, and increased virulence (5, 31, 37, 38).

However, this type of evolution usually involves the loss of regulatory genes (37), and while this is also the case for our medieval strain, there is one exception. The oppA operon, which encodes for an ABC transporter, is significantly involved in the uptake of oligopeptides in many bacterial species (40, 41). OppA-1, which likely plays a critical role in host environment adaptation and essential metabolic functions (42), is a pseudogene in B. recurrentis A1 due to an in-frame stop codon. However, our medieval strain retains the ancestral glutamine much like the TBRF pathogen B. duttonii. While we can only speculate about the effect of this mutation on the ecological life cycle of the disease, it is interesting to note that the inactivation of oppA-1 seems to be much more recent than the rest of the pseudogenes in B. recurrentis. These pseudogenes are degraded to the same extent in both B. recurrentis lineages, with the exception of some antigenic phase-variation loci, which the medieval strain seems to have lost altogether. Therefore, we could hypothesize that, while both lineages have continued their reductive evolution, they have done so in different ways.

The full ecological dynamics of LBRF remain unclear (43). The European lineage presented in this study probably evolved in a distinctly different environment than known modern African strains. The extent to which anthropogenic pressure on pathogen and vector impacted this evolution is difficult to assess based on a single representative in the medieval lineage. As the only epidemic and louse-borne RF, LBRF has been assumed to be responsible for all RF epidemics recorded throughout European history. It was known to have reached epidemic proportions in susceptible populations around key events, such as wars and famines. LBRF is a disease that probably would only infrequently have resulted in the type of mass mortality that would warrant the planning of mass graves, and, historically, scattered cases of LBRF have been recorded between epidemics, usually among the poor (12). Although at the time of the burial of individual OSL9/SZ50522 the town was still affected by the economic decline caused by the Black Death in the mid-1300s (44, 45), which probably left parts of the population vulnerable to disease and malnourishment, the results reported in this study represent an isolated case of the disease. Given the available data, we cannot speculate on the presence or size of an outbreak of LBRF in 15th century Oslo, especially since the number of studied samples from the same time period is limited (Dataset S1). However, our results can tentatively corroborate the involvement of B. recurrentis in the European epidemics reported in historical text because of its being the only known epidemic RF, the specificity of the reported symptoms, the identification of the spirochete during epidemics in 1868, and the discovery of its relation to lice in 1907 (33, 46). Finally, the studied individual probably died at the height of bacteremia, allowing the detection of the pathogen in relatively high quantities within aDNA shotgun datasets.

Previously assembled ancient pathogens have generally been limited to diseases that could leave visible marks on the skeletons or mummified tissues of affected individuals (e.g., leprosy, tuberculosis, smallpox) (4749). However, recent studies on pathogens that are invisible in the archaeological record have started to emerge (48, 50, 51). These studies are of particular relevance, as they clearly illustrate the osteological paradox (52, 53). It is hoped that, as one of the few epidemic fevers believed to have played a major role in the disease landscape of historical Europe, the addition of LBRF will lead the way for the detection of diseases which might not have much relevance to Western health today and therefore are less represented in the genomic databases and literature. The unique genome of B. recurrentis has provided a rare opportunity in aDNA to study the genomic make-up of an ancient pathogen and catch a glimpse of the evolutionary process that accompanies the environmental adaptation and pathogenesis of specialized human pathogens. Furthermore, the results detailed in our study illustrate the importance of human body lice as a vector throughout European history, which has recently also been suggested for plague (54).

Future research into the ecological dynamics of LBRF and surveillance of populations affected by lice infestation in industrialized countries is needed to better understand the persistence mechanism of the disease and the dangers of spread in susceptible Western communities. Furthermore, more insights into potential outbreaks of the disease in lice-infested Western populations, as seen in Brouqui et al. (19), and genomic sequences from affected patients would allow us to determine if LBRF persisted in Europe and thus is phylogenetically related to the lineage recovered in this study or was imported on one or more occasions from outside Europe. Additional genomes, European or otherwise, may further elucidate the evolution and pathogenicity of LBRF.

Methods

Samples.

We sampled two well-preserved molars from nine individuals (Dataset S1) recovered from the site of St. Clement’s/St. Nicolay’s church graveyard in Oslo, under clean conditions at the Norwegian Institute for Cultural Heritage Research, Oslo. One exception was tooth OSL6B, which was an incisor. The individuals all stem from different periods of the graveyard, which spans the 11th to the 15th century (Dataset S1). Upon discovery of B. recurrentis reads in OSL9, all sampled individuals were screened for the presence of the pathogen via qPCR but were negative (SI Appendix, Fig. S3), including the only potentially contemporary individual OSL6/SZ19834.

Laboratory Work.

Full experimental procedures are provided in SI Appendix.

Data Preparation and Quality Filtering.

Following sequencing, the datasets were demultiplexed at the Norwegian Sequencing Centre, and quality control was performed using FastQC (55). Adapters and indices were trimmed using cutadapt2.0 (56). Sequences shorter than 30 bp and below a quality score of 10 were discarded. Trimmed reads were merged using FLASH (57).

Metagenomic Analysis.

The datasets were explored using the taxonomic classifier Kraken (58). Upon discovery of B. recurrentis reads, we further investigated the metagenomic data using MetaPhlAn2 (59) and the protein-level classification tool Kaiju (60). All tools indicated the presence of the bacterium in all datasets stemming from individual OSL9/SZ50522. No hits for B. recurrentis were found in any other samples or blank controls.

RF Borreliae Mappings.

The merged reads were mapped noncompetitively to the Borrelia recurrentis A1 strain (ASM1970v1), Borrelia duttonii Ly strain (ASM1968v1), and Borrelia crocidurae Achema strain (ASM25934v1) reference sequences using BWA-ALN (-n 0·01 -l 16500) and BWA Samse (61). The generated data were converted to bam format and sorted using SAMtools (62, 63). Duplicates were marked using Picard MarkDuplicates module (64). Indels were realigned using GATK RealignerTargetCreator and IndelRealigner modules (65, 66). Using mapDamage2.0 (67), we quantified aDNA damage patterns for each dataset and recalibrated the quality scores of likely damaged bases. Libraries from tooth B yielded significantly more sequences mapping to B. recurrentis than libraries from tooth A, with an average 11,346 reads per dataset mapping to the reference compared with an average of 3,402 reads per dataset for tooth A.

Phylogeny.

Species for which only contigs were available were assembled with Multi-CAR (68) using the reference sequences for B. recurrentis A1 and Borrelia hermsii HS1. For six recently published B. recurrentis strains (8), we mapped available Illumina reads to the B. recurrentis A1 chromosome assembly using BWA-MEM, sorted the data with SAMtools (62), marked duplicates using Picard (64), and realigned around indels with GATK (65, 66). For all BAM files, SNPs were called with SAMtools mpileup (-R -ugf -B) and BCFtools call (-vm) and filter (-s LOWQUAL -i ‘%QUAL > 19’). SNPs were annotated using snpToolkit (69). Complete genomes were compared using Parsnp (70), and polymorphic sites were extracted using gingr. Two methods, Phi test (71) and ClonalFrameML (72), were used to assess the presence of recombination in our dataset. The midpoint-rooted tree was generated using phyML.

Supplementary Material

Supplementary File
Supplementary File
Supplementary File
Supplementary File
Supplementary File
Supplementary File
Supplementary File
Supplementary File
Supplementary File

Footnotes

The authors declare no conflict of interest.

Data deposition: Sequencing data have been deposited at the European Nucleotide Archive (accession no. PRJEB24501).

This article contains supporting information online at www.pnas.org/lookup/suppl/doi:10.1073/pnas.1807266115/-/DCSupplemental.

Acknowledgments

Data analysis was performed on the Abel Cluster owned by the University of Oslo and the Norwegian Metacenter for High Performance Computing and operated by the Department for Research Computing at USIT, the University of Oslo information technology department. Skeletal material was provided courtesy of the Museum of Cultural History (University of Oslo) after application to the museum and the Norwegian National Committee for Research Ethics on Human Remains. This project was funded by the European Research Council under FP7-IDEAS-ERC Program Grant 324249 MedPlag.

References

  • 1. ElbirH2012Complete genome sequence of Borrelia crociduraeJ Bacteriol19437233724[PubMed][Google Scholar]
  • 2. Fotso FotsoA2014Genome sequence of Borrelia crocidurae strain 03-02, a clinical isolate from SenegalGenome Announc2e01150-14[Google Scholar]
  • 3. GuptaRSMahmoodSAdeoluM2013A phylogenomic and molecular signature based approach for characterization of the phylum spirochaetes and its major clades: Proposal for a taxonomic revision of the phylumFront Microbiol4217[PubMed][Google Scholar]
  • 4. NorrisSJLinT2011Out of the woods: The remarkable genomes of the genus BorreliaJ Bacteriol19368126814[PubMed][Google Scholar]
  • 5. LescotM2008The genome of Borrelia recurrentis, the agent of deadly louse-borne relapsing fever, is a degraded subset of tick-borne Borrelia duttoniiPLoS Genet4e1000185[PubMed][Google Scholar]
  • 6. VidalVCutlerSScraggIGWrightDJMKwiatkowskiD2002Characterisation of silent and active genes for a variable large protein of Borrelia recurrentisBMC Infect Dis225[PubMed][Google Scholar]
  • 7. CutlerSJ1997Borrelia recurrentis characterization and comparison with relapsing-fever, Lyme-associated, and other Borrelia sppInt J Syst Bacteriol47958968[PubMed][Google Scholar]
  • 8. MarosevicD2017First insights in the variability of Borrelia recurrentis genomesPLoS Negl Trop Dis11e0005865[PubMed][Google Scholar]
  • 9. European Centre for Disease Prevention and Control2017Facts about louse-borne relapsing fever ecdc. Available at https://ecdc.europa.eu/en/louse-borne-relapsing-fever/facts. Accessed August 8, 2017
  • 10. HouhamdiLRaoultD2005Excretion of living Borrelia recurrentis in feces of infected human body liceJ Infect Dis19118981906[PubMed][Google Scholar]
  • 11. BrycesonA1969Louse-borne relapsing fever; a clinical and laboratory study of 62 cases in Ethiopia and a reconsideration of the literature. Naval Medical Research Unit No 3 APO New York 09319 Field Facility. Available at www.dtic.mil/docs/citations/AD0722494. Accessed July 12, 2017
  • 12. CreightonC1894A History of Epidemics in Britain: From the Extinction of Plague to the Present TimeCambridge Univ PressCambridge, UK
  • 13. HaysJN2005“Fevers” and the Great famine in Ireland, 1846–1850Epidemics and Pandemics: Their Impacts on Human HistoryHaysJNABC CLIOSanta Barbara, CA439448[Google Scholar]
  • 14. SparrowH1958Etude du foyer éthiopien de fièvre récurrenteBull World Health Organ19673710[PubMed][Google Scholar]
  • 15. SparrowH1962Tropical Health: A Report on a Study of Needs and ResourcesNational AcademiesWashington, DC
  • 16. MacarthurW1948Famine fevers in England and IrelandUlster Med J172833[PubMed][Google Scholar]
  • 17. CutlerSJ2010Relapsing fever–A forgotten disease revealedJ Appl Microbiol10811151122[PubMed][Google Scholar]
  • 18. European Centre for Disease Prevention and Control2015Rapid Risk Assessment: Louse-borne relapsing fever in the EU – 17 November 2015 (ECDC Stockholm). Available at https://ecdc.europa.eu/sites/portal/files/media/en/publications/Publications/louse-borne-relapsing-fever-in-eu-rapid-risk-assessment-17-nov-15.pdf. Accessed June 30, 2017
  • 19. BrouquiP2005Ectoparasitism and vector-borne diseases in 930 homeless people from MarseillesMedicine (Baltimore)846168[PubMed][Google Scholar]
  • 20. TranT-N-N2011High throughput, multiplexed pathogen detection authenticates plague waves in medieval Venice, ItalyPLoS One6e16735[PubMed][Google Scholar]
  • 21. Nguyen-HieuT2010Evidence of a louse-borne outbreak involving typhus in Douai, 1710-1712 during the war of Spanish successionPLoS One5e15405[PubMed][Google Scholar]
  • 22. KellerA2012New insights into the Tyrolean Iceman’s origin and phenotype as inferred by whole-genome sequencingNat Commun3698[PubMed][Google Scholar]
  • 23. DerrickM2018Follobaneprosjektet F04 Klypen Øst og Saxegaardsgata 15Arkeologisk Utgravning Mellom Bispegata og Loenga. Middelalderparken Og Saxegaardsgata 15 & 17, Oslo. NIKU Oppdragsrapport 40/2015Norsk institutt for kulturminneforskningOslo[Google Scholar]
  • 24. Jensen2018Osteologisk analyse av skjelettmaterialet fra Nikolaikirkens kirkegårdFollobaneprosjektet F04 Klypen Øst/Saxegaardsgata 15. NIKU Oppdragsrapport 160/2016Norsk institutt for kulturminneforskningOslo[Google Scholar]
  • 25. MillerSCPorcellaSFRaffelSJSchwanTGBarbourAG2013Large linear plasmids of Borrelia species that cause relapsing feverJ Bacteriol19536293639[PubMed][Google Scholar]
  • 26. GrosskinskyS2010Human complement regulators C4b-binding protein and C1 esterase inhibitor interact with a novel outer surface protein of Borrelia recurrentisPLoS Negl Trop Dis4e698[PubMed][Google Scholar]
  • 27. PalmerGHBankheadTSeifertHS2016Antigenic variation in bacterial pathogensMicrobiol Spectr4[Google Scholar]
  • 28. NorrisSJ2006Antigenic variation with a twist–The Borrelia storyMol Microbiol6013191322[PubMed][Google Scholar]
  • 29. FoleyJ2015Mini-review: Strategies for variation and evolution of bacterial antigensComput Struct Biotechnol J13407416[PubMed][Google Scholar]
  • 30. BarbourAGRestrepoBI2000Antigenic variation in vector-borne pathogensEmerg Infect Dis6449457[PubMed][Google Scholar]
  • 31. PallenMJWrenBW2007Bacterial pathogenomicsNature449835842[PubMed][Google Scholar]
  • 32. BarbourAGGuoBP2010Pathogenesis of relapsing feverBorrelia: Molecular Biology, Host Interaction and PathogenesisRadolfJDScott SamuelsDHorizon ScientificNorfolk, UK333357[Google Scholar]
  • 33. CutlerSJ2015Relapsing fever Borreliae: A global reviewClin Lab Med35847865[PubMed][Google Scholar]
  • 34. ChungH-LChangFC1939Relapsing fever. clinical and statistical study of 337 casesChin Med J55633[Google Scholar]
  • 35. RuttyJ1770A Chronological History of the Weather and Seasons, and of the Prevailing Diseases in Dublin: With Their Various Periods, Successions, and Revolutions, During the Space of Forty Years. With a Comparative View of the Difference of the Irish Climate and Diseases, and Those of England and Other CountriesRobinson and RobertsLondon
  • 36. LyonsRT1872A Treatise on Relapsing or Famine FeverHenry S. KingLondon
  • 37. DiopARaoultDFournierP-EDecember 27, 2017Rickettsial genomics and the paradigm of genome reduction associated with increased virulenceMicrobes Infect[PubMed][Google Scholar]
  • 38. AlsmarkCM2004The louse-borne human pathogen Bartonella quintana is a genomic derivative of the zoonotic agent Bartonella henselaeProc Natl Acad Sci USA10197169721[PubMed][Google Scholar]
  • 39. OgataH2001Mechanisms of evolution in Rickettsia conorii and R. prowazekiiScience29320932098[PubMed][Google Scholar]
  • 40. KornackiJAOliverDB1998Lyme disease-causing Borrelia species encode multiple lipoproteins homologous to peptide-binding proteins of ABC-type transportersInfect Immun6641154122[PubMed][Google Scholar]
  • 41. WangX-GLinBKidderJMTelfordSHuLT2002Effects of environmental changes on expression of the oligopeptide permease (opp) genes of Borrelia burgdorferiJ Bacteriol18461986206[PubMed][Google Scholar]
  • 42. WangX-G2004Analysis of differences in the functional properties of the substrate binding proteins of the Borrelia burgdorferi oligopeptide permease (Opp) operonJ Bacteriol1865160[PubMed][Google Scholar]
  • 43. AmanzougagheneN2017Detection of bacterial pathogens including potential new species in human head lice from MaliPLoS One12e0184621[PubMed][Google Scholar]
  • 44. NedkvitneANorsengPG2000Middelalderbyen Ved Bjørvika. Oslo 1000-1536CappelenOslo
  • 45. UngerCRHuitfeldt-KaasHJ1847Diplomatarium Norvegicum: Oldbreve Til Kundskab Om Norges Indre Og Ydre Forhold, Sprog, Slægter, Sæder, Lovgivning Og Rettergang I Middelalderen. Samling 11PT Mallings boghandels forlagChristiania, NorwayVol 5835
  • 46. SouthernPMSanfordJP1969Relapsing fever: A clinical and microbiological reviewMedicine (Baltimore)48129[Google Scholar]
  • 47. KayGL2015Eighteenth-century genomes show that mixed infections were common at time of peak tuberculosis in EuropeNat Commun66717[PubMed][Google Scholar]
  • 48. Patterson RossZ2018The paradox of HBV evolution as revealed from a 16th century mummyPLoS Pathog14e1006750[PubMed][Google Scholar]
  • 49. SchuenemannVJ2013Genome-wide comparison of medieval and modern Mycobacterium lepraeScience341179183[PubMed][Google Scholar]
  • 50. VågeneÅJ2018Salmonella enterica genomes from victims of a major sixteenth-century epidemic in MexicoNat Ecol Evol2520528[PubMed][Google Scholar]
  • 51. KayGL2014Recovery of a medieval Brucella melitensis genome using shotgun metagenomicsMBio5e01337e14[PubMed][Google Scholar]
  • 52. SiekT2013The osteological paradox and issues of interpretation in paleopathologyvis-à-vis: Explor Anthropol1292101[Google Scholar]
  • 53. WoodJW1992The osteological paradox: Problems of inferring prehistoric health from skeletal samplesCurr Anthropol33343370[Google Scholar]
  • 54. DeanKR2018Human ectoparasites and the spread of plague in Europe during the Second PandemicProc Natl Acad Sci USA11513041309[PubMed][Google Scholar]
  • 55. AndrewsS2010A quality control tool for high throughput sequence data. Available at https://www.bioinformatics.babraham.ac.uk/projects/fastqc/. Accessed August 14, 2017
  • 56. MartinM2011Cutadapt removes adapter sequences from high-throughput sequencing reads. EMBnet J 17:10–12
  • 57. MagočTSalzbergSL2011FLASH: Fast length adjustment of short reads to improve genome assembliesBioinformatics2729572963[PubMed][Google Scholar]
  • 58. WoodDESalzbergSL2014Kraken: Ultrafast metagenomic sequence classification using exact alignmentsGenome Biol15R46[PubMed][Google Scholar]
  • 59. SegataN2012Metagenomic microbial community profiling using unique clade-specific marker genesNat Methods9811814[PubMed][Google Scholar]
  • 60. MenzelPNgKLKroghA2016Fast and sensitive taxonomic classification for metagenomics with KaijuNat Commun711257[PubMed][Google Scholar]
  • 61. LiHDurbinR2009Fast and accurate short read alignment with Burrows-Wheeler transformBioinformatics2517541760[PubMed][Google Scholar]
  • 62. LiH1000 Genome Project Data Processing Subgroup2009The sequence alignment/map format and SAMtoolsBioinformatics2520782079[PubMed][Google Scholar]
  • 63. LiH2011A statistical framework for SNP calling, mutation discovery, association mapping and population genetical parameter estimation from sequencing dataBioinformatics2729872993[PubMed][Google Scholar]
  • 64. Picard2017Available at broadinstitute.github.io/picard/. Accessed August 14, 2017
  • 65. DePristoMA2011A framework for variation discovery and genotyping using next-generation DNA sequencing dataNat Genet43491498[PubMed][Google Scholar]
  • 66. Van der AuweraGA2013From FastQ data to high confidence variant calls: The Genome Analysis Toolkit best practices pipelineCurr Protoc Bioinformatics4311.10.111.10.33[PubMed][Google Scholar]
  • 67. JónssonHGinolhacASchubertMJohnsonPLFOrlandoL2013mapDamage2.0: Fast approximate Bayesian estimates of ancient DNA damage parametersBioinformatics2916821684[PubMed][Google Scholar]
  • 68. ChenK-T2016Multi-CAR: A tool of contig scaffolding using multiple referencesBMC Bioinformatics17469[PubMed][Google Scholar]
  • 69. NamouchiA2018snpToolkit. Available at https://bitbucket.org/Amine-Namouchi/snptoolkit. Accessed June 26, 2018
  • 70. TreangenTJOndovBDKorenSPhillippyAM2014The Harvest suite for rapid core-genome alignment and visualization of thousands of intraspecific microbial genomesGenome Biol15524[PubMed][Google Scholar]
  • 71. BruenTCPhilippeHBryantD2006A simple and robust statistical test for detecting the presence of recombinationGenetics17226652681[PubMed][Google Scholar]
  • 72. DidelotXWilsonDJ2015ClonalFrameML: Efficient inference of recombination in whole bacterial genomesPLOS Comput Biol11e1004041[PubMed][Google Scholar]
Collaboration tool especially designed for Life Science professionals.Drag-and-drop any entity to your messages.