1. Lawrence, J. G. & Retchless, A. C. The interplay of homologous recombination and horizontal gene transfer in bacterial speciation. Methods Mol. Biol. 532, 29–53 (2009).

2. Fraser, C., Alm, E. J., Polz, M. F., Spratt, B. G. & Hanage, W. P. The bacterial species challenge: making sense of genetic and ecological diversity. Science 323, 741–746 (2009).

3. Staley, J. T. The bacterial species dilemma and the genomic-phylogenetic species concept. Phil. Trans. R. Soc. Lond. B 361, 1899–1909 (2006).

4. Moeller, A. H. et al. Cospeciation of gut microbiota with hominids. Science 353, 380–382 (2016).

5. Vandamme, P. et al. Polyphasic taxonomy, a consensus approach to bacterial systematics. Microbiol Rev. 60, 407–438 (1996).

6. Cohan, F. M. & Perry, E. B. A systematics for discovering the fundamental units of bacterial diversity. Curr. Biol. 17, R373–R386 (2007).

7. Martin, J. S., Monaghan, T. M. & Wilcox, M. H. Clostridium difficile infection: epidemiology, diagnosis and understanding transmission. Nat. Rev. Gastroenterol. Hepatol. 13, 206–216 (2016).

8. Lessa, F. C., Winston, L. G., McDonald, L. C. & Emerging Infections Program C. difficile Surveillance Team. Burden of Clostridium difficile infection in the United States. N. Engl. J. Med. 372, 2369–2370 (2015).

9. Stabler, R. A. et al. Macro and micro diversity of Clostridium difficile isolates from diverse sources and geographical locations. PLoS ONE 7, e31559 (2012).

10. He, M. et al. Evolutionary dynamics of Clostridium difficile over short and long time scales. Proc. Natl Acad. Sci. USA 107, 7527–7532 (2010).

11. Drummond, A. J., Suchard, M. A., Xie, D. & Rambaut, A. Bayesian phylogenetics with BEAUti and the BEAST 1.7. Mol. Biol. Evol. 29, 1969–1973 (2012).

12. Jackson, M. & Spray, E. C. Health and Medicine in the Enlightenment (Oxford University Press, 2012).

13. Mostowy, R. et al. Efficient inference of recent and ancestral recombination within bacterial populations. Mol. Biol. Evol. 34, 1167–1182 (2017).

14. Lawley, T. D. et al. Proteomic and genomic characterization of highly infectious Clostridium difficile 630 spores. J. Bacteriol. 191, 5377–5386 (2009).

15. Pettit, L. J. et al. Functional genomics reveals that Clostridium difficile Spo0A coordinates sporulation, virulence and metabolism. BMC Genom. 15, 160 (2014).

16. Fimlaid, K. A. et al. Global analysis of the sporulation pathway of Clostridium difficile. PLoS Genet. 9, e1003660 (2013).

17. Lawley, T. D. et al. Use of purified Clostridium difficile spores to facilitate evaluation of health care disinfection regimens. Appl. Environ. Microbiol. 76, 6895–6900 (2010).

18. Connor, M. et al. Evolutionary clade affects resistance of Clostridium difficile spores to cold atmospheric Plasma. Sci. Rep. 7, 41814 (2017).

19. Kanehisa, M., Sato, Y., Kawashima, M., Furumichi, M. & Tanabe, M. KEGG as a reference resource for gene and protein annotation. Nucleic Acids Res. 44, D457–D462 (2016).

20. Cantarel, B. L. et al. The Carbohydrate-Active EnZymes database (CAZy): an expert resource for glycogenomics. Nucleic Acids Res. 37, D233–D238 (2009).

21. Lustig, R. H., Schmidt, L. A. & Brindis, C. D. Public health: the toxic truth about sugar. Nature 482, 27–29 (2012).

22. Collins, J. et al. Dietary trehalose enhances virulence of epidemic Clostridium difficile. Nature 553, 291–294 (2018).

23. Browne, H. P. et al. Culturing of ‘unculturable’ human microbiota reveals novel taxa and extensive sporulation. Nature 533, 543–546 (2016).

24. Merrigan, M. et al. Human hypervirulent Clostridium difficile strains exhibit increased sporulation as well as robust toxin production. J. Bacteriol. 192, 4904–4911 (2010).

25. Sebaihia, M. et al. The multidrug-resistant human pathogen Clostridium difficile has a highly mobile, mosaic genome. Nat. Genet. 38, 779–786 (2006).

26. He, M. et al. Emergence and global spread of epidemic healthcare-associated Clostridium difficile. Nat. Genet 45, 109–113 (2013).

27. Cairns, M. D. et al. Comparative genome analysis and global phylogeny of the toxin variant clostridium difficile PCR Ribotype 017 reveals the evolution of two independent sublineages. J. Clin. Microbiol. 55, 865–876 (2017).

28. Dingle, K. E. et al. A role for tetracycline selection in recent evolution of agriculture-associated Clostridium difficile PCR Ribotype 078. MBio 10 e02790-18 (2019).

29. Knetsch, C. W. et al. Zoonotic transfer of Clostridium difficile harboring antimicrobial resistance between farm animals and humans. J. Clin. Microbiol. 56 e01384-17 (2018).

30. Knight, D. R., Squire, M. M. & Riley, T. V. Nationwide surveillance study of Clostridium difficile in Australian neonatal pigs shows high prevalence and heterogeneity of PCR ribotypes. Appl. Environ. Microbiol. 81, 119–123 (2015).

31. Bauer, M. P. et al. Clostridium difficile infection in Europe: a hospital-based survey. Lancet 377, 63–73 (2011).

32. Tang, C. et al. The incidence and drug resistance of Clostridium difficile infection in Mainland China: a systematic review and meta-analysis. Sci. Rep. 6, 37865 (2016).

33. Argimon, S. et al. Microreact: visualizing and sharing data for genomic epidemiology and phylogeography. Micro. Genom. 2, e000093 (2016).

34. Croucher, N. J. et al. Rapid pneumococcal evolution in response to clinical interventions. Science 331, 430–434 (2011).

35. Harris, S. R. et al. Evolution of MRSA during hospital transmission and intercontinental spread. Science 327, 469–474 (2010).

36. Quail, M. A. et al. A large genome center’s improvements to the Illumina sequencing system. Nat. Methods 5, 1005–1010 (2008).

37. Zerbino, D. R. & Birney, E. Velvet: algorithms for de novo short read assembly using de Bruijn graphs. Genome Res. 18, 821–829 (2008).

38. Boetzer, M., Henkel, C. V., Jansen, H. J., Butler, D. & Pirovano, W. Scaffolding pre-assembled contigs using SSPACE. Bioinformatics 27, 578–579 (2011).

39. Boetzer, M. & Pirovano, W. Toward almost closed genomes with GapFiller. Genome Biol. 13, R56 (2012).

40. Seemann, T. Prokka: rapid prokaryotic genome annotation. Bioinformatics 30, 2068–2069 (2014).

41. Chain, P. S. et al. Genome project standards in a new era of sequencing. Science 326, 236–237 (2009).

42. Page, A. J. et al. Roary: rapid large-scale prokaryote pan genome analysis. Bioinformatics 31, 3691–3693 (2015).

43. Katoh, K. & Standley, D. M. MAFFT multiple sequence alignment software version 7: improvements in performance and usability. Mol. Biol. Evol. 30, 772–780 (2013).

44. Croucher, N. J. et al. Rapid phylogenetic analysis of large samples of recombinant bacterial whole genome sequences using Gubbins. Nucleic Acids Res. 43, e15 (2015).

45. Stamatakis, A. RAxML version 8: a tool for phylogenetic analysis and post-analysis of large phylogenies. Bioinformatics 30, 1312–1313 (2014).

46. Milne, I. et al. TOPALiv2: a rich graphical interface for evolutionary analyses of multiple alignments on HPC clusters and multi-core desktops. Bioinformatics 25, 126–127 (2009).

47. Ondov, B. D. et al. Mash: fast genome and metagenome distance estimation using MinHash. Genome Biol. 17, 132 (2016).

48. Popescu, A. A., Huber, K. T. & Paradis, E. ape 3.0: new tools for distance-based phylogenetics and evolutionary analysis in R. Bioinformatics 28, 1536–1537 (2012).

49. Letunic, I. & Bork, P. Interactive Tree Of Life v2: online annotation and display of phylogenetic trees made easy. Nucleic Acids Res. 39, W475–W478 (2011).

50. Delcher, A. L., Phillippy, A., Carlton, J. & Salzberg, S. L. Fast algorithms for large-scale genome alignment and comparison. Nucleic Acids Res. 30, 2478–2483 (2002).

51. Cheng, L., Connor, T. R., Siren, J., Aanensen, D. M. & Corander, J. Hierarchical and spatially explicit clustering of DNA sequences with BAPS software. Mol. Biol. Evol. 30, 1224–1228 (2013).

52. Tatusov, R. L., Galperin, M. Y., Natale, D. A. & Koonin, E. V. The COG database: a tool for genome-scale analysis of protein functions and evolution. Nucleic Acids Res. 28, 33–36 (2000).

53. Jombart, T., Devillard, S. & Balloux, F. Discriminant analysis of principal components: a new method for the analysis of genetically structured populations. BMC Genet. 11, 94 (2010).

54. Jombart, T. adegenet: a R package for the multivariate analysis of genetic markers. Bioinformatics 24, 1403–1405 (2008).

55. Yin, Y. et al. dbCAN: a web resource for automated carbohydrate-active enzyme annotation. Nucleic Acids Res. 40, W445–W451 (2012).

56. Riley, M. Functions of the gene products of Escherichia coli. Microbiol Rev. 57, 862–952 (1993).

57. Kanehisa, M., Sato, Y. & Morishima, K. BlastKOALA and GhostKOALA: KEGG tools for functional characterization of genome and metagenome sequences. J. Mol. Biol. 428, 726–731 (2016).

58. Finn, R. D. et al. Pfam: the protein families database. Nucleic Acids Res. 42, D222–D230 (2014).

59. Lerat, E. & Ochman, H. Recognizing the pseudogenes in bacterial genomes. Nucleic Acids Res. 33, 3125–3132 (2005).

60. Rambaut, A., Drummond, A. J., Xie, D., Baele, G. & Suchard, M. A. Posterior summarization in bayesian phylogenetics using tracer 1.7. Syst. Biol. 67, 901–904 (2018).

61. Karasawa, T., Ikoma, S., Yamakawa, K. & Nakamura, S. A defined growth medium for Clostridium difficile. Microbiology 141, 371–375 (1995).