In this blog post, I would like to share the Yearly Bills of Mortality dataset that I recently generated. The Bills of Mortality were a mortality statistic for London and were first published in the 17th century. Besides the number of deaths, they also provide information about the cause of death since 1629. The first time I read about these documents was in a newspaper article in The Telegraph about the absurdly sounding causes of death such as itch, lethargy, or grief.

Interestingly, Google digitized the book “Collection of Yearly Bills of Mortality, from 1657 to 1758 Inclusive” and made it available through Google Books . In addition to that, archive.org offers those scans in a variety of formats. Unfortunately, Google’s optical character recognition (OCR) does not work particularly well on historical documents especially if the data is presented in tables, which makes searching the PDF and analyzing the changes in certain numbers difficult.

Finding those kinds of statistics rather intriguing, I considered the Yearly Bills of Mortality to be a perfect test case for manual data entry services that I am currently testing. The task was to convert the “Diseases and Casualties” table of 102 pages into an Excel table. In the case of different variants of the same causes of death (e.g., hang’d and hanged), a consistent form of spelling was used.

If you are also interested in this subject or just want to explore what killed Londoners between the years 1657 and 1758, you can download the complete dataset here CSV file . Additionally, I uploaded the data to Tableau Public ; you can try it out in the interactive graph below. Please note that I of course cannot guarantee the accuracy of the dataset.



