Skip to content

Sam-Osian/PFD-toolkit

Repository files navigation

Python PyTest Licence DOI

PFD Toolkit

Turn raw PFD reports into structured insights — fast.

PFD Toolkit is a suite of tools that replaces the manual effort involved in the collection, screening, and thematic discovery of PFD reports. It helps researchers, journalists, and public health analysts turn raw reports into actionable insights.

For more information, please consult package documentation.

Getting started

Installation

pip install pfd_toolkit

Load PFD Report Data (in seconds)

To load PFD data, just import the module, specify the category of reports and your date-range:

from pfd_toolkit import load_reports

reports = load_reports(
    start_date="2024-01-01",
    end_date="2025-01-01"
)

reports will be a pandas DataFrame. Each row is a separate report, and each column is a report section. For example:

url date coroner area receiver investigation circumstances concerns
[...] 2025-05-01 A. Hodson Birmingham and... NHS England; The Rob... On 9th December 2024... At 10.45am on 23rd November... To The Robert Jones...
[...] 2025-04-30 J. Andrews West Sussex, Br... West Sussex C... On 2 November 2024 I... They drove their car into... The inquest was told t...
[...] 2025-04-30 A. Mutch Manchester Sou... Fluxton Road Medical... On 1 October 2024 I... They were prescribed long... The inquest heard evide...
[...] 2025-04-25 J. Heath North Yorkshire... Townhead Surgery On 4th June 2024 I... On 15 March 2024, Richar... When a referral docume...
[...] 2025-04-25 M. Hassell Inner North Lo... The President Royal... On 23 August 2024, on... They were a big baby and... With the benefit of a m...

PFD Toolkit updates each week with freshly published reports. To retrieve the latest reports, run load_reports() with refresh:

reports = load_reports(refresh=True)

Key features

Beyond loading reports, PFD Toolkit lets you:

  • Screen reports: find cases relevant to your specific research question.
  • Summarise text: distill full-length reports into a custom summary.
  • Discover themes: uncover recurring topics contained within a selection of reports.
  • Categorise: assign and tabulate reports by discovered or user-defined themes.

To get started with these features, please check out the documentation.

Licence

This project is distributed under the GNU Affero General Public License v3.0 (AGPL-3.0). See the LICENCE file for the full text.

Please note:

  • You are welcome to use, modify, and share this code under the terms of the AGPL-3.0.
  • If you use this code to provide a networked service, you are required to make the complete source code available to users of that service.
  • Some project dependencies may have their own licence terms, which could affect certain types of non-research use (e.g. commercial use). Please review all relevant licences to ensure compliance.

Collaborate

We welcome feedback as well as code collaborators! Please read our collaboration page here

How to cite

If you use PFD Toolkit in your research, please cite the archived release:

Osian, S., & Pytches, J. (2025). PFD Toolkit: Unlocking Prevention of Future Death Reports for Research (Version 0.3.5) [Software]. Zenodo. https://doi.org/10.5281/zenodo.15729717

Or, in BibTeX:

@software{osian2025pfdtoolkit,
  author       = {Sam Osian and Jonathan Pytches},
  title        = {PFD Toolkit: Unlocking Prevention of Future Death Reports for Research},
  year         = {2025},
  version      = {0.3.5},
  doi          = {10.5281/zenodo.15729717},
  url          = {https://github.com/sam-osian/PFD-toolkit}
}

About

A toolkit for using Prevention of Future Death (PFD) Reports for Research and Policy Projects.

Topics

Resources

License

Stars

Watchers

Forks

Packages

No packages published

Contributors 3

  •  
  •  
  •