Skip to content

Master's Research code - Preterm Birth Rate among cities and its reasons

Notifications You must be signed in to change notification settings

marciojunior159/ptb_ses_research

Repository files navigation

PTB and SES Research

This repository contains the main codes/notebooks and files used in the making of the Master's research Stratification of Preterm Birth Risk in Brazil Through Unsupervised Learning Methods and Socioeconomic Data.

Summary

Preterm birth (PTB) is a phenomenon that brings risks and challenges for the survival of the newborn child. Despite many advances in research, not all the causes of PTB are yet clear. It is currently understood that PTB risk is multi-factorial and may also be associated with socioeconomic factors. In order to analyse this possible relationship, this work seeks to stratify PTB risk in Brazil using only socioeconomic data, extracting and analysing those clusters that present relevant PTB divergence, all of which will be found by automatic clustering processes using a series of unsupervised machine learning methods. Through the use of datasets made publicly available by the Federal Government of Brazil, a new dataset was generated with municipality-level socioeconomic data and a PTB occurrence rate. This dataset was processed using two separate clustering methods, both built by assembling unsupervised learning techniques, such as k-means, principal component analysis (PCA), density-based spatial clustering of applications with noise (DBSCAN), self-organising maps (SOM) and hierarchical clustering. The methods dis- covered clusters of municipalities with both high levels and low levels of PTB occurrence. The clusters with high PTB were comprised mostly of municipalities with lower levels of education, worse quality of public services – such as basic sanitation and garbage collection – and a less white population. The regional distribution of the clusters was also observed, with clusters of high PTB located mostly in the North and Northeast regions of Brazil. The results indicate a positive influence of the quality of life and the offer of public services on the reduction of PTB risk

Keywords: Preterm birth, Clustering, Unsupervised learning, PTB risk, k-Means, Self-Organising Maps, Brazil.

Organisation

PS: The notebooks containing the generated plots will be added soon

About

Master's Research code - Preterm Birth Rate among cities and its reasons

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published