Skip to content

bsvab/python-plotting-challenge

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

5 Commits
 
 
 
 
 
 
 
 

Repository files navigation

Module 5 Challenge

Background / Scenario

You've just joined Pymaceuticals, Inc., a new pharmaceutical company that specializes in anti-cancer medications. Recently, it began screening for potential treatments for squamous cell carcinoma (SCC), a commonly occurring form of skin cancer.

As a senior data analyst at the company, you've been given access to the complete data from their most recent animal study. In this study, 249 mice who were identified with SCC tumors received treatment with a range of drug regimens. Over the course of 45 days, tumor development was observed and measured. The purpose of this study was to compare the performance of Pymaceuticals’ drug of interest, Capomulin, against the other treatment regimens.

The executive team has tasked you with generating all of the tables and figures needed for the technical report of the clinical study. They have also asked you for a top-level summary of the study results.

Data Preparation

Final Script Outputs:
Results

Summary Statistics

Final Script Outputs:
Results

Bar & Pie Charts

Final Script Outputs:
Results Results

Box Plot

Final Script Outputs:
Results

Line & Scatter Plots

Final Script Outputs:
Results Results

Correlation & Regression

Final Script Outputs:
Results

Analysis

  • Looking at the summary statistics by drug regimen, it is notable that of the 10 regimens, all mean and median values are in the 50-52 range except for two. Capomulin and Ramicane are both in the 40-41 range, roughly 10mm3 less than the other regimens. This may be indicative of these regimens being more effective at shrinking tumors than their counterparts.
  • The above observation is also reinforced when observing the box plot that was generated showing the lower tumor volume for the Capomulin and Ramicane regimens.
  • When observing the line graph of Capomulin treatment of mouse l509 over time, it shows that tumor volume drops over time with just a few blips of growth. Collectively the trend is showing shrinkage over time.
  • Lastly the linear regression plotted on mouse weight vs average tumor volume shows that there is a correlation between those two variables. Because of this, further analysis is needed to see whether the apparent positive results for Capomulin and Ramicane are being skewed by having smaller mice in their datasets on average. If so, the data will need to be controlled against mouse weight discrepancies between regimens.

References

Data generated by Mockaroo, LLC , (2022). Realistic Data Generator. Data for this dataset was generated by edX Boot Camps LLC, and is intended for educational purposes only. 📚



UTlogo

About

Module 5 Challenge

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors