A collection of my attempts to learn data science and machine learning on large datasets.
To-do
The opioid project is mainly a demonstration of preprocessing techniques. It also contains basic models of the data to try to predict Adverse Opioid Events given a variety of other data. I also used the code from the project to create an "Introduction to Data Science" tutorial, of which I have added the PDF version.
The finance scraper uses raw HTML webscraping of Yahoo Finance's website using user-inputted tickers. With this information, the scraper constructs a dataframe of key financial data. The project uses BeautifulSoup to scrape the websites and pandas to handle the data manipulation.