Skip to content
View ptorres001's full-sized avatar
Block or Report

Block or report ptorres001

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
ptorres001/README.md

Thanks for stopping by!

My name is Paul Torres.

I used to work in Physics with a focus on atmospheric physics research before I turned to focus full time on Data Science. I am interested in telling stories using data. I spend my time learning new topics about machine learning and writing articles on Medium about the techniques that I learn.

I've written articles on everything from past projects, to algorithm review for further learning, to deep learning concepts like pseudo-supervised learning and I look to continue to learn by doing and passing on that knowledge to the wider community.

As a former research scientist, I've found that exploratory analysis and using the data to build a narrative are the most exciting aspects of data science.

Projects I am currently working on include:

  • Tanzanian Water Pump Hackathon
  • Automated text extractor for transferring information from a detailed report to tabular sheet format
  • Topic Modeling and Basic NLP Analysis of the Harry Potter Series

You can also find me at:

  • LinkedIn LinkedIn
  • Medium Medium
  • Twitter Twitter

Pinned

  1. cluster_gentrification_new_york cluster_gentrification_new_york Public

    Cluster Classification using PCA, KMeans, and Hierarchical Agglomerative Clustering on 2000-2010 census data.

    Jupyter Notebook 6 3

  2. census_HI_classification_model census_HI_classification_model Public

    Logistic Regression on Census Survey data to predict whether a person was covered by health insurance or not.

    Jupyter Notebook

  3. dolcikey/TimeSeriesForecasting_BeijingAirQuality dolcikey/TimeSeriesForecasting_BeijingAirQuality Public

    Time Series/Forecasting Project using Beijing Air Quality Data

    Jupyter Notebook 3 1

  4. MLB_Salary_prediction_linear_regression MLB_Salary_prediction_linear_regression Public

    Using Linear Regression to find the best indicators of increased salary for rookie arbitration hearings.

    Jupyter Notebook

  5. DS_movie_EDA DS_movie_EDA Public

    Forked from stereopickle/DS_movie_EDA

    Preliminary exploratory data analyses using various movie data with intention to make business recommendations for new movie studio.

    Jupyter Notebook 1 1

  6. nyc_food_deserts nyc_food_deserts Public

    Web scraping data in order to cluster and classify food deserts in New York City.

    Jupyter Notebook 1 1