Development of a repeatable process for the analysis of Reddit conversations within specific condition and/or disease state with applicable threads and subreddit threads (subreddits) to potentially inform strategy and content development. Create a simplified and repeatable process that does not require the users to be fluent in Reddit.
While Reddit offers robust, open, and community-minded discussions surrounding conditions and disease states, Reddit also provides volumes of unstructured and unclassified data. The development of a repeatable process – that continues to monitor evolving conversations over time – currently requires multiple tools (ex. – tools to scrape threads, tools to analyze keyword content, tools to analyze sentiment, etc.).
After identifying priority conditions and/or disease states with active Reddit communities (ex. – prostate cancer, breast cancer, HIV, etc.), build relational taxonomy (ex. – medicine, treatment, and adherence all have specific topics but have relational discussions) of topical themes addressed within.
- Potential Output: Provide use case for healthcare companies on the importance of Reddit as an early source of social indicator of trends and conversational “lexicon” to be used for patient communications and programs.
That was it! 😄.
Update: Work in progress to the "RedditSmartApp" solution is available via Cloning or forking the project artifact.
Click on the button below to view the code in a Web IDE. Feel free to edit the code and then run it all from your browser.
Reda Mastouri is Security Data Scientist with a passion for teaching and coaching. | Data Analytics | Machine Learning | Predictive Modeling | Data Visualization | NLP | Network Analytics | Network Security | Ethical Hacking |
He is knowledgeable and technically certified engineer with 7 years of continued hands-on experience in the implementation, administration and troubleshooting.
Kalyani Pavuluri is Data Scientist having around 8.5 years of experience in design, analysis, development, implementation and maintenance of Python, J2EE applications and open source solutions.