This Repository Contains R-Codes executed on various Datasets in RStudio. I Hope This Repository is very helpful for those who are Willing to build their Career in Data Science, Big Data.
-
Updated
Aug 21, 2023 - R
Natural language processing (NLP) is a field of computer science that studies how computers and humans interact. In the 1950s, Alan Turing published an article that proposed a measure of intelligence, now called the Turing test. More modern techniques, such as deep learning, have produced results in the fields of language modeling, parsing, and natural-language tasks.
This Repository Contains R-Codes executed on various Datasets in RStudio. I Hope This Repository is very helpful for those who are Willing to build their Career in Data Science, Big Data.
⛔ ARCHIVED ⛔ Accesses the Monkeylearn API for Text Classifiers and Extractors
Predicting NEXT word - Data Science Capstone Project by Johns Hopkins University on Coursera
Academic project for Advances in Data Science and Architecture course
Applying unsupervised learning using K-means clustering.
Warmth and Competence Detectors
Homework assignments from the course, Big Data. Topics covered include: data warehousing, linear regression, NLP, KMeans, TF-IDF, PCA, decision trees, data cleaning, and recommendation systems - UBCF and IBCF. The assingments were completed with the following tools: R, RStudio, DataGrip, MySQL, and R libraries such as ggplot2, recommenderlab, qu…
To find the accurate review and sentiment about the product and tag the opinion as positive, negative or neutral.
from kaggle natural language processing
As a customer or a potential investor who lists properties on Airbnb, one would always be interested in determining the quality of the listing. The rating score is one of the indicators which every stakeholder looks forward to, in order to gauge this metric. It is often observed that this is not an accurate indicator.
Analyzing ratings and reviews for restaurants across 31 European cities
A repo for analysing sentiments in WhatsApp Chat
Computational literature review of water resources research in Latin America and the Caribbean.
In this project, we implemented the detection algorithm (D-3 in the folder: ''doc/paper'') and correction algorithm (C-3 in the folder: ''doc/paper'') for post-processing of OCR technique.
Assignments for STAT 3106 (Machine Learning, NLP, Sentiment Analysis)
a pre-processing functions used for text cleaning in R.
textRec utlizes Latent Dirichlet Allocation and Jensen-Shannon-Divergence on the discrete probability distributions over LDA topics per document, in order to recommend unique and novel documents to specific users.
Research project to measure the firm Expected Investment Growth (EIG) based on a combination of machine learning tools and text regression.
Sentiment Analysis on Demonetization tweets
Created by Alan Turing