Twitter-sentiment-analysis

What is Sentiment Analysis?

Sentiment analysis (also known as opinion mining) is one of the many applications of Natural Language Processing. It is a set of methods and techniques used for extracting subjective information from text or speech, such as opinions or attitudes. In simple terms, it involves classifying a piece of text as positive, negative or neutral.

The objective of this task is to detect hate speech in tweets. For the sake of simplicity, we say a tweet contains hate speech if it has a racist or sexist sentiment associated with it. So, the task is to classify racist or sexist tweets from other tweets
We are given a training sample of tweets and labels, where label ‘1’ denotes the tweet is racist/ sexist and label ‘0’ denotes the tweet is not racist/sexist.

Evaluation Metric:

The metric used for evaluating the performance of classification model would be F1-Score.

Data

Our overall collection of tweets was split in the ratio of 65:35 into training and testing data. Out of the testing data, 30% is public and the rest is private.

Data Files

train.csv - For training the models, we provide a labelled dataset of 31,962 tweets. The dataset is provided in the form of a csv file with each line storing a tweet id, its label and the tweet. There is 1 test file (public)
test_tweets.csv - The test data file contains only tweet ids and the tweet text with each tweet in a new line.

Name		Name	Last commit message	Last commit date
Latest commit History 10 Commits
.ipynb_checkpoints		.ipynb_checkpoints
1549269497113.png		1549269497113.png
README.md		README.md
Twitter_sentiment_analysis.ipynb		Twitter_sentiment_analysis.ipynb
gfg.py		gfg.py
test_tweets_anuFYb8.csv		test_tweets_anuFYb8.csv
train_E6oV3lV.csv		train_E6oV3lV.csv

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

.ipynb_checkpoints

.ipynb_checkpoints

1549269497113.png

1549269497113.png

README.md

README.md