Datasets which are usefully for sentimental analysis - NLP
Explore the docs »
View Demo
·
Report Bug
·
Request Feature
Table of Contents
In this repo, we are collecting datasets which are usefully for sentimental analysis - NLP. Kaggle is an online community platform for data scientists and machine learning enthusiasts. We also used web-scrapping on wikipedia website.
This are the frameworks & tools used in this repo
To get a local copy up and running follow these simple example steps.
List things you need to use the software and how to install them.
-
Python
python --version # Python 3.8.0
-
Jupiter Notebook
jupyter --version # Selected Jupyter core packages... # IPython : 8.4.0 # ipykernel : 6.15.2 # ipywidgets : not installed # jupyter_client : 7.3.5 # jupyter_core : 4.11.1 # jupyter_server : not installed # jupyterlab : not installed # nbclient : not installed # nbconvert : not installed # nbformat : not installed # notebook : not installed # qtconsole : not installed # traitlets : 5.3.0
-
Kaggle account
opendatasetsuses the Kaggle Official API for donwloading dataset from Kaggle. Follow these steps to find your API credentials:-
Sign in to https://kaggle.com/, then click on your profile picture on the top right and select "My Account" from the menu.
-
Scroll down to the "API" section and click "Create New API Token". This will download a file
kaggle.jsonwith the following contents:{ "username": "YOUR_KAGGLE_USERNAME", "key": "YOUR_KAGGLE_KEY" } -
When you run
opendatsets.download, you will be asked to enter your username & Kaggle API, which you can get from the file downloaded in step 2.
Note that you need to download the
kaggle.jsonfile only once. You can also place thekaggle.jsonfile in the same directory as the Jupyter notebook, and the credentials will be read automatically. -
- Get a free API Key at Kaggle
- Clone the repo
git clone -b develop https://github.com/nlp-project-sentiment-analysis/Get-data.git
- Install python packages
python -m pip install -r requirements.txt
In this repo, we are collecting datasets which are usefully for sentimental analysis - NLP. List of datasets are as follow:
abbreviations.csvuse to convert short-words into full and meaning full wordapostrophe.csvuse to convert apostrophe words into full wordemoji.csvuse to convert unicode emoji into english wordsemoticons.csvuse to convert special character emoji into english wordsneg_english_text.csvuse for NLP which contains all negative statementspos_english_text.csvuse for NLP which contains all positive statements
See the open issues for a list of proposed features (and known issues).
Contributions are what make the open source community such an amazing place to be learn, inspire, and create. Any contributions you make are greatly appreciated.
- Fork the Project
- Create your Feature Branch (
git checkout -b feature/AmazingFeature) - Commit your Changes (
git commit -m 'Add some AmazingFeature') - Push to the Branch (
git push origin develop:feature/AmazingFeature) - Open a Pull Request
Distributed under the MIT License. See LICENSE for more information.
- Pranav - PranavPremakumaran
- Komal - komal0608
- Karthik - KarthikUdyawar
