Skip to content

nlp-project-sentiment-analysis/Get-data

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

28 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

PyPI - Python Version MIT License


Logo

Get data

Datasets which are usefully for sentimental analysis - NLP
Explore the docs »

View Demo · Report Bug · Request Feature

Table of Contents
  1. About The Project
  2. Getting Started
  3. Usage
  4. Roadmap
  5. Contributing
  6. License
  7. Contributors
  8. Acknowledgements

About The Repository

Product Name Screen Shot

In this repo, we are collecting datasets which are usefully for sentimental analysis - NLP. Kaggle is an online community platform for data scientists and machine learning enthusiasts. We also used web-scrapping on wikipedia website.

Built With

This are the frameworks & tools used in this repo

Getting Started

To get a local copy up and running follow these simple example steps.

Prerequisites

List things you need to use the software and how to install them.

  • Python

    python --version
    # Python 3.8.0
  • Jupiter Notebook

    jupyter --version
    # Selected Jupyter core packages...
    # IPython          : 8.4.0
    # ipykernel        : 6.15.2
    # ipywidgets       : not installed
    # jupyter_client   : 7.3.5
    # jupyter_core     : 4.11.1
    # jupyter_server   : not installed
    # jupyterlab       : not installed
    # nbclient         : not installed
    # nbconvert        : not installed
    # nbformat         : not installed
    # notebook         : not installed
    # qtconsole        : not installed
    # traitlets        : 5.3.0
  • Kaggle account

    opendatasets uses the Kaggle Official API for donwloading dataset from Kaggle. Follow these steps to find your API credentials:

    1. Sign in to https://kaggle.com/, then click on your profile picture on the top right and select "My Account" from the menu.

    2. Scroll down to the "API" section and click "Create New API Token". This will download a file kaggle.json with the following contents:

      { "username": "YOUR_KAGGLE_USERNAME", "key": "YOUR_KAGGLE_KEY" }
    3. When you run opendatsets.download, you will be asked to enter your username & Kaggle API, which you can get from the file downloaded in step 2.

    Note that you need to download the kaggle.json file only once. You can also place the kaggle.json file in the same directory as the Jupyter notebook, and the credentials will be read automatically.

Installation

  1. Get a free API Key at Kaggle
  2. Clone the repo
    git clone -b develop https://github.com/nlp-project-sentiment-analysis/Get-data.git
  3. Install python packages
    python -m pip install -r requirements.txt

Usage

In this repo, we are collecting datasets which are usefully for sentimental analysis - NLP. List of datasets are as follow:

  • abbreviations.csv use to convert short-words into full and meaning full word
  • apostrophe.csv use to convert apostrophe words into full word
  • emoji.csv use to convert unicode emoji into english words
  • emoticons.csv use to convert special character emoji into english words
  • neg_english_text.csv use for NLP which contains all negative statements
  • pos_english_text.csv use for NLP which contains all positive statements

Roadmap

See the open issues for a list of proposed features (and known issues).

Contributing

Contributions are what make the open source community such an amazing place to be learn, inspire, and create. Any contributions you make are greatly appreciated.

  1. Fork the Project
  2. Create your Feature Branch (git checkout -b feature/AmazingFeature)
  3. Commit your Changes (git commit -m 'Add some AmazingFeature')
  4. Push to the Branch (git push origin develop:feature/AmazingFeature)
  5. Open a Pull Request

License

Distributed under the MIT License. See LICENSE for more information.

Contributors

Acknowledgements

About

Get required datasets for sentiment analysis

Resources

License

Stars

Watchers

Forks

Packages

No packages published