Skip to content

greyatomtestuser3/nlp_day_01_project

 
 

Repository files navigation

Natural Language Processing Project

NLP is one of the components of Artificial intelligence which is the ability of a computer program to understand the human speech.

Now, let's dig deep to learn how we can make a fully fledged model to understand textual data.

What have we learnt so far

In today's lecture we learned some of the core concepts invloved in Natural language processing such as :-

  • Sentence Splitter
  • Tokenization
  • POS (part of speech tagging)
  • Stop-word Removal
  • Stemming
  • Lemmatization
  • Chunking
  • Deep Parsing
  • Word Cloud
  • TF-IDF

Why solve this assignment?

Lets review some of the concepts taught in the class. At the end of the assignment you'll be able to:

  • Implement the necessary preprocessing steps required while processing natural language.
  • Show contextual understanding learned by machine.

Dataset

For this exercise, we will use the Twenty Newsgroups Data Set,which comprises of 20000 messages taken from 20 newsgroups.

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages

  • Python 90.7%
  • Jupyter Notebook 9.3%