Skip to content

aymansalama/NLP-and-regression-analysis

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

18 Commits
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

NLP-and-regression-analysis

Author: Ayman Salama Contact: ayman3salama@gmail.com Python scripts for testing purposes

Note: Each folder contains:

  1. Python Script
  2. The data file the is used by python script. The data files sometimes are zipped because of the size.
  3. Results folder contains the output of the script. File names are descriptive as per the sent the document.Some results are zipped beacuse of the size.

The code are solving problems such as:

  1. Get the Median of each product, Megre the Median result with the product ID
  2. Get the Mean, Min and Max of each product, Megre the Mean result with the product ID
  3. Get the Best Performing Peoduct (Based on volume)
  4. Identify the most promising product using regression analysis
  5. Identify the top 5 worst performing products on a biweekly basis
  6. Identify outliers from the data and output the corresponding week numbers using normal distribution
  7. Using NLP to extract information from text like tile, duration,location, discription ..etc giving the incosistency in data.
  8. Deal with Null vs N/A values in data.
  9. Perform several text processing to extract information
  10. Store the data in Dataframe and SQL and S3 Bucket
  11. Get the similarity between several discription

About

Python code for testing purposes

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages