Skip to content

shunjizhan/MovieBase

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Project 02A May 21 2018

In this project, we complete the sanitize function by meeeeeeting all the specs the project provides by 1. Replace new lines and tab characters with a single space. 2. Remove URLs. Replace them with the empty string 3.Split text on a single space 4.Separate all external punctuations 5.Remove all punctuation 6.Convert all text to lowercase 7.return one data structure containing four collections: a string containing the parsed text, a string of all unigrams (single tokens) in any order separated by a space, a string of all bigrams 8. Fixing bugs

Thank you!

DL, SZ, ZW

Project 02B June 08 2018

We finished all the basic requirements and our deliverables meet all the basic specifications. We attempted to go for extra credit but decided to stop after hitting some obstacles since all our members have other commitments. 

In the code that we are handing in, we commented out some codes that were used to speed up our implementations such as saving and loading intermediate results. 

Most of our results seem reasonable except for the data by state. Our result data for comment positiveness and negativeness by state seems correct but when using the template to plot the result in the graph, all states are in the same color. We suspect that there is something wrong in our graphs-plotting code but we couldn't locate the error. 

Thank you!
DL, SZ, ZW

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published