Skip to content

Using Vector Space Model in Simple Tweets Database with Custom Test Cases for COVID-19 related Misinformation Data.

Notifications You must be signed in to change notification settings

devshashwat/Tweets-Vector-Space-Model

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

13 Commits
 
 
 
 
 
 

Repository files navigation

TweetsVSM

Using Vector Space Model in Simple Tweets Database using Custom Test Cases.

Below are the steps to successfully run the sample project in your system:

• Open the VSMTweets Folder > src > TweetsVSM.java

• External Libraries folder contains add two jar files common-lang3.jar and opencsv-5.6.jar

• TweetsVSM also contains stopwords.txt and TweetSample.csv files in src folder.

• Before running, add two jar files common-lang3.jar and opencsv-5.6.jar to Files > Project Structures > Dependencies. ( For adding dependencies in JGrasp, Settings > Path/ClassPath > Workspace > ClassPath > add two jar files path individually from external libraries folder. )

• These are the external libraries used to read csv files in java.

• I have used a sample Tweets data here containing 1000 rows.

• When you Run and debugging keep the Tweets and stop words file in same folder as this TweetsVSM.java file.

• All the input for Tweets and stop words have functions but are called in the main function.

• All the Test Cases have functions which are called in main function with test values.

• This program gives VSM tf idf, and Search Query Test Cases for ranked search as output.

• This program gives finds similarity of the phrase search query using cosine similarity.

• The Ranked output has been sorted in descending order so that we can know which tweet is the most like the test cases we have in the input.

• There a plenty more test cases which can be used to detect the vaccine misinformation spreading users and if they are verified profiles.

• I have added the screenshot of test cases and GUI in the folder Test Cases.

• This program gives summary statistics for each Test Cases.

About

Using Vector Space Model in Simple Tweets Database with Custom Test Cases for COVID-19 related Misinformation Data.

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages