Skip to content

Sentiment Analysis on Fake News and Amazon Products Reviews

Notifications You must be signed in to change notification settings

AKASH2907/nlp_setiment_analysis

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

9 Commits
 
 
 
 
 
 

Repository files navigation

sentiment analysis

Dataset Info

Fake-News Detection

This dataset is available at this Github repo.

This dataset has evidence sentences extracted automatically from the full-text verdict report written by journalists in Politifact. Objective of this dataset is to provide a benchmark for evidence retrieval and show empirically that including evidence information in any automatic fake news detection method (regardless of features or classifier) always results in superior performance to any method lacking such information.

Below is the description of the TSV file taken as is from the original LIAR dataset, which was published in this paper. A new column is added at the end that has the extracted justification.

  • Column 1: the ID of the statement ([ID].json).
  • Column 2: the label.
  • Column 3: the statement.
  • Column 4: the subject(s).
  • Column 5: the speaker.
  • Column 6: the speaker's job title.
  • Column 7: the state info.
  • Column 8: the party affiliation.
  • Columns 9-13: the total credit history count, including the current statement.
    • 9: barely true counts.
    • 10: false counts.
    • 11: half true counts.
    • 12: mostly true counts.
    • 13: pants on fire counts.
  • Column 14: the context (venue / location of the speech or statement).
  • Column 15: the extracted justification

The justification extraction method is done as follows:

  • Get all sentences in the 'Our Ruling' section of the report if it exists or get the last five sentences.
  • Remove any sentence that have the verdict and any verdict-related words. Verdict-related words are provided in the forbidden words file.

Releases

No releases published

Packages

No packages published