Skip to content

a-bagrov/FactivaParser

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

3 Commits
 
 
 
 
 
 
 
 

Repository files navigation

Factiva Article Parser

Simple parser for Factiva articles. Can be useful for further analyzing with TopicMiner_LINIS.

Usage:

  1. Open 100 articles from Factiva in one page, save HTML page. Repeat as many times you wish. You can store your HTML files in different folders, next to each other.
  2. Run FactivaHtmlParser.Core.exe from cmd with arguments:
  • -p - Set path where factiva html files would be searched.
  • -d - Set depth of finding folders with factiva html data.
  1. Parser will find all *.html files, and write text content of each article to separate files with file name like article title. Files will be stored in 'out' directory, next to original articles located.

Releases

No releases published

Packages

No packages published

Languages