Skip to content

Various experiments related to information extraction.

Notifications You must be signed in to change notification settings

LQR471814/research-toolkit

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

10 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

research-toolkit

Various experiments relating to information extraction.

Experiments

AX Extract

Extract text formatted as markdown from webpages using headless chrome and the accessibility tree. This is far more accurate for generic parsing as all websites that have decent SEO scores should be able to be parsed this way.

About

Various experiments related to information extraction.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages