Skip to content

philipp-eisen/adam-crawler

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

8 Commits
 
 
 
 
 
 

Repository files navigation

#A.D.A.M.-Crawler

This small scrapy spider crawls the The A.D.A.M. Medical Encyclopedia which can be found here: https://www.nlm.nih.gov/medlineplus/encyclopedia.html .

This script ignores patientinstructions.

The output will be a JSON Array of JSON Objects in the following format:

[
{"Title":"<the title>",
"URL":"<the url of the article>",
"<section name>":"<section text>",

...

},

...

]

##How tu run:

first install requirements: pip install -r requirements.txt

run scrapy spider: scrapy runspider adamcrawler/adam_spider.py -o adam-articles.json

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages