Skip to content

Releases: CodeSante/medical-crawler

Medical Crawler - Version 0.0.1

15 May 12:41
Compare
Choose a tag to compare

Medical Crawler - Version 0.0.1

We are thrilled to announce the first release of Medical Crawler, an open-source project designed specifically for web crawling and data extraction from the Vidal.fr website.

What is Medical Crawler?

Medical Crawler is a powerful tool that enables exploration and extraction of relevant medical information from the Vidal.fr website. Leveraging crawling techniques, it analyzes web pages, extracts relevant HTML tags, and collects valuable medical data.

What's New in Version 0.0.1

  • Integrated Vidal.fr Crawler: Our crawler has extensively traversed the Vidal.fr website, respecting server load limits and excluding irrelevant media. We have obtained a complete backup of the collected data, which is now available for future use.

How to Use the Vidal.fr Crawling Backup?

The Vidal.fr crawling backup is provided in a compressed format for ease of use. You can download the backup file and restore the extracted data using the following PostgreSQL command:

sudo -u postgres pg_restore -U crawler -h localhost -p 5432 -d medical_crawler -F c backup

Please ensure that PostgreSQL is installed on your system and you have the necessary permissions to execute this command.