Pathfindr helps you analyze linking within your website by generating a comprehensive report on internal and external links.
- Crawls a given base URL to enumerate all links.
- Ignores pages with
Content-Type
other thantext/html
. - Uses concurrent crawling for improved performance.
- Logs results to CSV file
Clone the repository:
git clone https://github.com/nronzel/pathfindr.git
Navigate to the project directory:
cd pathfindr
Install the required dependencies:
npm install
Run Pathfindr by providing the base URL you wish to crawl. This URL serves as both the starting point and scope limiter for the crawl.
npm start https://url-to-crawl.com
A report will be printed to your console upon completion.
To run the available unit tests:
npm test
- Enhance error handling to accommodate larger, more complex sites.
Include external link counts in report.doneAdd CSV export functionality.done- Implement logging for comprehensive crawling records.
- Incorporate data visualization to represent internal linking graphically.