Skip to content

jlevere/docs-scraper

Repository files navigation

Microsoft Docs Scraper

Print-view Markdown scraper for Microsoft Learn.

Install

pnpm i && pnpm build

Usage

pnpm scrape -- --seed <url> --allow <prefix> --out <dir> [--conc N] [--user-agent UA] [--robots]

Example

pnpm scrape -- --seed https://learn.microsoft.com/en-us/windows-hardware/customize/desktop/unattend/ --allow https://learn.microsoft.com/en-us/windows-hardware/customize/desktop/unattend/ --out docs

Output

  • Markdown files with frontmatter in docs/
  • docs/toc.json (canonical URL list)

About

Scrape Microsoft or other docs cleanly with ctrl+p and convert to markdown

Resources

Stars

Watchers

Forks