Skip to content

Crawls a GitHub repo for any spelling errors/typos in HTML and MD files

Notifications You must be signed in to change notification settings

yeoji/github-typo-crawler

Repository files navigation

GitHub Typo Crawler

This is a simple Node.js script that will crawl through a GitHub repo, checking for any spelling errors/typos.

It only analyzes .html and .md files.

This project uses yeoji/Proofreader which is forked from kdzwinel/Proofreader.

Requirements

node v8

Usage

  1. Clone the repository
git clone git@github.com:yeoji/github-typo-crawler.git
  1. Install dependencies
yarn install
  1. Run crawler on GitHub repo
# REPO is in the format username/repo (eg. yeoji/github-typo-crawler)
yarn crawl <REPO>

Custom Ignored Words

Because adding custom dictionaries did not work on kdzwinel/Proofreader, I have added a post-processor.

To add your own words to the list of words to ignore in spellchecks, create a new file under dictionaries and start building your list (separated by a newline).

The words will be picked up and filtered out from the final result.

Ignored files

You can define files to ignore in a file tree by appending it into the ignoredfiles.txt file. The filename provided must be the exact match of the file to be excluded from checking.

For your convenience, CHANGELOG.md is ignored by default (as specified in the file).

About

Crawls a GitHub repo for any spelling errors/typos in HTML and MD files

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published