Skip to content

Get title of a URL from its HTML

Latest

Choose a tag to compare

@wolfram77 wolfram77 released this 16 Nov 12:04
· 2 commits to main since this release

Get title of a URL from its HTML.


$ node index.js urls.txt -o titles.md
# Generated URL titles saved to titles.md

$ export DEVTOOLS_PATH="/path/to/chrome-linux/chrome"
$ export DEVTOOLS_DATA_DIR="/path/to/chrome-linux/data-dir"
$ node index.js urls.txt -o titles.md --unique
# Generated URL titles saved to titles.md, with unique URLs

$ node index.js urls.txt -o titles.md --unique --sort
# Generated URL titles saved to titles.md, with unique and sorted URLs


Usage

$ script-url-title [options] <input-file>

# Options:
#   <input file>                Input file with URLs.
#   -o, --output <output file>  Write output to file.
#   -u, --unique                Make URLs unique.
#   -s, --sort                  Sort URLs.
#   -t, --throttle <ms>         Throttle requests.
#   -h, --help                  Show this help message.

# Environment Variables:
#   $DEVTOOLS_PATH                   Path to Chrome DevTools.
#   $DEVTOOLS_DATA_DIR               Path to Chrome DevTools data directory.


References





ORG
DOI