Extract the article title of a HTML document
HTML JavaScript
Fetching latest commit…
Cannot retrieve the latest commit at this time.
Failed to load latest commit information.
fixture
.editorconfig
.gitattributes
.gitignore
.travis.yml
cli.js
index.js
license
package.json
readme.md
test.js

readme.md

article-title Build Status

Extract the article title of a HTML document

It's often quite hard to get the actual title of an article from a page as authors either add a bunch of trash to <title> or don't use it at all. There's also no standardized way to indicate the title of an article in the markup. This module uses various ways for extracting it cleanly.

Install

$ npm install --save article-title

Usage

const articleTitle = require('article-title');
const htmlDocument = '<!doctype html><html><head><title>My awesome unicorn website</title></head><body><article><h1>How unicorns sleep</h1><p>...</p></body></html>';

articleTitle(htmlDocument);
//=> 'How unicorns sleep'

CLI

$ npm install --global article-title
$ article-title --help

  Usage
    $ article-title <file>
    $ curl <url> | article-title

  Example
    $ curl http://updates.html5rocks.com/2014/06/Automating-Web-Performance-Measurement | article-title
    Automating Web Performance Measurement

License

MIT © Sindre Sorhus