Skip to content

mheap/url-tagger

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

26 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

url-tagger

Provide a list of URLs, regexes and tags and have the URL returned with associated tags

Usage

const tagger = new UrlTagger(
  {
    "is-https": "^https://",
    "is-michael-domain": "michaelheap.com",
    "is-michael-content": "I like to learn and I like to teach"
  },
  {
    url: {
      "michael-url": ["is-michael-domain"],
      "michael-secure-url": [["is-michael-domain", "is-https"]]
    },
    content: {
      "michael-content": ["is-michael-content"]
    }
  }
);

await tagger.run("http://michaelheap.com");
# ["michael-content", "michael-url"];

await tagger.run("https://michaelheap.com");
# ["michael-content", "michael-url", "michael-secure-url"];

HTML Caching

To save you hitting the URLs you're testing too hard, URL Tagger allows you to cache the returned HTML using cacheman.

To use a cache, provide a third parameter when creating a URLTagger instance. The options provided are passed directly through to cacheman.

Here is how to use a file based cache:

const tagger = new UrlTagger(
  regexes,
  rules,
  {
    engine: 'file',
    tmpDir: '/tmp/url-tagger'
  }
);

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Packages

No packages published