Skip to content
@extractus

Extractus

A set of extractor tools for devs

Welcome to Extractus

We develop and share open source tools for collecting media content.

You can use one or combination of them to build news sites, create automated content systems for marketing campaign or gather dataset for NLP projects, etc.

Here is an example based on our news engine.

If you have any idea, or want more features, or face any problem while using them, please create issue.

In the future, we would like to add more dedicated tools for extracting links, tweets, audios, videos, products, crypto/stock prices.

We have not much time. This is self-training and non-profit side project. Contributions and collaborators are always welcomed 🙂


Pinned Loading

  1. article-extractor Public

    To extract main article from given URL with Node.js

    JavaScript 1.7k 151

  2. oembed-extractor Public

    Extract oEmbed data from given webpage

    JavaScript 117 46

  3. feed-extractor Public

    Simplest way to read & normalize RSS/ATOM/JSON feed data

    JavaScript 177 36

Repositories

Showing 5 of 5 repositories
  • feed-extractor Public

    Simplest way to read & normalize RSS/ATOM/JSON feed data

    JavaScript 177 MIT 36 5 0 Updated May 14, 2025
  • article-extractor Public

    To extract main article from given URL with Node.js

    JavaScript 1,733 MIT 151 6 1 Updated May 14, 2025
  • oembed-extractor Public

    Extract oEmbed data from given webpage

    JavaScript 117 MIT 46 1 0 Updated May 4, 2025
  • extractus Public
    HTML 14 MIT 0 4 (1 issue needs help) 0 Updated Jul 25, 2024
  • .github Public

    Organization meta data

    1 MIT 0 0 0 Updated Dec 3, 2022

Top languages

JavaScript HTML

Most used topics

Loading…