Skip to content
@extractus

Extractus

A set of extractor tools for devs

Welcome to Extractus

We develop and share open source tools for collecting media content.

You can use one or combination of them to build news sites, create automated content systems for marketing campaign or gather dataset for NLP projects, etc.

Here is an example based on our news engine.

If you have any idea, or want more features, or face any problem while using them, please create issue.

In the future, we would like to add more dedicated tools for extracting links, tweets, audios, videos, products, crypto/stock prices.

We have not much time. This is self-training and non-profit side project. Contributions and collaborators are always welcomed 🙂


Pinned Loading

  1. article-extractor Public

    To extract main article from given URL with Node.js

    JavaScript 1.7k 147

  2. oembed-extractor Public

    Extract oEmbed data from given webpage

    JavaScript 112 45

  3. feed-extractor Public

    Simplest way to read & normalize RSS/ATOM/JSON feed data

    JavaScript 173 34

Repositories

Showing 5 of 5 repositories
  • article-extractor Public

    To extract main article from given URL with Node.js

    JavaScript 1,674 MIT 147 5 1 Updated Feb 9, 2025
  • feed-extractor Public

    Simplest way to read & normalize RSS/ATOM/JSON feed data

    JavaScript 173 MIT 34 5 0 Updated Feb 9, 2025
  • oembed-extractor Public

    Extract oEmbed data from given webpage

    JavaScript 112 MIT 45 0 0 Updated Feb 9, 2025
  • extractus Public
    HTML 14 MIT 0 4 (1 issue needs help) 0 Updated Jul 25, 2024
  • .github Public

    Organization meta data

    1 MIT 0 0 0 Updated Dec 3, 2022

Top languages

Loading…

Most used topics

Loading…