Skip to content

Pinned

  1. Norconex Web Crawler (or spider) is a flexible web crawler for collecting, parsing, and manipulating data from the Internet (or Intranet) to various data repositories such as search engines.

    Java 153 63

  2. Norconex Filesystem Collector is a flexible crawler for collecting, parsing, and manipulating data ranging from local hard drives to network locations into various data repositories such as search …

    Java 19 11

  3. importer Public

    Norconex Importer is a Java library and command-line application meant to "parse" and "extract" content out of a file as plain text, whatever its format (HTML, PDF, Word, etc). In addition, it allo…

    Java 28 23

Repositories

  • commons-lang Public

    Generic library shared between several projects.

    Java 11 Apache-2.0 6 0 0 Updated Apr 1, 2023
  • importer Public

    Norconex Importer is a Java library and command-line application meant to "parse" and "extract" content out of a file as plain text, whatever its format (HTML, PDF, Word, etc). In addition, it allows you to perform any manipulation on the extracted text before using it in your own service or application.

    Java 28 Apache-2.0 23 14 0 Updated Mar 3, 2023
  • committer-core Public

    Norconex Committer is a java library and command line application used to route content to local or remote target repositories, such as a search engine index.

    Java 3 Apache-2.0 11 5 0 Updated Feb 8, 2023
  • collector-http Public

    Norconex Web Crawler (or spider) is a flexible web crawler for collecting, parsing, and manipulating data from the Internet (or Intranet) to various data repositories such as search engines.

    Java 153 Apache-2.0 63 21 0 Updated Feb 6, 2023
  • commons-maven-parent Public

    Maven parent POM for many Norconex Maven projects.

    JavaScript 0 Apache-2.0 2 0 0 Updated Feb 6, 2023
  • collector-core Public

    Collector-related code shared between different collector implementations

    Java 6 Apache-2.0 15 6 1 Updated Feb 6, 2023
  • collector-filesystem Public

    Norconex Filesystem Collector is a flexible crawler for collecting, parsing, and manipulating data ranging from local hard drives to network locations into various data repositories such as search engines.

    Java 19 11 6 3 Updated Dec 6, 2022
  • committer-sql Public

    Implementation of Norconex Committer for SQL (JDBC) databases.

    Java 1 Apache-2.0 5 1 0 Updated Feb 5, 2022
  • committer-solr Public

    Solr implementation of Norconex Committer. Should also work with any Solr-based products, such as LucidWorks.

    Java 3 Apache-2.0 5 8 0 Updated Jan 5, 2022
  • committer-neo4j Public

    Implementation of Norconex Committer for Neo4j.

    Java 2 Apache-2.0 1 2 0 Updated Jan 4, 2022

People

This organization has no public members. You must be a member to see who’s a part of this organization.

Top languages

Loading…

Most used topics

Loading…