Pinned
Repositories
-
- importer Public
Norconex Importer is a Java library and command-line application meant to "parse" and "extract" content out of a file as plain text, whatever its format (HTML, PDF, Word, etc). In addition, it allows you to perform any manipulation on the extracted text before using it in your own service or application.
- committer-core Public
Norconex Committer is a java library and command line application used to route content to local or remote target repositories, such as a search engine index.
- collector-http Public
Norconex Web Crawler (or spider) is a flexible web crawler for collecting, parsing, and manipulating data from the Internet (or Intranet) to various data repositories such as search engines.
-
-
- collector-filesystem Public
Norconex Filesystem Collector is a flexible crawler for collecting, parsing, and manipulating data ranging from local hard drives to network locations into various data repositories such as search engines.
-
- committer-solr Public
Solr implementation of Norconex Committer. Should also work with any Solr-based products, such as LucidWorks.
-
People
This organization has no public members. You must be a member to see who’s a part of this organization.
Top languages
Loading…
Most used topics
Loading…