Pinned Loading
Repositories
- crawlers Public
Norconex Crawlers (or spiders) are flexible web and filesystem crawlers for collecting, parsing, and manipulating data from the web or filesystem to various data repositories such as search engines.
Norconex/crawlers’s past year of commit activity - importer Public
Norconex Importer is a Java library and command-line application meant to "parse" and "extract" content out of a file as plain text, whatever its format (HTML, PDF, Word, etc). In addition, it allows you to perform any manipulation on the extracted text before using it in your own service or application.
Norconex/importer’s past year of commit activity - collector-filesystem Public
Norconex Filesystem Collector is a flexible crawler for collecting, parsing, and manipulating data ranging from local hard drives to network locations into various data repositories such as search engines.
Norconex/collector-filesystem’s past year of commit activity - committer-solr Public
Solr implementation of Norconex Committer. Should also work with any Solr-based products, such as LucidWorks.
Norconex/committer-solr’s past year of commit activity - committer-core Public
Norconex Committer is a java library and command line application used to route content to local or remote target repositories, such as a search engine index.
Norconex/committer-core’s past year of commit activity
People
This organization has no public members. You must be a member to see who’s a part of this organization.
Top languages
Loading…
Most used topics
Loading…