Skip to content

A Lucene 8.0.0 API based java application that given a .txt file of a book, returns the book's title and date along with their offsets

Notifications You must be signed in to change notification settings

Coursal/Lucene-Book-Info-Finder

Repository files navigation

Lucene-Book-Info-Finder

A Lucene 8.0.0 API based java application that given a .txt file of a book, returns the book's title and date along awith their offsets.

  1. An analyzer splits apart the tokens inside the file
  2. The first few tokens (up to 1000 characters in) are searched for labels such as Title: and Date:
  3. Everything right after the labels, including their offsets, are being returned to the screen

Execution

java book_info_finder.BookInfoFinder <.txt file path>

Examples

Moby Dick

Metamorphosis

The Odyssey

.txt book files taken from Project Gutenberg.

Essential Lucene 8.0.0 API .jar files included.

About

A Lucene 8.0.0 API based java application that given a .txt file of a book, returns the book's title and date along with their offsets

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages