Skip to content

maxcom/tikaserver-ex

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

2 Commits
 
 
 
 
 
 

Repository files navigation

This is JAX-RS Tika server for https://issues.apache.org/jira/browse/TIKA-593

Building
--------
Please build and install last Tika snapshot from SVN trunk.

Test data files is not available right now (I need some time to check and remove private data), so use
"mvn -Dmaven.test.skip=true install" to build.

Running
-------
java -jar target/tikaserver-1.5-SNAPSHOT.jar

Usage
-----

Usage examples from command line with curl utility:

1) Extract text:

curl -T price.xls https://localhost:9998/tika

2) Extract text with mime-type hint:

curl -v -H "Content-type: application/vnd.openxmlformats-officedocument.wordprocessingml.document" -T document.docx https://localhost:9998/tika

3) Get all document attachments as ZIP-file:

curl -v -T Doc1_ole.doc https://localhost:9998/unpacker > /var/tmp/x.zip

4) Extract metadata to CSV format:

curl -T price.xls https://localhost:9998/meta

HTTP Codes
----------
200 - Ok
204 - No content (for example when we are unpacking file without attachments)
415 - Unknown file type
422 - Unparsable document of known type (password protected documents and unsupported versions like Biff5 Excel)
500 - Internal error

About

JAX-RS Server for Apache Tika

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages