R-Forge Logo

Welcome to r-tika project!

Using the Apacha Tika library, mimetype detection and parsing of MS Office (OLE2 and OOXML), possibly malformed HTML and XML, ODF, PDF, EPUB, MBOX and RTF.

No content added.

The project summary page you can find here.