Autopsy File Ingest Module that uses Tika to detect the language of common file documents.
Currently programmed to try and process files with the following extensions:
- .doc
- .docx
- .xls
- .xlsx
- .ppt
- .pptx
Tika | Supported | Language | Models |
---|---|---|---|
Belarusian | Catalan | Danish | German |
Esperanto | Estonian | Greek | English |
Spanish | Finnish | French | Persian |
Galician | Hungarian | Icelandic | Italian |
Lithuanian | Dutch | Norwegian | Polish |
Portuguese | Romanian | Russian | Slovakian |
Slovenian | Swedish | Thai | Ukrainian |
Results are displayed as Interesting Items with a sub-category of Language_Detected
Tika Langue Detector reports processing summary results similar to other Plugins.
Once an ingestion is complete, the total processing time and number of files processed is reported.