You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
This is not an issue but rather a question. I am currently using this Dockerfile and manage to have containerized Tika Server running successfully using Docker for Windows.
My question is that.. can the steps on Dockerfile be replicated using purely Windows approach?
I cannot do text extraction from images using tika-server-1.20.jar on my end, while that works using the containerized Tika Server. I have setup Tesseract correctly with data file/s on Windows, but only Tesseract works, not my Tika Server using the .jar file.
Pardon my ignorance but please enlighten me. Thank you!
The text was updated successfully, but these errors were encountered:
Whilst I've not tried on Windows personally, knowing the TesseractOCRParser, this should be possible. The default config assume tesseract is on the system path, if not or you don't want to have it on it, you'll need to specify the path it using a custom configuration[1].
If you still have issues, either drop me a mail directly or jump on the Tika Users mailing list
Hi,
This is not an issue but rather a question. I am currently using this Dockerfile and manage to have containerized Tika Server running successfully using Docker for Windows.
My question is that.. can the steps on Dockerfile be replicated using purely Windows approach?
I cannot do text extraction from images using
tika-server-1.20.jar
on my end, while that works using the containerized Tika Server. I have setup Tesseract correctly with data file/s on Windows, but only Tesseract works, not my Tika Server using the .jar file.Pardon my ignorance but please enlighten me. Thank you!
The text was updated successfully, but these errors were encountered: