Issue in extracting text with regional languages. #973
Jisan09
started this conversation in
Ask for help with specific PDFs
Replies: 1 comment
-
Hi @Jisan09, the issue here seems to be that this is a scanned PDF, and that the OCR (converting the image to text) has not succeeded well, even before you start working with it in |
Beta Was this translation helpful? Give feedback.
0 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
-
Hi, I have this PDF when i try extract
page.extract_text()
it giving me some weird texts as output. Is there way to get proper text or skip saving if the text not proper?here output sample i got:
Beta Was this translation helpful? Give feedback.
All reactions