Skip to content

Commit

Permalink
words
Browse files Browse the repository at this point in the history
  • Loading branch information
ipitio committed Oct 12, 2024
1 parent 6eed523 commit 040e305
Showing 1 changed file with 1 addition and 1 deletion.
2 changes: 1 addition & 1 deletion README.md
Original file line number Diff line number Diff line change
Expand Up @@ -14,7 +14,7 @@

Make your unsearchable PDFs searchable with the help of a pretrained neural net!

The core logic resides in a Python script that extracts all the PDFs from `todo`, transforms their pages with Google's LSTM-based OCR engine Tesseract, and loads them into `done`. The Bash script installs the dependencies for and runs the Python script in a virtual environment. Docker sets up another virtual environment within which it installs the dependencies for and runs the Bash script. The GitHub Actions workflow uses a third virtual layer in the cloud within which it starts the Docker container.
The core logic resides in a Python script that extracts all the PDFs from `todo`, transforms their pages with Google's LSTM-based OCR engine Tesseract, and loads them into `done`. The Bash script installs the dependencies for and runs the Python script in a virtual environment. Docker sets up another virtual environment within which it installs the dependencies for and runs the Bash script. Google Colab and GitHub Actions both use a third virtual layer in the cloud within which they run the Docker container.

You choose how much you want to automate and virtualize.

Expand Down

0 comments on commit 040e305

Please sign in to comment.