Hacker News
new
|
past
|
comments
|
ask
|
show
|
jobs
|
submit
login
bonyt
on Nov 6, 2021
|
parent
|
context
|
favorite
| on:
Pdfsandwich
Interesting. I've used ocrmypdf for this a lot -
https://ocrmypdf.readthedocs.io/en/latest/
pronoiac
on Nov 6, 2021
|
next
[–]
I tried both recently on a scanned, color document recently. pdfsandwich gave me really unpleasant, monochrome, and blown out results; ocrmypdf did what I expected, giving me a searchable pdf.
kingcharles
on Nov 6, 2021
|
prev
|
next
[–]
Looks great.. I'm just trying to find the Windows binary?
jck
on Nov 6, 2021
|
parent
|
next
[–]
Maybe use their docker image?
https://ocrmypdf.readthedocs.io/en/latest/docker.html
kingcharles
on Nov 6, 2021
|
root
|
parent
|
next
[–]
This Docker app looks useful, thank you.
Shared404
on Nov 6, 2021
|
parent
|
prev
|
next
[–]
And people say Linux doesn't have any programs :P
zeppelin101
on Nov 6, 2021
|
parent
|
prev
|
next
[–]
I've been using ocrmypdf on Windows through WSL very successfully. Works perfectly.
schmorptron
on Nov 6, 2021
|
parent
|
prev
|
next
[–]
Looks like there isn't any
diarrhea
on Nov 6, 2021
|
prev
[–]
It's great and I use it almost every day: paperless-ng uses it in the background.
Guidelines
|
FAQ
|
Lists
|
API
|
Security
|
Legal
|
Apply to YC
|
Contact
Search: