Content-type: text/html Downes.ca ~ Stephen's Web ~ Running OCR against PDFs and images directly in your browser

Stephen Downes

Knowledge, Learning, Community

I tested this and it does work, though with the caveats expressed by Simon Willison in this post. What he has developed, in a nutshell, is a script that will convert a PDF or image to text (using an optical character recognition (OCR) algorithm called Tesseract) right in your browser - no uploading required! Here it is. This post describes how he created the tool, a process that involved working with Claude 3. This, I think, is becoming a new normal. Even if they do nothing more than save typing time, having an AI coding assistant is becoming a powerful developer tool.

Today: 3 Total: 121 [Direct link] [Share]


Stephen Downes Stephen Downes, Casselman, Canada
stephen@downes.ca

Copyright 2024
Last Updated: Nov 23, 2024 4:10 p.m.

Canadian Flag Creative Commons License.

Force:yes