OCRing Old Math: Do It Faster with olmOCR
Anyone working on translations and old math and science works will be familiar with spending hours typing up old documents, even if they're in readable scans that OCR should, in principle, be able to handle. The problem is of course the math and tables. Typing up the equations in LaTeX is a major hurdle, even if you use a snippets-based approach like Gilles Castille . I personally use Obsidian as a note-taking app, and the LaTeX Suite plugin, with some minor tweaks to the shortcuts, and I'm quite fast at this point. But seeing a dense page of multi-line equations (e.g. by Gauss) still makes me tense up a bit. Transcribing one page is no trouble, but 60 pages turns into a multi-week endeavor, and I'm limited by an issue with my ulnar nerves that makes typing for longer than 30 minutes quite painful. Needless to say, transcribing is a burden that I would very much like to lessen. Now I think I've cracked it, thanks to olmOCR . I tried it out on their web demo , and it se...