Free Capitalist Network - Community Archive
Mises Community Archive
An online community for fans of Austrian economics and libertarianism, featuring forums, user blogs, and more.

Works to OCR

Pages

Tags

Page Details

First published by:
David V
on Fri, Feb 20 2009
Last revision by:
Anders Mikkelsen
on Fri, Nov 19 2010
4 people found this article useful.

100% of people found this useful
Works to OCR

Filed under: [Edit Tags]

List of works that you would like to see digitized:

  • De Moneta by Nicole Oresme (PDF; we need this one in HTML. Direct questions to BK Marcus.)

If you are working on something, please add your name so work is not duplicated.)

  • Michael S Costello - De Moneta conversion to HTML, downloaded March 02 2009.
  • Stanley Pinchak - Free Banking (Sechrest) ocr and corrections currently, downloaded March 03, 2009.
  • Anders Mikkelsen - American Economy lecture series by Rothbard. (Transcribing 3 lectures as of 11/19/2010. Please contact me if you wish to help transcribe.)

Recent Comments

By: Stanley Pinchak Posted on Tue, Mar 3 2009 2:33 PM

I have found the tesseract ocr engine to be very accurate with the Free Banking text.  Of course the occasional ff and rn errors will be found, as well as mistakes on en and em dashes (I think that is what they are called).

By: Stanley Pinchak Posted on Tue, Mar 3 2009 2:59 PM

I have encountered graphs, is there a specific format that these should be converted to?  I was thinking either eps, or SVG.  From eps, any raster format may be exported, however, modern browsers natively support SVG.  As to browser compliance with the whole of the SVG specification, I do not possess this knowledge.

By: Michael S Costello Posted on Tue, Mar 24 2009 11:10 AM

Progress and Commentary:

Graphs question by Stanley kind of hit me as well, after a fashion, as there is an image on page 23 or so of de moneta (a set of coins) that will be interesting to hear how to handle.  Any advice appreciated.  Currently have paragraphed about 40 pages worth of De Moneta.  I know more about British coinage than mortal man was meant to as well.

By: jtucker Posted on Wed, Nov 4 2009 9:47 PM

I'm thinking that we should turn this into a google group for Mises Documents.

By: GeneralTelegraph Posted on Mon, Mar 15 2010 5:25 PM

Does anyone know if rehosting tesseract-converted google books violates any Google-EULA ?  I have a bunch of books

I converted with Tesseract for a research project. Some conversions are barely usable. Others are pretty good.  

I get the impression that all Google OCR's its books but  doesn't release the plaintext.  Anyone know how to access whole

OCR'ed copies of Google Books ?