This site uses cookies.
Some of these cookies are essential to the operation of the site,
while others help to improve your experience by providing insights into how the site is being used.
For more information, please see the ProZ.com privacy policy.
Hans Lenting Netherlands Member (2006) German to Dutch
Mar 29, 2020
I’ve just OCR’d a PDF with 7 pages of legal text with a Keyboard Maestro macro. The conversion is nearly perfect and was very fast. This feature makes KM a very good investment.
Camila Barbosa
Subject:
Comment:
The contents of this post will automatically be included in the ticket generated. Please add any additional comments or explanation (optional)
The issue we’ve found with OCR tools (and the reason why many LSPs forbid their use) is that post-CAT tool translation, formatting becomes a nightmare. This is especially the case if matching formatting with the OCR tool.
The only work around we found was to OCR export as plain text, clear formatting, insert correct formatting, run through CAT and then do a FE before delivery.
Let us know what the OCR macro results are like post-TR
Camila Barbosa
Subject:
Comment:
The contents of this post will automatically be included in the ticket generated. Please add any additional comments or explanation (optional)
Camila Barbosa Brazil Local time: 12:27 Portuguese to English + ...
OCR
Mar 30, 2020
Dylan Jan Hartmann wrote:
The issue we’ve found with OCR tools (and the reason why many LSPs forbid their use) is that post-CAT tool translation, formatting becomes a nightmare. This is especially the case if matching formatting with the OCR tool.
The only work around we found was to OCR export as plain text, clear formatting, insert correct formatting, run through CAT and then do a FE before delivery.
Let us know what the OCR macro results are like post-TR
I agree. From experience, you need a very clear .pdf document to start with before running OCR.
I personally think that SmartCat OCR program does a good job but, like everything else realted to OCR, it is not perfect.
Subject:
Comment:
The contents of this post will automatically be included in the ticket generated. Please add any additional comments or explanation (optional)
Hans Lenting Netherlands Member (2006) German to Dutch
TOPIC STARTER
Nice additional feature
Mar 31, 2020
Rather than buying an expensive OCR suite for occasional use, this new feature of Keyboard Maestro (based on the open source Tesseract software) is a nice additional feature to quickly OCR a dialogue box, a single page or whatever you have to translate.
Subject:
Comment:
The contents of this post will automatically be included in the ticket generated. Please add any additional comments or explanation (optional)
Exclusive discount for ProZ.com users!
Save over 13% when purchasing Wordfast Pro through ProZ.com. Wordfast is the world's #1 provider of platform-independent Translation Memory software. Consistently ranked the most user-friendly and highest value
The leading translation software used by over 270,000 translators.
Designed with your feedback in mind, Trados Studio 2022 delivers an unrivalled, powerful desktop
and cloud solution, empowering you to work in the most efficient and cost-effective way.