How to disable column select in FineReader 8 Trådens avsändare: Samuel Murray
| Samuel Murray Nederländerna Local time: 23:49 Medlem (2006) Engelska till Afrikaans + ...
G'day everyone
I want to scan and OCR a glossary with FineReader 8.0. FineReader automatically detects columns and scans them separately, which is often a good thing, but in the case of a glossary (which is typed in a table without table borders) it is a bad thing.
See, if FineReader gets a source file with this:
cat[tab]kat
dog[tab]hond
horse[tab]perd
then it outputs a text file that looks like this:
cat
dog ... See more G'day everyone
I want to scan and OCR a glossary with FineReader 8.0. FineReader automatically detects columns and scans them separately, which is often a good thing, but in the case of a glossary (which is typed in a table without table borders) it is a bad thing.
See, if FineReader gets a source file with this:
cat[tab]kat
dog[tab]hond
horse[tab]perd
then it outputs a text file that looks like this:
cat
dog
horse
kat
hond
perd
which is of no use to me.
Do you know of a way in FineReader 8.0 to disable this automatic column selection? I know I can select the blocks myself, manually, but there are 200 pages here and I don't want to spend time setting blocks manually. I want FineReader to assume that the entire page is one block to be scanned as-is. Do you know if this is possible?
Thanks ▲ Collapse | | | Tools -> Options -> Legacy options ... | Sep 3, 2009 |
try to check "Read as plain text formatted with spaces".
Good luck,
Egidijus | | | Adam Łobatiuk Polen Local time: 23:49 Medlem (2009) Engelska till Polska + ... Doesn't that depend on the output file settings? | Sep 3, 2009 |
I use version 9 and I haven't had such a problem, but I would expect FR to produce an acceptable Word file with a strict formatting option, like "Editable copy". Then it would be quite easy to manipulate the content in Word. | | | Samuel Murray Nederländerna Local time: 23:49 Medlem (2006) Engelska till Afrikaans + ... TOPIC STARTER Thanks, but... | Sep 3, 2009 |
Adam Łobatiuk wrote:
I would expect FR to produce an acceptable Word file with a strict formatting option, like "Editable copy".
The problem is that FR first creates the blocks, then reads them, and then creates the output format. Even if I select "strict formatting", the formatting is only done in the third step, and steps 1 and 2 (create blocks, read blocks) are done without taking the final formatting into account.
There are two ways to ensure that FR reads the page as a table. The first is to draw a block manually on the page, right-click it and select "Analyze table structure" and then OCR it. The second is to select Image, then Choose a Tool, then Table Block, and then draw a block manually on each page. The block is purple, and when it is read, it is OCR'ed as a table.
Saving a table from FR into plaintext yields the desired result.
Now... I have to write an AutoIt script to automate the page selection process. *sigh* | |
|
|
Selcuk Akyuz Turkiet Local time: 00:49 Engelska till Turkiska + ... Legacy options | Sep 3, 2009 |
Egiz wrote:
Tools -> Options -> Legacy options ...
try to check "Read as plain text formatted with spaces".
Good luck,
Egidijus
Tools -> Options -> General -> Legacy options | | | Samuel Murray Nederländerna Local time: 23:49 Medlem (2006) Engelska till Afrikaans + ... TOPIC STARTER
Selcuk Akyuz wrote:
Tools -> Options -> General -> Legacy options
Wow... I wish I had seen this setting earlier -- I would have scanned all my previous work with this setting enabled. As it is, I'm almost done with my scanning and I don't see myself re-OCR-ing 20000 pages. | | | To report site rules violations or get help, contact a site moderator: You can also contact site staff by submitting a support request » How to disable column select in FineReader 8 Wordfast Pro | Translation Memory Software for Any Platform
Exclusive discount for ProZ.com users!
Save over 13% when purchasing Wordfast Pro through ProZ.com. Wordfast is the world's #1 provider of platform-independent Translation Memory software. Consistently ranked the most user-friendly and highest value
Buy now! » |
| Trados Business Manager Lite | Create customer quotes and invoices from within Trados Studio
Trados Business Manager Lite helps to simplify and speed up some of the daily tasks, such as invoicing and reporting, associated with running your freelance translation business.
More info » |
|
| | | | X Sign in to your ProZ.com account... | | | | | |