How to disable column select in FineReader 8
Trådens avsändare: Samuel Murray
Samuel Murray
Samuel Murray  Identity Verified
Nederländerna
Local time: 23:49
Medlem (2006)
Engelska till Afrikaans
+ ...
Sep 3, 2009

G'day everyone

I want to scan and OCR a glossary with FineReader 8.0. FineReader automatically detects columns and scans them separately, which is often a good thing, but in the case of a glossary (which is typed in a table without table borders) it is a bad thing.

See, if FineReader gets a source file with this:

cat[tab]kat
dog[tab]hond
horse[tab]perd

then it outputs a text file that looks like this:

cat
dog
... See more
G'day everyone

I want to scan and OCR a glossary with FineReader 8.0. FineReader automatically detects columns and scans them separately, which is often a good thing, but in the case of a glossary (which is typed in a table without table borders) it is a bad thing.

See, if FineReader gets a source file with this:

cat[tab]kat
dog[tab]hond
horse[tab]perd

then it outputs a text file that looks like this:

cat
dog
horse
kat
hond
perd

which is of no use to me.

Do you know of a way in FineReader 8.0 to disable this automatic column selection? I know I can select the blocks myself, manually, but there are 200 pages here and I don't want to spend time setting blocks manually. I want FineReader to assume that the entire page is one block to be scanned as-is. Do you know if this is possible?

Thanks
Collapse


 
Egidijus Slepetys
Egidijus Slepetys  Identity Verified
Local time: 00:49
Tyska till Litauiska
Tools -> Options -> Legacy options ... Sep 3, 2009

try to check "Read as plain text formatted with spaces".

Good luck,
Egidijus


 
Adam Łobatiuk
Adam Łobatiuk  Identity Verified
Polen
Local time: 23:49
Medlem (2009)
Engelska till Polska
+ ...
Doesn't that depend on the output file settings? Sep 3, 2009

I use version 9 and I haven't had such a problem, but I would expect FR to produce an acceptable Word file with a strict formatting option, like "Editable copy". Then it would be quite easy to manipulate the content in Word.

 
Samuel Murray
Samuel Murray  Identity Verified
Nederländerna
Local time: 23:49
Medlem (2006)
Engelska till Afrikaans
+ ...
TOPIC STARTER
Thanks, but... Sep 3, 2009

Adam Łobatiuk wrote:
I would expect FR to produce an acceptable Word file with a strict formatting option, like "Editable copy".


The problem is that FR first creates the blocks, then reads them, and then creates the output format. Even if I select "strict formatting", the formatting is only done in the third step, and steps 1 and 2 (create blocks, read blocks) are done without taking the final formatting into account.

There are two ways to ensure that FR reads the page as a table. The first is to draw a block manually on the page, right-click it and select "Analyze table structure" and then OCR it. The second is to select Image, then Choose a Tool, then Table Block, and then draw a block manually on each page. The block is purple, and when it is read, it is OCR'ed as a table.

Saving a table from FR into plaintext yields the desired result.

Now... I have to write an AutoIt script to automate the page selection process. *sigh*


 
Selcuk Akyuz
Selcuk Akyuz  Identity Verified
Turkiet
Local time: 00:49
Engelska till Turkiska
+ ...
Legacy options Sep 3, 2009

Egiz wrote:
Tools -> Options -> Legacy options ...

try to check "Read as plain text formatted with spaces".

Good luck,
Egidijus


Tools -> Options -> General -> Legacy options


 
Samuel Murray
Samuel Murray  Identity Verified
Nederländerna
Local time: 23:49
Medlem (2006)
Engelska till Afrikaans
+ ...
TOPIC STARTER
I wish... Sep 3, 2009

Selcuk Akyuz wrote:
Tools -> Options -> General -> Legacy options


Wow... I wish I had seen this setting earlier -- I would have scanned all my previous work with this setting enabled. As it is, I'm almost done with my scanning and I don't see myself re-OCR-ing 20000 pages.


 


To report site rules violations or get help, contact a site moderator:


You can also contact site staff by submitting a support request »

How to disable column select in FineReader 8






Wordfast Pro
Translation Memory Software for Any Platform

Exclusive discount for ProZ.com users! Save over 13% when purchasing Wordfast Pro through ProZ.com. Wordfast is the world's #1 provider of platform-independent Translation Memory software. Consistently ranked the most user-friendly and highest value

Buy now! »
Trados Business Manager Lite
Create customer quotes and invoices from within Trados Studio

Trados Business Manager Lite helps to simplify and speed up some of the daily tasks, such as invoicing and reporting, associated with running your freelance translation business.

More info »