https://sve.proz.com/forum/software_applications/144392-how_to_disable_column_select_in_finereader_8.html

How to disable column select in FineReader 8
Trådens avsändare: Samuel Murray
Samuel Murray
Samuel Murray  Identity Verified
Nederländerna
Local time: 05:19
Medlem (2006)
Engelska till Afrikaans
+ ...
Sep 3, 2009

G'day everyone

I want to scan and OCR a glossary with FineReader 8.0. FineReader automatically detects columns and scans them separately, which is often a good thing, but in the case of a glossary (which is typed in a table without table borders) it is a bad thing.

See, if FineReader gets a source file with this:

cat[tab]kat
dog[tab]hond
horse[tab]perd

then it outputs a text file that looks like this:

cat
dog
... See more
G'day everyone

I want to scan and OCR a glossary with FineReader 8.0. FineReader automatically detects columns and scans them separately, which is often a good thing, but in the case of a glossary (which is typed in a table without table borders) it is a bad thing.

See, if FineReader gets a source file with this:

cat[tab]kat
dog[tab]hond
horse[tab]perd

then it outputs a text file that looks like this:

cat
dog
horse
kat
hond
perd

which is of no use to me.

Do you know of a way in FineReader 8.0 to disable this automatic column selection? I know I can select the blocks myself, manually, but there are 200 pages here and I don't want to spend time setting blocks manually. I want FineReader to assume that the entire page is one block to be scanned as-is. Do you know if this is possible?

Thanks
Collapse


 
Egidijus Slepetys
Egidijus Slepetys  Identity Verified
Local time: 06:19
Tyska till Litauiska
Tools -> Options -> Legacy options ... Sep 3, 2009

try to check "Read as plain text formatted with spaces".

Good luck,
Egidijus


 
Adam Łobatiuk
Adam Łobatiuk  Identity Verified
Polen
Local time: 05:19
Medlem (2009)
Engelska till Polska
+ ...
Doesn't that depend on the output file settings? Sep 3, 2009

I use version 9 and I haven't had such a problem, but I would expect FR to produce an acceptable Word file with a strict formatting option, like "Editable copy". Then it would be quite easy to manipulate the content in Word.

 
Samuel Murray
Samuel Murray  Identity Verified
Nederländerna
Local time: 05:19
Medlem (2006)
Engelska till Afrikaans
+ ...
TOPIC STARTER
Thanks, but... Sep 3, 2009

Adam Łobatiuk wrote:
I would expect FR to produce an acceptable Word file with a strict formatting option, like "Editable copy".


The problem is that FR first creates the blocks, then reads them, and then creates the output format. Even if I select "strict formatting", the formatting is only done in the third step, and steps 1 and 2 (create blocks, read blocks) are done without taking the final formatting into account.

There are two ways to ensure that FR reads the page as a table. The first is to draw a block manually on the page, right-click it and select "Analyze table structure" and then OCR it. The second is to select Image, then Choose a Tool, then Table Block, and then draw a block manually on each page. The block is purple, and when it is read, it is OCR'ed as a table.

Saving a table from FR into plaintext yields the desired result.

Now... I have to write an AutoIt script to automate the page selection process. *sigh*


 
Selcuk Akyuz
Selcuk Akyuz  Identity Verified
Turkiet
Local time: 06:19
Engelska till Turkiska
+ ...
Legacy options Sep 3, 2009

Egiz wrote:
Tools -> Options -> Legacy options ...

try to check "Read as plain text formatted with spaces".

Good luck,
Egidijus


Tools -> Options -> General -> Legacy options


 
Samuel Murray
Samuel Murray  Identity Verified
Nederländerna
Local time: 05:19
Medlem (2006)
Engelska till Afrikaans
+ ...
TOPIC STARTER
I wish... Sep 3, 2009

Selcuk Akyuz wrote:
Tools -> Options -> General -> Legacy options


Wow... I wish I had seen this setting earlier -- I would have scanned all my previous work with this setting enabled. As it is, I'm almost done with my scanning and I don't see myself re-OCR-ing 20000 pages.


 


To report site rules violations or get help, contact a site moderator:


You can also contact site staff by submitting a support request »

How to disable column select in FineReader 8






Protemos translation business management system
Create your account in minutes, and start working! 3-month trial for agencies, and free for freelancers!

The system lets you keep client/vendor database, with contacts and rates, manage projects and assign jobs to vendors, issue invoices, track payments, store and manage project files, generate business reports on turnover profit per client/manager etc.

More info »
CafeTran Espresso
You've never met a CAT tool this clever!

Translate faster & easier, using a sophisticated CAT tool built by a translator / developer. Accept jobs from clients who use Trados, MemoQ, Wordfast & major CAT tools. Download and start using CafeTran Espresso -- for free

Buy now! »