Tutorial Abbyy Recognition Server (en): Unterschied zwischen den Versionen
Aus Kallimachos
| (24 dazwischenliegende Versionen desselben Benutzers werden nicht angezeigt) | |||
| Zeile 9: | Zeile 9: | ||
These settings can be accepted as provided. It saves time-consuming troubleshooting. | These settings can be accepted as provided. It saves time-consuming troubleshooting. | ||
=== Choosing the user account the server is running === | === Choosing the user account the server is running === | ||
| Zeile 28: | Zeile 24: | ||
It's advisable to create a folder structure like the following: | It's advisable to create a folder structure like the following: | ||
[[Bild:Ordnerstruktur.PNG|600px|center]] | [[Bild:Ordnerstruktur.PNG|600px|center]] | ||
| Zeile 44: | Zeile 44: | ||
** set checkbox on ''share this folder'' | ** set checkbox on ''share this folder'' | ||
** privileges | ** privileges | ||
*** choose the user(s) that may | *** choose the user(s) that may get access | ||
| Zeile 58: | Zeile 58: | ||
**** change | **** change | ||
**** read | **** read | ||
Mount the higher folder, e.g.''Daheim'' as network share on computers installed with Verification Station. | Mount the higher folder, e.g.''Daheim'' as network share on computers installed with Verification Station. | ||
* open Windows-Explorer | * open Windows-Explorer | ||
| Zeile 69: | Zeile 71: | ||
*** reconnect during login | *** reconnect during login | ||
*** finish | *** finish | ||
=== Workflow settings=== | === Workflow settings=== | ||
[[Bild:Erweiterte_Workfloweinstellungen.png|600px|center]] | [[Bild:Erweiterte_Workfloweinstellungen.png|600px|center]] | ||
==== Explanation of "Entzerren/dewarping"==== | ==== Explanation of "Entzerren/dewarping"==== | ||
This option arranges the picture on the basis of straight lines. If they are like in this example, a trapeze-shaped flag, the result looks like this: | This option arranges the picture on the basis of straight lines. If they are like in this example, a trapeze-shaped flag, the result looks like this: | ||
| Zeile 92: | Zeile 99: | ||
That's why you should deactivate the checkbox ''Entzerren/dewarping'' | That's why you should deactivate the checkbox ''Entzerren/dewarping'' | ||
=== Quality check === | === Quality check === | ||
| Zeile 115: | Zeile 124: | ||
== Create training file for characters == | == Create training file for characters == | ||
If | If Abbyy Recognition Server doesn't recognize some characters, a training file can be created in FineReader, exported as .fbt-file and imported into Recognition Server. A demo version of ''FineReader'' can be downloaded on the Abbyy-homepage. | ||
In order to create a training file, e.g. for the Gothic type, set the checkbox "Use training in order to recognize new characters and ligatures". Now click on "read page". A dialog field opens. | In order to create a training file, e.g. for the Gothic type, set the checkbox "Use training in order to recognize new characters and ligatures". Now click on "read page". A dialog field opens. | ||
| Zeile 123: | Zeile 132: | ||
In this case the letter "M" wasn't recognized properly. Extend the box by using the | In this case the letter "M" wasn't recognized properly. Extend the box by using the opening double arrow >> until the letter is completely covered. Following click on "Training". If there is more than one letter boxed, execpt on ligatures, you can shrink the box by using the closing double arrow << until the single letter is covered on it's own. | ||
[[Bild:Mustertraining.png|600px|center]] | [[Bild:Mustertraining.png|600px|center]] | ||
[[Bild:Mustertraining_2.png|600px|center]] | [[Bild:Mustertraining_2.png|600px|center]] | ||
| Zeile 140: | Zeile 151: | ||
Their properties like '''bold''' or ''italic style'' can be set as well. | Their properties like '''bold''' or ''italic style'' can be set as well. | ||
| Zeile 145: | Zeile 157: | ||
The training file can be exported to Recognition Server by clicking on | In order to check it the accuracy of the OCR you can OCR some documents with and without your custom training file. | ||
The training file can be exported to Recognition Server by clicking on | |||
* Tools | * Tools | ||
** OCR | ** OCR | ||
| Zeile 155: | Zeile 169: | ||
and then add this file like described in the workflow settings. | and then add this file like described in the workflow settings. | ||
[[Bild:Benutzermuster_Explorer.png|600px|center]] | [[Bild:Benutzermuster_Explorer.png|600px|center]] | ||
[[Bild:Benutzermuster_hinzufügen.png|600px|center]] | [[Bild:Benutzermuster_hinzufügen.png|600px|center]] | ||
| Zeile 166: | Zeile 183: | ||
===Page numbers not displayed=== | ===Page numbers not displayed=== | ||
If there's a text box without page numbers, even though it is enclosed, the font color is | If there's a text box without page numbers, even though it is enclosed, maybe the font color is the same like the background color. Click on the double arrow like shown in the picture to open the menu, click into the color box and choose black. | ||
[[Bild:Nicht_eingelesene_Seitenzahlen_Schriftfarbe.png]] | [[Bild:Nicht_eingelesene_Seitenzahlen_Schriftfarbe.png]] | ||
=== Specifics of print type ''Gothic'' when drawing a new text box === | === Specifics of print type ''Gothic'' when drawing a new text box === | ||
Everytime a new text box is | |||
Everytime a new text box is drawn, the print type is set to default, which only includes "normal" print types. If there is gothic to be recognized, the check on ''Gothic'' has to be set. | |||
[[Bild:Neue_Textbox.png]] | [[Bild:Neue_Textbox.png]] | ||
After this you need to check the reading order and correct it. | After this you need to check the reading order and correct it. | ||
===Settings in Verification Station=== | ===Settings in Verification Station=== | ||
====Spell Checking==== | ====Spell Checking==== | ||
It's added | |||
The verification station has a built-in spell check, that can be invoked by clicking the corresponding button. As the spell check is quite faulty, sooner or later error messages will appear (see ''Error messages''). Alternatively, a regular text-file with UTF-16 formatting can act as dictionary when you apply it to the workflow settings '''It's added via 2nd tab -> ''Verarbeitung/Processing''. -> interner Link zu "Tab Workflow -> Workfloweinstellungen/Workflow settings'' | |||
[[Bild:Rechtschreibprüfung.PNG|600px|center]] | [[Bild:Rechtschreibprüfung.PNG|600px|center]] | ||
=Error Messages= | =Error Messages= | ||
| Zeile 199: | Zeile 225: | ||
* using the spell checking | * using the spell checking | ||
* quite rarely during the runtime | * quite rarely during the runtime | ||
These are bugs in the program. | These are bugs in the program. | ||