Read Text (OCRMax) Identification Tool - Training Tab
The Training tab is used to train, view, rename and remove characters. The tab is divided into two groups, Training, where the character training parameters are defined, and Characters, where the trained fonts are managed.
- While in the Training tab, the region can be adjusted and the segmentation graphics (character rectangles) will be updated as the region is modified.
- Multiple-byte character names (e.g. Kanji) are not supported. Multiple-byte characters can be trained, but the name given to the character must be a single byte name.
- Train String: Specifies the text string to be trained. The number of characters in the Train String text entry box must match the number of segmented characters. A maximum of 50 characters may be trained.
- Train All: Specifies that all of the characters in the region will be trained. The characters expected to be trained are entered into the Train String text entry box, before the Train All button is pressed. The number of characters in the Train String text entry box must match the number of segmented characters.
- Train New: Specifies that only new characters in the region will be trained. The characters expected to be trained are entered into the Train String text entry box, before the Train button is pressed. The number of characters in the Train String text entry box must match the number of segmented characters.
- Train Selected: Specifies that specific characters in the region will be trained. When this option is selected, the Train String text entry box will be disabled, and the Train Selected button will launch the Add Selected Characters to Font dialog. This dialog contains an unwrapped image of the region, with a label and text-entry boxes below each of the segmented characters. The label is the currently associated symbol for that character (a "?" denotes an unknown or untrained character). Below the label is a text-entry box. Enter a label for each segmented character in the text-entry box; leave the text-entry box empty to not retrain characters. Press the Train button to close the dialog.
After training, each trained character will be visible in the tree, where the character will assigned a folder and label, and grouped together (if the characters have matching labels).
- Select the root Font folder to display all of the characters in the font in the panel on the right-side. The characters will be displayed as icons with a label underneath.
- Select a character folder to display all of the trained instances of that character in the panel on the right-side. The characters will be displayed as icons with a label underneath.
-
Select a trained instance of a character to display that character, zoomed to fit the panel on the right-side.
Tip: Providing the tool with multiple training instances of each character to be read during run-time allows the function to better discriminate between similar character types. However, training multiple instances of the same characters has advantages and disadvantages. The advantage to training multiple instances is that better classification performance can be expected, especially when reading text that could easily be mis-classified (e.g. 8 and B). The disadvantages of training multiple instances are: a decrease in the function's execution time; possible decreases in the confidence scores, because multiple training instances should not only improve the highest score, but also improve the second-highest score, as well; and possibly training bad characters. - Import: Launches the Open dialog, where a font, saved as an OCRMax data file (*.ocm), may be imported.
- Export: Launches the Save As dialog, where a font can be saved as an OCRMax data file (*.ocm).
-
Rename: Press after selecting an item to rename it. Any element in the tree or in the list panel on the right-side can be renamed.
Note: Renaming a group of characters will assign a new label to all of the characters in the group. Renaming a single character will assign a new label to that single, selected character; the character will be moved to a new or different group in the tree.
-
Delete: Press after selecting an item to remove it from the font. Any element in the tree or in the list panel on the right-side can be renamed.
Note: Deleting the Font folder will clear the font; all characters will be deleted. Deleting a group of characters will delete all of the characters whose label matched the deleted group's label.