Character Separation
Character separation is a simple process in scanning an image in NewOCR. Since each piece of a character (The dot of an I, the top part of an equals sign, the two separate circles of a percent, etc.) are defined as completely separate characters, no character merging is required yet.
Line Separation
Before characters are separated a process, called line separation occurs. i
or a _
will be above or below all other characters, and will result in a separate line. This is overcome by the OCROptions#setMaxPercentDiffToMerge(double)
Character Separation
These are then compiled into a single character object.