Question
When training and using an OCR algorithm for handwriting recognition, is it helpful to indicate the author of the handwriting?
Use Case
Have a warehouse full of documents that need to be transcribed into digital format.
We'd like to feed the documents into an OCR algorithm first. If the OCR algorithm reports a low confidence score, then we will pass the documents off to a real person for transcription - and of course use the results for additional algorithm training - so that future documents, especially by the same author, will have a higher chance of being transcribed at a satisfactory confidence level.
For each document, it is feasible but non-trivial, to determine the author of the document and give that to the OCR algorithm as well. We anticipate there should be on the order of 100 authors for 8 million documents.
Intuitively, I assumed that knowing the author would increase the effectiveness of the algorithm, but on further reflection, I am unsure if this is the case. When I read handwriting I don't usually think about the author, but instead intuit how to decipher the handwriting based on the style.
Note: By effectiveness I mean primarily higher accuracy, and secondarily lower resource usage.