Add to Dictionary Function

Hey Wallace, so nice to see you here. Hope all is going well at Cambridge.

We’ve been thinking about exactly this kind of problem and the plan, in the intermediate future, is to introduce a way for users to fine tune the Leo model using their own transcripts. Rather than adding specific words, this would involve the user correcting whole transcripts. Our plan is that you would add all relevant items to a list, and then mark the transcription status for the images which have a manually corrected transcription (known as “ground truth”) as “Finalized”. Then, there would be a button, probably on the “…” options dropdown for that list, to fine tune the model using that data. It’d then very quickly learn all of these place names and specific abbreviations for that set of manuscripts. So this should both address this issue and increase the transcription accuracy in general too. Let us know if you have any thoughts!