Keeping formatting when exporting transcript

Would it be possible to have the option to keep the formatting in the transcription when it gets exported to a txt file?

For instance, within Leo, my document is transcribed as:

“21 GEO. III. 31° Octobris. 5.
Parliament Meets.
THE King’s most Excellent Majesty having…”

But when I export it as a txt file it turns into:

"“21 GEO. III. 31° Octobris. 5. Parliament Meets. THE King’s most Excellent Majesty having…”

Its useful to keep the formatting in the transcript so I can easily identify and remove the ‘header’ text. Is it possible for this to be added as an option when exporting?

Hi Nell! Thanks for flagging this issue – we’ll update it so that newlines are properly preserved in txt exports. I’ll make sure this is done ASAP.

Other formatting (newlines, strikethroughs, superscripts) cannot be preserved in txt files, however, as the txt format doesn’t support these.

I’ll let you know here once this is done!

Hi Nell – this is now fixed in version 0.1.10. See release notes here. Let me know if any problems!

I’d add to this, keep better text formatting when exporting to a pdf. It would be nice if the transcript looked more like the manuscript page - in my case, page numbers centered at the top, some words out in the margin, lines across the page separating parts of the document.

1 Like

Hadn’t thought of that - very good idea. It should be possible once we teach the model to preserve location information in the image. We’ll then be able to match parts of the image with the transcript, so that the user can hover over transcription text and it will highlight the relevant part of the image in the app.

These export issues should be addressed in our latest release v0.2.3 – let us know if any problems