Shundlikht is a Python program (Jupyter Notebook) that, once run locally, automatically transcribes, translates (if you want), and collates the pdf images associated with each installment of a given work in the Shund.org database. The images associated with each installment are hosted by the National Library of Israel (NLI).
With the Shund.org corpus approaching 25,000 installments (from a single publication!), exploratory analysis has become a daunting prospect. Distant viewing and textual analysis techniques provide us a way forward, but there are emerging alternatives to help us make sense of large and unwieldy data.
How might one glimpse the contents of a text corpus, to generate preliminary research questions that might inform downstream search and more sophisticated analyses related to topics, sentiments, parts of speech, or named entities? One way is to engage with the full text of a work in virtual reality, using Longhand.