summaryrefslogtreecommitdiff
AgeCommit message (Expand)Author
2019-05-08Set DPI for images, and maximally compress jpg (with binarisation it doesn't ...Nick White
2019-05-08Add format-for-hocr-pdf.sh scriptNick White
2019-04-23Save dehyphenated text to a different file, rather than overwriting the originalNick White
2019-04-23Add dehyphenate scriptNick White
2019-04-09Modify traintessv4.sh to include step to construct final trainingNick White
2019-04-02Fix bugs in traintessv4.shNick White
2019-04-02Add tesseractv4 training scriptNick White
2019-03-26Make book graph scripts more robust to dodgy page filenames, and name bookgra...Nick White
2019-03-26Add nonewlines scriptNick White
2019-03-11Add basic bsb scraperNick White
2019-02-25Make bookgraph script more readableNick White
2019-02-25Add various helper scriptsNick White