summaryrefslogtreecommitdiff
path: root/pkg/hocr
AgeCommit message (Collapse)Author
2021-03-23hocr: Add ability to specify a custom image path for hocr line extraction, ↵Nick White
and use it in extracthocrlines
2021-02-09hocr: Use extracted page name for line namingNick White
This means that even in multi page hocrs with lines with the same id (like line_1_1), then the page name will be different, so extracthocrlines now won't mistakenly name different lines the same and therefore overwrite them.
2021-02-09hocr: Use image specified in ocr_page title, so can support multipage hocrs ↵Nick White
cleanly
2020-04-14Add godoc documentationNick White
2020-03-13Add copyright statements to each fileNick White
2020-01-22Add GetWordConfs function to hocr pkgNick White
2019-10-31Export a couple of more generally useful functionsNick White
2019-10-30Simplify and document hocr package slightly betterNick White
2019-10-23Make bucket-lines and related packages more robustNick White
bucket-lines would crash for any line that didn't have a corresponding image. Lines which weren't grayscale would also cause crashes; now they are just converted to grayscale if necessary. As a bonus, lines in jpeg can also be decoded successfull.
2019-10-08Remove parts that have been moved elsewhere, and rename to rescribe.xyz/utilsNick White
bookpipeline is now at rescribe.xyz/bookpipeline preproc is now at rescribe.xyz/preproc integralimg is now at rescribe.xyz/preproc/integralimg