Age | Commit message (Collapse) | Author | |
---|---|---|---|
2021-03-23 | hocr: Add ability to specify a custom image path for hocr line extraction, ↵ | Nick White | |
and use it in extracthocrlines | |||
2021-02-09 | hocr: Use extracted page name for line naming | Nick White | |
This means that even in multi page hocrs with lines with the same id (like line_1_1), then the page name will be different, so extracthocrlines now won't mistakenly name different lines the same and therefore overwrite them. | |||
2021-02-09 | hocr: Use image specified in ocr_page title, so can support multipage hocrs ↵ | Nick White | |
cleanly | |||
2020-04-14 | Add godoc documentation | Nick White | |
2020-03-13 | Add copyright statements to each file | Nick White | |
2020-01-22 | Add GetWordConfs function to hocr pkg | Nick White | |
2019-10-31 | Export a couple of more generally useful functions | Nick White | |
2019-10-30 | Simplify and document hocr package slightly better | Nick White | |
2019-10-23 | Make bucket-lines and related packages more robust | Nick White | |
bucket-lines would crash for any line that didn't have a corresponding image. Lines which weren't grayscale would also cause crashes; now they are just converted to grayscale if necessary. As a bonus, lines in jpeg can also be decoded successfull. | |||
2019-10-08 | Remove parts that have been moved elsewhere, and rename to rescribe.xyz/utils | Nick White | |
bookpipeline is now at rescribe.xyz/bookpipeline preproc is now at rescribe.xyz/preproc integralimg is now at rescribe.xyz/preproc/integralimg |