Age | Commit message (Collapse) | Author | |
---|---|---|---|
2019-10-08 | Remove parts that have been moved elsewhere, and rename to rescribe.xyz/utils | Nick White | |
bookpipeline is now at rescribe.xyz/bookpipeline preproc is now at rescribe.xyz/preproc integralimg is now at rescribe.xyz/preproc/integralimg | |||
2019-09-05 | Handle no words found error in a better way so any page that is actually 0 ↵ | Nick White | |
confidence is recognised | |||
2019-09-05 | Don't abort analysis if we encounter a hocr with no words, just skip it | Nick White | |
2019-05-15 | Return an error if page average calculation cant be done with hocr | Nick White | |
2019-05-14 | Rewrite pgconf to be more accurate by measuring average word confidence ↵ | Nick White | |
rather than average line confidence | |||
2019-02-25 | Generalise get text from hocr lines | Nick White | |
2019-02-25 | Add tool to extract plain text from hocr | Nick White | |
2019-01-25 | gofmt | Nick White | |
2019-01-25 | Reorganisation and cleanup | Nick White | |