Age | Commit message (Expand) | Author |
2019-06-11 | Name hocrs as pdfimages does, and preserve entities for hocr | Nick White |
2019-06-11 | Add basic utility to turn an eebo xml into a set of hocr files (for hocr2pdf) | Nick White |
2019-06-03 | Add option to disable wiping for preproc and preprocmulti | Nick White |
2019-06-03 | Add -m option to wipe to set minimum content area for wipe to proceed | Nick White |
2019-05-15 | Return an error if page average calculation cant be done with hocr | Nick White |
2019-05-14 | Rewrite pgconf to be more accurate by measuring average word confidence rathe... | Nick White |
2019-05-14 | pgconf: Don't print NaN if a page has no lines, and show the percentage, rath... | Nick White |
2019-05-14 | Add pgconf tool that prints the overall confidence for a whole page of hocr | Nick White |
2019-05-14 | Basic cleanup of preprocmulti | Nick White |
2019-05-14 | gofmt | Nick White |
2019-05-14 | Add preprocmulti tool, that outputs multiple binarisation options quickly | Nick White |
2019-05-13 | Add preproc command, that binarises and preprocesses together | Nick White |
2019-05-13 | Define flags in each test, so they arent erroneously picked up and used by cm... | Nick White |
2019-05-13 | Use general integralimg functions for wipe functions | Nick White |
2019-05-13 | Add -slow flag to test to skip slow tests by default | Nick White |
2019-05-13 | Reorganise image manipulation to separate integral image parts | Nick White |
2019-05-13 | Start switching preproc to use interfaces more | Nick White |
2019-05-13 | Rename cleanup to wipe, and only export main function | Nick White |
2019-05-13 | Rename cleanup package to preproc, and add basic cmd version | Nick White |
2019-05-13 | Improve error handling in sauvola tests | Nick White |
2019-05-13 | Make cleanup a basic library | Nick White |
2019-05-13 | Add some basic tests for cleanup | Nick White |
2019-05-13 | Use the simplified findbestedge function, and simplify code | Nick White |
2019-04-18 | Simplify cleanup code | Nick White |
2019-04-18 | Put edge in middle of window slice, rather than at left side, and gofmt | Nick White |
2019-04-18 | Add basic cleanup tool; working, but more refinements planned. | Nick White |
2019-04-17 | Add basic dehyphenate tool | Nick White |
2019-03-28 | Remove todo for integral image testing for now | Nick White |
2019-03-28 | Improve tests; test regular sauvola, and add option to update golden files | Nick White |
2019-03-26 | Add zeroinv option for binarize command | Nick White |
2019-03-26 | Move sauvola binarization tool to cmd/binarize | Nick White |
2019-03-26 | Better error handling with hocr lines | Nick White |
2019-02-25 | Generalise get text from hocr lines | Nick White |
2019-02-25 | Add tool to extract plain text from hocr | Nick White |
2019-02-15 | Separate out binarize into a package, and start adding tests for it | Nick White |
2019-01-30 | Set window size automatically based on resolution | Nick White |
2019-01-30 | Remove dependency on Imger package | Nick White |
2019-01-30 | Add integral image functionality to enable massive speedup of Sauvola | Nick White |
2019-01-29 | Switch binarization to Sauvola algorithm | Nick White |
2019-01-25 | Simplify writing of sort functions in line pkg | Nick White |
2019-01-25 | gofmt | Nick White |
2019-01-25 | Use consistent naming for .prob and .hocr OcrName | Nick White |
2019-01-25 | Add html output including all images, by writing them to an html directory | Nick White |
2019-01-25 | Rename line-conf-avg to avg-lines | Nick White |
2019-01-25 | Rewrite line-conf-avg to use libraries, and support hocr | Nick White |
2019-01-25 | Update location of libraries | Nick White |
2019-01-25 | Add simple Otsu binarize tool (written a while ago) | Nick White |
2019-01-25 | Reorganisation and cleanup | Nick White |
2019-01-24 | Fix bug: if non-prob/hocr file was encountered a dupe old line could be proce... | Nick White |
2019-01-24 | Export hocr Parse() function as its likely to be useful elsewhere | Nick White |