Age | Commit message (Expand) | Author |
2019-09-05 | Don't abort analysis if we encounter a hocr with no words, just skip it | Nick White |
2019-09-05 | gofmt | Nick White |
2019-09-05 | Update Pipeliner interface in getpipelinebook, and update some comments | Nick White |
2019-09-04 | Rewrite heartbeat so errors during it will be reported, and the aws api doesn... | Nick White |
2019-09-04 | Ensure any channels that need to be consumed before goroutine is finished are... | Nick White |
2019-09-03 | Improve debug logging | Nick White |
2019-09-02 | Log upload and download events | Nick White |
2019-09-02 | Add initial getpipelinebook cmd (untested) | Nick White |
2019-08-28 | Add medium and bad lines to graphs | Nick White |
2019-08-28 | Add standalone graph tool; confgraph | Nick White |
2019-08-28 | Move booktopipeline and mkpipeline into bookpipeline/cmd | Nick White |
2019-08-28 | Split out bookpipeline to cmd/ | Nick White |
2019-08-28 | Move graph function to its own file, and further improve layout | Nick White |
2019-08-28 | Separate graph creation from analyse(). | Nick White |
2019-08-27 | Print x axis ticks nicely | Nick White |
2019-08-27 | Add annotations for pages with confidence below 70 | Nick White |
2019-08-27 | Add basic graphing (still work to do, but basics are working) | Nick White |
2019-08-27 | Add basic analyse step, working but incomplete | Nick White |
2019-08-23 | Expect source files to be .jpg | Nick White |
2019-08-23 | Fix gaping bugs by using correct queues and downloads | Nick White |
2019-08-22 | Generalise preprocessing and ocring to reuse common code | Nick White |
2019-08-22 | Switch to using flag to process command line, and allow different training to... | Nick White |
2019-08-22 | gofmt | Nick White |
2019-08-22 | Update usage string, and comments | Nick White |
2019-08-22 | Improve timing of queue checks | Nick White |
2019-08-22 | Fix process finishing by closing dl channel | Nick White |
2019-08-20 | Handle errors properly with goroutines | Nick White |
2019-08-20 | Handle errors correctly in main parts of program | Nick White |
2019-08-20 | Substantially improve problematic object listing part of API | Nick White |
2019-08-20 | Add basic OCR support, and reorganise code | Nick White |
2019-08-20 | Split aws implementation from main.go in pipelinepreprocess | Nick White |
2019-08-20 | Export qmsg type | Nick White |
2019-08-19 | Fix pipelinepreprocess segfaults | Nick White |
2019-08-19 | Work in progress rearchitecture to use interfaces; currently pointers are scr... | Nick White |
2019-08-13 | Various improvements to pipelinepreprocess | Nick White |
2019-08-13 | Correct typo in bucket name for pipelinepreprocess; tested and seems to work,... | Nick White |
2019-08-13 | Add bonus verbose log points | Nick White |
2019-08-13 | Add booktopipeline tool (only lightly tested) | Nick White |
2019-08-13 | Reduce SQS WaitTime to something in-spec, and add bonus verbose log points | Nick White |
2019-08-13 | Switch ksizes to use by preprocmulti | Nick White |
2019-08-13 | Add basic verbose logging capabilities to pipelinepreprocess | Nick White |
2019-07-25 | Add first draft of pipelinepreprocess - completely untested, will contain bugs | Nick White |
2019-07-19 | rename setupawspipeline to mkpipeline | Nick White |
2019-07-19 | rename pipelineaws to setupawspipeline | Nick White |
2019-07-19 | Add aws pipeline setup | Nick White |
2019-06-25 | Remove 0.6 binarisation threshold option from preprocmulti | Nick White |
2019-06-25 | Experimentally adjust wipe threshold according to binarisation level | Nick White |
2019-06-11 | Name hocrs as pdfimages does, and preserve entities for hocr | Nick White |
2019-06-11 | Add basic utility to turn an eebo xml into a set of hocr files (for hocr2pdf) | Nick White |
2019-06-03 | Add option to disable wiping for preproc and preprocmulti | Nick White |