summaryrefslogtreecommitdiff
path: root/bookpipeline
AgeCommit message (Collapse)Author
2019-10-02Improve usage notice for booktopipelineNick White
2019-10-02Add -prebinarised flag to booktopipelineNick White
2019-10-02Add wipeonly queue and functionalityNick White
This is useful for prebinarised images, which don't need full preprocessing, but do require wiping, albeit with a more conservative threshold.
2019-09-27Fix crash bug when graph was used on source with less than 10 pagesNick White
2019-09-27One more update of graph.go to correspond to new go-chart, and improve usage ↵Nick White
for wipe
2019-09-27Hardcode to ignore "workhorse" from logsNick White
2019-09-27Update usage of go-chart to correspond to latest version of libraryNick White
2019-09-24gofmtNick White
2019-09-24Improve ssh logs; ensure only fully operational servers are tried, and ↵Nick White
ensure connections to new ips not in known_hosts still succeed
2019-09-24Do ssh log collection concurrentlyNick White
2019-09-24Get ssh logs from all running serversNick White
2019-09-24Add list of books done and in progress to lspipelineNick White
2019-09-24Rewrite GetInstanceDetails so page function is separateNick White
2019-09-24Move ec2 stuff out of lspipeline and into aws.goNick White
2019-09-23gofmtNick White
2019-09-23Move the sqs stuff out to aws.goNick White
2019-09-19Add queue listing to lspipelineNick White
2019-09-19Switch to using a goroutine for ec2 instance info, so can do all aws ↵Nick White
requests concurrently in due course
2019-09-18Add start of lspipelineNick White
2019-09-17gofmtNick White
2019-09-16Be more careful to try to grab the message after a heartbeat failure more ↵Nick White
quickly Rather than waiting for the whole length of a visibility timeout, in which time another process may grab the message, instead wait a short amount of time, each time the message is searched for. Also add a bit more logging.
2019-09-14Ensure enough time has elapsed before looking for the message to reget in ↵Nick White
the case of heartbeat running out
2019-09-12Don't prefix date/time to logs, as logger will store that anywayNick White
2019-09-11Increase size of graph to 4kNick White
2019-09-11Fix bug with graph that prevented the ticks from being correct, thus ruining ↵Nick White
the graph
2019-09-11Work around the SQS limit of 12 hours of visibility timeoutNick White
This is done by checking for the error that is emitted in such a case, and if it's found trying several times to find the message back in the queue, and returning the message with an updated handle back to the caller to use in the future.
2019-09-06Add flags to disable checking various queuesNick White
2019-09-05Handle no words found error in a better way so any page that is actually 0 ↵Nick White
confidence is recognised
2019-09-05Don't abort analysis if we encounter a hocr with no words, just skip itNick White
2019-09-05gofmtNick White
2019-09-05Update Pipeliner interface in getpipelinebook, and update some commentsNick White
2019-09-04Rewrite heartbeat so errors during it will be reported, and the aws api ↵Nick White
doesn't rely on channels
2019-09-04Ensure any channels that need to be consumed before goroutine is finished ↵Nick White
are done in the case of an error
2019-09-03Improve debug loggingNick White
2019-09-02Log upload and download eventsNick White
2019-09-02Add initial getpipelinebook cmd (untested)Nick White
2019-08-28Add medium and bad lines to graphsNick White
2019-08-28Add standalone graph tool; confgraphNick White
2019-08-28Move booktopipeline and mkpipeline into bookpipeline/cmdNick White
2019-08-28Split out bookpipeline to cmd/Nick White
2019-08-28Move graph function to its own file, and further improve layoutNick White
2019-08-28Separate graph creation from analyse().Nick White
2019-08-27Print x axis ticks nicelyNick White
2019-08-27Add annotations for pages with confidence below 70Nick White
2019-08-27Add basic graphing (still work to do, but basics are working)Nick White
2019-08-27Add basic analyse step, working but incompleteNick White
2019-08-23Expect source files to be .jpgNick White
2019-08-23Fix gaping bugs by using correct queues and downloadsNick White
This has involved refactoring to make the interface simpler, and just use the URLs / IDs for the necessary queues and storage locations, rather than wrap these in functions.
2019-08-22Generalise preprocessing and ocring to reuse common codeNick White
2019-08-22Switch to using flag to process command line, and allow different training ↵Nick White
to be passed