Age | Commit message (Collapse) | Author | |
---|---|---|---|
2019-10-29 | Debugging: kill process immediately a heartbeat error is detected (systemd ↵ | Nick White | |
will restart it soon thereafter) | |||
2019-10-29 | Another attempt to fix the ongoing heartbeat issue | Nick White | |
This time wait up to 1 second between attempts, reduce long polling time significantly, and attempt for longer before giving up. | |||
2019-10-28 | Try to fix heartbeat renew issue more fully | Nick White | |
This approach first sets the remaining visibility timeout to zero. This should ensure that the message is available to re-find as soon as the process looks for it. Correspondingly the delay between checks is much shorter, as there shouldn't be a reason for much delay. | |||
2019-10-23 | getpipelinebook: default to downloading corresponding page images, and add ↵ | Nick White | |
option to download the original page images too | |||
2019-10-23 | Manually calculate yticks, so they fall on reasonable numbers | Nick White | |
2019-10-23 | Add more annotations to graph; anything outside of the 80% "normal" band ↵ | Nick White | |
gets an annotation now, and that band is labelled | |||
2019-10-17 | Adjust the heartbeat searching function to hopefully have better luck at ↵ | Nick White | |
finding it and not letting another process steal it. | |||
2019-10-16 | Rewrite booktopipeline to use bookpipeline aws interface | Nick White | |
2019-10-16 | Sort book list in lspipeline by modified date | Nick White | |
2019-10-16 | Ensure booktopipeline complains if given too many arguments | Nick White | |
2019-10-16 | Another attempted fix to "too many open files" issue | Nick White | |
2019-10-16 | Ensure files are promptly closed by booktopipeline | Nick White | |
2019-10-11 | Ensure graph produces output by falling back on generic page numbers if none ↵ | Nick White | |
can be determined | |||
2019-10-09 | Make confgraph and graph in general more resilient to bad input | Nick White | |
2019-10-09 | Match prebinarised presegmented output from ocropus in wipepattern (named ↵ | Nick White | |
like "010001.bin.png") | |||
2019-10-08 | Update paths of other rescribe imports | Nick White | |
2019-10-08 | Separate out bookpipeline from catch-all go.git repo, and rename to ↵ | Nick White | |
rescribe.xyz/bookpipeline The dependencies from the go.git repo will follow in due course. | |||
2019-10-07 | Ensure wipe pipeline uses the expected png files | Nick White | |
2019-10-02 | Improve usage notice for booktopipeline | Nick White | |
2019-10-02 | Add -prebinarised flag to booktopipeline | Nick White | |
2019-10-02 | gofmt | Nick White | |
2019-10-02 | Add wipeonly queue and functionality | Nick White | |
This is useful for prebinarised images, which don't need full preprocessing, but do require wiping, albeit with a more conservative threshold. | |||
2019-09-27 | Improve wiping procedure to work better with 2 column layouts | Nick White | |
2019-09-27 | Fix crash bug when graph was used on source with less than 10 pages | Nick White | |
2019-09-27 | One more update of graph.go to correspond to new go-chart, and improve usage ↵ | Nick White | |
for wipe | |||
2019-09-27 | Hardcode to ignore "workhorse" from logs | Nick White | |
2019-09-27 | Update usage of go-chart to correspond to latest version of library | Nick White | |
2019-09-24 | gofmt | Nick White | |
2019-09-24 | Improve ssh logs; ensure only fully operational servers are tried, and ↵ | Nick White | |
ensure connections to new ips not in known_hosts still succeed | |||
2019-09-24 | Do ssh log collection concurrently | Nick White | |
2019-09-24 | Get ssh logs from all running servers | Nick White | |
2019-09-24 | Add list of books done and in progress to lspipeline | Nick White | |
2019-09-24 | Rewrite GetInstanceDetails so page function is separate | Nick White | |
2019-09-24 | Move ec2 stuff out of lspipeline and into aws.go | Nick White | |
2019-09-23 | gofmt | Nick White | |
2019-09-23 | Move the sqs stuff out to aws.go | Nick White | |
2019-09-19 | Add queue listing to lspipeline | Nick White | |
2019-09-19 | Switch to using a goroutine for ec2 instance info, so can do all aws ↵ | Nick White | |
requests concurrently in due course | |||
2019-09-18 | Add start of lspipeline | Nick White | |
2019-09-17 | gofmt | Nick White | |
2019-09-16 | Be more careful to try to grab the message after a heartbeat failure more ↵ | Nick White | |
quickly Rather than waiting for the whole length of a visibility timeout, in which time another process may grab the message, instead wait a short amount of time, each time the message is searched for. Also add a bit more logging. | |||
2019-09-14 | Ensure enough time has elapsed before looking for the message to reget in ↵ | Nick White | |
the case of heartbeat running out | |||
2019-09-12 | Don't prefix date/time to logs, as logger will store that anyway | Nick White | |
2019-09-11 | Increase size of graph to 4k | Nick White | |
2019-09-11 | Fix bug with graph that prevented the ticks from being correct, thus ruining ↵ | Nick White | |
the graph | |||
2019-09-11 | Work around the SQS limit of 12 hours of visibility timeout | Nick White | |
This is done by checking for the error that is emitted in such a case, and if it's found trying several times to find the message back in the queue, and returning the message with an updated handle back to the caller to use in the future. | |||
2019-09-06 | Add flags to disable checking various queues | Nick White | |
2019-09-05 | Handle no words found error in a better way so any page that is actually 0 ↵ | Nick White | |
confidence is recognised | |||
2019-09-05 | Don't abort analysis if we encounter a hocr with no words, just skip it | Nick White | |
2019-09-05 | gofmt | Nick White | |