summaryrefslogtreecommitdiff
AgeCommit message (Collapse)Author
2020-07-14Do vertical wiping before horizontalv0.1.4Nick White
This should help the horizontal wiping, hopefully, as there will be less inconsistent noise inside of the text area it processes.
2020-07-13Improve wiper algorithmNick White
Several changes made to wiper to improve things: - If findbestedge finds more than one 'best' line, choose the middle one, rather than the first. - findbestedge selects the line with the lowest number of black pixels, rather than (erroneously) the highest. - Added a findedgesOutin() function, which finds edges like findedges() but looking from the outside edges of the image inwards, rather than from the inside out. More testing is needed to decide whether this is useful; initial testing showed very little difference, but that was limited to lightly binarised images. This is in use by VWipe() at present, where worries about several gutters thanks to the edges of other pages aren't present. - Added separate vwsize, vthresh and vmin arguments to WipeFile so that vertical wiping can have different parameters (which is very important for good quality vertical wiping). - Updated wiping tests. - Rewrote the wipe cmd to use WipeFile() directly.
2020-06-01Mention documentation URLNick White
2020-05-04Have wipefile do vertical as well as horizontal wipingv0.1.3Nick White
2020-04-14Update integralimg dependencyv0.1.2Nick White
2020-04-14Improve documentation throughoutNick White
2020-03-05Add another top/bottom wipe testv0.1.1Nick White
2020-03-04Add vertical test to wipesidesNick White
2020-03-03Add experimental vertical wiping feature to wiperNick White
2020-03-03Add edge detection tests for wipesidesNick White
These should be nice and robust to small changes that are still valid, as they don't depend on the output not changing, only on the detected sides being within appropriate limits.
2020-02-27Add go.mod and go.sum for v0.1.0v0.1.0Nick White
2020-02-27Clean up readmeNick White
2020-02-26Pretty up READMENick White
2020-02-26Fix import lines to use new integralimg locationNick White
2020-02-26Remove the integralimg package from here; it now has its own repository at ↵Nick White
rescribe.xyz/integralimg
2020-02-26Fix flag parsing with go test, and switch to -short to gate long testsNick White
2020-02-26Add README and GPLv3 LICENSE and mention license in every source fileNick White
2020-02-20Update wipesides testNick White
2020-02-13Add experimental and partial splittable toolNick White
2019-10-08Continue separating the repository; remove all but preproc, and move ↵Nick White
integralimg package under it
2019-10-07Ensure wipe pipeline uses the expected png filesNick White
2019-10-02Improve usage notice for booktopipelineNick White
2019-10-02Add -prebinarised flag to booktopipelineNick White
2019-10-02gofmtNick White
2019-10-02Add wipeonly queue and functionalityNick White
This is useful for prebinarised images, which don't need full preprocessing, but do require wiping, albeit with a more conservative threshold.
2019-09-27Improve wiping procedure to work better with 2 column layoutsNick White
2019-09-27Fix crash bug when graph was used on source with less than 10 pagesNick White
2019-09-27One more update of graph.go to correspond to new go-chart, and improve usage ↵Nick White
for wipe
2019-09-27Hardcode to ignore "workhorse" from logsNick White
2019-09-27Update usage of go-chart to correspond to latest version of libraryNick White
2019-09-24gofmtNick White
2019-09-24Improve ssh logs; ensure only fully operational servers are tried, and ↵Nick White
ensure connections to new ips not in known_hosts still succeed
2019-09-24Do ssh log collection concurrentlyNick White
2019-09-24Get ssh logs from all running serversNick White
2019-09-24Add list of books done and in progress to lspipelineNick White
2019-09-24Rewrite GetInstanceDetails so page function is separateNick White
2019-09-24Move ec2 stuff out of lspipeline and into aws.goNick White
2019-09-23gofmtNick White
2019-09-23Move the sqs stuff out to aws.goNick White
2019-09-19Add queue listing to lspipelineNick White
2019-09-19Switch to using a goroutine for ec2 instance info, so can do all aws ↵Nick White
requests concurrently in due course
2019-09-18Add start of lspipelineNick White
2019-09-17gofmtNick White
2019-09-16Be more careful to try to grab the message after a heartbeat failure more ↵Nick White
quickly Rather than waiting for the whole length of a visibility timeout, in which time another process may grab the message, instead wait a short amount of time, each time the message is searched for. Also add a bit more logging.
2019-09-14Ensure enough time has elapsed before looking for the message to reget in ↵Nick White
the case of heartbeat running out
2019-09-12Don't prefix date/time to logs, as logger will store that anywayNick White
2019-09-11Increase size of graph to 4kNick White
2019-09-11Fix bug with graph that prevented the ticks from being correct, thus ruining ↵Nick White
the graph
2019-09-11Work around the SQS limit of 12 hours of visibility timeoutNick White
This is done by checking for the error that is emitted in such a case, and if it's found trying several times to find the message back in the queue, and returning the message with an updated handle back to the caller to use in the future.
2019-09-06Add flags to disable checking various queuesNick White