Age | Commit message (Collapse) | Author | |
---|---|---|---|
2020-07-14 | Do vertical wiping before horizontalv0.1.4 | Nick White | |
This should help the horizontal wiping, hopefully, as there will be less inconsistent noise inside of the text area it processes. | |||
2020-07-13 | Improve wiper algorithm | Nick White | |
Several changes made to wiper to improve things: - If findbestedge finds more than one 'best' line, choose the middle one, rather than the first. - findbestedge selects the line with the lowest number of black pixels, rather than (erroneously) the highest. - Added a findedgesOutin() function, which finds edges like findedges() but looking from the outside edges of the image inwards, rather than from the inside out. More testing is needed to decide whether this is useful; initial testing showed very little difference, but that was limited to lightly binarised images. This is in use by VWipe() at present, where worries about several gutters thanks to the edges of other pages aren't present. - Added separate vwsize, vthresh and vmin arguments to WipeFile so that vertical wiping can have different parameters (which is very important for good quality vertical wiping). - Updated wiping tests. - Rewrote the wipe cmd to use WipeFile() directly. | |||
2020-06-01 | Mention documentation URL | Nick White | |
2020-05-04 | Have wipefile do vertical as well as horizontal wipingv0.1.3 | Nick White | |
2020-04-14 | Update integralimg dependencyv0.1.2 | Nick White | |
2020-04-14 | Improve documentation throughout | Nick White | |
2020-03-05 | Add another top/bottom wipe testv0.1.1 | Nick White | |
2020-03-04 | Add vertical test to wipesides | Nick White | |
2020-03-03 | Add experimental vertical wiping feature to wiper | Nick White | |
2020-03-03 | Add edge detection tests for wipesides | Nick White | |
These should be nice and robust to small changes that are still valid, as they don't depend on the output not changing, only on the detected sides being within appropriate limits. | |||
2020-02-27 | Add go.mod and go.sum for v0.1.0v0.1.0 | Nick White | |
2020-02-27 | Clean up readme | Nick White | |
2020-02-26 | Pretty up README | Nick White | |
2020-02-26 | Fix import lines to use new integralimg location | Nick White | |
2020-02-26 | Remove the integralimg package from here; it now has its own repository at ↵ | Nick White | |
rescribe.xyz/integralimg | |||
2020-02-26 | Fix flag parsing with go test, and switch to -short to gate long tests | Nick White | |
2020-02-26 | Add README and GPLv3 LICENSE and mention license in every source file | Nick White | |
2020-02-20 | Update wipesides test | Nick White | |
2020-02-13 | Add experimental and partial splittable tool | Nick White | |
2019-10-08 | Continue separating the repository; remove all but preproc, and move ↵ | Nick White | |
integralimg package under it | |||
2019-10-07 | Ensure wipe pipeline uses the expected png files | Nick White | |
2019-10-02 | Improve usage notice for booktopipeline | Nick White | |
2019-10-02 | Add -prebinarised flag to booktopipeline | Nick White | |
2019-10-02 | gofmt | Nick White | |
2019-10-02 | Add wipeonly queue and functionality | Nick White | |
This is useful for prebinarised images, which don't need full preprocessing, but do require wiping, albeit with a more conservative threshold. | |||
2019-09-27 | Improve wiping procedure to work better with 2 column layouts | Nick White | |
2019-09-27 | Fix crash bug when graph was used on source with less than 10 pages | Nick White | |
2019-09-27 | One more update of graph.go to correspond to new go-chart, and improve usage ↵ | Nick White | |
for wipe | |||
2019-09-27 | Hardcode to ignore "workhorse" from logs | Nick White | |
2019-09-27 | Update usage of go-chart to correspond to latest version of library | Nick White | |
2019-09-24 | gofmt | Nick White | |
2019-09-24 | Improve ssh logs; ensure only fully operational servers are tried, and ↵ | Nick White | |
ensure connections to new ips not in known_hosts still succeed | |||
2019-09-24 | Do ssh log collection concurrently | Nick White | |
2019-09-24 | Get ssh logs from all running servers | Nick White | |
2019-09-24 | Add list of books done and in progress to lspipeline | Nick White | |
2019-09-24 | Rewrite GetInstanceDetails so page function is separate | Nick White | |
2019-09-24 | Move ec2 stuff out of lspipeline and into aws.go | Nick White | |
2019-09-23 | gofmt | Nick White | |
2019-09-23 | Move the sqs stuff out to aws.go | Nick White | |
2019-09-19 | Add queue listing to lspipeline | Nick White | |
2019-09-19 | Switch to using a goroutine for ec2 instance info, so can do all aws ↵ | Nick White | |
requests concurrently in due course | |||
2019-09-18 | Add start of lspipeline | Nick White | |
2019-09-17 | gofmt | Nick White | |
2019-09-16 | Be more careful to try to grab the message after a heartbeat failure more ↵ | Nick White | |
quickly Rather than waiting for the whole length of a visibility timeout, in which time another process may grab the message, instead wait a short amount of time, each time the message is searched for. Also add a bit more logging. | |||
2019-09-14 | Ensure enough time has elapsed before looking for the message to reget in ↵ | Nick White | |
the case of heartbeat running out | |||
2019-09-12 | Don't prefix date/time to logs, as logger will store that anyway | Nick White | |
2019-09-11 | Increase size of graph to 4k | Nick White | |
2019-09-11 | Fix bug with graph that prevented the ticks from being correct, thus ruining ↵ | Nick White | |
the graph | |||
2019-09-11 | Work around the SQS limit of 12 hours of visibility timeout | Nick White | |
This is done by checking for the error that is emitted in such a case, and if it's found trying several times to find the message back in the queue, and returning the message with an updated handle back to the caller to use in the future. | |||
2019-09-06 | Add flags to disable checking various queues | Nick White | |