Age | Commit message (Collapse) | Author | |
---|---|---|---|
2020-11-09 | [rescribe] Local only combo tool basically now working. Testing is still ↵ | Nick White | |
minimal. | |||
2020-11-09 | [rescribe] work in progress at a self-contained local pipeline processor, ↵ | Nick White | |
called rescribe | |||
2020-11-09 | [bookpipeline] Split most functionality out to package internal/pipeline | Nick White | |
No functionality changes, but this should make it easier to make custom builds using the pipeline in slightly different ways. | |||
2020-11-09 | Add -autostop, so time to shutdown can be specified, and so the process can ↵ | Nick White | |
just be stopped after a period, rather than the whole computer shut down | |||
2020-11-09 | [bookpipeline] Improve interface, particularly for local use, by disabling ↵ | Nick White | |
(failing) log saving, mail sending, and removing erroneous references to AWS | |||
2020-11-09 | Set hocr config options directly rather than relying on 'hocr' config file | Nick White | |
This ensures that bookpipeline will still work even if TESSDATA_PREFIX has been set to a directory without configs in it. | |||
2020-11-06 | Fix the README to be valid markdown in the local example | Nick White | |
2020-11-06 | Document the local mode | Nick White | |
2020-11-06 | Add git clone advice to readme | Nick White | |
2020-10-21 | Fix a bug that caused analyse step to not be triggered with local connection | Nick White | |
2020-10-20 | Improve logging by using Println, which ensures there is a space between ↵ | Nick White | |
arguments, even if all are strings | |||
2020-10-20 | Fix local queue deletion properly | Nick White | |
2020-10-20 | Hopefully fix off-by-one error causing errors with local bookpipeline | Nick White | |
2020-10-20 | Add postprocess-bythresh cmd | Nick White | |
2020-10-20 | Update spot image to use | Nick White | |
2020-09-22 | [booktopipeline] Check that all images are valid before adding to pipeline | Nick White | |
2020-09-15 | Abort and delete a failed wipeonly job, like we do with preprocessing | Nick White | |
There was no reason not to do this with wipeonly as well, and sure enough a single broken PNG image in a wipeonly task would cause the queue to exponentially fill as happened previously. | |||
2020-09-07 | Update spot instance ami once again | Nick White | |
2020-09-01 | Update spot instance ami to use | Nick White | |
2020-09-01 | Fix confusing usage message for booktopipeline | Nick White | |
2020-08-24 | update getsamplepages to just get jpg pages | Nick White | |
2020-08-19 | Add getsamplepages | Nick White | |
2020-08-18 | Update preproc to v0.4.0 to enable vertical wiping | Nick White | |
2020-07-28 | Allow override of autodetected queues for booktopipeline | Nick White | |
2020-07-28 | Autodetect queue for booktopipeline based on file extension | Antonia Karaisl | |
2020-07-27 | Use os.Getenv() to find config dir, rather than rely on os.UserConfigDir(), ↵ | Nick White | |
as that isnt present on go1.11 | |||
2020-07-27 | Update AMI to new one which includes a mailsettings file | Nick White | |
2020-07-27 | Switch mail settings to an externally set file | Nick White | |
2020-07-21 | [bookpipeline] If preprocessing fails, email us and remove the job from the ↵ | Nick White | |
queue This prevents the current situation where a failed preprocessing job is endlessly repeated, potentially spawning thousands of ocrpage jobs in its wake each time. Note that the email stuff works but requires putting secrets into .go files, so need to rewrite that to read from somewhere more sensible like a dotfile on the host. | |||
2020-07-20 | Fix typo | Nick White | |
2020-07-20 | Merge branch 'master' of https://git.rescribe.xyz/bookpipeline | Nick White | |
2020-07-20 | Update preproc to v0.1.4 to take advantage of vertical wiping parameters, ↵v0.2.5 | Nick White | |
and change WipeFile() to take advantage of them | |||
2020-06-16 | [getallhocrs] Skip files which have already been downloaded | Nick White | |
2020-06-15 | Add getallhocrs tool | Nick White | |
2020-06-03 | Hopefully fix last bug in analyse step of bookpipeline | Nick White | |
2020-06-03 | Fix bug in analyse step of bookpipeline | Nick White | |
2020-06-02 | Fix race condition that could cause errors to be silently discarded | Nick White | |
This was a nasty one. By closing the up channel, the up() function would finish and send to the done channel. This means that the select between err and done would be random as to which was picked, whereas of course if there has been an error that path must be taken. | |||
2020-06-02 | Proper full fix for local queue handling (hopefully) | Nick White | |
2020-06-02 | Fix bug with local queue deletion | Nick White | |
2020-06-01 | Mention documentation URL | Nick White | |
2020-05-29 | [bookpipeline] Remove local copy of original page image once preprocessed | Nick White | |
2020-05-29 | Merge branch 'minimisedisk'v0.2.4 | Nick White | |
2020-05-26 | Merge branch 'local' | Nick White | |
2020-05-26 | Add -c conntype for necessary tools to allow local connection to be used | Nick White | |
2020-05-26 | Fix DelFromQueue and Upload for local connections | Nick White | |
2020-05-22 | Fix CheckQueue for LocalConnlocal | Nick White | |
2020-05-22 | Fix bookpipeline failing if shutdown option isnt used | Nick White | |
2020-05-22 | Fix bookpipeline failing if shutdown option isnt used | Nick White | |
2020-05-22 | Add experimental local connection type | Nick White | |
2020-05-22 | [untested] Use less disk spaceminimisedisk | Nick White | |
There are several ways that disk usage is reduced with this patch: - Files are deleted as soon as they have been uploaded - Once a page image has been added to a PDF, immediately delete it This should allow much larger books to be processed without needing bigger disks. |