bookpipeline - Tools to process books in a cloud based pipeline system

Age	Commit message (Collapse)	Author
2022-02-28	Add PreNoWipe queue, that just does binarisation but no wiping	Nick White

2022-01-31	Make pipeline context-aware, so the rescribe tool can cancel jobs	Nick White

2021-08-17	pipeline: use regular storage for tests, rather than a separate one	Nick White

2021-08-02	internal/pipeline: Add test (incomplete but working) for UploadImages	Nick White

2020-11-24	[booktopipeline] Add a check to disallow adding a book that already exists	Nick White
	This is important as if a book is added which has already been done, then an analyse job will be added every time a page is OCRed, which will clog up the pipeline with unnecessary work. Also if a book was added with the same name but differently named files, or a different number of pages, the results would almost certainly not be as intended. In the case of a book really wanting to be added with a particular name, either the original directory can be removed on S3, or "v2" or similar can be appended to the book name before calling booktopipeline.
2020-11-10	Switch booktopipeline to use internal pipeline functions	Nick White

2020-11-09	Add a couple of things that should not be forgotten	Nick White

2020-10-20	Improve logging by using Println, which ensures there is a space between ↵	Nick White
	arguments, even if all are strings
2020-09-22	[booktopipeline] Check that all images are valid before adding to pipeline	Nick White

2020-09-01	Fix confusing usage message for booktopipeline	Nick White

2020-07-28	Allow override of autodetected queues for booktopipeline	Nick White

2020-07-28	Autodetect queue for booktopipeline based on file extension	Antonia Karaisl

2020-05-26	Add -c conntype for necessary tools to allow local connection to be used	Nick White

2020-04-14	Briefly document each of the commands in a godoc friendly way, and improve ↵	Nick White
	the cloudsettings documentation slightly
2020-02-27	Add documentation, license notices, and license	Nick White

2019-12-11	Clarify use of -training in tools	Nick White

2019-12-11	Add ability to set a different training for the ocr job	Nick White

2019-12-11	Use aws.go with mkpipeline too, plus fix one log.Fatal call in aws.go which ↵	Nick White
	should have been handled by caller
2019-10-16	Rewrite booktopipeline to use bookpipeline aws interface	Nick White

2019-10-16	Ensure booktopipeline complains if given too many arguments	Nick White

2019-10-16	Another attempted fix to "too many open files" issue	Nick White

2019-10-16	Ensure files are promptly closed by booktopipeline	Nick White

2019-10-08	Separate out bookpipeline from catch-all go.git repo, and rename to ↵	Nick White
	rescribe.xyz/bookpipeline The dependencies from the go.git repo will follow in due course.