summaryrefslogtreecommitdiff
path: root/cmd
AgeCommit message (Collapse)Author
2020-05-22Fix bookpipeline failing if shutdown option isnt usedNick White
2020-05-22[untested] Use less disk spaceminimisediskNick White
There are several ways that disk usage is reduced with this patch: - Files are deleted as soon as they have been uploaded - Once a page image has been added to a PDF, immediately delete it This should allow much larger books to be processed without needing bigger disks.
2020-05-19Add getandpurgequeue debugging toolNick White
2020-04-21Simplify spotmeNick White
2020-04-14Add getbests tool that was previously in the utils repoNick White
2020-04-14Briefly document each of the commands in a godoc friendly way, and improve ↵Nick White
the cloudsettings documentation slightly
2020-04-07Remove unused OCR queue (was superceded by the ocrpage queue some time ago)Nick White
2020-04-07gofmtNick White
2020-04-07Separate out cloud settings into a separate file; cloudsettings.goNick White
2020-03-31Disable autoshutdown by default for bookpipeline, and update to ami 0.11 ↵Nick White
(which reenables it for spot instances)
2020-03-31[bookpipeline] Fix typo in previous commit and rename HeartbeatTime to ↵Nick White
HeartbeatSeconds, as it is not a Time
2020-03-31[bookpipeline] Stop using filepath.Join for storage keys, as we want to ↵Nick White
ensure it is always a / delimeter
2020-03-31[bookpipeline] Improve logging outputNick White
2020-03-31[bookpipeline] Add (experimental) log saving functionalityNick White
2020-03-30[bookpipeline] Clean up autoshutdownNick White
2020-03-30[bookpipeline] Enable real shutdown when bookpipeline has been idle for 5 ↵Nick White
minutes
2020-03-30[bookpipeline] Neaten shutdown fixNick White
2020-03-30[bookpipeline] Fix hang bug when restarting shutdown timerNick White
2020-03-30Rewrite autoshutdown to do things right [bugs excluded] (wip)Nick White
2020-03-24[bookpipeline] Improve autoshutdown wipNick White
2020-03-24[bookpipeline] Add experimental (dummy) shutdown partNick White
2020-03-23[getpipelinebook] Switch to MinimalInit() so that it can be run without SQS ↵Nick White
permissions
2020-03-23Add Log() function to Pipeliner interfaceNick White
This simplifies things nicely from using conn.GetLogger().Println() to conn.Log()
2020-03-23Replace errors.New(fmt.Sprintf with fmt.ErrorfNick White
Embarassing I hadn't noticed the fmt.Errorf function before, but better late than never.
2020-03-23Don't try to make a graph with one line (it will fail), and don't mark ↵Nick White
analysis as failed if graph isn't made for that reason
2020-03-23[getpipelinebook] Add -binarisedpdf and -colourpdf flagsNick White
2020-03-23[getpipelinebook] Add -graph flag to download just graphsNick White
2020-03-09Add nobooks flag to lspipeline so it has a faster modeNick White
2020-02-27Remove fonttobytes (use the one in rescribe.xyz/utils repo instead)Nick White
2020-02-27Add documentation, license notices, and licenseNick White
2020-02-27Improve usage description of confgraph and pagegraphNick White
2020-02-05Fix allOCRed for wipeonly books (hopefully)Nick White
allOCRed was checking for wipePattern files, however they should have been transformed into the regular preprocessedPattern for OCR anyway, so shouldn't have been directly OCRed. Thus, allOCRed was mistakenly looking for .hocr versions of the original wipePattern files, which never would have been produced.
2020-01-22[pagegraph] Stop printing debug outputNick White
2020-01-22[pagegraph] Fix bug where word graphs werent stable as their number wasnt ↵Nick White
parsed by graph, and add line or word option
2020-01-22Make pagegraph use lines againNick White
2020-01-22Remove unused function in pagegraphNick White
2020-01-21Add pagegraph toolNick White
2019-12-17Add png flag to getpipelinebookNick White
2019-12-17Add pdf flag to getpipelinebookNick White
2019-12-16Fix error message syntax in getpipelinebookNick White
2019-12-16Add a new tool, addtoqueue, which can be used to generically add any message ↵Nick White
to any queue
2019-12-16Fix usage message for getpipelinebook, and trim final slashes in lspipeline ↵Nick White
output
2019-12-13Hopefully fix empty training bugNick White
2019-12-13Mention training in ocr error messageNick White
2019-12-13Print stdout and stderr output when tesseract failsNick White
2019-12-11Add addtoanalysequeue tool, which is useful for debuggingNick White
2019-12-11Fix typo incorrectly screwing up PDFsNick White
2019-12-11Clarify use of -training in toolsNick White
2019-12-11Clean up and correct book name parsing in the pipeline, and update usage of ↵Nick White
getpipelinebook
2019-12-11Add ability to set a different training for the ocr jobNick White