summaryrefslogtreecommitdiff
AgeCommit message (Expand)Author
2020-02-05Fix allOCRed for wipeonly books (hopefully)Nick White
2020-01-22[pagegraph] Stop printing debug outputNick White
2020-01-22[pagegraph] Fix bug where word graphs werent stable as their number wasnt par...Nick White
2020-01-22Make pagegraph use lines againNick White
2020-01-22Remove unused function in pagegraphNick White
2020-01-21Add pagegraph toolNick White
2019-12-17Add png flag to getpipelinebookNick White
2019-12-17Add pdf flag to getpipelinebookNick White
2019-12-16Fix error message syntax in getpipelinebookNick White
2019-12-16Add a new tool, addtoqueue, which can be used to generically add any message ...Nick White
2019-12-16Fix usage message for getpipelinebook, and trim final slashes in lspipeline o...Nick White
2019-12-13Update StartInstance to point to the newest imageNick White
2019-12-13Hopefully fix empty training bugNick White
2019-12-13Mention training in ocr error messageNick White
2019-12-13Print stdout and stderr output when tesseract failsNick White
2019-12-11Add addtoanalysequeue tool, which is useful for debuggingNick White
2019-12-11Fix typo incorrectly screwing up PDFsNick White
2019-12-11Clarify use of -training in toolsNick White
2019-12-11Clean up and correct book name parsing in the pipeline, and update usage of g...Nick White
2019-12-11Add ability to set a different training for the ocr jobNick White
2019-12-11Use aws.go with mkpipeline too, plus fix one log.Fatal call in aws.go which s...Nick White
2019-12-06Don't abort PDF generation if pages aren't found, just do the best that can b...Nick White
2019-12-05Remove (the generally empty) files in the case of a failed downloadNick White
2019-12-05Default getpipelinebook to downloading pdfs instead of imagesNick White
2019-12-05Fix the PDF in analyse step part of bookpipelineNick White
2019-12-05Add pdf generation to analyse step (untested)Nick White
2019-12-03Rewrite lspipeline book listing part to be much faster by taking advantage of...Nick White
2019-12-03Don't pause between OCR page jobs; this should save us significant amounts of...Nick White
2019-11-29Make error message clear what page is causing issuesNick White
2019-11-26Improve usage noticeNick White
2019-11-26Ensure error in file walking is correctly returnedNick White
2019-11-20Add x/image to go.modNick White
2019-11-20Merge branch 'addpdf'Nick White
2019-11-20Implement image resizing option into PDF generation, so that smaller PDFs to ...Nick White
2019-11-19Send pages to the individual OCR Page queue by defaultNick White
2019-11-19Add ocrpage queue for processing individual pagesNick White
2019-11-12Merge branch 'addpdf'Nick White
2019-11-12Embed a font, compressed, into the binaryNick White
2019-11-12Fix sleep in unstickocrNick White
2019-11-12Add unstickocr tool, until the heartbeat bug is eliminatedNick White
2019-11-12Add spotme command to start appropriate spot instancesNick White
2019-11-12Merge branch 'addpdf'Nick White
2019-11-11Add go.mod and go.sumNick White
2019-11-11Switch to main gofpdf, now our SetTextRenderingMode has been mergedNick White
2019-11-01Compress the font with zlib, and include it in repoNick White
2019-10-31Add capability to embed font files into toolNick White
2019-10-31PDF: add functionality to use "best" file if it existsNick White
2019-10-31PDF: add space to each word to ensure copy-past ability from more PDF readersNick White
2019-10-31PDF: lay out every word with coordinates separatelyNick White
2019-10-31Add flag to switch between binarised and colour outputNick White