Age | Commit message (Expand) | Author |
2020-03-23 | Replace errors.New(fmt.Sprintf with fmt.Errorf | Nick White |
2020-03-23 | Don't try to make a graph with one line (it will fail), and don't mark analys... | Nick White |
2020-03-23 | [getpipelinebook] Add -binarisedpdf and -colourpdf flags | Nick White |
2020-03-23 | [getpipelinebook] Add -graph flag to download just graphs | Nick White |
2020-03-09 | Add nobooks flag to lspipeline so it has a faster mode | Nick White |
2020-02-27 | Remove fonttobytes (use the one in rescribe.xyz/utils repo instead) | Nick White |
2020-02-27 | Add documentation, license notices, and license | Nick White |
2020-02-27 | Improve usage description of confgraph and pagegraph | Nick White |
2020-02-05 | Fix allOCRed for wipeonly books (hopefully) | Nick White |
2020-01-22 | [pagegraph] Stop printing debug output | Nick White |
2020-01-22 | [pagegraph] Fix bug where word graphs werent stable as their number wasnt par... | Nick White |
2020-01-22 | Make pagegraph use lines again | Nick White |
2020-01-22 | Remove unused function in pagegraph | Nick White |
2020-01-21 | Add pagegraph tool | Nick White |
2019-12-17 | Add png flag to getpipelinebook | Nick White |
2019-12-17 | Add pdf flag to getpipelinebook | Nick White |
2019-12-16 | Fix error message syntax in getpipelinebook | Nick White |
2019-12-16 | Add a new tool, addtoqueue, which can be used to generically add any message ... | Nick White |
2019-12-16 | Fix usage message for getpipelinebook, and trim final slashes in lspipeline o... | Nick White |
2019-12-13 | Hopefully fix empty training bug | Nick White |
2019-12-13 | Mention training in ocr error message | Nick White |
2019-12-13 | Print stdout and stderr output when tesseract fails | Nick White |
2019-12-11 | Add addtoanalysequeue tool, which is useful for debugging | Nick White |
2019-12-11 | Fix typo incorrectly screwing up PDFs | Nick White |
2019-12-11 | Clarify use of -training in tools | Nick White |
2019-12-11 | Clean up and correct book name parsing in the pipeline, and update usage of g... | Nick White |
2019-12-11 | Add ability to set a different training for the ocr job | Nick White |
2019-12-11 | Use aws.go with mkpipeline too, plus fix one log.Fatal call in aws.go which s... | Nick White |
2019-12-06 | Don't abort PDF generation if pages aren't found, just do the best that can b... | Nick White |
2019-12-05 | Default getpipelinebook to downloading pdfs instead of images | Nick White |
2019-12-05 | Fix the PDF in analyse step part of bookpipeline | Nick White |
2019-12-05 | Add pdf generation to analyse step (untested) | Nick White |
2019-12-03 | Rewrite lspipeline book listing part to be much faster by taking advantage of... | Nick White |
2019-12-03 | Don't pause between OCR page jobs; this should save us significant amounts of... | Nick White |
2019-11-29 | Make error message clear what page is causing issues | Nick White |
2019-11-26 | Improve usage notice | Nick White |
2019-11-26 | Ensure error in file walking is correctly returned | Nick White |
2019-11-20 | Merge branch 'addpdf' | Nick White |
2019-11-20 | Implement image resizing option into PDF generation, so that smaller PDFs to ... | Nick White |
2019-11-19 | Send pages to the individual OCR Page queue by default | Nick White |
2019-11-19 | Add ocrpage queue for processing individual pages | Nick White |
2019-11-12 | Fix sleep in unstickocr | Nick White |
2019-11-12 | Add unstickocr tool, until the heartbeat bug is eliminated | Nick White |
2019-11-12 | Add spotme command to start appropriate spot instances | Nick White |
2019-11-01 | Compress the font with zlib, and include it in repo | Nick White |
2019-10-31 | Add capability to embed font files into tool | Nick White |
2019-10-31 | PDF: add functionality to use "best" file if it exists | Nick White |
2019-10-31 | Add flag to switch between binarised and colour output | Nick White |
2019-10-31 | Move PDF handling code to a separate file | Nick White |
2019-10-31 | Many improvements to pdfbook; basically working now | Nick White |