Age | Commit message (Expand) | Author |
2020-01-22 | [pagegraph] Fix bug where word graphs werent stable as their number wasnt par... | Nick White |
2020-01-22 | Make pagegraph use lines again | Nick White |
2020-01-22 | Remove unused function in pagegraph | Nick White |
2020-01-21 | Add pagegraph tool | Nick White |
2019-12-17 | Add png flag to getpipelinebook | Nick White |
2019-12-17 | Add pdf flag to getpipelinebook | Nick White |
2019-12-16 | Fix error message syntax in getpipelinebook | Nick White |
2019-12-16 | Add a new tool, addtoqueue, which can be used to generically add any message ... | Nick White |
2019-12-16 | Fix usage message for getpipelinebook, and trim final slashes in lspipeline o... | Nick White |
2019-12-13 | Hopefully fix empty training bug | Nick White |
2019-12-13 | Mention training in ocr error message | Nick White |
2019-12-13 | Print stdout and stderr output when tesseract fails | Nick White |
2019-12-11 | Add addtoanalysequeue tool, which is useful for debugging | Nick White |
2019-12-11 | Fix typo incorrectly screwing up PDFs | Nick White |
2019-12-11 | Clarify use of -training in tools | Nick White |
2019-12-11 | Clean up and correct book name parsing in the pipeline, and update usage of g... | Nick White |
2019-12-11 | Add ability to set a different training for the ocr job | Nick White |
2019-12-11 | Use aws.go with mkpipeline too, plus fix one log.Fatal call in aws.go which s... | Nick White |
2019-12-06 | Don't abort PDF generation if pages aren't found, just do the best that can b... | Nick White |
2019-12-05 | Default getpipelinebook to downloading pdfs instead of images | Nick White |
2019-12-05 | Fix the PDF in analyse step part of bookpipeline | Nick White |
2019-12-05 | Add pdf generation to analyse step (untested) | Nick White |
2019-12-03 | Rewrite lspipeline book listing part to be much faster by taking advantage of... | Nick White |
2019-12-03 | Don't pause between OCR page jobs; this should save us significant amounts of... | Nick White |
2019-11-29 | Make error message clear what page is causing issues | Nick White |
2019-11-26 | Improve usage notice | Nick White |
2019-11-26 | Ensure error in file walking is correctly returned | Nick White |
2019-11-20 | Merge branch 'addpdf' | Nick White |
2019-11-20 | Implement image resizing option into PDF generation, so that smaller PDFs to ... | Nick White |
2019-11-19 | Send pages to the individual OCR Page queue by default | Nick White |
2019-11-19 | Add ocrpage queue for processing individual pages | Nick White |
2019-11-12 | Fix sleep in unstickocr | Nick White |
2019-11-12 | Add unstickocr tool, until the heartbeat bug is eliminated | Nick White |
2019-11-12 | Add spotme command to start appropriate spot instances | Nick White |
2019-11-01 | Compress the font with zlib, and include it in repo | Nick White |
2019-10-31 | Add capability to embed font files into tool | Nick White |
2019-10-31 | PDF: add functionality to use "best" file if it exists | Nick White |
2019-10-31 | Add flag to switch between binarised and colour output | Nick White |
2019-10-31 | Move PDF handling code to a separate file | Nick White |
2019-10-31 | Many improvements to pdfbook; basically working now | Nick White |
2019-10-31 | Add work in progress PDF producer | Nick White |
2019-10-29 | Print heartbeat error on failure | Nick White |
2019-10-29 | Debugging: kill process immediately a heartbeat error is detected (systemd wi... | Nick White |
2019-10-28 | Try to fix heartbeat renew issue more fully | Nick White |
2019-10-23 | getpipelinebook: default to downloading corresponding page images, and add op... | Nick White |
2019-10-16 | Rewrite booktopipeline to use bookpipeline aws interface | Nick White |
2019-10-16 | Sort book list in lspipeline by modified date | Nick White |
2019-10-16 | Ensure booktopipeline complains if given too many arguments | Nick White |
2019-10-16 | Another attempted fix to "too many open files" issue | Nick White |
2019-10-16 | Ensure files are promptly closed by booktopipeline | Nick White |