index
:
bookpipeline
fullsizepdf
guirefactor
local
master
minimisedisk
rotation
separatelocal
Tools to process books in a cloud based pipeline system
summary
refs
log
tree
commit
diff
log msg
author
committer
range
path:
root
/
cmd
Age
Commit message (
Expand
)
Author
2019-12-13
Mention training in ocr error message
Nick White
2019-12-13
Print stdout and stderr output when tesseract fails
Nick White
2019-12-11
Add addtoanalysequeue tool, which is useful for debugging
Nick White
2019-12-11
Fix typo incorrectly screwing up PDFs
Nick White
2019-12-11
Clarify use of -training in tools
Nick White
2019-12-11
Clean up and correct book name parsing in the pipeline, and update usage of g...
Nick White
2019-12-11
Add ability to set a different training for the ocr job
Nick White
2019-12-11
Use aws.go with mkpipeline too, plus fix one log.Fatal call in aws.go which s...
Nick White
2019-12-06
Don't abort PDF generation if pages aren't found, just do the best that can b...
Nick White
2019-12-05
Default getpipelinebook to downloading pdfs instead of images
Nick White
2019-12-05
Fix the PDF in analyse step part of bookpipeline
Nick White
2019-12-05
Add pdf generation to analyse step (untested)
Nick White
2019-12-03
Rewrite lspipeline book listing part to be much faster by taking advantage of...
Nick White
2019-12-03
Don't pause between OCR page jobs; this should save us significant amounts of...
Nick White
2019-11-29
Make error message clear what page is causing issues
Nick White
2019-11-26
Improve usage notice
Nick White
2019-11-26
Ensure error in file walking is correctly returned
Nick White
2019-11-20
Merge branch 'addpdf'
Nick White
2019-11-20
Implement image resizing option into PDF generation, so that smaller PDFs to ...
Nick White
2019-11-19
Send pages to the individual OCR Page queue by default
Nick White
2019-11-19
Add ocrpage queue for processing individual pages
Nick White
2019-11-12
Fix sleep in unstickocr
Nick White
2019-11-12
Add unstickocr tool, until the heartbeat bug is eliminated
Nick White
2019-11-12
Add spotme command to start appropriate spot instances
Nick White
2019-11-01
Compress the font with zlib, and include it in repo
Nick White
2019-10-31
Add capability to embed font files into tool
Nick White
2019-10-31
PDF: add functionality to use "best" file if it exists
Nick White
2019-10-31
Add flag to switch between binarised and colour output
Nick White
2019-10-31
Move PDF handling code to a separate file
Nick White
2019-10-31
Many improvements to pdfbook; basically working now
Nick White
2019-10-31
Add work in progress PDF producer
Nick White
2019-10-29
Print heartbeat error on failure
Nick White
2019-10-29
Debugging: kill process immediately a heartbeat error is detected (systemd wi...
Nick White
2019-10-28
Try to fix heartbeat renew issue more fully
Nick White
2019-10-23
getpipelinebook: default to downloading corresponding page images, and add op...
Nick White
2019-10-16
Rewrite booktopipeline to use bookpipeline aws interface
Nick White
2019-10-16
Sort book list in lspipeline by modified date
Nick White
2019-10-16
Ensure booktopipeline complains if given too many arguments
Nick White
2019-10-16
Another attempted fix to "too many open files" issue
Nick White
2019-10-16
Ensure files are promptly closed by booktopipeline
Nick White
2019-10-09
Make confgraph and graph in general more resilient to bad input
Nick White
2019-10-09
Match prebinarised presegmented output from ocropus in wipepattern (named lik...
Nick White
2019-10-08
Update paths of other rescribe imports
Nick White
2019-10-08
Separate out bookpipeline from catch-all go.git repo, and rename to rescribe....
Nick White