index
:
bookpipeline
fullsizepdf
guirefactor
local
master
minimisedisk
rotation
separatelocal
Tools to process books in a cloud based pipeline system
summary
refs
log
tree
commit
diff
log msg
author
committer
range
Age
Commit message (
Expand
)
Author
2021-06-22
rescribe: allow use of embedded training even if -systess is used
Nick White
2021-06-22
cloud: update spot image to latest version that wont attempt to build rescrib...
Nick White
2021-06-22
rescribe: Add go generate command to download the needed files to embed
Nick White
2021-06-22
rescribe: Add an embedded tessdata
Nick White
2021-06-21
Merge remote-tracking branch 'ssh/master'
Nick White
2021-06-21
rescribe: Set up so only Tesseract needed for the build platform is embedded
Nick White
2021-06-21
rescribe: Embed Tesseract into binary so that no Tesseract install is necessary
Nick White
2021-06-21
update spot image used
Nick White
2021-06-15
pipeline: Ignore hidden files when checking and uploading
Nick White
2021-05-31
local: Only create a file once we are sure that it will be writeable
Nick White
2021-05-31
Add a test for up(), and document download() and up() properly
Nick White
2021-05-31
Fix bug after changing pipeliner for tests, to ensure DeleteObjects is availa...
Nick White
2021-05-19
Close process channel after writing to err channel in download(), in case of ...
Nick White
2021-05-19
Add tests for download()
Nick White
2021-05-19
Fix syntax with another Errorf call
Nick White
2021-05-19
Local download now tries to open the source file before creating a destinatio...
Nick White
2021-05-19
Add basic DeleteObjects implementation to local.go
Nick White
2021-05-19
Fix syntax for some fmt.Errorf calls
Nick White
2021-04-12
Update preproc dependency
Nick White
2021-03-16
rescribe: change default training directory to trainings/
v0.3.3
Nick White
2021-02-22
lspipeline: Rename to lspipeline-ng, and restore pre concurrency version to l...
Nick White
2021-02-15
getsamplepages: Add -prefix option, and use 'best' to get random page numbers
Nick White
2021-02-05
Merge branch 'master' of ssh://ssh.phx.nearlyfreespeech.net/home/public/bookp...
Nick White
2021-02-05
Update go-chart dependency
Nick White
2021-02-01
Update AWS dependency to 1.37.1
Nick White
2021-02-01
Ensure DeleteObjects can handle over 1000 files to delete; fixes rmbook for l...
Nick White
2021-01-26
Make ListObjectsWithMeta generic again and create a specialised ListObjectWit...
Nick White
2021-01-26
Improve lspipeline concurrency by removing WaitGroup stuff
Nick White
2021-01-26
Speed up lspipeline by making s3 requests concurrently and only processing si...
Nick White
2021-01-26
Stop limiting keys returned from listobjectprefixes' api usage; this speeds u...
Nick White
2020-12-15
[rmbook] Append / to end of bookname, to ensure e.g. "1" doesnt match all boo...
Nick White
2020-12-15
[rmbook] Add -dryrun flag
Nick White
2020-12-14
Add rmbook tool
Nick White
2020-12-14
Update preproc module used to incorporate an important crash fix
Nick White
2020-12-07
[rescribe] Fix up *.hocr glob, which ensures that using a savedir that alread...
v0.3.2
Nick White
2020-12-07
[rescribe] Allow saving of results to somewhere other than a directory named ...
Nick White
2020-12-04
Ensure mkdir will succeed in upload
Nick White
2020-12-03
[rescribe] Fix portability issue where hocrs may not be correctly moved and t...
Nick White
2020-12-03
Don't upload binarised pdf twice needlessly
Nick White
2020-11-30
Merge branch 'master' of ssh://hammerhead/home/nick/rescribe/src/bookpipeline
Nick White
2020-11-30
Add getstats tool
Nick White
2020-11-24
[booktopipeline] Add a check to disallow adding a book that already exists
Nick White
2020-11-18
Switch to a maintained version of gofpdf
Nick White
2020-11-18
Describe rescribe tool in documentation
v0.3.1
Nick White
2020-11-17
Add trimqueue and logwholequeue utilities which can help deal with weird queu...
Nick White
2020-11-17
Remove _bin0.x from txt filenames
v0.3.0
Nick White
2020-11-16
Some changes to ensure the pipeline works correctly on Windows
Nick White
2020-11-16
[rescribe] Default to an appropriate tesscmd for Windows
Nick White
2020-11-16
[rescribe] Add txt output, only keep colour pdf, and reorganise files so they...
Nick White
2020-11-16
[rescribe] Mention in usage that things can be saved in a different directory
Nick White
[next]