summaryrefslogtreecommitdiff
AgeCommit message (Expand)Author
2021-06-21rescribe: Embed Tesseract into binary so that no Tesseract install is necessaryNick White
2021-06-21update spot image usedNick White
2021-05-31local: Only create a file once we are sure that it will be writeableNick White
2021-05-31Add a test for up(), and document download() and up() properlyNick White
2021-05-31Fix bug after changing pipeliner for tests, to ensure DeleteObjects is availa...Nick White
2021-05-19Close process channel after writing to err channel in download(), in case of ...Nick White
2021-05-19Add tests for download()Nick White
2021-05-19Fix syntax with another Errorf callNick White
2021-05-19Local download now tries to open the source file before creating a destinatio...Nick White
2021-05-19Add basic DeleteObjects implementation to local.goNick White
2021-05-19Fix syntax for some fmt.Errorf callsNick White
2021-04-12Update preproc dependencyNick White
2021-03-16rescribe: change default training directory to trainings/v0.3.3Nick White
2021-02-22lspipeline: Rename to lspipeline-ng, and restore pre concurrency version to l...Nick White
2021-02-15getsamplepages: Add -prefix option, and use 'best' to get random page numbersNick White
2021-02-05Merge branch 'master' of ssh://ssh.phx.nearlyfreespeech.net/home/public/bookp...Nick White
2021-02-05Update go-chart dependencyNick White
2021-02-01Update AWS dependency to 1.37.1Nick White
2021-02-01Ensure DeleteObjects can handle over 1000 files to delete; fixes rmbook for l...Nick White
2021-01-26Make ListObjectsWithMeta generic again and create a specialised ListObjectWit...Nick White
2021-01-26Improve lspipeline concurrency by removing WaitGroup stuffNick White
2021-01-26Speed up lspipeline by making s3 requests concurrently and only processing si...Nick White
2021-01-26Stop limiting keys returned from listobjectprefixes' api usage; this speeds u...Nick White
2020-12-15[rmbook] Append / to end of bookname, to ensure e.g. "1" doesnt match all boo...Nick White
2020-12-15[rmbook] Add -dryrun flagNick White
2020-12-14Add rmbook toolNick White
2020-12-14Update preproc module used to incorporate an important crash fixNick White
2020-12-07[rescribe] Fix up *.hocr glob, which ensures that using a savedir that alread...v0.3.2Nick White
2020-12-07[rescribe] Allow saving of results to somewhere other than a directory named ...Nick White
2020-12-04Ensure mkdir will succeed in uploadNick White
2020-12-03[rescribe] Fix portability issue where hocrs may not be correctly moved and t...Nick White
2020-12-03Don't upload binarised pdf twice needlesslyNick White
2020-11-30Merge branch 'master' of ssh://hammerhead/home/nick/rescribe/src/bookpipelineNick White
2020-11-30Add getstats toolNick White
2020-11-24[booktopipeline] Add a check to disallow adding a book that already existsNick White
2020-11-18Switch to a maintained version of gofpdfNick White
2020-11-18Describe rescribe tool in documentationv0.3.1Nick White
2020-11-17Add trimqueue and logwholequeue utilities which can help deal with weird queu...Nick White
2020-11-17Remove _bin0.x from txt filenamesv0.3.0Nick White
2020-11-16Some changes to ensure the pipeline works correctly on WindowsNick White
2020-11-16[rescribe] Default to an appropriate tesscmd for WindowsNick White
2020-11-16[rescribe] Add txt output, only keep colour pdf, and reorganise files so they...Nick White
2020-11-16[rescribe] Mention in usage that things can be saved in a different directoryNick White
2020-11-16Add makefile for generating cross compiled rescribe binariesNick White
2020-11-10gofmtNick White
2020-11-10[rescribe] Enable custom paths to tesseract command to be set (also improve s...Nick White
2020-11-10[rescribe] Change -t to the path of the traineddata file, and set TESSDATA_PR...Nick White
2020-11-10[rescribe] Handle errors in processbook correctly, and improve console outputNick White
2020-11-10[getpipelinebook] Rewrite to use internal package functionsNick White
2020-11-10Switch booktopipeline to use internal pipeline functionsNick White