summaryrefslogtreecommitdiff
AgeCommit message (Expand)Author
2021-03-16rescribe: change default training directory to trainings/v0.3.3Nick White
2021-02-22lspipeline: Rename to lspipeline-ng, and restore pre concurrency version to l...Nick White
2021-02-15getsamplepages: Add -prefix option, and use 'best' to get random page numbersNick White
2021-02-05Merge branch 'master' of ssh://ssh.phx.nearlyfreespeech.net/home/public/bookp...Nick White
2021-02-05Update go-chart dependencyNick White
2021-02-01Update AWS dependency to 1.37.1Nick White
2021-02-01Ensure DeleteObjects can handle over 1000 files to delete; fixes rmbook for l...Nick White
2021-01-26Make ListObjectsWithMeta generic again and create a specialised ListObjectWit...Nick White
2021-01-26Improve lspipeline concurrency by removing WaitGroup stuffNick White
2021-01-26Speed up lspipeline by making s3 requests concurrently and only processing si...Nick White
2021-01-26Stop limiting keys returned from listobjectprefixes' api usage; this speeds u...Nick White
2020-12-15[rmbook] Append / to end of bookname, to ensure e.g. "1" doesnt match all boo...Nick White
2020-12-15[rmbook] Add -dryrun flagNick White
2020-12-14Add rmbook toolNick White
2020-12-14Update preproc module used to incorporate an important crash fixNick White
2020-12-07[rescribe] Fix up *.hocr glob, which ensures that using a savedir that alread...v0.3.2Nick White
2020-12-07[rescribe] Allow saving of results to somewhere other than a directory named ...Nick White
2020-12-04Ensure mkdir will succeed in uploadNick White
2020-12-03[rescribe] Fix portability issue where hocrs may not be correctly moved and t...Nick White
2020-12-03Don't upload binarised pdf twice needlesslyNick White
2020-11-30Merge branch 'master' of ssh://hammerhead/home/nick/rescribe/src/bookpipelineNick White
2020-11-30Add getstats toolNick White
2020-11-24[booktopipeline] Add a check to disallow adding a book that already existsNick White
2020-11-18Switch to a maintained version of gofpdfNick White
2020-11-18Describe rescribe tool in documentationv0.3.1Nick White
2020-11-17Add trimqueue and logwholequeue utilities which can help deal with weird queu...Nick White
2020-11-17Remove _bin0.x from txt filenamesv0.3.0Nick White
2020-11-16Some changes to ensure the pipeline works correctly on WindowsNick White
2020-11-16[rescribe] Default to an appropriate tesscmd for WindowsNick White
2020-11-16[rescribe] Add txt output, only keep colour pdf, and reorganise files so they...Nick White
2020-11-16[rescribe] Mention in usage that things can be saved in a different directoryNick White
2020-11-16Add makefile for generating cross compiled rescribe binariesNick White
2020-11-10gofmtNick White
2020-11-10[rescribe] Enable custom paths to tesseract command to be set (also improve s...Nick White
2020-11-10[rescribe] Change -t to the path of the traineddata file, and set TESSDATA_PR...Nick White
2020-11-10[rescribe] Handle errors in processbook correctly, and improve console outputNick White
2020-11-10[getpipelinebook] Rewrite to use internal package functionsNick White
2020-11-10Switch booktopipeline to use internal pipeline functionsNick White
2020-11-09Add a couple of things that should not be forgottenNick White
2020-11-09Switch Preprocess() to take the thresholds to use, and have rescribe tool onl...separatelocalNick White
2020-11-09[rescribe] Local only combo tool basically now working. Testing is still mini...Nick White
2020-11-09[rescribe] work in progress at a self-contained local pipeline processor, cal...Nick White
2020-11-09[bookpipeline] Split most functionality out to package internal/pipelineNick White
2020-11-09Add -autostop, so time to shutdown can be specified, and so the process can j...Nick White
2020-11-09[bookpipeline] Improve interface, particularly for local use, by disabling (f...Nick White
2020-11-09Set hocr config options directly rather than relying on 'hocr' config fileNick White
2020-11-06Fix the README to be valid markdown in the local exampleNick White
2020-11-06Document the local modeNick White
2020-11-06Add git clone advice to readmeNick White
2020-10-21Fix a bug that caused analyse step to not be triggered with local connectionNick White