summaryrefslogtreecommitdiff
AgeCommit message (Expand)Author
2020-09-28[iiifdownloader] Work around oxford needing the iiif suffix adding to its idNick White
2020-09-28[iiifdownloader] Default to iiifmanifest type if none is given and no definit...Nick White
2020-09-28Make page numbering more generic to handle more iiif variety, and add harvard...Nick White
2020-09-28Add ability to pass -service to choose which download type to use, plus add a...Nick White
2020-09-22[analysestats] completeNick White
2020-09-22[analysestats] Parse hocr for training usedNick White
2020-09-21Add wip analysestats commandNick White
2020-09-21Use strings.Replace rather than strings.ReplaceAll so that it works on older ...Nick White
2020-09-12Add todosNick White
2020-09-08Add the option to force METS usage for BSBNick White
2020-09-08Switch to using generic page downloader for BNFNick White
2020-09-08Improve urlToPgName so it can be used by BNF tooNick White
2020-09-08Improve urlToPgName and documentationNick White
2020-09-08Sanitise URLs so that // in url doesn't cause issues (bsb site can spew these)Nick White
2020-09-08Switch from METS to IIIF manifest for BSB downloading, as it returns higher q...Nick White
2020-09-08[iiifdownloader] BSB downloading works now, by parsing METS XMLNick White
2020-09-07[iiifdownloader] Split out NoPgNums downloading to its own functionNick White
2020-09-07Add skeleton of bsb supportNick White
2020-08-25Move dehyphenate string code into its own functionNick White
2020-08-25Fixes to dehyphenateNick White
2020-08-25Add text mode for dehyphenate toolNick White
2020-06-23[iiifdownloader] Only remove 1 duplicate page, as 2nd one may not be duplicat...Nick White
2020-06-23[iiifdownloader] Add support for BNF urls with a dot after book idNick White
2020-06-23Add IIIF downloader, that just supports BNF for nowNick White
2020-06-01Mention documentation URLNick White
2020-04-14Remove getbests; it belongs with bookpipeline (and putting it there removes a...v0.1.3Nick White
2020-04-14Add godoc documentationNick White
2020-03-13Update go.mod now that getbests util has a dependencyv0.1.2Nick White
2020-03-13Add simple "getbests" utility, useful for statistics gatheringNick White
2020-03-13Add copyright statements to each fileNick White
2020-02-28Add license, copyright statements and a basic readmev0.1.1Nick White
2020-02-27Add go.modv0.1.0Nick White
2020-02-27Reorganise all commands to be behind cmd/Nick White
2020-02-20[pare-gt] gofmtNick White
2020-02-20[pare-gt] Fix sampling formula, make robust in the face of a 100% sample requ...Nick White
2020-02-20[pare-gt] Add some tests, and make deterministicNick White
2020-02-20[pare-gt] gofmtNick White
2020-02-19Split sampling functionality in pare-gt into a separate function that can be ...Nick White
2020-02-11Add pare-gt toolNick White
2020-01-22Fix up boxtotxt toolNick White
2020-01-22Add GetWordConfs function to hocr pkgNick White
2020-01-22Add simple boxtotxt toolNick White
2019-11-12Clean up, and add comment explaining design choice to fonttobytesNick White
2019-11-12Add fonttobytes, to embed the font into pdf tools in due courseNick White
2019-10-31Export a couple of more generally useful functionsNick White
2019-10-30Simplify and document hocr package slightly betterNick White
2019-10-23Add tiny doc.go, hopefully will ensure 'go get rescribe.xyz/utils' doesn't re...Nick White
2019-10-23Make bucket-lines and related packages more robustNick White
2019-10-08Remove parts that have been moved elsewhere, and rename to rescribe.xyz/utilsNick White
2019-10-07Ensure wipe pipeline uses the expected png filesNick White