index
:
bookpipeline
fullsizepdf
guirefactor
local
master
minimisedisk
rotation
separatelocal
Tools to process books in a cloud based pipeline system
summary
refs
log
tree
commit
diff
log msg
author
committer
range
path:
root
/
lib
Age
Commit message (
Collapse
)
Author
2019-10-08
Separate out bookpipeline from catch-all go.git repo, and rename to ↵
Nick White
rescribe.xyz/bookpipeline The dependencies from the go.git repo will follow in due course.
2019-09-05
Handle no words found error in a better way so any page that is actually 0 ↵
Nick White
confidence is recognised
2019-09-05
Don't abort analysis if we encounter a hocr with no words, just skip it
Nick White
2019-05-15
Return an error if page average calculation cant be done with hocr
Nick White
2019-05-14
Rewrite pgconf to be more accurate by measuring average word confidence ↵
Nick White
rather than average line confidence
2019-05-14
Add pgconf tool that prints the overall confidence for a whole page of hocr
Nick White
2019-03-26
Better error handling with hocr lines
Nick White
2019-02-25
Generalise get text from hocr lines
Nick White
2019-02-25
Add tool to extract plain text from hocr
Nick White
2019-01-25
Simplify writing of sort functions in line pkg
Nick White
2019-01-25
gofmt
Nick White
2019-01-25
Use consistent naming for .prob and .hocr OcrName
Nick White
2019-01-25
Update location of libraries
Nick White
2019-01-25
Reorganisation and cleanup
Nick White