<feed xmlns='http://www.w3.org/2005/Atom'>
<title>bookpipeline/lib/hocr, branch v0.5.0</title>
<subtitle>Tools to process books in a cloud based pipeline system</subtitle>
<link rel='alternate' type='text/html' href='https://git.rescribe.xyz/cgit/cgit.cgi/bookpipeline/'/>
<entry>
<title>Separate out bookpipeline from catch-all go.git repo, and rename to rescribe.xyz/bookpipeline</title>
<updated>2019-10-08T11:52:33+00:00</updated>
<author>
<name>Nick White</name>
<email>git@njw.name</email>
</author>
<published>2019-10-08T11:52:33+00:00</published>
<link rel='alternate' type='text/html' href='https://git.rescribe.xyz/cgit/cgit.cgi/bookpipeline/commit/?id=7482157a03ed3e9d7f45e54a126b391001f34948'/>
<id>7482157a03ed3e9d7f45e54a126b391001f34948</id>
<content type='text'>
The dependencies from the go.git repo will follow in due course.
</content>
<content type='xhtml'>
<div xmlns='http://www.w3.org/1999/xhtml'>
<pre>
The dependencies from the go.git repo will follow in due course.
</pre>
</div>
</content>
</entry>
<entry>
<title>Handle no words found error in a better way so any page that is actually 0 confidence is recognised</title>
<updated>2019-09-05T21:42:30+00:00</updated>
<author>
<name>Nick White</name>
<email>git@njw.name</email>
</author>
<published>2019-09-05T21:24:37+00:00</published>
<link rel='alternate' type='text/html' href='https://git.rescribe.xyz/cgit/cgit.cgi/bookpipeline/commit/?id=561d8461cbe19316762489cd7b04f95b9014bcda'/>
<id>561d8461cbe19316762489cd7b04f95b9014bcda</id>
<content type='text'>
</content>
<content type='xhtml'>
<div xmlns='http://www.w3.org/1999/xhtml'>
<pre>
</pre>
</div>
</content>
</entry>
<entry>
<title>Don't abort analysis if we encounter a hocr with no words, just skip it</title>
<updated>2019-09-05T21:20:35+00:00</updated>
<author>
<name>Nick White</name>
<email>git@njw.name</email>
</author>
<published>2019-09-05T21:20:35+00:00</published>
<link rel='alternate' type='text/html' href='https://git.rescribe.xyz/cgit/cgit.cgi/bookpipeline/commit/?id=60a198f7ee5843a0f77b6dfb845c3b0413e83705'/>
<id>60a198f7ee5843a0f77b6dfb845c3b0413e83705</id>
<content type='text'>
</content>
<content type='xhtml'>
<div xmlns='http://www.w3.org/1999/xhtml'>
<pre>
</pre>
</div>
</content>
</entry>
<entry>
<title>Return an error if page average calculation cant be done with hocr</title>
<updated>2019-05-15T15:03:29+00:00</updated>
<author>
<name>Nick White</name>
<email>git@njw.name</email>
</author>
<published>2019-05-15T15:03:29+00:00</published>
<link rel='alternate' type='text/html' href='https://git.rescribe.xyz/cgit/cgit.cgi/bookpipeline/commit/?id=d7f07893d08d9c29f46e50c4f779b0e701f411e4'/>
<id>d7f07893d08d9c29f46e50c4f779b0e701f411e4</id>
<content type='text'>
</content>
<content type='xhtml'>
<div xmlns='http://www.w3.org/1999/xhtml'>
<pre>
</pre>
</div>
</content>
</entry>
<entry>
<title>Rewrite pgconf to be more accurate by measuring average word confidence rather than average line confidence</title>
<updated>2019-05-14T17:02:34+00:00</updated>
<author>
<name>Nick White</name>
<email>git@njw.name</email>
</author>
<published>2019-05-14T17:02:34+00:00</published>
<link rel='alternate' type='text/html' href='https://git.rescribe.xyz/cgit/cgit.cgi/bookpipeline/commit/?id=f49a8a74a8ef2c96cc2bbf34461a8387f7e324d8'/>
<id>f49a8a74a8ef2c96cc2bbf34461a8387f7e324d8</id>
<content type='text'>
</content>
<content type='xhtml'>
<div xmlns='http://www.w3.org/1999/xhtml'>
<pre>
</pre>
</div>
</content>
</entry>
<entry>
<title>Add pgconf tool that prints the overall confidence for a whole page of hocr</title>
<updated>2019-05-14T10:20:33+00:00</updated>
<author>
<name>Nick White</name>
<email>git@njw.name</email>
</author>
<published>2019-05-14T10:20:33+00:00</published>
<link rel='alternate' type='text/html' href='https://git.rescribe.xyz/cgit/cgit.cgi/bookpipeline/commit/?id=6b4e704befb7f82627d2c9a4e3f4e2971fdaf883'/>
<id>6b4e704befb7f82627d2c9a4e3f4e2971fdaf883</id>
<content type='text'>
</content>
<content type='xhtml'>
<div xmlns='http://www.w3.org/1999/xhtml'>
<pre>
</pre>
</div>
</content>
</entry>
<entry>
<title>Better error handling with hocr lines</title>
<updated>2019-03-26T15:21:43+00:00</updated>
<author>
<name>Nick White</name>
<email>git@njw.name</email>
</author>
<published>2019-03-26T15:21:43+00:00</published>
<link rel='alternate' type='text/html' href='https://git.rescribe.xyz/cgit/cgit.cgi/bookpipeline/commit/?id=7d4cc9f7dd297c08e02447dda1f8aad9db0b0768'/>
<id>7d4cc9f7dd297c08e02447dda1f8aad9db0b0768</id>
<content type='text'>
</content>
<content type='xhtml'>
<div xmlns='http://www.w3.org/1999/xhtml'>
<pre>
</pre>
</div>
</content>
</entry>
<entry>
<title>Generalise get text from hocr lines</title>
<updated>2019-02-25T13:01:28+00:00</updated>
<author>
<name>Nick White</name>
<email>git@njw.name</email>
</author>
<published>2019-02-25T13:01:28+00:00</published>
<link rel='alternate' type='text/html' href='https://git.rescribe.xyz/cgit/cgit.cgi/bookpipeline/commit/?id=cd1fb1c9f6e1384ac0add8904425e6f92b17a704'/>
<id>cd1fb1c9f6e1384ac0add8904425e6f92b17a704</id>
<content type='text'>
</content>
<content type='xhtml'>
<div xmlns='http://www.w3.org/1999/xhtml'>
<pre>
</pre>
</div>
</content>
</entry>
<entry>
<title>Add tool to extract plain text from hocr</title>
<updated>2019-02-25T12:29:59+00:00</updated>
<author>
<name>Nick White</name>
<email>git@njw.name</email>
</author>
<published>2019-02-25T12:09:06+00:00</published>
<link rel='alternate' type='text/html' href='https://git.rescribe.xyz/cgit/cgit.cgi/bookpipeline/commit/?id=3c4c5f7c202b7c54ca8d23e7bd7bff4a4bb696cc'/>
<id>3c4c5f7c202b7c54ca8d23e7bd7bff4a4bb696cc</id>
<content type='text'>
</content>
<content type='xhtml'>
<div xmlns='http://www.w3.org/1999/xhtml'>
<pre>
</pre>
</div>
</content>
</entry>
<entry>
<title>gofmt</title>
<updated>2019-01-25T17:41:52+00:00</updated>
<author>
<name>Nick White</name>
<email>git@njw.name</email>
</author>
<published>2019-01-25T17:41:52+00:00</published>
<link rel='alternate' type='text/html' href='https://git.rescribe.xyz/cgit/cgit.cgi/bookpipeline/commit/?id=988658936cbcd8e92b35a66e1943bea0f9eaf3bc'/>
<id>988658936cbcd8e92b35a66e1943bea0f9eaf3bc</id>
<content type='text'>
</content>
<content type='xhtml'>
<div xmlns='http://www.w3.org/1999/xhtml'>
<pre>
</pre>
</div>
</content>
</entry>
</feed>
