<feed xmlns='http://www.w3.org/2005/Atom'>
<title>bookpipeline/internal/pipeline, branch v0.3.3</title>
<subtitle>Tools to process books in a cloud based pipeline system</subtitle>
<link rel='alternate' type='text/html' href='https://git.rescribe.xyz/cgit/cgit.cgi/bookpipeline/'/>
<entry>
<title>[rescribe] Allow saving of results to somewhere other than a directory named after the book being processed</title>
<updated>2020-12-07T16:53:58+00:00</updated>
<author>
<name>Nick White</name>
<email>git@njw.name</email>
</author>
<published>2020-12-07T16:53:58+00:00</published>
<link rel='alternate' type='text/html' href='https://git.rescribe.xyz/cgit/cgit.cgi/bookpipeline/commit/?id=068ad0b666705a49ab22d7b48cd6a7d67b37f234'/>
<id>068ad0b666705a49ab22d7b48cd6a7d67b37f234</id>
<content type='text'>
</content>
<content type='xhtml'>
<div xmlns='http://www.w3.org/1999/xhtml'>
<pre>
</pre>
</div>
</content>
</entry>
<entry>
<title>Don't upload binarised pdf twice needlessly</title>
<updated>2020-12-03T15:16:14+00:00</updated>
<author>
<name>Nick White</name>
<email>git@njw.name</email>
</author>
<published>2020-12-03T15:13:22+00:00</published>
<link rel='alternate' type='text/html' href='https://git.rescribe.xyz/cgit/cgit.cgi/bookpipeline/commit/?id=38dbdd0b21fb363e3f63fd3ea50272975e98eb77'/>
<id>38dbdd0b21fb363e3f63fd3ea50272975e98eb77</id>
<content type='text'>
This can also result in the file being uploaded twice simultaneously,
as up() is running in a separate goroutine. This can cause failures
on Windows as the file is attempted to be removed by one upload
process while being open to upload by the other process. Probably it
could also fail if the process completed by one (so the file was
deleted) before being started by the other.
</content>
<content type='xhtml'>
<div xmlns='http://www.w3.org/1999/xhtml'>
<pre>
This can also result in the file being uploaded twice simultaneously,
as up() is running in a separate goroutine. This can cause failures
on Windows as the file is attempted to be removed by one upload
process while being open to upload by the other process. Probably it
could also fail if the process completed by one (so the file was
deleted) before being started by the other.
</pre>
</div>
</content>
</entry>
<entry>
<title>Some changes to ensure the pipeline works correctly on Windows</title>
<updated>2020-11-16T17:43:27+00:00</updated>
<author>
<name>Nick White</name>
<email>git@njw.name</email>
</author>
<published>2020-11-16T17:43:27+00:00</published>
<link rel='alternate' type='text/html' href='https://git.rescribe.xyz/cgit/cgit.cgi/bookpipeline/commit/?id=cfbb3481368714adcd734906d8a460b873551c90'/>
<id>cfbb3481368714adcd734906d8a460b873551c90</id>
<content type='text'>
There were a couple of places where a file was uploaded while still open,
which resulted in an attempt to remove it, which causes an error from
Windows.

The allOCRed function also included an assumption that the path separator
would be a /, which is always correct for AWS, and correct for local on
Linux and OSX, but not for local Windows. Fixed by leaving the separator
well alone.

Also, the local connection was not stripping leading \, like it did /,
which caused an issue with Windows local.

Windows local is now tested and working, at least through wine.
</content>
<content type='xhtml'>
<div xmlns='http://www.w3.org/1999/xhtml'>
<pre>
There were a couple of places where a file was uploaded while still open,
which resulted in an attempt to remove it, which causes an error from
Windows.

The allOCRed function also included an assumption that the path separator
would be a /, which is always correct for AWS, and correct for local on
Linux and OSX, but not for local Windows. Fixed by leaving the separator
well alone.

Also, the local connection was not stripping leading \, like it did /,
which caused an issue with Windows local.

Windows local is now tested and working, at least through wine.
</pre>
</div>
</content>
</entry>
<entry>
<title>gofmt</title>
<updated>2020-11-10T12:30:15+00:00</updated>
<author>
<name>Nick White</name>
<email>git@njw.name</email>
</author>
<published>2020-11-10T12:30:15+00:00</published>
<link rel='alternate' type='text/html' href='https://git.rescribe.xyz/cgit/cgit.cgi/bookpipeline/commit/?id=33f1726a4c9f8013dcde39e644281059d9766bc4'/>
<id>33f1726a4c9f8013dcde39e644281059d9766bc4</id>
<content type='text'>
</content>
<content type='xhtml'>
<div xmlns='http://www.w3.org/1999/xhtml'>
<pre>
</pre>
</div>
</content>
</entry>
<entry>
<title>[rescribe] Enable custom paths to tesseract command to be set (also improve some error output)</title>
<updated>2020-11-10T12:28:50+00:00</updated>
<author>
<name>Nick White</name>
<email>git@njw.name</email>
</author>
<published>2020-11-10T12:28:50+00:00</published>
<link rel='alternate' type='text/html' href='https://git.rescribe.xyz/cgit/cgit.cgi/bookpipeline/commit/?id=ad7aaf490e78e969bb5495dfda06a33d2a176aec'/>
<id>ad7aaf490e78e969bb5495dfda06a33d2a176aec</id>
<content type='text'>
</content>
<content type='xhtml'>
<div xmlns='http://www.w3.org/1999/xhtml'>
<pre>
</pre>
</div>
</content>
</entry>
<entry>
<title>[getpipelinebook] Rewrite to use internal package functions</title>
<updated>2020-11-10T10:41:15+00:00</updated>
<author>
<name>Nick White</name>
<email>git@njw.name</email>
</author>
<published>2020-11-10T10:41:15+00:00</published>
<link rel='alternate' type='text/html' href='https://git.rescribe.xyz/cgit/cgit.cgi/bookpipeline/commit/?id=198f8215f8dd0460608abcd03fa49451462c9d11'/>
<id>198f8215f8dd0460608abcd03fa49451462c9d11</id>
<content type='text'>
</content>
<content type='xhtml'>
<div xmlns='http://www.w3.org/1999/xhtml'>
<pre>
</pre>
</div>
</content>
</entry>
<entry>
<title>Switch Preprocess() to take the thresholds to use, and have rescribe tool only use 0.1,0.2,0.3</title>
<updated>2020-11-09T18:55:36+00:00</updated>
<author>
<name>Nick White</name>
<email>git@njw.name</email>
</author>
<published>2020-11-09T18:55:36+00:00</published>
<link rel='alternate' type='text/html' href='https://git.rescribe.xyz/cgit/cgit.cgi/bookpipeline/commit/?id=f19df9e8c1213a49c426caefd2fadc711f5faf11'/>
<id>f19df9e8c1213a49c426caefd2fadc711f5faf11</id>
<content type='text'>
</content>
<content type='xhtml'>
<div xmlns='http://www.w3.org/1999/xhtml'>
<pre>
</pre>
</div>
</content>
</entry>
<entry>
<title>[rescribe] Local only combo tool basically now working. Testing is still minimal.</title>
<updated>2020-11-09T18:54:50+00:00</updated>
<author>
<name>Nick White</name>
<email>git@njw.name</email>
</author>
<published>2020-11-09T18:29:56+00:00</published>
<link rel='alternate' type='text/html' href='https://git.rescribe.xyz/cgit/cgit.cgi/bookpipeline/commit/?id=a1de8862a091f9584220db40671a0d43346c4519'/>
<id>a1de8862a091f9584220db40671a0d43346c4519</id>
<content type='text'>
</content>
<content type='xhtml'>
<div xmlns='http://www.w3.org/1999/xhtml'>
<pre>
</pre>
</div>
</content>
</entry>
<entry>
<title>[rescribe] work in progress at a self-contained local pipeline processor, called rescribe</title>
<updated>2020-11-09T17:33:52+00:00</updated>
<author>
<name>Nick White</name>
<email>git@njw.name</email>
</author>
<published>2020-11-09T17:33:52+00:00</published>
<link rel='alternate' type='text/html' href='https://git.rescribe.xyz/cgit/cgit.cgi/bookpipeline/commit/?id=fc6becf5ed98e9c0815532fd76639c15eb481ed1'/>
<id>fc6becf5ed98e9c0815532fd76639c15eb481ed1</id>
<content type='text'>
</content>
<content type='xhtml'>
<div xmlns='http://www.w3.org/1999/xhtml'>
<pre>
</pre>
</div>
</content>
</entry>
<entry>
<title>[bookpipeline] Split most functionality out to package internal/pipeline</title>
<updated>2020-11-09T16:46:43+00:00</updated>
<author>
<name>Nick White</name>
<email>git@njw.name</email>
</author>
<published>2020-11-09T16:46:43+00:00</published>
<link rel='alternate' type='text/html' href='https://git.rescribe.xyz/cgit/cgit.cgi/bookpipeline/commit/?id=4c7cdeb5646e84af3f15d4a7cd48f64d8086a6b9'/>
<id>4c7cdeb5646e84af3f15d4a7cd48f64d8086a6b9</id>
<content type='text'>
No functionality changes, but this should make it easier to make custom
builds using the pipeline in slightly different ways.
</content>
<content type='xhtml'>
<div xmlns='http://www.w3.org/1999/xhtml'>
<pre>
No functionality changes, but this should make it easier to make custom
builds using the pipeline in slightly different ways.
</pre>
</div>
</content>
</entry>
</feed>
