<feed xmlns='http://www.w3.org/2005/Atom'>
<title>bookpipeline/cmd, branch fullsizepdf</title>
<subtitle>Tools to process books in a cloud based pipeline system</subtitle>
<link rel='alternate' type='text/html' href='https://git.rescribe.xyz/cgit/cgit.cgi/bookpipeline/'/>
<entry>
<title>Add initial support for full-size PDF generation</title>
<updated>2022-03-11T13:36:59+00:00</updated>
<author>
<name>Nick White</name>
<email>git@njw.name</email>
</author>
<published>2022-03-11T13:36:59+00:00</published>
<link rel='alternate' type='text/html' href='https://git.rescribe.xyz/cgit/cgit.cgi/bookpipeline/commit/?id=9d1382b69700129a66541d786ba3b784eda56e36'/>
<id>9d1382b69700129a66541d786ba3b784eda56e36</id>
<content type='text'>
Some issues:
1) The PDF generation stores every page in memory while it constructs it. That means that
there's a higher chance of failure due to running out of memory with these. There's no
getting around this except by improving the PDF generation library, which is not easy.

2) Currently I've just changed the pipeline to always generate these full size PDFs, and
then the rescribe tool will just delete them if they weren't requested. This is bad in
particular because of point 1, and would probably cause issues of failures in the server
pipeline as a result

Therefore the plan is to add a tag to queue messages so that full size generation can be
selectively enabled.

Also, it should be split from the loop with colour pdf generation, as holding them both in RAM at
the same time is unnecessary.
</content>
<content type='xhtml'>
<div xmlns='http://www.w3.org/1999/xhtml'>
<pre>
Some issues:
1) The PDF generation stores every page in memory while it constructs it. That means that
there's a higher chance of failure due to running out of memory with these. There's no
getting around this except by improving the PDF generation library, which is not easy.

2) Currently I've just changed the pipeline to always generate these full size PDFs, and
then the rescribe tool will just delete them if they weren't requested. This is bad in
particular because of point 1, and would probably cause issues of failures in the server
pipeline as a result

Therefore the plan is to add a tag to queue messages so that full size generation can be
selectively enabled.

Also, it should be split from the loop with colour pdf generation, as holding them both in RAM at
the same time is unnecessary.
</pre>
</div>
</content>
</entry>
<entry>
<title>Name PDF extracted images so they sort correctly</title>
<updated>2022-03-11T10:17:24+00:00</updated>
<author>
<name>Nick White</name>
<email>git@njw.name</email>
</author>
<published>2022-03-11T10:17:24+00:00</published>
<link rel='alternate' type='text/html' href='https://git.rescribe.xyz/cgit/cgit.cgi/bookpipeline/commit/?id=d6e1cb61da7a9155023ce9dece96da79c7246790'/>
<id>d6e1cb61da7a9155023ce9dece96da79c7246790</id>
<content type='text'>
</content>
<content type='xhtml'>
<div xmlns='http://www.w3.org/1999/xhtml'>
<pre>
</pre>
</div>
</content>
</entry>
<entry>
<title>rescribe: improve layout of completed popup</title>
<updated>2022-02-28T17:56:12+00:00</updated>
<author>
<name>Nick White</name>
<email>git@njw.name</email>
</author>
<published>2022-02-28T17:56:12+00:00</published>
<link rel='alternate' type='text/html' href='https://git.rescribe.xyz/cgit/cgit.cgi/bookpipeline/commit/?id=326bbf430125b69614ac4f4160cf5f1d8c6dedbe'/>
<id>326bbf430125b69614ac4f4160cf5f1d8c6dedbe</id>
<content type='text'>
</content>
<content type='xhtml'>
<div xmlns='http://www.w3.org/1999/xhtml'>
<pre>
</pre>
</div>
</content>
</entry>
<entry>
<title>rescribe: Further improve getembeds error reporting and recognition</title>
<updated>2022-02-28T17:21:27+00:00</updated>
<author>
<name>Nick White</name>
<email>git@njw.name</email>
</author>
<published>2022-02-28T17:21:27+00:00</published>
<link rel='alternate' type='text/html' href='https://git.rescribe.xyz/cgit/cgit.cgi/bookpipeline/commit/?id=fe9fd2e460762f60c1887b07727e6dd6668970a8'/>
<id>fe9fd2e460762f60c1887b07727e6dd6668970a8</id>
<content type='text'>
</content>
<content type='xhtml'>
<div xmlns='http://www.w3.org/1999/xhtml'>
<pre>
</pre>
</div>
</content>
</entry>
<entry>
<title>rescribe: Fix error printing with getembeds</title>
<updated>2022-02-28T17:11:02+00:00</updated>
<author>
<name>Nick White</name>
<email>git@njw.name</email>
</author>
<published>2022-02-28T17:11:02+00:00</published>
<link rel='alternate' type='text/html' href='https://git.rescribe.xyz/cgit/cgit.cgi/bookpipeline/commit/?id=7ebcab53b0e8e90dca1ed024e4d815529d49fd09'/>
<id>7ebcab53b0e8e90dca1ed024e4d815529d49fd09</id>
<content type='text'>
</content>
<content type='xhtml'>
<div xmlns='http://www.w3.org/1999/xhtml'>
<pre>
</pre>
</div>
</content>
</entry>
<entry>
<title>rescribe: Add embed_darwin.go to include getbook into OSX builds</title>
<updated>2022-02-28T17:05:28+00:00</updated>
<author>
<name>Nick White</name>
<email>git@njw.name</email>
</author>
<published>2022-02-28T17:05:28+00:00</published>
<link rel='alternate' type='text/html' href='https://git.rescribe.xyz/cgit/cgit.cgi/bookpipeline/commit/?id=65be1a688b9213e9e020ef00fe5726f18404a99b'/>
<id>65be1a688b9213e9e020ef00fe5726f18404a99b</id>
<content type='text'>
</content>
<content type='xhtml'>
<div xmlns='http://www.w3.org/1999/xhtml'>
<pre>
</pre>
</div>
</content>
</entry>
<entry>
<title>rescribe: Add " searchable" to file name for saved PDF</title>
<updated>2022-02-28T17:00:27+00:00</updated>
<author>
<name>Nick White</name>
<email>git@njw.name</email>
</author>
<published>2022-02-28T17:00:27+00:00</published>
<link rel='alternate' type='text/html' href='https://git.rescribe.xyz/cgit/cgit.cgi/bookpipeline/commit/?id=2bf8016a4572d5f83812c03785db3d4248dac449'/>
<id>2bf8016a4572d5f83812c03785db3d4248dac449</id>
<content type='text'>
</content>
<content type='xhtml'>
<div xmlns='http://www.w3.org/1999/xhtml'>
<pre>
</pre>
</div>
</content>
</entry>
<entry>
<title>rescribe: Add popup on completion reporting where files were saved</title>
<updated>2022-02-28T17:00:08+00:00</updated>
<author>
<name>Nick White</name>
<email>git@njw.name</email>
</author>
<published>2022-02-28T17:00:08+00:00</published>
<link rel='alternate' type='text/html' href='https://git.rescribe.xyz/cgit/cgit.cgi/bookpipeline/commit/?id=a116e67e9b297c32c7dfa802cdf9fef54a2e0a74'/>
<id>a116e67e9b297c32c7dfa802cdf9fef54a2e0a74</id>
<content type='text'>
</content>
<content type='xhtml'>
<div xmlns='http://www.w3.org/1999/xhtml'>
<pre>
</pre>
</div>
</content>
</entry>
<entry>
<title>rescribe: Improve wording of training dropdown</title>
<updated>2022-02-28T16:38:23+00:00</updated>
<author>
<name>Nick White</name>
<email>git@njw.name</email>
</author>
<published>2022-02-28T16:38:23+00:00</published>
<link rel='alternate' type='text/html' href='https://git.rescribe.xyz/cgit/cgit.cgi/bookpipeline/commit/?id=fac998f847f42ca2a849c425cb6d9fa7f002ada0'/>
<id>fac998f847f42ca2a849c425cb6d9fa7f002ada0</id>
<content type='text'>
</content>
<content type='xhtml'>
<div xmlns='http://www.w3.org/1999/xhtml'>
<pre>
</pre>
</div>
</content>
</entry>
<entry>
<title>Add PreNoWipe queue, that just does binarisation but no wiping</title>
<updated>2022-02-28T16:17:35+00:00</updated>
<author>
<name>Nick White</name>
<email>git@njw.name</email>
</author>
<published>2022-02-28T16:17:35+00:00</published>
<link rel='alternate' type='text/html' href='https://git.rescribe.xyz/cgit/cgit.cgi/bookpipeline/commit/?id=21d49b546a27de6c53d8fe7d1a68d5a3b5506c93'/>
<id>21d49b546a27de6c53d8fe7d1a68d5a3b5506c93</id>
<content type='text'>
</content>
<content type='xhtml'>
<div xmlns='http://www.w3.org/1999/xhtml'>
<pre>
</pre>
</div>
</content>
</entry>
</feed>
