<feed xmlns='http://www.w3.org/2005/Atom'>
<title>bookpipeline, branch v0.5.0</title>
<subtitle>Tools to process books in a cloud based pipeline system</subtitle>
<link rel='alternate' type='text/html' href='https://git.rescribe.xyz/cgit/cgit.cgi/bookpipeline/'/>
<entry>
<title>lspipeline-ng: Limit number of book details requests so we don't run into EC2's rate limiting</title>
<updated>2021-08-19T16:50:11+00:00</updated>
<author>
<name>Nick White</name>
<email>git@njw.name</email>
</author>
<published>2021-08-19T16:50:11+00:00</published>
<link rel='alternate' type='text/html' href='https://git.rescribe.xyz/cgit/cgit.cgi/bookpipeline/commit/?id=9f3fec3e0982c5b419338f68428f12bbeed4c2bb'/>
<id>9f3fec3e0982c5b419338f68428f12bbeed4c2bb</id>
<content type='text'>
</content>
<content type='xhtml'>
<div xmlns='http://www.w3.org/1999/xhtml'>
<pre>
</pre>
</div>
</content>
</entry>
<entry>
<title>rescribe: Update documentation on how to deal with M1 signing, and move makefile to where it makes sense</title>
<updated>2021-08-18T21:37:40+00:00</updated>
<author>
<name>Nick White</name>
<email>git@njw.name</email>
</author>
<published>2021-08-18T21:37:40+00:00</published>
<link rel='alternate' type='text/html' href='https://git.rescribe.xyz/cgit/cgit.cgi/bookpipeline/commit/?id=bde651bbde9df3a8c33b705dbe33bbcaf4e3e73d'/>
<id>bde651bbde9df3a8c33b705dbe33bbcaf4e3e73d</id>
<content type='text'>
</content>
<content type='xhtml'>
<div xmlns='http://www.w3.org/1999/xhtml'>
<pre>
</pre>
</div>
</content>
</entry>
<entry>
<title>pdf: Stretch words to fit in their boxes, for more perfect embedding</title>
<updated>2021-08-17T12:39:09+00:00</updated>
<author>
<name>Nick White</name>
<email>git@njw.name</email>
</author>
<published>2021-08-17T12:39:09+00:00</published>
<link rel='alternate' type='text/html' href='https://git.rescribe.xyz/cgit/cgit.cgi/bookpipeline/commit/?id=767b60db23311adaf1035e821bc189877d63b7f0'/>
<id>767b60db23311adaf1035e821bc189877d63b7f0</id>
<content type='text'>
- Words are stretched to fit their boxes, which means the accuracy
  is now very high indeed. This was done by modifying gofpdf to add
  the SetCellStretchToFit function, which will hopefully be
  upstreamed in due course.
- Copy pasting from a PDF works well with lines rarely if ever being
  erroneously broken by the PDF reader. There was quite a bit of
  trial-and-error to improve this, and the stretched text plus a space
  being added after the word in CellFormat was the best (plus preserves
  accuracy of word and character locations).
</content>
<content type='xhtml'>
<div xmlns='http://www.w3.org/1999/xhtml'>
<pre>
- Words are stretched to fit their boxes, which means the accuracy
  is now very high indeed. This was done by modifying gofpdf to add
  the SetCellStretchToFit function, which will hopefully be
  upstreamed in due course.
- Copy pasting from a PDF works well with lines rarely if ever being
  erroneously broken by the PDF reader. There was quite a bit of
  trial-and-error to improve this, and the stretched text plus a space
  being added after the word in CellFormat was the best (plus preserves
  accuracy of word and character locations).
</pre>
</div>
</content>
</entry>
<entry>
<title>pipeline: use regular storage for tests, rather than a separate one</title>
<updated>2021-08-17T12:18:04+00:00</updated>
<author>
<name>Nick White</name>
<email>git@njw.name</email>
</author>
<published>2021-08-17T12:18:04+00:00</published>
<link rel='alternate' type='text/html' href='https://git.rescribe.xyz/cgit/cgit.cgi/bookpipeline/commit/?id=48c68cfa7f18992b26765c7b67c52c11687ad74a'/>
<id>48c68cfa7f18992b26765c7b67c52c11687ad74a</id>
<content type='text'>
</content>
<content type='xhtml'>
<div xmlns='http://www.w3.org/1999/xhtml'>
<pre>
</pre>
</div>
</content>
</entry>
<entry>
<title>pdf: use same line height and origin for all words on a line as it makes things neater in the PDF in most cases</title>
<updated>2021-08-09T15:34:01+00:00</updated>
<author>
<name>Nick White</name>
<email>git@njw.name</email>
</author>
<published>2021-08-09T15:34:01+00:00</published>
<link rel='alternate' type='text/html' href='https://git.rescribe.xyz/cgit/cgit.cgi/bookpipeline/commit/?id=2bac685f600b97676de14bb850e5b087f0d54b83'/>
<id>2bac685f600b97676de14bb850e5b087f0d54b83</id>
<content type='text'>
</content>
<content type='xhtml'>
<div xmlns='http://www.w3.org/1999/xhtml'>
<pre>
</pre>
</div>
</content>
</entry>
<entry>
<title>pdf: significantly improve character coordinates</title>
<updated>2021-08-09T14:51:50+00:00</updated>
<author>
<name>Nick White</name>
<email>git@njw.name</email>
</author>
<published>2021-08-09T14:51:50+00:00</published>
<link rel='alternate' type='text/html' href='https://git.rescribe.xyz/cgit/cgit.cgi/bookpipeline/commit/?id=69eeb41a33f6a764fc6baf1a95e629a6482b67ea'/>
<id>69eeb41a33f6a764fc6baf1a95e629a6482b67ea</id>
<content type='text'>
A few good changes to make word coordinate lookups significantly
more accurate:
- Set font size dynamically based on the line height (previously it was
  fixed as size 10)
- Correct height and width of word boxes (previously they were way too
  large, which probably didn't make a difference in the general case,
  but now they're correct)
- Set word box margin to zero

Also change PDF size to A5 paper, as that's closer to an average book page size.
</content>
<content type='xhtml'>
<div xmlns='http://www.w3.org/1999/xhtml'>
<pre>
A few good changes to make word coordinate lookups significantly
more accurate:
- Set font size dynamically based on the line height (previously it was
  fixed as size 10)
- Correct height and width of word boxes (previously they were way too
  large, which probably didn't make a difference in the general case,
  but now they're correct)
- Set word box margin to zero

Also change PDF size to A5 paper, as that's closer to an average book page size.
</pre>
</div>
</content>
</entry>
<entry>
<title>rescribe: Add experimental m1 build</title>
<updated>2021-08-02T16:28:03+00:00</updated>
<author>
<name>Nick White</name>
<email>git@njw.name</email>
</author>
<published>2021-08-02T16:28:03+00:00</published>
<link rel='alternate' type='text/html' href='https://git.rescribe.xyz/cgit/cgit.cgi/bookpipeline/commit/?id=1f2a05e466c195dde83effd82c96d4329259d249'/>
<id>1f2a05e466c195dde83effd82c96d4329259d249</id>
<content type='text'>
</content>
<content type='xhtml'>
<div xmlns='http://www.w3.org/1999/xhtml'>
<pre>
</pre>
</div>
</content>
</entry>
<entry>
<title>internal/pipeline: Add test (incomplete but working) for UploadImages</title>
<updated>2021-08-02T09:45:53+00:00</updated>
<author>
<name>Nick White</name>
<email>git@njw.name</email>
</author>
<published>2021-08-02T09:45:53+00:00</published>
<link rel='alternate' type='text/html' href='https://git.rescribe.xyz/cgit/cgit.cgi/bookpipeline/commit/?id=4b7a4bd1b87d1dbba283e577aa03ce7e390d85d8'/>
<id>4b7a4bd1b87d1dbba283e577aa03ce7e390d85d8</id>
<content type='text'>
</content>
<content type='xhtml'>
<div xmlns='http://www.w3.org/1999/xhtml'>
<pre>
</pre>
</div>
</content>
</entry>
<entry>
<title>internal/pipeline: Add test to check that hidden files are skipped</title>
<updated>2021-07-27T15:31:46+00:00</updated>
<author>
<name>Nick White</name>
<email>git@njw.name</email>
</author>
<published>2021-07-27T15:31:46+00:00</published>
<link rel='alternate' type='text/html' href='https://git.rescribe.xyz/cgit/cgit.cgi/bookpipeline/commit/?id=5310a5e77e945debdc7bd6f149ba87fab13902db'/>
<id>5310a5e77e945debdc7bd6f149ba87fab13902db</id>
<content type='text'>
</content>
<content type='xhtml'>
<div xmlns='http://www.w3.org/1999/xhtml'>
<pre>
</pre>
</div>
</content>
</entry>
<entry>
<title>Update dependencies</title>
<updated>2021-07-27T13:25:17+00:00</updated>
<author>
<name>Nick White</name>
<email>git@njw.name</email>
</author>
<published>2021-07-27T13:25:17+00:00</published>
<link rel='alternate' type='text/html' href='https://git.rescribe.xyz/cgit/cgit.cgi/bookpipeline/commit/?id=02b537b790624057c3f9d0b287a37bf8182847b4'/>
<id>02b537b790624057c3f9d0b287a37bf8182847b4</id>
<content type='text'>
</content>
<content type='xhtml'>
<div xmlns='http://www.w3.org/1999/xhtml'>
<pre>
</pre>
</div>
</content>
</entry>
</feed>
