<feed xmlns='http://www.w3.org/2005/Atom'>
<title>preproc/testdata, branch v0.4.2</title>
<subtitle>Miscellaneous OCR related utilities and packages</subtitle>
<link rel='alternate' type='text/html' href='https://git.rescribe.xyz/cgit/cgit.cgi/preproc/'/>
<entry>
<title>Update testdata to reflect updated integral package</title>
<updated>2020-08-04T16:12:24+00:00</updated>
<author>
<name>Nick White</name>
<email>git@njw.name</email>
</author>
<published>2020-08-04T16:12:24+00:00</published>
<link rel='alternate' type='text/html' href='https://git.rescribe.xyz/cgit/cgit.cgi/preproc/commit/?id=add5499c74e7884edeca03ca045e41b1ac7888c5'/>
<id>add5499c74e7884edeca03ca045e41b1ac7888c5</id>
<content type='text'>
</content>
<content type='xhtml'>
<div xmlns='http://www.w3.org/1999/xhtml'>
<pre>
</pre>
</div>
</content>
</entry>
<entry>
<title>Improve sauvola window code slightly</title>
<updated>2020-08-04T15:59:43+00:00</updated>
<author>
<name>Nick White</name>
<email>git@njw.name</email>
</author>
<published>2020-08-04T15:59:43+00:00</published>
<link rel='alternate' type='text/html' href='https://git.rescribe.xyz/cgit/cgit.cgi/preproc/commit/?id=38e235f95895d96c03be4e64ce4263eaac477687'/>
<id>38e235f95895d96c03be4e64ce4263eaac477687</id>
<content type='text'>
The sauvola and integral image version are still slightly different,
irritatingly.
</content>
<content type='xhtml'>
<div xmlns='http://www.w3.org/1999/xhtml'>
<pre>
The sauvola and integral image version are still slightly different,
irritatingly.
</pre>
</div>
</content>
</entry>
<entry>
<title>Update to replace integralimg package with integral package</title>
<updated>2020-08-04T15:19:01+00:00</updated>
<author>
<name>Nick White</name>
<email>git@njw.name</email>
</author>
<published>2020-08-04T15:19:01+00:00</published>
<link rel='alternate' type='text/html' href='https://git.rescribe.xyz/cgit/cgit.cgi/preproc/commit/?id=aa5bd02bd5ed5a6a8d371f953ef128fce64e61a7'/>
<id>aa5bd02bd5ed5a6a8d371f953ef128fce64e61a7</id>
<content type='text'>
</content>
<content type='xhtml'>
<div xmlns='http://www.w3.org/1999/xhtml'>
<pre>
</pre>
</div>
</content>
</entry>
<entry>
<title>Switch to integralimg 0.3.0 and use the new functions for integral sauvola</title>
<updated>2020-07-26T22:44:41+00:00</updated>
<author>
<name>Nick White</name>
<email>git@njw.name</email>
</author>
<published>2020-07-26T22:44:41+00:00</published>
<link rel='alternate' type='text/html' href='https://git.rescribe.xyz/cgit/cgit.cgi/preproc/commit/?id=97753052db0239d907631d4a480ac742412827d3'/>
<id>97753052db0239d907631d4a480ac742412827d3</id>
<content type='text'>
</content>
<content type='xhtml'>
<div xmlns='http://www.w3.org/1999/xhtml'>
<pre>
</pre>
</div>
</content>
</entry>
<entry>
<title>Update to v0.2.1 of integralimg, and improve various things</title>
<updated>2020-07-23T22:09:43+00:00</updated>
<author>
<name>Nick White</name>
<email>git@njw.name</email>
</author>
<published>2020-07-23T22:09:43+00:00</published>
<link rel='alternate' type='text/html' href='https://git.rescribe.xyz/cgit/cgit.cgi/preproc/commit/?id=1ef9df29fb2d5bb6585e0d0ed99e13a900289e7b'/>
<id>1ef9df29fb2d5bb6585e0d0ed99e13a900289e7b</id>
<content type='text'>
- Improve integral sauvola by rounding threshold correctly
- The Sauvola functions can now process any image.Image, not
  just an image.Gray
- Add ImageWindower interface and use it to generalise
  wipesides.go
- Rely on Bounds() for image bounds rather than implementation-
  specific stuff in integralimg

Note that a couple of the wipesides tests are now failing. It's
possible that this is due to fixed or introduced bugs (let's hope
the former) changing sensible thresholds. Will need to look into
this and sort it.
</content>
<content type='xhtml'>
<div xmlns='http://www.w3.org/1999/xhtml'>
<pre>
- Improve integral sauvola by rounding threshold correctly
- The Sauvola functions can now process any image.Image, not
  just an image.Gray
- Add ImageWindower interface and use it to generalise
  wipesides.go
- Rely on Bounds() for image bounds rather than implementation-
  specific stuff in integralimg

Note that a couple of the wipesides tests are now failing. It's
possible that this is due to fixed or introduced bugs (let's hope
the former) changing sensible thresholds. Will need to look into
this and sort it.
</pre>
</div>
</content>
</entry>
<entry>
<title>Improve sauvola by ensuring threshold is rounded correctly before comparison</title>
<updated>2020-07-23T22:04:49+00:00</updated>
<author>
<name>Nick White</name>
<email>git@njw.name</email>
</author>
<published>2020-07-23T22:04:49+00:00</published>
<link rel='alternate' type='text/html' href='https://git.rescribe.xyz/cgit/cgit.cgi/preproc/commit/?id=6a36400351bea5052431bb1feace358fa67a5cf9'/>
<id>6a36400351bea5052431bb1feace358fa67a5cf9</id>
<content type='text'>
</content>
<content type='xhtml'>
<div xmlns='http://www.w3.org/1999/xhtml'>
<pre>
</pre>
</div>
</content>
</entry>
<entry>
<title>Improve wiper algorithm</title>
<updated>2020-07-13T17:17:48+00:00</updated>
<author>
<name>Nick White</name>
<email>git@njw.name</email>
</author>
<published>2020-07-13T17:09:15+00:00</published>
<link rel='alternate' type='text/html' href='https://git.rescribe.xyz/cgit/cgit.cgi/preproc/commit/?id=0ceaa5d97b585987539c51b8b8ce0705c06fc3a8'/>
<id>0ceaa5d97b585987539c51b8b8ce0705c06fc3a8</id>
<content type='text'>
Several changes made to wiper to improve things:
- If findbestedge finds more than one 'best' line, choose
  the middle one, rather than the first.
- findbestedge selects the line with the lowest number of
  black pixels, rather than (erroneously) the highest.
- Added a findedgesOutin() function, which finds edges like
  findedges() but looking from the outside edges of the image
  inwards, rather than from the inside out. More testing is
  needed to decide whether this is useful; initial testing
  showed very little difference, but that was limited to lightly
  binarised images. This is in use by VWipe() at present, where
  worries about several gutters thanks to the edges of other
  pages aren't present.
- Added separate vwsize, vthresh and vmin arguments to WipeFile
  so that vertical wiping can have different parameters (which
  is very important for good quality vertical wiping).
- Updated wiping tests.
- Rewrote the wipe cmd to use WipeFile() directly.
</content>
<content type='xhtml'>
<div xmlns='http://www.w3.org/1999/xhtml'>
<pre>
Several changes made to wiper to improve things:
- If findbestedge finds more than one 'best' line, choose
  the middle one, rather than the first.
- findbestedge selects the line with the lowest number of
  black pixels, rather than (erroneously) the highest.
- Added a findedgesOutin() function, which finds edges like
  findedges() but looking from the outside edges of the image
  inwards, rather than from the inside out. More testing is
  needed to decide whether this is useful; initial testing
  showed very little difference, but that was limited to lightly
  binarised images. This is in use by VWipe() at present, where
  worries about several gutters thanks to the edges of other
  pages aren't present.
- Added separate vwsize, vthresh and vmin arguments to WipeFile
  so that vertical wiping can have different parameters (which
  is very important for good quality vertical wiping).
- Updated wiping tests.
- Rewrote the wipe cmd to use WipeFile() directly.
</pre>
</div>
</content>
</entry>
<entry>
<title>Add another top/bottom wipe test</title>
<updated>2020-03-05T14:44:17+00:00</updated>
<author>
<name>Nick White</name>
<email>git@njw.name</email>
</author>
<published>2020-03-05T14:44:17+00:00</published>
<link rel='alternate' type='text/html' href='https://git.rescribe.xyz/cgit/cgit.cgi/preproc/commit/?id=822ce40a7d708ed44d6e638e2e25db9261254d13'/>
<id>822ce40a7d708ed44d6e638e2e25db9261254d13</id>
<content type='text'>
</content>
<content type='xhtml'>
<div xmlns='http://www.w3.org/1999/xhtml'>
<pre>
</pre>
</div>
</content>
</entry>
<entry>
<title>Add vertical test to wipesides</title>
<updated>2020-03-04T17:32:47+00:00</updated>
<author>
<name>Nick White</name>
<email>git@njw.name</email>
</author>
<published>2020-03-04T17:32:47+00:00</published>
<link rel='alternate' type='text/html' href='https://git.rescribe.xyz/cgit/cgit.cgi/preproc/commit/?id=e8c47ee22391bc395bc4c8001c8edde9b6eff708'/>
<id>e8c47ee22391bc395bc4c8001c8edde9b6eff708</id>
<content type='text'>
</content>
<content type='xhtml'>
<div xmlns='http://www.w3.org/1999/xhtml'>
<pre>
</pre>
</div>
</content>
</entry>
<entry>
<title>Add edge detection tests for wipesides</title>
<updated>2020-03-03T18:16:25+00:00</updated>
<author>
<name>Nick White</name>
<email>git@njw.name</email>
</author>
<published>2020-03-03T18:16:25+00:00</published>
<link rel='alternate' type='text/html' href='https://git.rescribe.xyz/cgit/cgit.cgi/preproc/commit/?id=83f6144cfe0e91df991e107e66b73d9e037ea969'/>
<id>83f6144cfe0e91df991e107e66b73d9e037ea969</id>
<content type='text'>
These should be nice and robust to small changes that are still valid, as they don't depend on the output not changing, only on the detected sides being within appropriate limits.
</content>
<content type='xhtml'>
<div xmlns='http://www.w3.org/1999/xhtml'>
<pre>
These should be nice and robust to small changes that are still valid, as they don't depend on the output not changing, only on the detected sides being within appropriate limits.
</pre>
</div>
</content>
</entry>
</feed>
