summaryrefslogtreecommitdiff
AgeCommit message (Collapse)Author
2022-03-11Separate out fullsize pdf creation from colour pdf creation, so less memory ↵fullsizepdfNick White
is needed
2022-03-11Add initial support for full-size PDF generationNick White
Some issues: 1) The PDF generation stores every page in memory while it constructs it. That means that there's a higher chance of failure due to running out of memory with these. There's no getting around this except by improving the PDF generation library, which is not easy. 2) Currently I've just changed the pipeline to always generate these full size PDFs, and then the rescribe tool will just delete them if they weren't requested. This is bad in particular because of point 1, and would probably cause issues of failures in the server pipeline as a result Therefore the plan is to add a tag to queue messages so that full size generation can be selectively enabled. Also, it should be split from the loop with colour pdf generation, as holding them both in RAM at the same time is unnecessary.
2022-03-11Name PDF extracted images so they sort correctlyNick White
2022-02-28rescribe: improve layout of completed popupNick White
2022-02-28rescribe: Further improve getembeds error reporting and recognitionNick White
2022-02-28rescribe: Fix error printing with getembedsNick White
2022-02-28rescribe: Add embed_darwin.go to include getbook into OSX buildsNick White
2022-02-28rescribe: Add " searchable" to file name for saved PDFNick White
2022-02-28rescribe: Add popup on completion reporting where files were savedNick White
2022-02-28adjusted file renaming to make suffixes of png and jpg files lowercase and ↵Antonia Rescribe
change jpeg to jpg
2022-02-28rescribe: Improve wording of training dropdownNick White
2022-02-28Add PreNoWipe queue, that just does binarisation but no wipingNick White
2022-02-28bookpipeline: ensure context is initialised before using it, to avoid panicNick White
2022-02-28bookpipeline: Switch to rescribev9 as default trainingNick White
2022-02-24rescribe: Add embedded getgbookNick White
2022-02-24rescribe: Improve getgbook failure error dialog by keeping it simpleNick White
2022-02-23rescribe: fix typo with embedded getgbook runningNick White
2022-02-23Add getgbook embedding for darwinNick White
2022-02-23rescribe: Add embedded support for getgbook, for linux only so farNick White
2022-02-21rescribe: Add getgbook use to the GUI (not embedded yet)Nick White
2022-02-21Ensure that no new console windows are opened on Windows when executing ↵Nick White
Tesseract
2022-02-21rescribe: add .zip version of .app for macNick White
2022-02-14rescribe: Add gui elements for getgbook integration (wip)Nick White
2022-02-09rescribe: ensure go generate is called when needed in makefile, and remove ↵Nick White
unneeded rules
2022-02-09rescribe: makefile now supports cross-compiling with fyneNick White
This also necessitated a version bump to fyne.
2022-02-09rescribe: make go generate skip already downloaded files, checking that ↵Nick White
checksum matches expected for safety
2022-01-31rescribe: remove unnecessary extra cancel calls; anything which errors ↵Nick White
should clean up well enough, and this has the potential for more harder to find bugs
2022-01-31Ensure cancel is sent to any errant processes in case of an error, and stick ↵Nick White
with "Start OCR" for go button
2022-01-31pipeline: Fail if no images are presentNick White
2022-01-31rescribe: Add context cancelling to extractPdfImgs(), so it's no longer ↵Nick White
possible to get the gui into a bad state by cancelling before startProcess began (hopefully)
2022-01-31rescribe: Ensure status isnt overwritten after an abort, when wipe-only ↵Nick White
preprocessing
2022-01-31rescribe: fix bug where a successful run would segfaultNick White
2022-01-31Make pipeline context-aware, so the rescribe tool can cancel jobsNick White
2022-01-24Update spot image idNick White
2022-01-17rescribe: Surface errors properly, with a dialogue boxNick White
2022-01-17rescribe: Show friendly names for the trainings, and hide "osd" trainingNick White
2022-01-17internal/pipeline: if a graph cannot be created, don't leave an empty ↵Nick White
graph.png file, and allow failure to download that as it won't be created in the case of a 1 page book, which is fine
2022-01-10rescribe: Increase size of file & folder picker dialog windowsNick White
2022-01-10rescribe: Put log in an accordion, disable buttons when processing, and ↵Nick White
don't lock gui when processing
2022-01-10rescribe: ensure books with a space in the name are handled correctly in the guiNick White
2022-01-10rescribe: Rename PDFs taking into account that in some cases one or the ↵Nick White
other of binarised or colour may not exist
2022-01-10internal/pipeline: Have DownloadPdfs() try to download all PDFs, but only ↵Nick White
return an error if none downloaded, as there are times when the colour PDF will not exist, which is fine
2022-01-10rescribe: handle PDF errors much more gracefullyNick White
2022-01-04rescribe: parse stdout and set progress bar based on it, using appropriate ↵Nick White
labels for the progress bar text to show what's being done
2022-01-04rescribe: Restrict file types to select for .pdf and .traineddata file pickersNick White
2022-01-04rescribe: add select box to choose training to use, including an Other... optionNick White
2021-12-20rescribe: Ensure temporary tesseract data is only removed when the program ↵Nick White
ends, so multiple books can be processed by the gui one after the other
2021-12-20rescribe: Improve layout of gui, and make dir entry box read onlyNick White
2021-12-20rescribe: Ensure temporary tesseract dir is removed in gui mode tooNick White
2021-12-20rescribe: add "Choose PDF" button, and make chosen dir/file section a label ↵Nick White
rather than an entry