summaryrefslogtreecommitdiff
AgeCommit message (Collapse)Author
2022-03-21rescribe: Improve error messages if no pages are foundNick White
2022-03-21rescribe: Update copyright years and add TODO fileNick White
2022-03-21rescribe: Update traineddata descriptions in command line versionNick White
2022-03-21rescribe: remove unneeded old tessdataNick White
2022-03-21Update tessdata to only include a few trainingsNick White
2022-03-21rescribe: fix bug in gui where choosing "other" then cancelling would leave ↵Nick White
the "other" training selected
2022-03-21rescribe: move getBookIdFromUrl() to gbook.go, and add tests for itNick White
2022-03-21rescribe: Remove unneeded clause and add example urls for gbook id functionNick White
2022-03-21added support for new type of Google Books URLSAntonia Rescribe
2022-03-21rescribe: Simplify disabling and enabling common widgetsNick White
2022-03-21rescribe: disable & enable checkboxes alongside other parts when processingNick White
2022-03-21rescribe: Improve cli wording and simplify PDF stuff slightlyNick White
2022-03-21Only generate full-size PDF if requestedNick White
This avoids the issue that large PDFs require a lot of RAM, so there are chances of running out of memory. Plus it's a waste of space and time.
2022-03-11Separate out fullsize pdf creation from colour pdf creation, so less memory ↵fullsizepdfNick White
is needed
2022-03-11Add initial support for full-size PDF generationNick White
Some issues: 1) The PDF generation stores every page in memory while it constructs it. That means that there's a higher chance of failure due to running out of memory with these. There's no getting around this except by improving the PDF generation library, which is not easy. 2) Currently I've just changed the pipeline to always generate these full size PDFs, and then the rescribe tool will just delete them if they weren't requested. This is bad in particular because of point 1, and would probably cause issues of failures in the server pipeline as a result Therefore the plan is to add a tag to queue messages so that full size generation can be selectively enabled. Also, it should be split from the loop with colour pdf generation, as holding them both in RAM at the same time is unnecessary.
2022-03-11Name PDF extracted images so they sort correctlyNick White
2022-02-28rescribe: improve layout of completed popupNick White
2022-02-28rescribe: Further improve getembeds error reporting and recognitionNick White
2022-02-28rescribe: Fix error printing with getembedsNick White
2022-02-28rescribe: Add embed_darwin.go to include getbook into OSX buildsNick White
2022-02-28rescribe: Add " searchable" to file name for saved PDFNick White
2022-02-28rescribe: Add popup on completion reporting where files were savedNick White
2022-02-28adjusted file renaming to make suffixes of png and jpg files lowercase and ↵Antonia Rescribe
change jpeg to jpg
2022-02-28rescribe: Improve wording of training dropdownNick White
2022-02-28Add PreNoWipe queue, that just does binarisation but no wipingNick White
2022-02-28bookpipeline: ensure context is initialised before using it, to avoid panicNick White
2022-02-28bookpipeline: Switch to rescribev9 as default trainingNick White
2022-02-24rescribe: Add embedded getgbookNick White
2022-02-24rescribe: Improve getgbook failure error dialog by keeping it simpleNick White
2022-02-23rescribe: fix typo with embedded getgbook runningNick White
2022-02-23Add getgbook embedding for darwinNick White
2022-02-23rescribe: Add embedded support for getgbook, for linux only so farNick White
2022-02-21rescribe: Add getgbook use to the GUI (not embedded yet)Nick White
2022-02-21Ensure that no new console windows are opened on Windows when executing ↵Nick White
Tesseract
2022-02-21rescribe: add .zip version of .app for macNick White
2022-02-14rescribe: Add gui elements for getgbook integration (wip)Nick White
2022-02-09rescribe: ensure go generate is called when needed in makefile, and remove ↵Nick White
unneeded rules
2022-02-09rescribe: makefile now supports cross-compiling with fyneNick White
This also necessitated a version bump to fyne.
2022-02-09rescribe: make go generate skip already downloaded files, checking that ↵Nick White
checksum matches expected for safety
2022-01-31rescribe: remove unnecessary extra cancel calls; anything which errors ↵Nick White
should clean up well enough, and this has the potential for more harder to find bugs
2022-01-31Ensure cancel is sent to any errant processes in case of an error, and stick ↵Nick White
with "Start OCR" for go button
2022-01-31pipeline: Fail if no images are presentNick White
2022-01-31rescribe: Add context cancelling to extractPdfImgs(), so it's no longer ↵Nick White
possible to get the gui into a bad state by cancelling before startProcess began (hopefully)
2022-01-31rescribe: Ensure status isnt overwritten after an abort, when wipe-only ↵Nick White
preprocessing
2022-01-31rescribe: fix bug where a successful run would segfaultNick White
2022-01-31Make pipeline context-aware, so the rescribe tool can cancel jobsNick White
2022-01-24Update spot image idNick White
2022-01-17rescribe: Surface errors properly, with a dialogue boxNick White
2022-01-17rescribe: Show friendly names for the trainings, and hide "osd" trainingNick White
2022-01-17internal/pipeline: if a graph cannot be created, don't leave an empty ↵Nick White
graph.png file, and allow failure to download that as it won't be created in the case of a 1 page book, which is fine