summaryrefslogtreecommitdiff
path: root/cmd/iiifdownloader
AgeCommit message (Collapse)Author
2021-06-08iiifdownloader: remove old and incorrect part which could cause errorsNick White
2021-05-11Handle pages with png suffix correctlyNick White
Example book: https://content.staatsbibliothek-berlin.de/dc/687222079/manifest
2021-01-25Fix generic IIIF downloading to fix special-case for the Bodleian onlyNick White
This was also triggering for erara, causing it to fail. As it's clearly a Bodleian special case, we now check the URL is Bodleian before applying it.
2020-10-26[iiifdownloader] Add -insecure flag to ignore TLS errorsNick White
At the time of writing, the https://manuscrits-france-angleterre.org website has expired certificates, which make accessing their images a pain. While the issue is obviously with them, it's reasonable for us to add a -insecure flag (emphatically not the default) to override cert checking for cases like this.
2020-10-13[iiifdownloader] Catch SIGINT when writing a file to remove half-written ↵Nick White
files before exit
2020-10-13Improve error handling, and ensure incomplete page downloads are removedNick White
2020-09-28[iiifdownloader] Add a TODO to switch to tile based downloadingNick White
2020-09-28[iiifdownloader] Work around oxford needing the iiif suffix adding to its idNick White
2020-09-28[iiifdownloader] Default to iiifmanifest type if none is given and no ↵Nick White
definitive service can be found
2020-09-28Make page numbering more generic to handle more iiif variety, and add ↵Nick White
harvardartmuseums iiif manifest example url
2020-09-28Add ability to pass -service to choose which download type to use, plus add ↵Nick White
a -bookdir flag to set download directory
2020-09-21Use strings.Replace rather than strings.ReplaceAll so that it works on older ↵Nick White
versions of go
2020-09-12Add todosNick White
2020-09-08Add the option to force METS usage for BSBNick White
2020-09-08Switch to using generic page downloader for BNFNick White
2020-09-08Improve urlToPgName so it can be used by BNF tooNick White
2020-09-08Improve urlToPgName and documentationNick White
2020-09-08Sanitise URLs so that // in url doesn't cause issues (bsb site can spew these)Nick White
2020-09-08Switch from METS to IIIF manifest for BSB downloading, as it returns higher ↵Nick White
quality images with no visible watermark
2020-09-08[iiifdownloader] BSB downloading works now, by parsing METS XMLNick White
2020-09-07[iiifdownloader] Split out NoPgNums downloading to its own functionNick White
2020-09-07Add skeleton of bsb supportNick White
2020-06-23[iiifdownloader] Only remove 1 duplicate page, as 2nd one may not be ↵Nick White
duplicate (no way of knowing as if it is its downsized)
2020-06-23[iiifdownloader] Add support for BNF urls with a dot after book idNick White
2020-06-23Add IIIF downloader, that just supports BNF for nowNick White