diff options
author | Nick White <git@njw.name> | 2020-11-24 12:40:54 +0000 |
---|---|---|
committer | Nick White <git@njw.name> | 2020-11-24 12:40:54 +0000 |
commit | 0d914a5de3f8169d41df4fcff1ee4aea6d01afbe (patch) | |
tree | 6ba24389250bfc13edd32798af120b3f56dc0d73 /internal/pipeline/testdata/good | |
parent | 0b9bd466dd2e099bf6c7d3165f1285f4b7a8f38e (diff) |
[booktopipeline] Add a check to disallow adding a book that already exists
This is important as if a book is added which has already been done,
then an analyse job will be added every time a page is OCRed, which
will clog up the pipeline with unnecessary work. Also if a book was
added with the same name but differently named files, or a different
number of pages, the results would almost certainly not be as
intended.
In the case of a book really wanting to be added with a particular
name, either the original directory can be removed on S3, or "v2"
or similar can be appended to the book name before calling
booktopipeline.
Diffstat (limited to 'internal/pipeline/testdata/good')
0 files changed, 0 insertions, 0 deletions