Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

incorrect paths for covers in metadata.yaml #18

Open
rdhyee opened this issue Aug 5, 2015 · 3 comments
Open

incorrect paths for covers in metadata.yaml #18

rdhyee opened this issue Aug 5, 2015 · 3 comments

Comments

@rdhyee
Copy link
Contributor

rdhyee commented Aug 5, 2015

Consider https://github.com/GITenberg/Adventures-of-Huckleberry-Finn_76/blob/0a41eef0f59c26a6c6527d74fbd315cbb035adb7/metadata.yaml, we have the following paths for covers:

  • 76/76-h/76-h/images/cover.jpg
  • 76/images/bookcover.jpg
  • ///public/vhost/g/gutenberg/private/publish/76/images/titlepage.jpg
  • 76/76-h/images/bookcover.jpg
  • 76/76-h/images/cover.jpg
  • cover.jpg

we have

  • images/bookcover.jpg
  • images/cover.jpg
  • /old/orig76-h/images/bookcover.jpg
  • /old/orig76-h/images/cover.jpg
  • cover.jpg

Besides cover.jpg, which is the RTC cover, there might be some correspondence betwen some of the rest of the images and those listed in the Project Gutenberg RDF. But between the drift in Gutenberg metadata and any reorganization of file structure from GITenberg, doing a match might not be straightforward.

@rdhyee
Copy link
Contributor Author

rdhyee commented Aug 5, 2015

What to do while we don't have a longterm fix in place? I can loop through the Second Folio repos and remove cover metadata that don't match the literal paths in the repo.

@eshellman
Copy link
Contributor

So, either the initial paths (in rdf metadata) were wrong, or the files have been moved when they were loaded into gitenberg. If the latter, we should be able to fix the paths in yaml.

@rdhyee
Copy link
Contributor Author

rdhyee commented Aug 6, 2015

Re #18 (comment)

If the latter, we should be able to fix the paths in yaml.

In the case of Huck Finn, by looking at the list of covers, we could manually do the remapping. Don't have any census of all the repos in general. For Second Folio, there aren't that many repos affected (5) -- so they could all be fixed by hand if we want.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants