Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Add built in feature to generate SAF files #3758

Open
alexklbuckley opened this issue Dec 18, 2024 · 2 comments
Open

Add built in feature to generate SAF files #3758

alexklbuckley opened this issue Dec 18, 2024 · 2 comments
Labels
help wanted Needs a volunteer to claim to move forward new feature tools: packager related to package or AIP importer/exporter

Comments

@alexklbuckley
Copy link

alexklbuckley commented Dec 18, 2024

Is your feature request related to a problem? Please describe.

The 'Batch import (ZIP)' feature in the DSpace admin interface requires the user to upload a zipped SAF (Simple Archive Format) file.

Currently there are 4 GitHub repos that people have made available offering solutions for generating the SAF files:

These repos are not necessarily maintained, and some of them have not been updated in a long time, deeming them an unreliable and unsustainable solution.

Describe the solution you'd like

It would be nice if there was a built-in DSpace feature where either:

  1. Users could upload their files and DSpace would generate the SAF file and import it.
  2. Or the 'Batch import (zip)' feature was enhanced so that users did not have to upload SAF zipped files, but instead they could upload their csv (in Dublin Core schema format) and bitstream files and DSpace could import them.

Describe alternatives or workarounds you've considered

We have tried using the SAFBuilder repo, however, it continually errored out for us. As it notes it is not being actively maintained it is hard to know if these errors are due to updates to the codebase being required:

:~/SAFBuilder$ ./safbuilder.sh -c src/sample_data/AAA_batch-metadata.csv -z Recompiling SAFBuilder, just a moment... Detected input CSV as:UTF-8 Opened CSV File:src/sample_data/AAA_batch-metadata.csv Output directory is: /home/<name>/SAFBuilder/src/sample_data/SimpleArchiveFormat [ERROR] Failed to execute goal org.codehaus.mojo:exec-maven-plugin:3.5.0:java (default-cli) on project safbuilder: An exception occurred while executing the Java class. org/apache/commons/io/file/attribute/FileTimes: org.apache.commons.io.file.attribute.FileTimes -> [Help 1] [ERROR] [ERROR] To see the full stack trace of the errors, re-run Maven with the -e switch. [ERROR] Re-run Maven using the -X switch to enable full debug logging. [ERROR] [ERROR] For more information about the errors and possible solutions, please read the following articles: [ERROR] [Help 1] http://cwiki.apache.org/confluence/display/MAVEN/MojoExecutionException

Additional information

Add any other information, related tickets or screenshots about the feature request here.

@alexklbuckley alexklbuckley added needs triage New issue needs triage and/or scheduling new feature labels Dec 18, 2024
@github-project-automation github-project-automation bot moved this to 🆕 Triage in DSpace Backlog Dec 18, 2024
@tdonohue
Copy link
Member

While I'm generally in favor of this idea, these tools have always been maintained separately from DSpace. Adopting one would require a developer to want to contribute their code back into DSpace and also help maintain it there. At this time, DSpace also can only accept tools written in Java or Angular, as our volunteer Committers group only has expertise in those areas.

From what I've seen the current "best" tool (in terms of most maintained) is https://github.com/jcreel/SAFCreator. However, I do not have experience with some of those other tools.

I can verify though that https://github.com/DSpace-Labs/SAFBuilder has been unmaintained since its core developer moved on to another job. See the note in the README about that: https://github.com/DSpace-Labs/SAFBuilder?tab=readme-ov-file#similar-projects-to-also-consider

This ticket needs a volunteer. Any tool given back to DSpace must be written in either Java or Angular.

@tdonohue tdonohue added help wanted Needs a volunteer to claim to move forward tools: packager related to package or AIP importer/exporter and removed needs triage New issue needs triage and/or scheduling labels Dec 18, 2024
@tdonohue tdonohue moved this from 🆕 Triage to 🙋 Needs Help / Unscheduled in DSpace Backlog Dec 18, 2024
@alanorth
Copy link
Contributor

alanorth commented Jan 7, 2025

I use https://github.com/DSpace-Labs/SAFBuilder often—once a month or more over the past year at least. It works fine with Linux and DSpace 7. The only hard requirement is probably Java 8, not sure it works in anything else.

Moving batches of metadata + files into DSpace is a crucial functionality. It is in our interest to make sure we have tools for this. No need to be built in, as long as there is something that works.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
help wanted Needs a volunteer to claim to move forward new feature tools: packager related to package or AIP importer/exporter
Projects
Status: 🙋 Needs Help / Unscheduled
Development

No branches or pull requests

3 participants