Feature #2568
Updated by Bernhard Koschiček-Krombholz 4 months ago
Create an admin interface with the magical one click button to create an all-inclusive ARCHE dump.
This dump should include:
* Files sorted by extension ✅
* Metadata for files in a ttl (#2466) ✅
* Lists of failed files due to ARCHE restrictions (no license, no creator, no license holder, etc.) ✅
* Stripped SQL dump ✅ (currently, all data is in there)
* RDF dump (#2551) ✅
One major issue is, how the ARCHE metadata is stored/transferred into OpenAtlas. Currently, the data is stored in the production.py, which is not very handy to handle.
Done:
* change folder structure into @data@, @metadata@, and @debug@ ✅
* add statistic to debug (how large, how many files, how many folders, etc.) ✅
* check for duplicate files via hashes ✅
* add the reference as named entity of class acdh:Publication and link it with acdh:isSourceOf ✅
* convert URL to ASCII ✅
* Enrich the description of files ✅
** if linked entity has ext ref system URL, check with "arche_assets":https://github.com/acdh-oeaw/arche-assets?tab=readme-ov-file#python, if correct, then add a link to a new entity (only for Actors and Places)✅
** if no ext ref system, then just add a named entity ✅
* Add file checker ✅ --> functionality will be used in #2580
* create API endpoint for ARCHE metadata ✅
* write manual entry (what is needed, where and who can enter metadata, who can export, which file checkers are there) ✅
Todo:
* SQL dump with only the project data -> #2613