OpenAtlas bITEM to ARCHE meeting 2025-10-17, 11:00¶
Location: Online
Updated information in the course of the meeting is in color and/or marked with an ✅. Every participant is welcome to add and adapt.
Topics are about archiving the bItem project, that used OpenAtlas respectively THANADOS for data acquisition and structuring, in the long term archiving system ARCHE
Participants¶
ARCHE
- Martina Trognitz
- Seta Stuhec
OpenAtlas
- Alexander Watzinger
- Bernhard Koschiček-Krombholz
- Nina Richards
bITEM
- Viola Winkler
- Stefan Eichert
Topics¶
See also: meeting protocol https://redmine.openatlas.eu/projects/uni/wiki/Meeting_2025-06-03
ARCHE needs some more information from bITEM; while having a look at the data and metadata, some problems were found that can't be solved technically by ARCHE:
Images¶
Multiple creators and license holders need to be separated
Who contributed to the project and should be stated as MetadataCreator
File clean up:
- No creator (490)
- No license holder (499)
- Not public (5)
- No license (1)
- No files (1)
- Duplicates (31)
- It was discussed what to do with bigger files that weren't uploaded to OpenAtlas (e.g. original 3D scans) and it was decided not to add them additionally because they are also archived at the NHM.
License issues
- Some sources still have a copyright attached, so they can't be archived in ARCHE in that way; please check and adjust if necessary
- Resolution of the 3D models provided in bITEM is low - should they be archived in ARCHE in low resolution and linked to NHM repository? bITEM: NHM repository is meant for long-term archiving as well, so that would be ideal esp. to prevent duplication of data; ARCHE is happy to link to other repositories in this case to avoid duplication and redundancy
- Sometimes formats were used that are not perfectly suited for the pictures - such as .png for scans. bITEM: were used as they also provide an alpha channel, and it was needed for those special cases; so the formats will be kept in that way
- Some images don't have a creator/license listed in their metadata - sources for some maps are missing; bITEM: that's human error and has to be checked again and will be corrected; every image not produced by the team
- Problem: ARCHE can't use Bildzitat as they need a proper license; therefore images with Bildzitat can't be used and would have to be taken out of the datapool (if they are not old enough to be public domain); bITEM: most of the images stem from publications though and would not be usable, they have to be deleted from the ARCHE dump; ARCHE: Wikimedia images are still okay though as they have a prober license (check if license and creator are mentioned in each case); Metadata for the images will be kept but images themselves will not be archived in ARCHE -> Therefore important that the citation of the deleted images is mentioned and stored as metadata, so the pictures are still findable if neccessary
- -> Everything that is Bildzitat will be taken out of the datapool automatically by Bernhard in a new dump; other images will be checked again if creator and license are cited
- -> All metadata will be archived in ARCHE but not every image will be
Descriptions on bITEM¶
- ARCHE: Every item on bITEM has a description next to the digital twin - is this text also included in OpenAtlas; bITEM: yes, all information presented is provided in the OpenAtlas instance (as description of the artifact/actor/...)
Duplicates and formats¶
Some file check issues
- PDF/A violations
- duplicated files
- ARCHE: duplicates of files and PDFs in the wrong format were submitted; PDF should be submitted in PDF/A format - bITEM/OpenAtlas: files are actually duplicates that were added to two entities; Bernhard provides a file checker that can show all duplicates; clean up should be done by bITEM
More bITEM Metadata¶
More metadata for the top collection
- better description
- contact
- etc. -> example on ARCHE: IUENNA
Deposition Agreement
ARCHE needs more background information on bITEM itself for the top collection - see IUENNA on ARCHE as a sample; description of project is needed, other information can be retracted from Datenübergabevertrag; a table with needed information will be provided by Martina to Viola, Roland and Stefan, Viola will provide a description for the bITEM project; furthermore the bITEM team (+ Bernhard) will keep working on the deposition agreement and will contact the ARCHE team if help is needed
Someone should serve as contact person who can be contacted by the ARCHE team if more problems arise during the archiving process - Stefan (CC an Nina) will serve as direct contact to ARCHE for any non-OpenAtlas questions; Viola and Roland will sign the deposition agreement
Further Information¶
Is it ok to incorporate the data into PFP?
PFP as prosopographic research platform at the ACDH ("Wikidata for names") - can bITEM data also be incorporated in there? bITEM: Yes of course it can be; Kulturpool and ARIADNE use ARCHE as source as well, but neither ARIADNE nor Kulturpool makes sense in the case of bITEM; ARIADNE gets the information vial THANADOS anyways; for Kulturpool data isn't suited; 3D models are integrated into Kulturpool anyway -> so it would be twice the work for the same outcome; if the bITEM team changes their mind about that, they'll reach out again
Archive Statistics
- Total size: 1274044869 bytes -> 1.27 Gigabytes
- Total entries: 620
- Files: 620
- Directories: 31