Project

General

Profile

Text annotation » History » Version 12

Alexander Watzinger, 2024-08-09 13:46

1 12
h1. Text annotation
2
3
Issue #2079
4
5
Text annotation has been on our radar for some time now so I created an issues to discuss how we can proceed.
6
Basically it's about linking entities (actors, places, ...) to specific parts of a text, instead of just linking them to the whole text like it is possible currently.
7
8
h2. Scope (for the first version)
9
10
* It would be used at *Source* (E33) and their translations 
11
* Only already linked entities are offered
12
* No overlapping annotations
13
14
h2. User interface
15
16
We need a tool for users to annotate. Although difficult, text changes and annotations should be doable in one form element. 
17
18
*Advantages*
19
* Easy to use
20
* It's not possible for users to "break" things
21
* Concurrent changes can be avoided with already implemented functionality
22
* Maybe take a look at APIS implementation: https://github.com/acdh-oeaw/apis-highlighter-ng
23
24
h2. Information storing
25
26
We will save the information in an extra database table. Draft for fields:
27
28
* *id* (int, required) generic internal database identifier 
29
* *source_id* (int, required)
30
* *entity_id* - (int, required) the entity is linked in the annotation, e.g. actor, place, artifact, ... 
31
* *link_start* (int, required)
32
* *link_end* - (int, required) or maybe the length?
33
* *user_id* - (int, not required) to track who added it 
34
* *text* - (text, not required) a kind of description field for text information, maybe think about the name, e.g. annotation, description, text, ... 
35
36
*Advantages*
37
* We can guarantee that there are no orphaned links
38
* It can than be used to e.g. be presented as HTML, TEI, "Web Annotation Data Model":https://www.w3.org/TR/annotation-model/, ...
39
40
h2. Topics to discuss
41
42
* How to deal with text changes in already annotated text
43
* How to remove text annotations (maybe showing a list with delete buttons)?
44
45
h2. Time frame
46
47
Ideally we have a working basic version at the end of 2024 and can refine it in 2025.
48
49
h2. Ideas for future versions
50
51
* Offer links to external reference systems (e.g. GeoNames) which we would use to create new entities on the fly (using available meta information, creating links to reference system, ...). Interesting but a lot of work.
52
* A tool to find possible annotation candidates providing a result list to annotate multiple occurrences in one go