Project

General

Profile

Text annotation » History » Version 17

Alexander Watzinger, 2024-09-11 15:03

1 12
h1. Text annotation
2
3
Issue #2079
4
5 13 Alexander Watzinger
Text annotation will be about linking entities (actors, places, ...) to specific parts of a text, instead of just linking them to the whole text like it is possible currently.
6 12
7
h2. Scope (for the first version)
8
9
* It would be used at *Source* (E33) and their translations 
10
* Only already linked entities are offered
11
* No overlapping annotations
12
13 17 Alexander Watzinger
h2. Topics to discuss
14
15
h3. Backend
16
17
* Which schema for annotation database table (model or web)?
18
* Label *text* ok for annotation column in database table? 
19
* How to deal with text changes in already annotated text?
20
21
h3. User interface
22
23
* Do we try to solve it in on form element or split in separate steps/views for text editing and text annotation?
24
* How to remove text annotations (maybe showing a list with delete buttons)?
25
26 12
h2. User interface
27
28 14 Alexander Watzinger
We need a tool for users to annotate. Although difficult, ideally text changes and annotations can be done in one form element. 
29 12
30
*Advantages*
31
* Easy to use
32
* It's not possible for users to "break" things
33
* Concurrent changes can be avoided with already implemented functionality
34
* Maybe take a look at APIS implementation: https://github.com/acdh-oeaw/apis-highlighter-ng
35 14 Alexander Watzinger
36
Otherwise we will have to split:
37
* *Text editing* - leaving as is but would be problematic when annotations can't be seen
38
* *Text annotation* - most likely easier to implement if "stand alone"
39 12
40
h2. Information storing
41
42
We will save the information in an extra database table. Draft for fields:
43
44
* *id* (int, required) generic internal database identifier 
45 15 Alexander Watzinger
* *source_id* (int, required)
46
* *entity_id* - (int, required) the entity is linked in the annotation, e.g. actor, place, artifact, ... 
47 12
* *link_start* (int, required)
48
* *link_end* - (int, required) or maybe the length?
49
* *user_id* - (int, not required) to track who added it 
50
* *text* - (text, not required) a kind of description field for text information, maybe think about the name, e.g. annotation, description, text, ... 
51
* *created* - timestamp of creation (set automatically at creation)
52
* *modified* - timestamp of modification (triggered automatically on change)
53
54 16 Alexander Watzinger
*Advantages*
55 12
* We can guarantee that there are no orphaned links
56
* It can than be used to e.g. be presented as HTML, TEI, "Web Annotation Data Model":https://www.w3.org/TR/annotation-model/, ...
57
58
h2. Time frame
59
60
Ideally we have a working basic version at the end of 2024 and can refine it in 2025.
61
62
h2. Ideas for future versions
63
64
* Offer links to external reference systems (e.g. GeoNames) which we would use to create new entities on the fly (using available meta information, creating links to reference system, ...). Interesting but a lot of work.
65
* A tool to find possible annotation candidates providing a result list to annotate multiple occurrences in one go