Merged requested to merge feature/#117-tokenize-tei into develop
This MR provides an in-memory tokenization of all relevant text nodes. Each word is wrapped in a
tei:seg with a unique ID which reflects its node identity.
This tokenization is applied both before the HTML and the annotation creation so that words in the text panel can reference entities of the AnnotationAPI and vice versa.
Compliance to “Definition of Done”
Unit tests passed
Product Owner accepts the User Story
I provided my functions with appropriate documentation
Are we able to test this new feature?
Yes, everything can be done via unit tests.
Yes, you can test by following these steps:
- build the repo locally
- navigate to
- investigate the HTML with your developer tools. Each relevant word is wrapped in a separate
xhtml:segwith an ID.
I added a statement to the CHANGELOG.
I bumped the version number in
Closes #117 (closed).