Skip to content
GitLab
Projects Groups Topics Snippets
  • /
  • Help
    • Help
    • Support
    • Community forum
    • Submit feedback
    • Contribute to GitLab
  • Sign in
  • backend backend
  • Project information
    • Project information
    • Activity
    • Labels
    • Members
  • Repository
    • Repository
    • Files
    • Commits
    • Branches
    • Tags
    • Graph
    • Compare revisions
  • Issues 13
    • Issues 13
    • List
    • Boards
    • Service Desk
    • Milestones
  • Merge requests 1
    • Merge requests 1
  • Deployments
    • Deployments
    • Releases
  • Packages and registries
    • Packages and registries
    • Container Registry
  • Activity
  • Graph
  • Create a new issue
  • Commits
  • Issue Boards
Collapse sidebar
  • subugoesubugoe
  • ahiqarahiqar
  • backendbackend
  • Merge requests
  • !88

Feature/#117 tokenize tei

  • Review changes

  • Download
  • Patches
  • Plain diff
Merged Michelle Weidling requested to merge feature/#117-tokenize-tei into develop Feb 19, 2021
  • Overview 3
  • Commits 14
  • Pipelines 0
  • Changes 8

Feature

Summary

This MR provides an in-memory tokenization of all relevant text nodes. Each word is wrapped in a tei:seg with a unique ID which reflects its node identity.

This tokenization is applied both before the HTML and the annotation creation so that words in the text panel can reference entities of the AnnotationAPI and vice versa.

Compliance to “Definition of Done”

  • Unit tests passed
  • Code reviewed
  • Product Owner accepts the User Story

Documentation

  • I provided my functions with appropriate documentation

Tests

Are we able to test this new feature?

  • Yes, everything can be done via unit tests.
  • Yes, you can test by following these steps:
    • build the repo locally
    • navigate to ${IP}/exist/restxq/textapi/ahikar/content/sample_teixml-82a.html
    • investigate the HTML with your developer tools. Each relevant word is wrapped in a separate xhtml:seg with an ID.

Changelog

  • I added a statement to the CHANGELOG.

Version number

  • I bumped the version number in build.properties.

Closes

Closes #117 (closed).

Logs and Screenshots

/cc Mathias Göbel, Frank Schneider, Michelle Weidling

Edited Feb 19, 2021 by Michelle Weidling
Assignee
Assign to
Reviewers
Request review from
Time tracking
Source branch: feature/#117-tokenize-tei