- 05 Feb, 2019 40 commits
-
Since restructuring the XML in such a way that tei:milestone[@unit = "paragraph"] is expanded to tei:p is costly in terms of performance while not adding much value we decided to drop this functionality and leave the milestones as they are.
For the removal of tei:handShift duplicates we first enhance all tei:handShift with all missing attributes by adding an empty attribute node - this makes the comparison easier. Afterwards these empty nodes are removed again.
This should make sorting out handShifts in the next processing step easier.
We noticed that the elimination of double tei:handShift is buggy and decided that the solution of this should better be implemented in the second serialisation step that tidies up the whole document because the hierarchy of the XML is flatter.