Make token IDs stable
It should be possible to reference a specific version of the XML files:
- input: current XML file(s)
- output: retokenized XML file(s)
- optional: reference XML file(s) The token IDs in the retokenized XML should then not be identical to the token IDs in the current XML, but to those of the reference XML.