Make token IDs stable

It should be possible to reference a specific version of the XML files:

  • input: current XML file(s)
  • output: retokenized XML file(s)
  • optional: reference XML file(s) The token IDs in the retokenized XML should then not be identical to the token IDs in the current XML, but to those of the reference XML.