plain text export and processing some documents
Introduced with !20 (merged) we made a limited subset of documents available for download. This function is for a specific task at a specific stage and MAY BE is subject to be removed later on. For a more generic approach i like to recommend the following:
- include the plain text serialization to the TextAPI, as we can provide derivatives like this as items with a specific MIME type. (This will also test the frontend to be ready for this use case!)
- load the documents for the collation separately via the API.
This will move the selection of the subset to the CollateX pipeline and will be more flexible when the selection changes. Also it will provide the plain texts to the world (an to those people usually dealing with this format, e.g. linguists and text miner).