Skip to content
Snippets Groups Projects

pipeline status

PROV@TOS

A W3C Prov Wrapper for Talend Open Studio for Data Integration Jobs

Getting and compiling the software

This project compiles best with Maven. To do so, clone this repository:

git clone https://gitlab.gwdg.de/medinfpub/tos/provAtTos

Afterwards, compile it using maven using mvn package in the cloned folder. It will compile all files to the target folder. There may be other ways to start it using maven, I just used this exact method to compile the project. Any suggestions (via an issue) to improve this method are appreciated.

Running PROV@TOS

To wrap around an existing Talend Open Studio for Data Integration job, export it using the "Build Job"-function (accessible in the context menu of a job under "Job Designs"). The "Context script" as well as the "Items" are mandatory, as they are used in the wrapper.

Run it with the command:

java -jar target/provAtTos.jar -o /some/output/dir /my/talend/job.zip

Without an output (specified with the -o or --output argument), the current working dir will be used to save the file that need to be created (e.g. a provenance file in the desired format, XML-files with MapperData from tMap components, ...). The format of the provenance data can be changed using the -f (or --format). Available formats are: xml, turtle, provn (which is the default):

java -jar target/provAtTos.jar -f turtle /my/talend/job.zip

It is possible to add additional arguments, which will be then provided to the Talend job itself, using the -t (or --tos) option:

java -jar target/provAtTos.jar -t "--context_param=Default" /my/talend/job.zip

A complete example could look like this:

java -jar target/provAtTos.jar -o /home/username/Documents/provenanceOutput -f xml -t "--context_param=Testing" /home/username/workspace/dataIntegrationJob.zip

Licence