Add reichsanzeiger gt (#5)
* feat: add Reichsanzeiger-GT to prepare.sh * add Reichsanzeiger subsets * add download for Reichsanzeiger subsets * add first draft for reichsanzeiger gt * extract sample bundles * update .gitignore * clean up * tidy up project root * tidy up volumes * update README * update data * build: roll back to version where cis runs still runs * build: remove default model mounting
parent
cbe93933
No related branches found
No related tags found
Showing
- .gitignore 2 additions, 1 deletion.gitignore
- Dockerfile 3 additions, 4 deletionsDockerfile
- README.md 1 addition, 1 deletionREADME.md
- data/workflows.json 1301 additions, 7556 deletionsdata/workflows.json
- data_srcs/default_data_sources.txt 0 additions, 0 deletionsdata_srcs/default_data_sources.txt
- data_srcs/reichsanzeiger_full.txt 1 addition, 0 deletionsdata_srcs/reichsanzeiger_full.txt
- data_srcs/reichsanzeiger_many_ads.list 5 additions, 0 deletionsdata_srcs/reichsanzeiger_many_ads.list
- data_srcs/reichsanzeiger_random.list 6 additions, 0 deletionsdata_srcs/reichsanzeiger_random.list
- data_srcs/reichsanzeiger_tables.list 5 additions, 0 deletionsdata_srcs/reichsanzeiger_tables.list
- data_srcs/reichsanzeiger_title_pages.list 5 additions, 0 deletionsdata_srcs/reichsanzeiger_title_pages.list
- docker-compose.yml 2 additions, 3 deletionsdocker-compose.yml
- prepare.sh 0 additions, 41 deletionsprepare.sh
- scripts/convert-yml-to-json.py 22 additions, 0 deletionsscripts/convert-yml-to-json.py
- scripts/prepare.sh 98 additions, 0 deletionsscripts/prepare.sh
- scripts/prepare_reichsanzeiger_sets.sh 68 additions, 0 deletionsscripts/prepare_reichsanzeiger_sets.sh
- src/benchmark_extraction.py 5 additions, 13 deletionssrc/benchmark_extraction.py
- workflows/execute_workflows.sh 18 additions, 36 deletionsworkflows/execute_workflows.sh
Loading
Please register or sign in to comment