This is the top level Readme. For more information have a look at src/README.md
For convenience we offer a bash script to simplify using our system.
NOTE: this system requires CUDA, but it should be possible to deactivate this dependency, which will result in much longer runtimes.
- download the corpus files and place them in the data directory
- make script executable:
chmod +x run.sh
- Execute
./run.sh --help
. This will help you set up the environment and train or evaluate models.
This code belongs to the following paper and should be cited as the same:
Accepted at 27th International Conference on Text, Speech and Dialogue in Brno, Czech Republic, September 9-13, 2024
M. Schmidt, K. Harbusch and D. Memmesheimer (2024). Automatic Ellipsis Reconstruction in Coordinated German Sentences based on Text-To-Text Transfer Transformers.