TimeLine based List Question Answering (TLQA)

Data Collection pipeline

To reproduce data collection, run the files im data_pipeline. Run the files in this order: tempLama.py, addWikiDataIDs.py, fetch_most_common_name.py, offices_parser.mjs, sport_parser.mjs.

Reproducing Results on paper

Inference on the benchmark for multiple settings can be seen in folder inference. Experiments were done using ollama for open-weight models, and litellm for proprietary models.

The script evaluateResults.py in evaluation/ can be used to reproduce the numbers on the table and the plots.

Corpus

The title to infobox mapping can be downloaded here https://drive.google.com/drive/folders/1KyYiQEYob5SkK3zyUCXoPRT19dANy2-a?usp=sharing

📁 File Structure

.
├── LICENSE.MD
├── README.md
├── TLQA.png
├── TLQA_data_splits
│   ├── benchmark_v0.0.json
│   └── splits
│       ├── add_wikipedia_titles.py
│       ├── get_golden_evidence.py
│       ├── test_split_benchmark_v0.0.json
│       ├── test_split_benchmark_v0.0_golden_evidence.json
│       ├── test_split_benchmark_v0.0_updated_with_titles.json
│       ├── train_split_benchmark_v0.0.json
│       ├── train_split_benchmark_v0.0_golden_evidence.json
│       └── train_split_benchmark_v0.0_updated_with_titles.json
├── TempLama
│   ├── data
│   │   ├── templama_test.json
│   │   ├── templama_train.json
│   │   └── templama_val.json
│   ├── evaluateData.py
│   ├── evaluateResults.py
│   ├── experiment.py
│   ├── gpt3_responses.csv
│   ├── listqas.json
│   ├── output.json
│   ├── queryGPT.py
│   ├── tempLama.py
│   ├── templama_test.json
│   ├── templama_train.json
│   └── templama_val.json
├── data_pipeline
│   ├── offices_parser.mjs
│   ├── sport_parser.mjs
│   ├── tempLama.py
│   └── test.py
├── inference
│   ├── few_shot.py
│   ├── rag_title_summary.
│   └── rag_golden_evidence.py
├── evaluation
│   └── evaluateResults.py
├── output.txt
├── test.py
├── tlqa_1.png
└── tlqa_2.png

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

TimeLine based List Question Answering (TLQA)

Data Collection pipeline

Reproducing Results on paper

Corpus

📁 File Structure

About

Releases

Packages

Contributors 2

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 33 Commits
TLQA_data_splits		TLQA_data_splits
TempLama		TempLama
data_pipeline		data_pipeline
evaluation		evaluation
inference		inference
.gitignore		.gitignore
LICENSE.MD		LICENSE.MD
README.md		README.md
TLQA.png		TLQA.png
pipeline_tlqa.png		pipeline_tlqa.png
requirements.txt		requirements.txt
test.py		test.py
tlqa_1.png		tlqa_1.png
tlqa_2.png		tlqa_2.png

License

AlexDumitru17/TLQA

Folders and files

Latest commit

History

Repository files navigation

TimeLine based List Question Answering (TLQA)

Data Collection pipeline

Reproducing Results on paper

Corpus

📁 File Structure

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages