ELI5: Long Form QA

We provide scripts to recreate the dataset. Please follow the provided instructions in the README.

We provide a ROUGE evaluation script. We use F1 ROUGE to evaluate the quality of the full generated answer and the final 20% of the answer.

Seq2Seq Multi-task