Task 1: Quantitative Understanding (English)
Training and Test sets are available.
Please download the dataset from https://drive.google.com/drive/folders/10uQI2BZrtzaUejtdqNU9Sp1h0H9zhLUE?usp=sharing
[1] Chen, Chung-Chi, et al. "Improving Numeracy by Input Reframing and Quantitative Pre-Finetuning Task." Findings of the Association for Computational Linguistics: EACL 2023. 2023.
Task 2: Reading Comprehension of the Numerals in Text (Chinese)
Training and Test sets are available.
Please download the dataset from https://drive.google.com/file/d/16-a6d8FtGp17W8_l4eYd-h42NlvQJW7w/view
[2] Chen, Chung-Chi, Hen-Hsen Huang, and Hsin-Hsi Chen. "NQuAD: 70,000+ Questions for Machine Comprehension of the Numerals in Text." Proceedings of the 30th ACM International Conference on Information & Knowledge Management (CIKM). 2021.
Task 3: Numeral-Aware Headline Generation (English)
Dataset for Dry Run: https://drive.google.com/drive/folders/1fOuUboXOMuLzZv38TXP_nRRZ_I5NXT2V?usp=sharing
Training Set: Please Register to get the dataset. Registration Form: https://forms.gle/LCEhpuRECyaggmsS6
Test Set will be released by 10 January 2024.
[3] Jian-Tao Huang, Chung-Chi Chen, Hen-Hsen Huang, Hsin-Hsi Chen. "NumHG: A Dataset for Number-Focused Headline Generation", arXiv, 2023.