Data
Training Set is available now. Please write us an email with the agreement. Mail: finnum@nlg.csie.ntu.edu.tw
Please cite the following paper when referring to the FinNum-2 dataset in academic publications and papers.
Chung-Chi Chen, Hen-Hsen Huang, and Hsin-Hsi Chen. 2019. Numeral Attachment with Auxiliary Tasks. In Proceedings of the 42nd International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR'19), Paris, France pdf
Data Format
The NumAttach dataset is consisted of "tweet", "target_num", "target_cashtag", "relation", and "reason_type" in json format.
Example:
{
"tweet": "$SQ is $39 per share with a P/E of over 150 and losing money...should be at least as good as them with a P/E of 30 and making money!!",
"target_num": "39",
"target_cashtag": "SQ",
"relation": 1,
"offset": 8
}
License
The annotated dataset is licensed under the Creative Commons Attribution-Non-Commercial-ShareAlike 4.0 International (CC BY-NC-SA 4.0) license.