Data

Training Set is available now. Please write us an email with the agreement. Mail: finnum@nlg.csie.ntu.edu.tw


Please cite the following paper when referring to the FinNum-2 dataset in academic publications and papers.

Chung-Chi Chen, Hen-Hsen Huang, and Hsin-Hsi Chen. 2019. Numeral Attachment with Auxiliary Tasks. In Proceedings of the 42nd International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR'19), Paris, France pdf

Data Format

The NumAttach dataset is consisted of "tweet", "target_num", "target_cashtag", "relation", and "reason_type" in json format.

Example:

{

"tweet": "$SQ is $39 per share with a P/E of over 150 and losing money...should be at least as good as them with a P/E of 30 and making money!!",

"target_num": "39",

"target_cashtag": "SQ",

"relation": 1,

"offset": 8

}

License

The annotated dataset is licensed under the Creative Commons Attribution-Non-Commercial-ShareAlike 4.0 International (CC BY-NC-SA 4.0) license.