結果提交及評估公式

(English version goes here.)

結果提交

各系統的輸出結果存為一個結果檔 (run),檔名格式為

RITEVAL-(Team Name)-(Lang)-(Subtask Name)-(Run Number).txt
  • Team Name = 報名時所使用的 Group ID
  • Lang = CS, CT
  • Subtask Name = FV, SVBC, SVMC
  • Run Number = 01, 02, ... 05

例如:

RITEVAL-NTOUA-CT-SVBC-01.txt

結果檔內容格式

結果檔中每一行對應到每道題目,格式如下:

t2_ID [SPACE] Label [SPACE] Confidence

其中 Confidence 表示推測的信心度。

中文 FV 任務每道題目都應標出所屬三種標籤 (E, C, U 請參考任務定義) 之一。

中文 SV 的 BC 任務每道題目都應標出所屬兩種標籤 (Y, N 請參考任務定義) 之一。

中文 SV 的 MC 任務每道題目都應標出所屬四種標籤 (F, B, C, I 請參考任務定義) 之一。

FV 結果檔內容範例如下:

1 E 0.852
2 U 0.994
3 E 0.789
4 C 1.000
    :

SV 結果檔內容與 FV 相同,僅標籤集合不同。

系統描述

亦請提供各系統之簡短描述 (各結果所對應之系統需各提交一個系統描述檔),

包括策略、所使用之資訊或特徵、資源 (例如語言資源或 www)、以及所使用工具 (例 NLP 工具) 等等。

系統描述檔取名規則如下:

RITEVAL-(Team Name)-(Lang)-(Subtask Name)-(Run Number)-sysdesc.txt
    • Team Name = 報名時所使用的 Group ID
  • Lang = CS, CT
  • Subtask Name = FV, SVBC, SVMC
  • Run Number = 01, 02, ... 05

例如:

RITEVAL-NTOUA-CT-SVBC-01-sysdesc.txt

系統描述檔內容範例如下:

1. Approach
   [ ]rule: 
   [x]statistics: SVM
   [ ]hybrid
2. Feature/Information
   [ ]Overlapping
   [x]Alignment
   [ ]Transformation
   [x]Char/Word Overlapping
   [x]Syntactic Information
   [ ]Predicate-Argument Relationship
   [x]Named Entity
   [ ]Entity/Event
   [x]Temporal/Numeric Information
   [ ]Entailment
   [ ]Modality
   [ ]Polarity
   [x]Synonym/Antonym
   [x]Hypernym/Hyponym
   [ ]Meronym/Holonym
   [ ]Entity/Event Relationship
   [ ]Entailment Rule
3. Resources: word segmentation, syntactic parser
4. Tools: WordNet, Wikipedia

提交方式

請將所有檔案壓縮成一個檔案,以 e-mail 寄至 rite-val-organizers@nii.ac.jp , 期限是 2014/8/7 23:59 (已延長)

請在信件檔題標明所參加之任務以及語言為何。

收到結果檔後,我們會在 24 小時內回信確認。若您沒收到確認信,請與我們聯絡。

結果評估公式

Macro-F (所有類別的 F-measures 之 Macro-averaging 平均值)

Submission

Results from one system comprise a run. Name a run file in the following format:

RITEVAL-(Team Name)-(Lang)-(Subtask Name)-(Run Number).txt
  • Team Name = use the same short id used in NTCIR registration (Group ID).
  • Lang = CS, CT
  • Subtask Name = FV, SVBC, SVMC
  • Run Number = 01, 02, ... 05

Example:

RITEVAL-NTOUA-CT-SVBC-01.txt

Run Format

Each line contains the result for one t2 in the following format:

t2_ID [SPACE] Label [SPACE] Confidence

where Confidence is the confidence score.

In Chinese FV Subtasks, each t2 should be tagged in one of the three labels (E, C, U cf. Task Definition).

In Chinese SV-BC Subtasks, each t2 should be tagged in one of the two labels (Y, N cf. Task Definition).

In Chinese SV-MC Subtasks, each t2 should be tagged in one of the four labels (F, B, C, I cf. Task Definition).

Examples of one FV run:

1 E 0.852
2 U 0.994
3 E 0.789
4 C 1.000
    :

SV runs look like FV runs except that their label sets are different.

System Description

Please also create a text file (one file for each run) to provide brief system description,

including approaches, used information or features, resources (any language resource, web etc), and tools (NLP tools etc).

Name a run file in the following format:

RITEVAL-(Team Name)-(Lang)-(Subtask Name)-(Run Number)-sysdesc.txt
  • Team Name = use the same short id used in NTCIR registration (Group ID).
  • Lang = CS, CT
  • Subtask Name = FV, SVBC, SVMC
  • Run Number = 01, 02, ... 05

Example:

RITEVAL-NTOUA-CT-SVBC-01-sysdesc.txt

Example of a system description file is as follows:

1. Approach
   [ ]rule: 
   [x]statistics: SVM
   [ ]hybrid
2. Feature/Information
   [ ]Overlapping
   [x]Alignment
   [ ]Transformation
   [x]Char/Word Overlapping
   [x]Syntactic Information
   [ ]Predicate-Argument Relationship
   [x]Named Entity
   [ ]Entity/Event
   [x]Temporal/Numeric Information
   [ ]Entailment
   [ ]Modality
   [ ]Polarity
   [x]Synonym/Antonym
   [x]Hypernym/Hyponym
   [ ]Meronym/Holonym
   [ ]Entity/Event Relationship
   [ ]Entailment Rule
3. Resources: word segmentation, syntactic parser
4. Tools: WordNet, Wikipedia

Run Submission

Archive all files in zip, and send the result to rite-val-organizers@nii.ac.jp via email attachment by the end of the formal run period (2014/8/7 23:59, extended).

Specify which subtasks and languages you are participating in the mail title.

We'll email you a notification of acceptance as a reply within 24 hours of submission. Contact us if you did not receive the notification.

Evaluation Metrics

Macro-F (Macro-averaging F-measures over all labels)