Task B Dataset

The dataset for Task B is designed for trait-specific Arabic automated essay scoring (AES) and includes essays written by native Arabic-speaking first-year university students under test-like conditions. It contains unique essay IDs, prompt IDs, the full text of each essay, and scores for seven traits: Relevance, Organization, Vocabulary, Style, Development, Mechanics, and Grammar.

Dataset Format

The dataset for Task B includes three files:

TAQEEM2025_TaskB_train_prompts.json: This file contains the writing prompts provided to students.

Each entry includes:

prompt_id (Integer): Unique identifier for each prompt.
prompt_text (String): The actual prompt text given to the students.
prompt_type (String): The type of the writing task ("persuasive" or "explanatory").

Example:

{

"prompt_id": 1,

"prompt_text": " ... باتَ اِهْتمام وحماس المراهقين",

"prompt_type": "explanatory"

}

TAQEEM2025_TaskB_train_essays.json: This file contains the full text of student essays written in response to a specific prompt.

Each entry includes:

prompt_id (Integer): Indicates which prompt the essay responds to (e.g., 1).
- essay_id (String): A six-digit unique identifier for the essay (e.g., "010210").
essay (String): The full essay text.

Example:

{

"prompt_id": 1,

"essay_id": "010210",

"essay": " ... الصحة والجسم السليم من نعم الله على الإنسان"

}

TAQEEM2025_TaskB_train_human_scores.csv: This file contains the trait-specific scores assigned by the human raters to each essay in CSV format.

Each row includes the following columns:

prompt_id (Integer): The ID of the prompt associated with the essay.
essay_id (String): The unique identifier of the essay, matching the essay_id in the essays file.
relevance (Integer): The relevance score assigned to the essay (range: 0–2).
organization (Integer): The organization score assigned to the essay (range: 0–5).
vocabulary (Integer): The vocabulary score assigned to the essay (range: 0–5).
style (Integer): The style score assigned to the essay (range: 0–5).
development (Integer): The development score assigned to the essay (range: 0–5).
mechanics (Integer): The mechanics score assigned to the essay (range: 0–5).
grammar (Integer): The grammar score assigned to the essay (range: 0–5).

Example:

prompt_id,essay_id,relevance,organization,vocabulary,style,development,mechanics,grammar

1,010210,2,4,4,4,4,3,3

**Note**

You must read the essay_id as a String to preserve leading zeros.

Grading Rubrics

The rubrics used to annotate the essays in our dataset were developed for the Core Academic Skills Test (CAST) by the Qatar University Testing Center (QUTC). They are designed to assess students’ ability to write persuasive and explanatory essays. The rubrics were also reviewed by independent specialists to ensure their validity and reliability. For more details, you can refer to the CAST rubrics available here.

Download the Dataset

The training and dev sets of for Task B will be released on June 10, 2025. They are only available to registered teams.

We will release the test set on July 20, 2025.

Acknowledgments

We thank Qatar University for supporting the dataset collection and annotation, and the Ministry of Education and Higher Education (MoE) in Qatar for facilitating data collection from male and female high schools across Qatar.

Page updated

Google Sites

Report abuse