Combined Dataset 2009-10
This file combines
https://sites.google.com/site/assistmentsdata/home/assistment-2009-2010-data/non-skill-builder-data-2009-10
and
https://sites.google.com/site/assistmentsdata/home/assistment-2009-2010-data/skill-builder-data-2009-2010
----------------------------------------
In you are using this data set in a publication then please site this exact (https://sites.google.com/site/assistmentsdata/home/assistment-2009-2010-data/combined-dataset-2009-10) URL so folks can know which assistment data set you used.
If you are using it in a publication, please make reference to ASSISTments by citing our most commonly used paper to reference the ASSISTments system. (To be clear the paper below is from 2009 and does not use this data.)
Feng, M., Heffernan, N.T., & Koedinger, K.R. (2009). Addressing the assessment challenge in an Intelligent Tutoring System that tutors as it assesses. The Journal of User Modeling and User-Adapted Interaction, 19, 243-266. See more about this article here.
Get the data
It was stored here
http://web.cs.wpi.edu/~zpardos/datasets/assistments_2009_2010.zip
But is now hosted here
https://drive.google.com/open?id=0B2X0QD6q79ZJNEdiMHkyb0RNQlE
More info is available from this year school year here.
Column Headings
order_id
These id's are chronological, and refer to the id of the original problem log.
assignment_id
Two different assignments can have the same sequence id. Each assignment is specific to a single teacher/class.
user_id
The ID of the student doing the problem.
assistment_id
The ID of the ASSISTment. An ASSISTment consists of one or more problems.
problem_id
The ID of the problem.
original
1 = Main problem
0 = Scaffolding problem
correct
1 = Correct on first attempt
0 = Incorrect on first attempt, or asked for help.
attempt_count
Number of student attempts on this problem.
ms_first_response
The time in milliseconds for the student's first response.
tutor_mode
tutor, test mode, pre-test, or post-test
answer_type
choose_1: Multiple choice (radio buttons)
algebra: Math evaluated string (text box)
fill_in: Simple string-compared answer (text box)
open_response: Records student answer, but their response is always marked correct
sequence_id
The content id of the problem set. Different assignments that assign the same problem set will have the same sequence id.
student_class_id
The class ID.
position
Assignment position on the class assignments page.
problem_set_type
Linear - Student completes all problems in a predetermined order.
Random - Student completes all problems, but each student is presented with the problems in a different random order.
Mastery - Random order, and student must "master" the problem set by getting a certain number of questions correct in a row before being able to continue.
base_sequence_id
This is to account for if a sequence has been copied. This will point to the original copy, or be the same as sequence_id if it hasn't been copied.
list_skill_ids
A semi-colon-delimited list of the IDs of the skills associated with the problem.
list_skills
A semi-colon-delimited list of the skills associated with the problem.
teacher_id
The ID of the teacher who assigned the problem.
school_id
The ID of the school where the problem was assigned.
You might the extra detail on these columns elsewhere on this cite like here
https://sites.google.com/site/assistmentsdata/how-to-interpret