Combined Dataset 2009-10

This file combines

https://sites.google.com/site/assistmentsdata/home/assistment-2009-2010-data/non-skill-builder-data-2009-10

and

https://sites.google.com/site/assistmentsdata/home/assistment-2009-2010-data/skill-builder-data-2009-2010

----------------------------------------

In you are using this data set in a publication then please site this exact (https://sites.google.com/site/assistmentsdata/home/assistment-2009-2010-data/combined-dataset-2009-10) URL so folks can know which assistment data set you used.

If you are using it in a publication, please make reference to ASSISTments by citing our most commonly used paper to reference the ASSISTments system. (To be clear the paper below is from 2009 and does not use this data.)

    • Feng, M., Heffernan, N.T., & Koedinger, K.R. (2009). Addressing the assessment challenge in an Intelligent Tutoring System that tutors as it assesses. The Journal of User Modeling and User-Adapted Interaction, 19, 243-266. See more about this article here.

Get the data

It was stored here

http://web.cs.wpi.edu/~zpardos/datasets/assistments_2009_2010.zip

But is now hosted here

https://drive.google.com/open?id=0B2X0QD6q79ZJNEdiMHkyb0RNQlE

More info is available from this year school year here.

Column Headings

  • order_id

      • These id's are chronological, and refer to the id of the original problem log.

  • assignment_id

      • Two different assignments can have the same sequence id. Each assignment is specific to a single teacher/class.

  • user_id

      • The ID of the student doing the problem.

  • assistment_id

      • The ID of the ASSISTment. An ASSISTment consists of one or more problems.

  • problem_id

      • The ID of the problem.

  • original

      • 1 = Main problem

      • 0 = Scaffolding problem

  • correct

      • 1 = Correct on first attempt

      • 0 = Incorrect on first attempt, or asked for help.

  • attempt_count

      • Number of student attempts on this problem.

  • ms_first_response

      • The time in milliseconds for the student's first response.

  • tutor_mode

      • tutor, test mode, pre-test, or post-test

  • answer_type

      • choose_1: Multiple choice (radio buttons)

      • algebra: Math evaluated string (text box)

      • fill_in: Simple string-compared answer (text box)

      • open_response: Records student answer, but their response is always marked correct

  • sequence_id

      • The content id of the problem set. Different assignments that assign the same problem set will have the same sequence id.

  • student_class_id

      • The class ID.

  • position

      • Assignment position on the class assignments page.

  • problem_set_type

      • Linear - Student completes all problems in a predetermined order.

      • Random - Student completes all problems, but each student is presented with the problems in a different random order.

      • Mastery - Random order, and student must "master" the problem set by getting a certain number of questions correct in a row before being able to continue.

  • base_sequence_id

      • This is to account for if a sequence has been copied. This will point to the original copy, or be the same as sequence_id if it hasn't been copied.

  • list_skill_ids

      • A semi-colon-delimited list of the IDs of the skills associated with the problem.

  • list_skills

      • A semi-colon-delimited list of the skills associated with the problem.

  • teacher_id

      • The ID of the teacher who assigned the problem.

  • school_id

      • The ID of the school where the problem was assigned.

You might the extra detail on these columns elsewhere on this cite like here

https://sites.google.com/site/assistmentsdata/how-to-interpret