The research team aims to develop and pilot test the education technology called iCODE (improve source CODE comprehension), which takes place in universities and community colleges in urban Tennessee and Minnesota.
For the development and usability, and feasibility work, the research team will collect data from approximately 400 students and at least eight instructors across the different institutions, primarily at the Tennessee and Minnesota universities. During the pilot study, they will recruit 300 college-level students from the two universities and Tennessee and Minnesota community colleges. Students at the Tennessee and Minnesota universities will come from intro-to-programming and introductory psychology courses, and students from the Tennessee and Minnesota community colleges will come from a wide range of backgrounds, including CS and non-CS majors.
iCODE will be a web-based intervention that adaptively monitors, tracks, models, and scaffolds students' source code comprehension processes as they engage in various code-comprehension tasks. iCODE draws on and integrates theories of reading comprehension and source code comprehension, motivation theory, and frameworks of self-regulated learning with open prosocial learner models (OPLMs) and animated pedagogical agents (APAs) and adopts culturally responsive teaching models to address the fundamental difficulty of students in intro-to-programming classes to construct accurate mental models during source code comprehension. iCODE will focus on two coding languages, Java and Python, and will be a web-based supplemental tool, such as those commonly used in the lab component of computer science courses. The web-based environment will also include motivational elements such as mastery framing and self-assessment through an aspirational peer in the OPLM to increase students' self-regulated, mastery-oriented engagement with the iCODE system and the assigned instructional tasks. Furthermore, as a way to increase engagement and learning effectiveness, iCODE will adapt to students' background and career goals. The researchers will design the assignments and the APAs to follow a comprehenshive learner and instruction model.
The research will occur in three major phases: prototyping, formative evaluation and refinement, and summative evaluation. The iterative development work happens over the first two phases. During these phases, the researchers will engage with stakeholders, such as university instructors and students, through interviews. They will use cases and walkthroughs to gain insights from their knowledge and experience to develop and refine iCODE. This work will also address usability, feasibility, and cost issues to help guide revisions. Once the full iCODE is complete, the researchers will conduct a pilot study (the summative evaluation phase). They will collect data from students across multiple sites in the introduction to programming courses. They will compare CS and non-CS majors' outcomes and consider different underrepresented subgroups in CS (e.g., women and racial minorities).
During the pilot study, the researchers will compare iCODE to two control conditions: standard instruction (business-as-usual control) and another intelligent tutoring system, DeepTutor, which does not include the core reading strategies of iCODE but can still provide a one-to-one tutoring context for implementation.
During the development phase, the research team will use data from interviews, think-aloud, eye-tracking, and log files. During the pilot study, they will assess learning using the Foundational CS1 test or a similar one, course completion and course grades, and source code comprehension. They will evaluate motivation and beliefs using the Computer Science Attitude and Identity Survey, the Computer Programming Self-Efficacy Scale, and the Achievement-Goal Orientation Questionnaire-Revised. They will measure engagement using logfiles and measures such as time spent on tasks, self-explanation verbiage, and session time. To estimate retention, they will administer the Intentions for CS survey at the end of the semester, asking students if they are likely to continue with their CS degree (for majors) or study further and use programming in their work (non-CS majors) and course registration for up to two semesters.
The researchers will compare the output of the automated method with expert observations and judgments to analyze the accuracy of various iCODE components. For the summative final pilot study, the primary focus of the inferential analyses will be on testing the effectiveness of iCODE relative to the two control conditions. Statistical analyses will comprise descriptive and visual studies and fitting inferential models to each dependent variable separately. The researchers will fit a multiple regression model to each dependent variable individually and include pretest variables (covariates) in the model and two interaction terms testing if pretest status moderates the effect of iCODE.The models will consist of variables for CS major, gender, race/ethnicity, and first-generation status.
The researchers will conduct a cost analysis following the ingredients method, deriving the average unit cost to deliver the intervention at the student level based on the estimated total cost of each condition divided by the number of students served.