COMS E6998 Generative Models for Code,

Fall 2024

Course Information

Class Time: Thursday 2:10-4:00 pm, Location: 963 EXT SCHERMERHORN HALL
Instructor: Baishakhi Ray, E-mail: rayb@cs.columbia.edu, Office: CEPSR 604, Office Hour: By Appointment
Head TA: Bowen Yang, E-mail: by2365@columbia.edu, Office Hour: By Appointment

Grading

Participation: 5%
Paper Presentations & Critiques: 35%
Course Project: 60%

Schedule

Introduction

Slides

Link

Guest Lecture Slides (Robin, Microsoft Research)

Topic 1. Code Large Language Models

Papers for Presentation, Critique, & Discussion

Popular Generative Model For Code

Code Llama https://arxiv.org/abs/2308.12950
StarCoder https://arxiv.org/abs/2305.06161
DeepSeek Coder https://arxiv.org/pdf/2401.14196

Alternative Generative Models For Code

CodeSage https://arxiv.org/pdf/2402.01935

A Survey on Language Models for Code https://arxiv.org/abs/2311.07989

Deep Learning for Source Code Modeling and Generation https://arxiv.org/abs/2002.05442
CodeT5+ (Encoder-Decoder Models) https://arxiv.org/abs/2305.07922
CodeFusion (Diffusion Models) https://www.microsoft.com/en-us/research/publication/codefusion-a-pre-trained-diffusion-model-for-code-generation/
DALL-E 2 https://arxiv.org/abs/2204.06125

Topic 2. Evaluation of Code Models

Papers for Presentation, Critique, & Discussion

LiveCodeBench https://arxiv.org/abs/2403.07974
SWE-bench: Can Language Models Resolve Real-World GitHub Issues? https://arxiv.org/abs/2310.06770

Supplemental for Paper 1

LiveCodeBenchRepo https://livecodebench.github.io/
HumanEval/Codex (Accuracy) https://arxiv.org/abs/2107.03374
ReCode: Robustness Evaluation of Code Generation Models (Trustworthiness) https://arxiv.org/abs/2212.10264
MBPP https://arxiv.org/abs/2108.07732

Supplemental for Paper 2

DevBench: A Comprehensive Benchmark for Software Development https://arxiv.org/abs/2403.08604
DevEval: Evaluating Code Generation in Practical Software Projects https://arxiv.org/abs/2401.06401
CrossCodeEval: A Diverse and Multilingual Benchmark for Cross-File Code Completion https://arxiv.org/abs/2310.11248
Evaluating the Code Quality of AI-Assisted Code Generation Tools: An Empirical Study on GitHub Copilot, Amazon CodeWhisperer, and ChatGPT https://arxiv.org/abs/2304.10778

General Supplemental

ReCode: Robustness Evaluation of Code Generation Models (Trustworthiness) https://arxiv.org/abs/2212.10264
CodeXGLUE https://arxiv.org/abs/2102.04664
XLCoST https://arxiv.org/abs/2206.08474
APPS https://arxiv.org/abs/2105.09938
CodeContest/AlphaCode https://arxiv.org/abs/2203.07814
DS-1000 https://arxiv.org/abs/2211.11501
xCodeEval https://arxiv.org/abs/2303.03004
BigCode Eval Harness https://github.com/bigcode-project/bigcode-evaluation-harness
BigCodeBench https://huggingface.co/blog/leaderboard-bigcodebench
LMSYS Coding https://lmarena.ai/?leaderboard

Topic 3. Agents

Papers for Presentation, Critique, & Discussion

MASAI https://arxiv.org/pdf/2406.11638
AutoCodeRover https://arxiv.org/abs/2404.05427

General Supplemental

Topic 4. Improving Code Generation

Papers for Presentation, Critique, & Discussion

CODEDPO https://arxiv.org/pdf/2410.05605
LintSeq https://arxiv.org/pdf/2410.02749
GAD https://arxiv.org/pdf/2405.21047
PIE https://arxiv.org/pdf/2407.03157

General Supplements

CoCoMIC: Code Completion By Jointly Modeling In-file and Cross-file Context https://arxiv.org/abs/2212.10007
RepoFusion: Training Code Models to Understand Your Repository https://arxiv.org/abs/2306.10998
Guiding Language Models of Code with Global Context using Monitors https://arxiv.org/abs/2306.10763
CodePlan: Repository-level Coding using LLMs and Planning https://arxiv.org/abs/2309.12499
A^3-CodGen: A Repository-Level Code Generation Framework for Code Reuse with Local-Aware, Global-Aware, and Third-Party-Library-Aware https://arxiv.org/abs/2312.05772
REPOFUSE: Repository-Level Code Completion with Fused Dual Context https://arxiv.org/abs/2402.14323
Generation-Augmented Retrieval for Open-domain Question Answering https://arxiv.org/abs/2009.08553
Query2doc: Query Expansion with Large Language Models https://arxiv.org/abs/2303.07678

Topic 5. Interpretability of Code Models

Papers for Presentation, Critique, & Discussion

Explainable AI https://www.mdpi.com/1099-4300/23/1/18
Explainable AI https://www.mdpi.com/1099-4300/23/1/18

Note: Explainable AI is a heavy paper, both groups of people should focus on the same paper.

General Supplemental

Rethinking Interpretability in the Era of Large Language Models https://arxiv.org/abs/2402.01761
Interpretable Machine Learning: Fundamental Principles and 10 Grand Challenges https://arxiv.org/abs/2103.11251
Benchmarking and Explaining Large Language Model-based Code Generation: A Causality-Centric Approach https://arxiv.org/abs/2310.06680
Benchmarking Causal Study to Interpret Large Language Models for Source Code https://arxiv.org/abs/2308.12415
Towards Causal Deep Learning for Vulnerability Detection https://arxiv.org/abs/2310.07958

Topic 6. Applications

Papers for Presentation, Critique, & Discussion

SafeCoder https://arxiv.org/pdf/2402.09497
Coeditor https://www.cs.utexas.edu/~isil/coeditor.pdf

Topic 7. Neurosymbolic AI

Papers for Presentation, Critique, & Discussion

COSMOS https://arxiv.org/pdf/2310.12690
LLM on Program Invariants https://drive.google.com/file/d/1t8Veh-JX7xCRtcHcHPmFtnfM38zXK31D/view

Page updated

Google Sites

Report abuse

COMS E6998 Generative Models for Code,

Fall 2024

Course Information

Grading

Schedule

Introduction

Slides

Guest Lecture Slides (Robin, Microsoft Research)

Topic 1. Code Large Language Models

Papers for Presentation, Critique, & Discussion

Supplemental for Paper 1

Supplemental for Paper 2

Supplemental for Paper 3

Supplemental for Paper 4

General Supplemental

Topic 2. Evaluation of Code Models

Papers for Presentation, Critique, & Discussion

Supplemental for Paper 1

Supplemental for Paper 2

General Supplemental

Topic 3. Agents

Papers for Presentation, Critique, & Discussion

General Supplemental

Topic 4. Improving Code Generation

Papers for Presentation, Critique, & Discussion

General Supplements

Topic 5. Interpretability of Code Models

Papers for Presentation, Critique, & Discussion

General Supplemental

Topic 6. Applications

Papers for Presentation, Critique, & Discussion

Topic 7. Neurosymbolic AI

Papers for Presentation, Critique, & Discussion