ACO Student Seminar

The Georgia Tech ACO Student Seminar is run by students in the Algorithms, Combinatorics, & Optimization program at Georgia Tech.

The purpose of this seminar is to keep students updated with current research, and to give students a venue to present their work. Any topic related (but not restricted) to algorithms, combinatorics, and optimization is very welcome. You can present research results, demo a work in progress, or just share something of general interest. There will also be occasional talks by ACO faculty and visitors. (Post-docs are welcome too!)

In Spring 2025, the seminar will meet on Fridays in Skiles 006 from 1-2pm. For more information please refer to the announcement sent by the organizers (or contact them directly). Subscribe to the mailing list aco-announce (or send an email to sympa@lists.gatech.edu with the subject line "subscribe aco-announce") to receive the announcements regularly.

Contact

If you are interested in giving a talk, you can contact any of the organizers: Aiya Kuchukova, Albert Weng, Jade Lintott, Yuexing (April) Niu. (Thanks to Max Dabagia and Sam van der Poel for organizing the seminar with us previously)

Fall 2025 Talks

August 22th: Hoang Huy Nguyen (Georgia Tech)

Gold-medalist Performance in Solving Olympiad Geometry with AlphaGeometry2

Abstract: We present AlphaGeometry2, a significantly improved version of AlphaGeometry introduced in Trinh et al. (2024), which has now surpassed an average gold medalist in solving Olympiad geometry problems. To achieve this, we first extend the original AlphaGeometry language to tackle harder problems involving movements of objects, and problems containing linear equations of angles, ratios, and distances. This, together with support for non-constructive problems, has markedly improved the coverage rate of the AlphaGeometry language on International Math Olympiads (IMO) 2000-2024 geometry problems from 66% to 88%. The search process of AlphaGeometry2 has also been greatly improved through the use of Gemini architecture for better language modeling, and a novel knowledge-sharing mechanism that enables effective communication between search trees. Together with further enhancements to the symbolic engine and synthetic data generation, we have significantly boosted the overall solving rate of AlphaGeometry2 to 84% for all geometry problems over the last 25 years, compared to 54% previously. AlphaGeometry2 was also part of the system that achieved silver-medal standard at IMO 2024. Last but not least, we report progress towards using AlphaGeometry2 as a part of a fully automated system that reliably solves geometry problems directly from natural language input.
Joint work with the Google DeepMind team during the IMO run in 2024.

August 29th: Kalen Patton (Georgia Tech)

Online Allocation with Concave DR-Submodular Objectives

Online resource allocation problems are central challenges in economics and computer science, modeling situations in which n items arriving one at a time must each be immediately allocated among m agents. In such problems, our objective is to maximize a monotone reward function f(x) over the allocation vector x, which describes the amount of each item given to each agent. In settings where f is concave and has "diminishing returns'' (monotone decreasing gradient), several lines of work over the past two decades have had great success designing constant-competitive algorithms, including the foundational work of Mehta et. al. (2005) on the Adwords problem and its follow-ups. Notably, while a greedy algorithm is 1/2-competitive in such settings, these works have shown that one can often obtain a competitive ratio of 1−1/e≈0.632 in many settings when items are divisible (i.e. allowing fractional allocations). However, prior works have thus far used a variety of problem-specific techniques, leaving open the general question: Does a (1−1/e)-competitive fractional algorithm always exist for online resource allocation problems with concave, diminishing-returns objectives?

In this work, we answer this question affirmatively, thereby unifying and generalizing prior results for special cases. Our algorithm is one which makes continuous greedy allocations with respect to an auxiliary objective U(x). Using the online primal-dual method, we show that if U satisfies a "balanced" property with respect to f, then one can bound the competitiveness of such an algorithm. Our crucial observation is that there is a simple expression for U which has this balanced property for any f, yielding our general (1−1/e)-competitive algorithm.

September 5th: Ruben Ascoli (Georgia Tech)

Polynomial-to-Exponential Transition in Hypergraph Ramsey Numbers

Abstract: Let r_k(s, e; t) denote the smallest N such that any red/blue edge coloring of the complete k-uniform hypergraph on N vertices contains either e red edges among some s vertices, or a blue clique of size t. Erdős and Hajnal introduced the study of this Ramsey number in 1972 and conjectured that for fixed s > k, there is a well defined value h_k(s) such that r_k(s, h_k(s)-1; t) is polynomial in t, while r_k(s, h_k(s); t) is exponential in a power of t. Erdős later offered $500 for a proof.

Conlon, Fox, and Sudakov proved the conjecture for k=3 and 3-adically special values of s, and Mubayi and Razborov proved it for k at least 4. We prove the conjecture for k=3 and all s, settling all remaining cases of the problem.

Joint work with Xiaoyu He and Hans Yu.

September 19th: No Seminar

Skiles 005 and 006 are reserved for the Colloquium.

September 26th: Aiya Kuchukova (Georgia Tech)

Sampling Colorings with Fixed Color Class Sizes

Classical results by Jerrum (1995) and Salas-Sokal (1997) show the possibility of efficient approximate counting and uniform sampling of proper colorings with $q \geq 2\Delta+1$ colors in graphs of maximum degree $\Delta$. For colorings with fixed color class sizes, Kierstand and Kostochka (2007) reproved the existence of equitable coloring (colorings where class sizes differ by at most 1) and provided a polynomial algorithm which produces such a coloring and requires only $q \geq \Delta+1$ colors. In this paper we provide efficient approximate counting and uniform sampling algorithms for colorings with fixed color class sizes that are equitable or close to equitable. The proof uses techniques such as zero-freeness of partition functions, Local Central Limit Theorems, and cluster expansion. We hope our result adds to the growing evidence of the possibility to efficiently sample fundamental combinatorial objects, such as colorings, with global constraints. Joint work with Will Perkins and Xavier Povill.

The talk will not assume any knowledge of sampling or statistical physics.

October 3rd: Caleb McFarland (Georgia Tech)

What do signed graphs have to do with integer programming?

TBD

October 17th: Van Vu, Ph.D. (Yale)

Computing low rank approximation of a noisy matrix

Let A be a matrix and A_p its best rank p approximation. Computing A_p, for a relatively small p, is a task of fundamental importance. One often stores A_p and uses it as input in many downstream tasks. Typically, we want to choose p so that A_p capture most of the energy of A, namely || A- A_p||_F < 10% || A||_F, say.

In practice, data is noisy and incomplete. Thus, one only has access to a matrix A' = A+E, where E represents noise. The question is to find a low rank matrix B, given A', such that || A- B||_F < 10% || A||_F, say. (We do not know p. )

We propose a very simple algorithm to this task, based on the idea of "energy shrinkage". The core of the method is a new perturbation bound for the difference ||A'_p -A_p||, which significantly improves bounds obtained using Eckard-Young-Mirsky theorem. This is one of many recent new perturbations obtained by the contour expansion method, introduced by Tran and the speaker a few years ago.

October 24th: Abhishek Shetty, Ph.D. (MIT, future Georgia Tech)

Reasoning in/about Language Model: Perspectives from TCS

Abstract: The success of modern Large Language Models (LLMs) is a technological breakthrough but foundations on which it is built has remained largely a mystery. A foundational understanding would vastly improve our ability to both diagnose potential issues (e.g. from a safety perspective) and also ability to push the capabilities of these models (e.g. in challenging settings such as reasoning and scientific discovery). Towards this broad goal, I will speak about two recent directions, both inspired by classical work in the theory of computation, with the aim to convey the importance of algorithmic insights even in the setting of LLMs.

First, we ask the question of how we can elicit new behavior from language models aimed towards tasks such as reasoning. A paradigm that has recently gained popularity is process verification, which aims to use guidance in language model generation. Though effective in principle, standard methods using them suffer compounding error with the length of the generated sequence, an issue particularly problematic due to the increase in the horizon of modern reasoning tasks. We present a new perspective on this problem by connecting the guidance problem to fundamental problems in approximate counting and sampling and present an algorithm, we dub VGB, that circumvents compounding of error with the horizon.

Secondly, we look at the problem of understanding the structure in language and models thereof. Again drawing inspiration from a classical area of TCS, the study of latent variable models, we present a new perspective on sequence models, which we call (extended) low logit rank. We demonstrate empirically that models in practice exhibit this structure in practice and use this structure to demonstrate surprising consequences such as sampling from language models by querying unrelated prompts. Motivated by this, we present theoretical results such as equivalence to low dimensional latent variable models, expressivity and learning guarantees. In particular, we present this as an approach to theoretically understand the structure in large language models.

In summary, through this talk I aim to convey how perspectives from the theory of computation can inspire principled approaches to fundamental pressing problems in the study of large language models. This talk is based on joint works with Dhruv Rohatgi, Donya Saless, Yunchen Li, Ankur Moitra, Andrej Risteki, Dylan Foster, Noah Golowich and Allen Liu.

Bio: Abhishek Shetty is the incoming Catherine M. and James E. Allchin Early-Career Assistant Professor in Computer Science at Georgia Tech, starting in Fall 2026. Currently, he is a FODSI postdoctoral fellow at Massachusetts Institute of Technology hosted by Costis Daskalakis, Ankur Moitra and Sasha Rakhlin and previously PhD student at the University of California at Berkeley advised by Nika Haghtalab. His research focuses on building fundamental connections between the theory of computation and machine learning, with particular interest in understanding the algorithmic and statistical role of data in generalization and sequential decision making. His research has been recognized with an Apple AI/ML fellowship and an ASA SCGS best student paper.

October 31st: Aldo Kiem (Zuse Institute Berlin, Technische Universität Berlin)

Sidorenko’s conjecture, Graph Operators and Box Products with Subgraphs of Hypercubes

Abstract: Sidorenko’s conjecture is a central problem in extremal combinatorics. It is known that finite sums-of-squares methods (related to connection matrices for graphons) are not able to prove some of the inequalities that this conjecture predicts. We explore this conjecture from the perspective of graph operators like blow-ups and subdivisions. Our main observation is that these operators have auxiliary graph constructions which makes them adjunctions in the terminology of Lovász and Sezegedy. The main new theorem I would like to present is a generalization of a result of Conlon, Kim, Lee and Lee on box products with even cycles. We can show that certain box products of certain subgraphs of hypercubes also preserve the Sidorenko property. This is joint work with Christoph Spiegel and Olaf Parczyk.

November 7: Jacob Platnick (Georgia Tech)

Detecting Gerrymandering on Dense Random Graphs

Abstract: We consider the probability that a spanning tree chosen uniformly at random from a graph can be partitioned into a fixed number $k$ of trees of equal size with the removal of $k-1$ edges. In that case, the spanning tree is called {\em splittable}.

Splittable spanning trees are useful in algorithms for sampling {\em balanced forests}, forests whose components are of equal size, and for sampling partitions of a graph into components of equal size, with applications in detecting gerrymandering in redistricting.

Recent results that {spanning trees on} grid graphs and grid-like graphs on $n$ vertices are splittable into $k$ equal sized components with probability at least $n^{-2k}$,

leading to the first rigorous sampling algorithm for balanced forests for any class of graphs.

Focusing on the complementary case of dense random graphs, we show that random spanning trees again have inverse polynomial probability of being splittable; specifically, a random spanning tree is splittable with probability at least $n^{(-k/2)}$ for both the $G(n,p)$ and $G(n,m)$ models when $p = \Omega(1/\log n)$, giving the first dense class of graphs where partitions of equal size can be sampled efficiently.

In addition, we present an infinite family of graphs with properties that have been conjectured

to ensure splittability (i.e., Hamiltonian subgraphs of the triangular lattice)

and prove that random spanning trees are not splittable with more than an exponentially small probability. As a consequence of this, we show that the most widely-used family of Markov chain algorithms for sampling partitions of equal size will fail on this family of graphs if their state spaces are restricted to equal-size partitions. Moreover, we show these algorithms will be inefficient if their state spaces are generalized to include any unbalanced partitions, suggesting barriers for sampling balanced partitions in other classes of sparse graphs. Joint work with David Gillman and Dana Randall.

November 14: Thiago Oliveira (Georgia Tech)

Why Weak Duality is Awesome: an Application of SDPs to Spectral Graph Theory

Abstract: A standard and successful approach in combinatorial optimization is to write a relaxed version of a problem as an LP and to study it through the lens of the dual program. In this talk, we will show how one can extend this framework, via SDP duality, to handle certain “spectral" problems as well.

In 2017, Nikiforov and Rojo introduced the parameter α_0 as the minimum α such that αA(G)+(1−α)D(G) is positive semidefinite,

where A(G) and D(G) are the adjacency and degree matrices of a graph, respectively. Using the duality framework, we were able to generalize the degree bounds appearing in their original paper, improve lower bounds from the literature, and unify different proof approaches under a single optimization viewpoint.

This talk will not assume any prior background in SDP or spectral graph theory. In fact, most of the talk will be dedicated to motivating these areas. The only assumption is that the audience is enthusiastic about (A)lgorithms, (C)ombinatorics, and (O)ptimization.

Joint work with Gabriel Coutinho.

November 21: Zedong Wang (Georgia Tech)

Tail bounds for Queue with Abandonment

Abstract: In probability, a central goal is to approximate complex stochastic systems via asymptotic results such as the Central Limit Theorem (CLT). For queueing systems, these approximations are especially important for understanding steady-state tail probabilities, which are crucial when customers have finite patience and may abandon the system. Recent work by Jhunjhunwala, Zubeldia, and Maguluri (2024) adopts an asymptotic viewpoint in which the abandonment parameter tends to 0, i.e., customers are fully patient. They show that the resulting steady-state distribution is Gaussian. However, such asymptotic theorems do not, by themselves, exhibit the pre-limit tails when the abandonment parameter is finite.

In this talk, I will study pre-limit tail bounds and introduce what we call efficient concentration inequalities: bounds that (i) are sub-Gaussian in the pre-limit and (ii) converge exactly to the Gaussian tail in the asymptotic regime, thus offering “the best of both worlds.” I will first present efficient concentration results for constant deviations, and then extend them to larger deviations that scale with the abandonment parameter, showing sub-Gaussian and sub-Poisson behavior in the appropriate regimes. Furthermore, I will prove that these bounds are order-wise tight by establishing matching lower bounds. The proofs combine Stein’s method with transform method techniques.

Joint work with Siva Theja Maguluri.

December 5: Kirill Kovalenko (Scuola Superiore Meridionale)

Finite time predictions for chaotic or random systems

Abstract: TBD

Page updated

Google Sites

Report abuse