Probability Seminar

FMI & IMAR

This seminar is intended as a working seminar and its purpose is to be accessible to anyone with some probability background.

It is primarily organized by the Faculty of Mathematics of University of Bucharest and the Institute of Mathematics "Simion Stoilow" of the Romanian Academy in hybrid format.

The main talks are geared toward people who are not specialists in the area but it does not exclude more specialized topics. The schedule is posted here.

The seminar is on Mondays 10-11am at IMAR, Ciprian Foias room (804, eight floor).

The google meet link is this: meet.google.com/zqf-wrka-cpq

Talks 2024

3/11/2024 (Dan Paraschiv IMAR) Title: Multiply connected Fatou components of rational maps

Historically, the only way to obtain Fatou components of finite connectivity higher than 2 has been by means of quasiconformal surgery. We provide a family of rational maps such that, for specific parameters, the corresponding dynamical planes contain Fatou components of arbitrarily large connectivity. We also introduce a family obtained by using the Chebyshev-Halley family of numerical methods. We show the existence of parameters such that all Fatou components are infinitely connected.

Talks 2023

12/18 (Martin Fronescu FMI) Magics of the Rubik's cube

Instead of abstract: Martin is a famous figure among the cubers in Romania for his incredible performance in solving the Rubiks cube. He will tell us about the various aspects of the Rubiks cube, including fast solvings, algorithms and the conjectures, now theorems on how many moves one needs to solve the cube. His profile on the the World Cube Association is here: Martin's WCA page. Here is the presentation of Martin.

12/11/2023 (Viorel Costeanu, JP Morgan) Analytical approximations with Strang Splitting NOTICE THE TIME 15:30 not at the usual time 10:00 AND THIS WILL BE ONLY ONLINE.

Strang splitting is the statement that for 2 non-commuting matrices or operators the approximation exp(t(A+B)) ~ exp(tA/2)exp(tB)exp(tA/2) is second order accurate. This approximation is used in the design of finite difference schemes in order to effectively lower the local dimensionality of a problem. We demonstrate a novel application of Strang splitting where we reduce the calculation of the kernel of a complex stochastic differential equation to the kernel of a simpler one and the solution of an ordinary differential equation. We show that in cases of interest we can arrange so that the simpler kernel is known in closed form as either the regular heat kernel or the heat kernel of a hyperbolic space, and the ODE is either analytically solvable or can be very efficiently solved numerically.

11/20/2023 (Ionel Popescu) The fundamental theory of machine learning, the regression case II

I will give an introduction to PAC learning and then introduce the most common version, namely the fundamental theorem of PAC learning for classification. This is well known. I will discuss also the case of less known case of regression model. On the way we will discuss a twist which is given by the McDiarmind inequality and how one can guarantee a good sample size is determined.

11/13/2023 (Ionel Popescu) The fundamental theory of machine learning, the regression case

I will give an introduction to PAC learning and then introduce the most common version, namely the fundamental theorem of PAC learning for classification. This is well known. I will discuss also the case of less known case of regression model. On the way we will discuss a twist which is given by the McDiarmind inequality and how one can guarantee a good sample size is determined.

11/06/2023 (Maximilian Sebastian Janisch, University of Zurich) Review of generative models from the mathematical and computer science perspective.

A branch of machine learning, generative models, has gained a lot of academic and public attention in the recent past. Notable cases of generative models are transformer-based large language models such as ChatGPT, as well as image-generating diffusion models such as Dall-E. Mathematically, generative models aim to “learn” a probability distribution after observing independent samples from this distribution. This talk will outline various methodologies to do so, notably (in order of recency) Restricted Boltzmann machines, Variational Auto-Encoders, Generative Adversarial Networks, and diffusion models,. I will also discuss applications to medical research as well as mathematical questions on the efficacy of such generative models.

10/16/2023 (Gabriel Majeri, Univ. of Bucharest) slides: here

Title: Introduction to Statistical Learning Theory: the Vapnik-Chervonenkis Theorem II

A fundamental problem in machine learning is that of generalization: how can we ensure the models we train on a finite sample perform well over the entire distribution of the data? One of the earliest theoretical results in this direction was given in 1971 by the Soviet mathematicians Vladimir Vapnik and Alexey Chervonenkis. This talk will cover the formal definition of the learning problem in the Empirical Risk Minimization (ERM) framework, the VC dimension of a hypothesis set and the statement and proof of the fundamental inequality of VC theory. If time permits, we will also discuss how these ideas lead to the implementation of Support Vector Machines (SVMs).

The only required prerequisites are basic measure and probability theory.

10/9/2023 (Gabriel Majeri, Univ. of Bucharest)

Title: Introduction to Statistical Learning Theory: the Vapnik-Chervonenkis Theorem I

A fundamental problem in machine learning is that of generalization: how can we ensure the models we train on a finite sample perform well over the entire distribution of the data? One of the earliest theoretical results in this direction was given in 1971 by the Soviet mathematicians Vladimir Vapnik and Alexey Chervonenkis. This talk will cover the formal definition of the learning problem in the Empirical Risk Minimization (ERM) framework, the VC dimension of a hypothesis set and the statement and proof of the fundamental inequality of VC theory. If time permits, we will also discuss how these ideas lead to the implementation of Support Vector Machines (SVMs).

The only required prerequisites are basic measure and probability theory.

References:

- "On The Uniform Convergence of Relative Frequencies of Events to Their Probabilities", V. N. Vapnik and A. Ya. Chervonenkis

- "A Probabilistic Theory of Pattern Recognition", L. Devroye, L. Györfi and G. Lugosi

- "Foundations of Machine Learning", M. Mohri, A. Rostamizadeh and A. Talwalkar

9/26/2023 at 12pm (Arnulf Jentzen, the Chinese Univ. of Hong Kong, China & Univ. of Münster, Germany) (Notice the date and time change) This is also the monthly conference at IMAR and it will take place in Miron Nicolescu Amphitheater at the ground level

Title: Overcoming the course of dimensionality: from nonlinear Monte Carlo to the training of neural networks

Partial differential equations (PDEs) are among the most universal tools used in modelling problems in nature and man-made complex systems. Nearly all traditional approximation algorithms for PDEs in the literature suffer from the so-called "curse of dimensionality" in the sense that the number of required computational operations of the approximation algorithm to achieve a given approximation accuracy grows exponentially in the dimension of the considered PDE. With such algorithms it is impossible to approximatively compute solutions of high-dimensional PDEs even when the fastest currently available computers are used. In the case of linear parabolic PDEs and approximations at a fixed space-time point, the curse of dimensionality can be overcome by means of Monte Carlo approximation algorithms and the Feynman-Kac formula. In this talk we present an efficient machine learning algorithm to approximate solutions of high-dimensional PDE and we also prove that deep artificial neural network (ANNs) do indeed overcome the curse of dimensionality in the case of a general class of semilinear parabolic PDEs. Moreover, we specify concrete examples of smooth functions which cannot be approximated by shallow ANNs without the curse of dimensionality, but which can be approximated by deep ANNs without the curse of dimensionality. In the final part of the talk we present some recent mathematical results on the training of neural networks.

9/18/2023 (Tushar Vaidya Singapore) Pathwise Quantum Lasso Regression

We study the high dimensional aspect of linear regression with an ℓ1 penalty. While classical, numerical algorithms are available for Lasso, our focus is on developing a hybrid quantum algorithm that offers new insights and speedup. Quadratic speedup is theoretically possible over the classical Homotopy (Least Angle Regression) method. In particular, we provide a general setup for Lasso solutions as the penalty term varies. Several challenges remain in creating such an algorithm. The task is fraught with difficulties. Still the pursuit is worthwhile and we will elucidate how to go about this. The talk should be accessible to those without knowledge of quantum computing!

6/12/2023 (Ionel Popescu) Accelerated methods for optimization

I will present some results of accelerated methods for optimization of a convex function using some accelerated methods. There are some classical, recent and very recent results which deserve attention. The papers I am going to base my talk:

Accelerated variational methods and the interesting paper by Convergence for Heavy Ball methody and also this FISTA for strongly convex functions

5/24/2023 (Max von Renesse, Leipzig University) NOTICE THE TIME: 4pm, Entropic Regularization for General Linear Programs

We revisit the problem of unbalanced optimal transport for which we introduce an analogue of the the Schr"odinger problem leading to its entropic regularization which then can be solved via by iterative scaling similar to the Sinkhorn algorithm from the standard balanced case. It turns out that entropic regularization and iterative scaling as computational tool can be applied to a much larger class of linear programs.

5/22/2023 (Heinrich Matzinger, Georgia Tech) NOTICE THE TIME CHANGE 11-12: Problems with deep neural networks for image recognition

We show how the Convolutional Neural Networks do not use global shape for recognizing objects but use micro-features, which can lead to errors in more involved industry related problems.

5/15/2023 (Evgnosia Kelesidis) FastICA

This is about some classical aspects of ICA (independent component analysis) with some by now standard algorithms.

5/08/2023 (Ionel Popescu) About ICA and related conjectures

I will discuss some conjectures related to independent component analysis with k speakers and p microphones. The interesting phenomena is when the number of microphones is much smaller than the number of speakers and we will see some conjectures around this.

3/27/2023 (Razvan Moraru) Bazele Cuantificării

Pe parcursul seminarului vor fi prezentate bazele cuantizării geometrice, cuprinzând definirea spațiului Hilbert precuantic, modul de utilizare al polarizărilor, precum și relevanța acestora în contextul teoriei probabilităților.

Odată ce a fost stabilită o modalitatea precisă de a asocia o dinamică unitară unui curent diferențial, vom încerca să înțelegem corespondența dinamicii cuantice cu dinamica stocastică rezultată prin aplicarea transformatei Wigner-Moyal.

3/13/2023 ( Rishabh Bhardwaj, Singapore) Language Models

Language Models (LMs) aim to model a probability distribution over a sequence of words. This simple setting and its variants have shown a huge potential in solving many AI applications dealing with natural language as well as other modalities. In this presentation, we will discuss how LMs evolved from the perspective of model architectures, including classical RRNs and advanced Transformers. We will cover various algorithms to transfer the learning of an LM to solve a given downstream task efficiently. We will also touch upon emerging approaches to prune large models, preserve user-private information, and de-biasing techniques. In the end, we will discuss the basics of training methods adopted by recent and widely famous systems such as ChatGPT with a large LM as its backbone.

3/06/2023 (Ionel Popescu, FMI and IMAR) The Fundamental Theorems of Mathematical Finance III

This is the continuation of the previous two lectures on mathematical finance. I will finally arrive at the continuous case and discuss the Black-Scholes equation appearence and eventually how one can solve it.

20/27/2023 (Andrei Comaneci, Berlin) Tropical Geometry in Data Analysis and Machine Learning

Tropical geometry deals with certain piecewise linear geometric

objects with rich combinatorial structure. Its algebraic roots can be

traced from combinatorial optimization and dynamic programming from

1960s, but the geometric viewpoint arose from algebraic geometry at

the beginning of 21st century. This led to various connections to

other subjects, including computational biology and machine learning.

In this talk we will present the basic notions from tropical geometry

that appear mostly in data science. On one hand, we focus on tropical

convexity, which are important in phylogenetics. The data consists of

evolutionary trees which can be seen as points in a certain tropically

convex set. On the other hand, we will focus on tropical hypersurfaces and its

combinatorial properties. They were recently studied in their

relationship to neural networks with ReLU activation function. This

led to complexity results in deep learning theory.

2/20/2023 (Ionel Popescu, FMI & IMAR) The Fundamental Theorems of Mathematical Finance II

I will present a simple introduction to the fundamental theorem of mathematical finance. I will do it first in the case of binomial model in the discrete case where the main concept of no arbitrage market plays the central role which in turn yields the main result. In the continous case, this is more involved, but the fundamental principle is almost the same. In this second round of the seminar, the goal is to see the how the Black-Sholes ecuation appears.

2/13/2023 (Ionel Popescu, FMI & IMAR) The Fundamental Theorems of Mathematical Finance I

I will present a simple introduction to the fundamental theorem of mathematical finance. I will do it first in the case of binomial model in the discrete case where the main concept of no arbitrage market plays the central role which in turn yields the main result. In the continous case, this is more involved, but the fundamental principle is almost the same.

2/06/2023 (Razvan Moraru, IMAR) Operatorii Lévy. Dinamica stocastică și dinamica Cuantică. III

Abstract:

Conținutul este structurat pe durata a două seminarii în modul următor:

Cu ocazia primului seminar vom pleca de la definiția Proceselor Lévy. Un exemplu de proces Lévy este și procesul de difuzie cu salturi, vom vedea că funcția sa caracteristică are forma prescrisă de Teorema Lévy-Kincin.

- O consecință a faptului că formula Lévy-Kincin caracterizează exponentul caracteristic al unei distribuții in(de)finit divizabile, iar procesele Lévy au distribuții de acest fel. Mai general, vom vedea cum formula Lévy-Itô de descompunere este inerent legată de Teorema Levy-Kincin

- De asemenea vom vedea faptul că procesele Lévy sunt procese Markov, având semigrupul de tranziție asociat chiar un semigrup de convoluție, al cărui generator infinitezimal este un Operator Lévy, rezolvă problema Martingalului și constituie un operator pseudodiferențial, al cărui simbol este întocmai exponentul caracteristic prescris de formula Lévy-Kincin.

O formă aparent identică cu cea a Operatorilor Lévy vom regăsi pe parcursul celui de-al doilea seminar pentru Operatorii Weyl ai Mecanicii Cuantice.

- În cel de-al doilea seminar vom prezenta legătura inerentă dintre formalismul Hamiltonian al Mecanicii Cuantice, geometria simplectică și operatorii Mecanicii Cuantice. Voi prezenta bazele Cuantificării Geometrice - mai precis, voi prezenta cuantificarea Weyl; astfel încât la sfârșitul celui de-al doilea seminar vom putea avea o imagine unitară a conexiunii dintre:

- semigrupul de simplectomorfisme (curentul diferențial) asociat unui sistem dinamic în geometria diferențială/mecanica clasică

- semigrupul de operatori unitari asociat evoluției în timp a unei observabile în Mecanica Cuantică

- semigrupul de tranziție asociat unui proces Markov în analiza stocastică.

Ca un pas intermediar, în primul seminar vom discuta și despre aplicarea metodei dilatărilor (P.Halmos) pentru a pune în corespondență dinamica unui lanț Markov și dinamica unui sistem cuantic reprezentat pe o sferă complexă. Aspectele acestei conexiuni nu au doar o valoare pur-teoretică, ele pot fi aplicate în modelarea de qubits - pe sfera Bloch, respectiv în calculul cuantic sau în procedee precum tomografia cuantică.

În plus pe această cale este devoalată o legătură subtilă între grafuri (asociate dinamicii unui lanț Markov) și spațiile proiective complexe, respectiv sferele cuantice (asociate dinamicii cuantice unitare).

1/30/2023 (Razvan Moraru, IMAR) Operatorii Lévy. Dinamica stocastică și dinamica Cuantică. II