Apr 21, 2025
Monday
|
09:15 AM - 09:30 AM
|
|
Welcome to SLMath
|
- Location
- SLMath: Eisenbud Auditorium
- Video
-
--
- Abstract
- --
- Supplements
-
--
|
09:30 AM - 10:30 AM
|
|
The broken sample problem revisited: Proof of a conjecture by Bai and Hsing and extensions
Yihong Wu (Yale University)
|
- Location
- SLMath: Eisenbud Auditorium, Online/Virtual
- Video
-
--
- Abstract
Zoom Link
We revisit the classical broken sample problem: Two samples of iid data points X={X_1,...,X_n} and Y={Y_1,...,Y_m} are observed without correspondence, with m \leq n. The goal is to distinguish the following two hypotheses: Under the null hypothesis, X and Y are independent. Under the alternative hypothesis, X and Y are correlated in the sense that (X_{\pi(i)}, Y_i)'s are drawn independently from a bivariate distribution for some latent injection \pi: [m] \to [n]. Originally introduced by DeGroot, Feder, and Goel [AoMS 1971] to model matching records in census data, this problem has recently gained renewed interest due to its applications in data de-anonymization, data integration, and target tracking. Despite extensive research over the past decades, determining the precise detection threshold has remained an open problem even for equal sample sizes (m=n). Assuming m and n grow proportionally, in this work, we show that the sharp threshold is given by a spectral and an L_2 condition of the likelihood ratio operator, resolving a conjecture of Bai and Hsing [PTRF 2005] in the positive. Extensions to high dimensions and linear models are also discussed. This is based on joint work with Shuyang Gong (Peking University), Simiao Jiao (Duke), and Jiaming Xu (Duke).
- Supplements
-
--
|
10:30 AM - 11:00 AM
|
|
Morning Break
|
- Location
- SLMath: Atrium
- Video
-
--
- Abstract
- --
- Supplements
-
--
|
11:00 AM - 12:00 PM
|
|
Testing between planted distributions and connections to recovery
Fiona Skerman (Uppsala University)
|
- Location
- SLMath: Eisenbud Auditorium, Online/Virtual
- Video
-
- Abstract
Zoom Link
- Supplements
-
--
|
12:00 PM - 02:00 PM
|
|
Lunch
|
- Location
- SLMath: Atrium
- Video
-
--
- Abstract
- --
- Supplements
-
--
|
02:00 PM - 03:00 PM
|
|
Inference on recursive trees with a community structure
Anna Ben-Hamou (Sorbonne Université)
|
- Location
- SLMath: Eisenbud Auditorium, Online/Virtual
- Video
-
--
- Abstract
Zoom Link
- Supplements
-
--
|
03:00 PM - 03:30 PM
|
|
Afternoon Tea
|
- Location
- SLMath: Atrium
- Video
-
--
- Abstract
- --
- Supplements
-
--
|
03:30 PM - 04:30 PM
|
|
Detection and Estimation in Multi-Index models
Florent KRZAKALA (Ecole Polytechnique federale de Lausanne)
|
- Location
- SLMath: Eisenbud Auditorium, Online/Virtual
- Video
-
- Abstract
Zoom Link
- Supplements
-
--
|
|
Apr 22, 2025
Tuesday
|
09:30 AM - 10:30 AM
|
|
Lecture
Morgane Austern (Harvard University)
|
- Location
- SLMath: Eisenbud Auditorium, Online/Virtual
- Video
-
--
- Abstract
Zoom Link
- Supplements
-
--
|
10:30 AM - 11:00 AM
|
|
Morning Break
|
- Location
- SLMath: Atrium
- Video
-
--
- Abstract
- --
- Supplements
-
--
|
11:00 AM - 12:00 PM
|
|
Exact Label Recovery in Euclidean Random Graphs
Julia Gaudio (Northwestern University)
|
- Location
- SLMath: Eisenbud Auditorium, Online/Virtual
- Video
-
- Abstract
Zoom Link
We propose a family of label recovery problems on weighted Euclidean random graphs. The vertices of a graph are embedded in R^d according to a Poisson point process, and are assigned to a discrete community label. Our goal is to infer the vertex labels, given edge weights whose distributions depend on the vertex labels as well as their geometric positions. Our general model provides a geometric extension of popular graph and matrix problems, including submatrix localization and Z_2-synchronization, and includes the Geometric Stochastic Block Model (proposed by Sankararaman and Baccelli) as a special case. We study the fundamental limits of exact recovery of the vertex labels. Under a mild distinctness of distributions assumption, we determine the information-theoretic threshold for exact label recovery, in terms of a Chernoff-Hellinger divergence criterion. Impossibility of recovery below the threshold is proven by a unified analysis using a Cramer lower bound. Achievability above the threshold is proven via an efficient two-phase algorithm, where the first phase computes an almost-exact labeling through a local propagation scheme, while the second phase refines the labels. The information-theoretic threshold is dictated by the performance of the so-called genie estimator, which decodes the label of a single vertex given all the other labels. This shows that our proposed models exhibit the local-to-global amplification phenomenon. Joint work with Charlie Guan, Xiaochun Niu, and Ermin Wei.
- Supplements
-
--
|
12:00 PM - 02:00 PM
|
|
Lunch
|
- Location
- SLMath: Atrium
- Video
-
--
- Abstract
- --
- Supplements
-
--
|
02:00 PM - 03:00 PM
|
|
Sharp Phase Transitions in Estimation with Low-Degree Polynomials
Alex Wein (University of California, Davis)
|
- Location
- SLMath: Eisenbud Auditorium, Online/Virtual
- Video
-
- Abstract
Zoom Link
In the stochastic block model (a canonical model for finding hidden communities in random networks), it has long been conjectured that the so-called Kesten-Stigum (KS) threshold marks the boundary between the "easy" (polynomial-time solvable) and "hard" (computationally intractable) regimes. We substantiate this conjecture by proving a sharp transition at the KS bound for a restricted class of estimators, namely those that compute low-degree polynomials in the entries of the adjacency matrix. This builds on a long line of work that uses low-degree polynomials as a proxy for efficient computation, but our work is the first to establish "sharp" thresholds for the estimation (rather than detection) task. I will also discuss some unexpected phenomena that emerge when the number of communities grows with the problem size. This is based on multiple joint works with Youngtak Sohn, Byron Chin, and Elchanan Mossel.
- Supplements
-
--
|
03:00 PM - 03:30 PM
|
|
Afternoon Tea
|
- Location
- SLMath: Atrium
- Video
-
--
- Abstract
- --
- Supplements
-
--
|
03:30 PM - 04:30 PM
|
|
Attributed Stochastic Block Models as Benchmark Datasets for Theoretical Analysis of Graph Neural Networks
Lenka Zdeborova
|
- Location
- SLMath: Eisenbud Auditorium, Online/Virtual
- Video
-
- Abstract
Zoom Link
- Supplements
-
--
|
04:30 PM - 06:20 PM
|
|
Reception
|
- Location
- SLMath: Atrium
- Video
-
--
- Abstract
- --
- Supplements
-
--
|
|
Apr 23, 2025
Wednesday
|
09:30 AM - 10:30 AM
|
|
A Proof of The Changepoint Detection Threshold Conjecture in Preferential Attachment Models
Jiaming Xu (Duke University)
|
- Location
- SLMath: Eisenbud Auditorium, Online/Virtual
- Video
-
- Abstract
Zoom Link
We investigate the problem of detecting and estimating a changepoint in the attachment function of a network evolving according to a preferential attachment model on $n$ vertices, using only a single final snapshot of the network. Bet et al. show that a simple test based on thresholding the number of vertices with minimum degrees can detect the changepoint when the change occurs at time $n-\Omega(\sqrt{n})$. They further make the striking conjecture that detection becomes impossible for any test if the change occurs at time $n-o(\sqrt{n}).$ Kaddouri et al. make a step forward by proving the detection is impossible if the change occurs at time $n-o(n^{1/3}).$ In this paper, we resolve the conjecture affirmatively, proving that detection is indeed impossible if the change occurs at time $n-o(\sqrt{n}).$ Furthermore, we establish that estimating the changepoint with an error smaller than $o(\sqrt{n})$ is also impossible, thereby confirming that the estimator proposed in Bhamidi et al. is order-optimal.
- Supplements
-
--
|
10:30 AM - 11:00 AM
|
|
Morning Break
|
- Location
- SLMath: Atrium
- Video
-
--
- Abstract
- --
- Supplements
-
--
|
11:00 AM - 12:00 PM
|
|
Algorithms to go back in time: network archaeology in random graphs
Simon Briend (Unidistance Suisse)
|
- Location
- SLMath: Eisenbud Auditorium, Online/Virtual
- Video
-
- Abstract
Zoom Link
- Supplements
-
--
|
|
Apr 24, 2025
Thursday
|
09:30 AM - 10:30 AM
|
|
Barriers to online algorithms for optimization from the overlap gap property. Three case studies
David Gamarnik (Massachusetts Institute of Technology)
|
- Location
- SLMath: Eisenbud Auditorium, Online/Virtual
- Video
-
- Abstract
Zoom Link
Many optimization problems over random structures (such as random graphs and some statistical physics models) exhibit a gap between the existential and algorithmically achievable values, dubbed as statistics-to-computation gap. For some of these problems the best known algorithms exhibit a certain online features: the decisions are made sequentially based on partially revealing information about the problem input step by step. We will exhibit three examples where we prove the following: subject to such online restrictions, these algorithms are best. The proof is based on establishing variants of the overlap gap property (ogp), inspired by the theory of spin glasses, tailored for each individual problem. Previously ogp was used to prove barriers to stable algorithms. While algorithms under the consideration are not stable, they are still obstructed by ogp, as we demonstrate. Three examples to be considered are (a) finding largest clique in dense random graphs, (b) finding a dense submatrix of a Gaussian matrix, and (c) the symmetric binary perceptron problem.
- Supplements
-
--
|
10:30 AM - 10:40 AM
|
|
Group Photo
|
- Location
- SLMath: Front Courtyard
- Video
-
--
- Abstract
- --
- Supplements
-
--
|
10:40 AM - 11:00 AM
|
|
Morning Break
|
- Location
- SLMath: Atrium
- Video
-
--
- Abstract
- --
- Supplements
-
--
|
11:00 AM - 12:00 PM
|
|
Exchangeability and vertex arrival times
Peter Orbanz (University College London)
|
- Location
- SLMath: Eisenbud Auditorium, Online/Virtual
- Video
-
--
- Abstract
Zoom Link
- Supplements
-
--
|
12:00 PM - 02:00 PM
|
|
Lunch
|
- Location
- SLMath: Atrium
- Video
-
--
- Abstract
- --
- Supplements
-
--
|
02:00 PM - 03:00 PM
|
|
Embedding in high-dimensional random geometric graphs
Shuangping Li (Stanford University)
|
- Location
- SLMath: Eisenbud Auditorium, Online/Virtual
- Video
-
--
- Abstract
Zoom Link
- Supplements
-
--
|
03:00 PM - 03:30 PM
|
|
Afternoon Tea
|
- Location
- SLMath: Atrium
- Video
-
--
- Abstract
- --
- Supplements
-
--
|
03:30 PM - 04:30 PM
|
|
Causal effect estimation under inference using mean field methods
Subhabrata Sen (Harvard University)
|
- Location
- SLMath: Eisenbud Auditorium, Online/Virtual
- Video
-
- Abstract
Zoom Link
We will discuss causal effect estimation from observational data under interference. We adopt the chain-graph formalism of Tchetgen-Tchetgen et. al. (2021). Under “mean-field” assumptions on the interaction networks, we will introduce novel algorithms for causal effect estimation using Naive Mean Field approximations and Approximate Message Passing. Our algorithms are provablyconsistent under a “high-temperature” assumption on the underlying model.Finally, we will discuss parameter estimation in these models using maximum pseudo-likelihood, and establish the consistency of the downstream plug-in estimator.
Based on joint work with Sohom Bhattacharya (U Florida).
- Supplements
-
--
|
|
Apr 25, 2025
Friday
|
09:30 AM - 10:30 AM
|
|
The reasonable effectiveness of continuous time branching processes in understanding evolving network models
Shankar Bhamidi (University of North Carolina)
|
- Location
- SLMath: Eisenbud Auditorium, Online/Virtual
- Video
-
- Abstract
Zoom Link
A wide array of network growth models have been proposed across various domains as test beds to understand questions such as the effect of network change point (when a shock to the network changes the probabilistic rules of its evolution) or the role of attributes in driving the emergence of network structure and subsequent centrality measures in real world systems.
The goal of this talk will be to describe two specific settings where continuous time branching processes give mathematical insight into asymptotic properties of such models. In the first setting, a natural network change point model can be directly embedded into continuous time thus leading to an understanding of long range dependence of the initial network system on subsequent properties imply the difficulty in understanding and estimating network change point. In the second application, we will describe a notion of resolvability where convergence of a simple macroscopic functional in a model of networks with vertex attributes, coupled with stochastic approximation techniques implies local weak convergence of a standard model of nodal attribute driven network evolution to a limit infinite random structure driven by a multitype continuous time branching process. In the second setting, continuous time branching processes naturally emerge in the limit. If time permits, we will describe a final setting of network evolution with delay where once again such processes arise only in the limit.
- Supplements
-
--
|
10:30 AM - 11:00 AM
|
|
Morning Break
|
- Location
- SLMath: Atrium
- Video
-
--
- Abstract
- --
- Supplements
-
--
|
11:00 AM - 12:00 PM
|
|
Estimating the number of communities in networks
Andressa Cerqueira (Universidade Federal de São Carlos)
|
- Location
- SLMath: Eisenbud Auditorium, Online/Virtual
- Video
-
- Abstract
Zoom Link
Network models have received increasing attention from the statistical community, in particular in the context of analyzing and describing the interactions of complex random systems. In this context, community structures can be observed in many networks, where the nodes are clustered into communities with the same connection patterns. In this talk, we will discuss the problem of estimating the number of communities for the Stochastic Block Model (SBM) and the Degree Corrected Stochastic Block Model (DCSBM). In general, we will discuss penalized methods used to infer the number of communities given a network.
- Supplements
-
--
|
12:00 PM - 02:00 PM
|
|
Lunch
|
- Location
- SLMath: Atrium
- Video
-
--
- Abstract
- --
- Supplements
-
--
|
02:00 PM - 03:00 PM
|
|
Coherence-free Entrywise Estimation of Eigenvectors in Low-rank Signal-plus-noise Matrix Models
Keith Levin (University of Wisconsin-Madison)
|
- Location
- SLMath: Eisenbud Auditorium, Online/Virtual
- Video
-
- Abstract
Zoom Link
Spectral methods are widely used to estimate eigenvectors of a low-rank signal matrix subject to noise. These methods use the leading eigenspace of an observed matrix to estimate this low-rank signal. Typically, the entrywise estimation error of these methods depends on the coherence of the low-rank signal matrix with respect to the standard basis. In this work, we present a novel method for eigenvector estimation that avoids this dependence on coherence. Assuming a rank-one signal matrix, under mild technical conditions, the entrywise estimation error of our method provably has no dependence on the coherence under Gaussian noise (i.e., in the spiked Wigner model), and achieves the optimal estimation rate up to logarithmic factors. Simulations demonstrate that our method performs well under non-Gaussian noise and that an extension of our method to the case of a rank-r signal matrix has little to no dependence on the coherence. In addition, we derive new metric entropy bounds for rank-r singular subspaces under the two-to-infinity distance, which may be of independent interest. We use these new bounds to improve the best known lower bound for rank-r eigenspace estimation under two-to-infinity distance.
- Supplements
-
--
|
03:00 PM - 03:30 PM
|
|
Afternoon Tea
|
- Location
- SLMath: Atrium
- Video
-
--
- Abstract
- --
- Supplements
-
--
|
03:30 PM - 04:30 PM
|
|
Towards Interpretable and Trustworthy Network-Assisted Prediction
Elizaveta Levina (University of Michigan)
|
- Location
- SLMath: Eisenbud Auditorium, Online/Virtual
- Video
-
--
- Abstract
Zoom Link
When training data points for a prediction algorithm are connected by a network, it creates dependency, which reduces effective sample size but also creates an opportunity to improve prediction by leveraging information from neighbors. Multiple prediction methods on networks taking advantage of this opportunity have been developed, but they are rarely interpretable or have uncertainty measures available. This talk will cover two contributions bridging this gap. One is a conformal prediction method for network-assisted regression. The other is a family of flexible network-assisted models built upon a generalization of random forests (RF+), which both achieves competitive prediction accuracy and can be interpreted through feature importance measures. Importantly, it allows one to separate the importance of node covariates in prediction from the importance of the network itself. These tools help broaden the scope and applicability of network-assisted prediction to practical applications.
This talk is based on joint work with Robert Lunde, Tiffany Tang, and Ji Zhu.
- Supplements
-
--
|
|