Program Schedule

Wednesday

Click to expand/collapse
Wednesday
Speaker 1 9:00 - 10:00 Bastian Rieck: Shapes, Spaces, Simplices, and Structure: Geometry, Topology, and Machine Learning
Break 10:00-10:20
Oral A1 10:15-10:35 Tenorio et al.: Tracking Network Dynamics using Probabilistic State-Space Models
Oral A2 10:35-10:55 Misiakos & Püschel: Learning Graphs from Structural Vector Autoregressions with Sparse Input
Oral A3 10:55-11:15 Vlas et al.:Online Network Inference from Graph-Stationary Signals with Hidden Nodes
Speaker 2 11:30-12:30 Reihaneh Rabbany: Benchmarks and Evaluation in Temporal Graph Learning
Lunch 12:30-14:00
Oral B1 14:00-14:20 Holden & Ruiz: Leveraging Line Graph Representations to Improve GNN Convergence Rates
Oral B2 14:20-14:40 Bello et al.: HyperNATE: Scaling Tensor-Based Hypergraph Neural Networks Through Attention
Oral B3 14:40-15:00 Park et al.: FedBaF: Federated Learning Aggregation Biased by a Foundation Model
Break 15:00-15:15
Speaker 3 15:15-16:15 Danai Koutra: Not all Neighbors Agree: Graph Learning Beyond Homophily
Poster Session 1 16:30-17:30
Dinner 18:30-20:30

Thursday

Click to expand/collapse
Thursday
Speaker 1 9:00 - 10:00 Mathilde Papillon: Make Any Graph Neural Network Go Topological with TopoTune
Break 10:00-10:15
Oral C1 10:15-10:35 Rhodes & Rustad: Multimodal Graphs for Diffusion-Based Manifold Alignment
Oral C2 10:35-10:55 Le et al.: Landmark-Based Node Embeddings and Graph Distance Approximations
Oral C3 10:55-11:15 Shen & Kolaczyk: Consistent Identification of Top-K Nodes in Noisy Networks
Lunch 11:15-13:00
Speaker 2 13:00-14:00 Melanie Weber: A Geometric Lens on Challenges in Graph Machine Learning: Insights and Remedies
Poster Session 2 14:00-15:00
Travel to Excursion 15:00 - 16:00
Excursion 16:00-19:00

Friday

Click to expand/collapse
Friday
Speaker 1 9:00 - 10:00 Dhananjay Bhaskar: Beyond Message Passing: Learning Representations from Dynamics on Graphs
Break 10:00-10:20
Oral D1 10:15-10:35 Shi & Moura: Graph Signal Processing: Frequency Analysis for Similar Matrices
Oral D2 10:35-10:55 Do et al.: Interpretable Lightweight Transformer via Unrolling of Learned Graph Smoothness Priors
Oral D3 10:55-11:15 Mutnuri et al.: Causal Analysis of Graph Signals for Brain Effectome Inference
Speaker 2 11:30-12:30 Naoki Saito: Generalized Haar-Walsh Dictionaries: Extensions and Applications
Lunch 12:30-14:00
TMLR 1 14:00-14:20 Skenderi et al.: Graph-level Representation Learning with Joint-Embedding Predictive Architectures
TMLR 2 14:20-14:40 Franks et al.: Towards Graph Foundation Models: A Study on the Generalization of Positional and Structural Encodings
TMLR 3 14:40-15:00 Hu et al.: Sparse Decomposition of Graph Neural Networks
TMLR 4 15:00-15:20 Rumiantsev & Coates: Graph Knowledge Distillation to Mixture of Experts
Break 15:20-15:30
Speaker 3 15:30-16:30 Dominique Beaini: How to Learn Molecules

General Information

Lectures in the oral sessions are 20-minutes long (including Q&As). We will have two poster sessions, (a) Wednesday 16:30-17:30, and (b) Thursday 14:00-15:00. The format of posters is flexible, but A0 size and portrait orientation are recommended.

Our main excursion will be at the Montreal botanical gardens. Stay tuned for more info!


Talks

Wednesday Speaker 1, 9:00 - 10:00

Bastian Grossenbacher Rieck, University of Fribourg

Title: Shapes, Spaces, Simplices, and Structure: Geometry, Topology, and Machine Learning

Abstract: large driver contributing to the undeniable success of deep-learning models is their ability to synthesise task-specific features from data. For a long time, the predominant belief was that ‘given enough data, all features can be learned.’ However, as large language models are hitting diminishing returns in output quality while requiring an ever-increasing amount of training data and compute, new approaches are required. One promising avenue involves focusing more on aspects of modelling, which involves the development of novel inductive biases such as invariances that cannot be readily gleaned from the data. This approach is particularly useful for data sets that model real-world phenomena, as well as applications where data availability is scarce. Given their dual nature, geometry and topology provide a rich source of potential inductive biases. In this talk, I will present novel advances in harnessing multi-scale geometrical-topological characteristics of data. A special focus will be given to show how geometry and topology can improve representation learning tasks. Underscoring the generality of a hybrid geometrical-topological perspective, I will furthermore showcase applications from a diverse set of data domains, including point clouds, graphs, and higher-order combinatorial complexes.

Bio: Bastian Rieck is a Full Professor of Machine Learning at the University of Fribourg in Switzerland and the Principal Investigator of the AIDOS Lab, which focuses on developing novel machine learning methods driven by geometry and topology. His research has received multiple awards, including an ERC Starting Grant. Dr. Rieck is also a member of ELLIS, the European Laboratory for Learning and Intelligent Systems, and AI-LIFE, the International Network of Artificial Intelligence for the Life Sciences. Moreover, Dr. Rieck serves as the director of AATRN, the Applied Algebraic Topology Research Network. He received his M.Sc. degree in mathematics, as well as his Ph.D. in computer science, from Heidelberg University in Germany, graduating both times with distinction.

Wednesday Speaker 2, 11:30 - 12:30

Reihaneh Rabbany, McGill University, Mila – Quebec AI Institute

Title: Benchmarks and Evaluation in Temporal Graph Learning

Abstract: Learning from time-evolving graphs is crucial for understanding many real-world systems. However, effectively evaluating machine learning methods on temporal graphs presents significant challenges, including the lack of large, diverse datasets and inconsistencies in evaluation protocols. This talk will discuss novel benchmarks and evaluation strategies to address these challenges. First, we introduce new evaluation procedures and datasets for dynamic link prediction, focusing on more realistic negative sampling strategies and the importance of diverse data domains. Second, we present the Temporal Graph Benchmark (TGB), a collection of large-scale datasets and evaluation pipelines for temporal graph learning, covering both edge and node-level prediction tasks. Finally, we introduce TGB 2.0, which expands TGB to multi-relational temporal graphs, providing new benchmarks and evaluation protocols for temporal knowledge graphs and heterogeneous graphs. Through these benchmarks and evaluations, we aim to foster more rigorous and realistic assessments of temporal graph learning methods, ultimately driving advancements in the field.

Bio: Reihaneh Rabbany is a Canada CIFAR AI Chair, a core faculty member of Mila - Quebec’s artificial intelligence institute, and an assistant professor at the School of Computer Science at McGill University. Before that she was a postdoctoral fellow at the School of Computer Science, Carnegie Mellon, and completed her Ph.D. in the Computing Science Department at the University of Alberta. Rabbany’s research is at the intersection of network science, data mining and machine learning, with a focus on developing techniques for analyzing large-scale complex data that is interconnected, evolving, multi-modal, and noisy. She is particularly interested in data from online societies and applications to enhance the health and safety of online spaces.

Wednesday Speaker 3, 11:30 - 12:30

Danai Koutra, University of Michigan, Ann Arbor

Title: Not all Neighbors Agree: Graph Learning Beyond Homophily

Abstract: Graph neural networks (GNNs) have become a cornerstone of graph-based machine learning, demonstrating strong performance across a variety of applications spanning recommendation systems, molecular analysis, and social networks. While a wide variety of GNN models have been proposed, most of them perform best in graphs that exhibit the property of homophily, in which linked nodes often belong to the same class or have similar features, echoing the adage “birds of a feather flock together”. However, in the real world, there are also many settings where “opposites attract”, leading to networks that exhibit heterophily, in which linked nodes tend to be from different classes (e.g., protein-protein interaction networks or fraud detection scenarios). Moreover, even homophilic networks exhibit local variations in homophily, including strong heterophily.

In this talk, I will present our recent advances in understanding and improving GNNs in the presence of heterophily. I will introduce effective GNN designs for node classification and link prediction, and discuss how heterophily relates to core challenges such as oversmoothing and robustness. Moving beyond global homophily, I will show how local homophily variations can lead to performance disparities across node groups, ultimately resulting in unfair predictions. Finally, I will present the limitations of standard positional encodings in heterophilic graphs and introduce a variant that improves performance across a range of GNN architectures, including graph transformers.

Bio: Danai Koutra is an Associate Professor of Computer Science and Engineering at the University of Michigan and an Amazon Scholar. Her research interests include graph mining and learning, graph–LLM joint models, and graph summarization. Her work has been applied to social, collaboration, and web networks, as well as brain connectivity graphs. Danai has won the Presidential Early Career Award for Scientists and Engineers (PECASE), the 2024 IBM Early Career Data Mining Research Award, the 2023 Tao Li Award, an NSF CAREER Award, an ARO Young Investigator Award, the 2020 SIGKDD Rising Star Award, multiple industry-sponsored research faculty awards, a Precision Health Investigator Award, and the 2016 ACM SIGKDD Dissertation Award. She has also received nine paper awards and the 2022 IEEE ICDM Test-of-Time Award. In terms of service, she is currently Program Chair for the IJCAI 2025 Survey Track and has previously served as Program Chair for ACM SIGKDD, ECML/PKDD, and The Web Conference (track chair).

Thursday Speaker 1, 9:00 - 10:00

Mathilde Papillon, UC Santa Barbara

Title: Make Any Graph Neural Network Go Topological with TopoTune

Abstract: Graph Neural Networks (GNNs) excel in learning from relational datasets, processing node and edge features in a way that preserves the symmetries of the graph domain. However, many complex systems–such as biological or social networks–involve multiway complex interactions that go beyond the graph’s pairwise relationships. The emerging field of Topological Deep Learning (TDL) aims to accommodate and leverage these higher-order structures. However, differently from the graph deep learning ecosystem, TDL lacks a standardized framework for easily defining new architectures, restricting its accessibility and applicability. In this talk, I will introduce TopoTune, a lightweight software that allows practitioners to define, build, and train general TDL models with unprecedented ease. Theoretical results show that TopoTune’s models generalize and subsume the current landscape of TDL models. Extensive experiments show that these architectures consistently match or outperform previous models, often with less model complexity.

Bio: Mathilde Papillon is a Physics PhD candidate in the Geometric Intelligence Lab at the University of California Santa Barbara where she develops novel deep learning methods leveraging geometry and topology. She harnesses these models to study relational data, with a special focus on full-body human movement. Mathilde obtained her BSc in Honours Physics from McGill University and was recently awarded Canada’s Post Graduate Doctoral Fellowship.

Thursday Speaker 2, 13:00 - 14:00

Melanie Weber, Harvard University

Title: A Geometric Lens on Challenges in Graph Machine Learning: Insights and Remedies

Abstract: Graph Neural Networks (GNNs) are a popular architecture for learning on graphs. While they achieved notable success in areas such as biochemistry, drug discovery, and material sciences, GNNs are not without challenges: Deeper GNNs exhibit instability due to the convergence of node representations (oversmoothing), which can reduce their effectiveness in learning long-range dependencies that are often crucial in applications. In addition, GNNs have limited expressivity in that there are fundamental function classes that they cannot learn. In this talk we will discuss both challenges from a geometric perspective. We propose and study unitary graph convolutions, which allow for deeper networks that provably avoid oversmoothing during training. Our experimental results confirm that Unitary GNNs achieve competitive performance on benchmark datasets. An effective remedy for limited expressivity are encodings, which augment the input graph with additional structural information. We propose novel encodings based on discrete Ricci curvature, which lead to significant gains in empirical performance and expressivity thanks to capturing higher-order relational information. We then consider the more general question of how higher-order relational information can be leveraged most effectively in graph learning. We propose a set of encodings that are computed on a hypergraph parametrization of the input graph and provide theoretical and empirical evidence for their effectiveness.

Bio: Melanie Weber is an Assistant Professor of Applied Mathematics and of Computer Science at Harvard University, where she leads the Geometric Machine Learning Group. Her research studies geometric structure in data and how to leverage such information for the design of new, efficient Machine Learning algorithms with provable guarantees. In 2021-2022, she was a Hooke Research Fellow at the Mathematical Institute in Oxford. Previously, she received her PhD from Princeton University (2021). She is a recipient of the IMA Leslie Fox Prize in Numerical Analysis and a Sloan Research Fellowship in Mathematics.

Friday Speaker 1, 9:00 - 10:00

Dhananjay Bhaskar, Yale University

Title: Beyond Message Passing: Learning Representations from Dynamics on Graphs

Abstract: This talk introduces two complementary frameworks that harness geometric and topological structure in dynamics on graphs for representation learning. I will first introduce Neurospectrum, a modular architecture that models brain activity as graph signals and learns latent neural trajectories shaped by multiscale spatial and temporal structure. By extracting geometric and topological invariants of these naturally occurring dynamics, such as curvature, path signatures, persistent homology, and recurrent dynamics, Neurospectrum reveals interpretable patterns in brain function, offering insights into neural synchrony, coordination, sensory processing, and aberrant dynamics associated with psychiatric disorders like OCD. Building on the idea of using dynamical behavior to reveal structure, I will next introduce DYMAG, a novel graph neural network that replaces traditional message passing with solutions to heat, wave, and chaotic partial differential equations defined over graphs. By treating graphs as continuous dynamical systems, DYMAG captures intrinsic geometric and topological features of the graph, enabling richer node and graph-level representations that improve performance on various benchmarks.

Bio: Dhananjay Bhaskar is a postdoctoral researcher at Yale University and visiting scholar at Brown Engineering, where he develops machine learning methods grounded in principles of geometry, topology, and biology to study complex systems, from molecular interactions to brain activity. He received his Ph.D. in Biomedical Engineering and M.S. in Data Science from Brown University, and his B.Sc. in Computer Science and Mathematics from the University of British Columbia. He is a recipient of the Boehringer Ingelheim Data Science Fellowship, Kavli Neuroscience Postdoctoral Fellowship, and DAAD Fellowship for Generative Models, and has been recognized by the Yale Postdoctoral Association for initiatives to advance professional development and interdisciplinary dialogue.

Friday Speaker 2, 11:30 - 12:30

Naoki Saito, UC Davis

Title: Generalized Haar-Walsh Dictionaries: Extensions and Applications

Abstract: I will discuss my group’s effort to generalize and extend the classical Haar-Walsh wavelet packet transforms for graphs, matrix data, and images. Using a recursive partitioning of the graph and successive averaging and differencing operations, we proposed the Generalized Haar-Walsh Transform (GHWT) in 2014, which generates an overcomplete dictionary of orthonormal bases of piecewise constant nature. Then, in 2021, we lifted the GHWT for higher adaptability using the method originally developed by Thiele and Villemoes for 1D regular lattices (1996) and that of Lindberg and Villemoes for 2D regular lattices (2000). It significantly improved the previous GHWT with the similar computational cost, O(n log n) where n is a number of vertices of an input graph. While the previous GHWT best-basis algorithm seeks the most suitable orthonormal basis for a given task among more than 1.5^n possible bases, the eGHWT best-basis algorithm can find a better one by searching through more than 0.618*(1.84)^n possible bases. The difference between these two is huge especially for large n. As applications, we will discuss an idea of performing simultaneous segmentation, denoising, and compression of graph signals, texture images, and term-document matrix analysis. If the time permits, I will also describe our recent generalization of the scattering transform—a robust nonlinear signal feature extraction tool—using the GHWT/eGHWT.

Bio: Naoki Saito is an applied mathematician specializing in applied and computational harmonic analysis. He studied at the University of Tokyo, receiving his BEng and MEng in 1982 and 1984, respectively. He joined Nippon Schlumberger K.K. in 1984; in 1986 moved to Schlumberger-Doll Research, Ridgefield, CT, where he was a research scientist. He continued his studies, receiving his PhD in applied mathematics from Yale University in 1994. He began teaching at the Department of Mathematics at the University of California, Davis in 1997, where he is currently a professor and a director of the UC Davis TETRAPODS Institute of Data Science. His honor includes: the Best Paper Awards from SPIE (1994); the Henri Doll Award from Schlumberger (1996); ONR Young Investigator Award (2000); the Presidential Early Career Award for Scientists and Engineers [PECASE] (2000); the Best Paper Award from JSIAM (2016); the Best Author Award from JSIAM (2016). He is a life senior member of IEEE. He also served as Chair of the SIAM Activity Group on Imaging Science from 2013 to 2015, and is a member of the editorial board of the three international journals: Applied and Computational Harmonic Analysis; Inverse Problems and Imaging; Journal of Mathematical Imaging and Vision.

Friday Speaker 3, 15:30 - 16:30

Dominique Beaini, Valence Labs, Université de Montréal, Mila – Quebec AI Institute

Title: How to Learn Molecules

Abstract: How can we build powerful representation of molecules for drug discovery? We’ll dive in the different algorithms, the challenges that they face in terms of expressivity, and how to overcome them. We’ll discuss building positional and structural encodings, and how they can make GNNs quite expressive without algorithmic changes. Then, we’ll see how to build graph Transformers for pre-training on large molecular datasets, and scale them to infinity. Finally, we’ll step into the world of multi-modality to learn how molecules impact human cells from a morphological perspective.

Bio: Dominique Beaini is a research unit team lead at Valence Discovery, and also serves as an adjunct professor in the Department of Computer Science and Operations Research at Université de Montréal and Mila – Quebec AI Institute. His research interests include graph neural networks, self-supervised learning, quantum mechanics, drug discovery, computer vision, and robotics. Dominique completed his PhD at Polytechnique Montréal, focusing on robotics and computer vision.


Orals: Extended Abstract Track

Click to expand/collapse
Oral ID Paper Title Authors Abstract
A1 Tracking Network Dynamics using Probabilistic State-Space Models Víctor Tenorio (King Juan Carlos University)*; Elvin Isufi (TU Delft); Geert Leus (TU Delft); Antonio G. Marques (King Juan Carlos University) This paper introduces a probabilistic approach for tracking the dynamics of unweighted and directed graphs using state-space models (SSMs). Unlike conventional topology inference methods that assume static graphs and generate point-wise estimates, our method accounts for dynamic changes in the network structure over time. We model the network at each timestep as the state of the SSM, and use observations to update the beliefs that quantify the probability of the network being in a particular state. Then, the proposed method incorporates the information of real-time graph signals into the beliefs. These beliefs provide a probability distribution of the network at each timestep, providing both an estimate of the network and the uncertainty it entails. Our approach is evaluated through experiments with synthetic and real-world networks. The results demonstrate that our method effectively estimates network states, outperforming traditional techniques such as recursive least squares.
A2 Learning Graphs from Structural Vector Autoregressions with Sparse Input Panagiotis Misiakos (ETH Zurich)*; Markus Püschel (ETH Zurich) We introduce SpinSVAR, a novel method for estimating a structural vector autoregression (SVAR) from time-series data under sparse input assumption. Unlike prior approaches using Gaussian noise, we model the input as independent Laplacian variables, enforcing sparsity and yielding a maximum likelihood estimator (MLE) based on least absolute error regression. We provide theoretical consistency guarantees for the MLE under mild assumptions. SpinSVAR is efficient: it can leverage GPU acceleration to scale to thousands of nodes. On synthetic data with Laplacian or Bernoulli-uniform inputs, SpinSVAR outperforms state-of-the-art methods in accuracy and runtime. When applied to S&P 500 data, it clusters stocks by sectors and identifies significant structural shocks linked to major price movements, demonstrating the viability of our sparse input assumption.
A3 Online Network Inference from Graph-Stationary Signals with Hidden Nodes Andrei Buciulea Vlas (Universidad Rey Juan Carlos)*; Madeline Navarro (Rice University); Samuel Rey Escudero ( Universidad Rey Juan Carlos); Santiago Segarra (Rice University); Antonio García Marqués (Universidad Rey Juan Carlos) Graph learning is the fundamental task of estimating unknown graph connectivity from available data. Traditional methods assume full, simultaneous data availability, but real-world scenarios often involve incomplete, sequential data. We propose an online graph estimation method that accounts for hidden nodes. By leveraging signals stationary on the underlying graph, we model unknown connections to hidden nodes. We formulate a convex optimization problem for graph learning from streaming, incomplete signals and solve it via an efficient proximal gradient algorithm that operates in real-time. We establish theoretical conditions ensuring that our online method aligns with batch solutions. Experiments on synthetic and real data validate the effectiveness of our approach for online graph learning with missing observations.
B1 Leveraging Line Graph Representations to Improve GNN Convergence Rates Roxanne Holden (JHU)*; Luana Ruiz (JHU) A graphon is a representation of ``families" of graphs and describes the limiting objects for sequences of large, finite graphs which are applicable to many areas including Graph Neural Networks (GNNs). Specifically, graphons in convergent graph sequences can be used to analyze the convergence and transferability of GNNs. Currently, there is a tradeoff between restrictive continuity assumptions of the graphon and increased computational cost when studying the convergence of the spectra of the graph to the spectra of the graphon. In this work, we propose a study of the rate of convergences of a graph and its associated line graph to improve convergence bounds applicable to GNNs.
B2 HyperNATE: Scaling Tensor-Based Hypergraph Neural Networks Through Attention Nicolas Bello (University of Delaware)*; Fuli Wang (University of Delaware); Daniel Lau (University of Kentucky); Gonzalo Arce (University of Delaware) Hypergraphs are vital for modeling high-order relationships in complex systems across various domains.This has spurred significant research interest in hypergraph neural networks. Currently, existing hypergraph neural networks face two key challenges: (1) tensor-based representations preserve rich structural information but incur heavy computational costs during training due to message passing operations, and (2) prevalent homophily assumptions fail to model the growing number of heterophilic datasets. To address these gaps, we introduce the Hypergraph Neighborhood Aggregation Transformer Encoder (HyperNATE), which decouples tensor-based message aggregation from training and leverages an attention mechanism to enable parallel multi-hop aggregation.Additionally, a high-pass filter is incorporated to effectively capture heterophilic features.Empirical evaluations on node classification tasks show that HyperNATE is 10-100x faster in training than tensor-based approaches.
B3 FedBaF: Federated Learning Aggregation Biased by a Foundation Model Jong-Ik Park (Carnegie Mellon University); Srinivasa Pranav (Carnegie Mellon University)*; José M. F. Moura ( Carnegie Mellon University); Carlee Joe-Wong (Carnegie Mellon University) Foundation models are now a major focus because they generalize across diverse tasks. Existing approaches for adapting foundation models to new applications often rely on Federated Learning (FL) and disclose the foundation model weights to clients when using it to initialize the global model. These methods ensure client data privacy, but compromise model and information security. We introduce Federated Learning Aggregation Biased by a Foundation Model (FedBaF), a method for dynamically integrating pre-trained foundation model weights during the FL aggregation phase. FedBaF keeps the foundation model confidential while still leveraging it to train more accurate models, especially in non-IID and adversarial scenarios. Our comprehensive experiments use Pre-ResNet and foundation models like Vision Transformer to demonstrate that FedBaF matches and often surpasses traditional weight initialization methods by up to 11.4% in IID and up to 15.8% in non-IID settings.
C1 Multimodal Graphs for Diffusion-Based Manifold Alignment Jake Rhodes (Brigham Young University)*; Adam Rustad (Brigham Young University) Data from various sources are often intrinsically linked. Multimodal data integration enriches information content over single-source data. Manifold alignment seeks a shared low-dimensional representation of multiple data sources, emphasizing similarities. Semi-supervised manifold alignment relies on partial correspondences, either through shared features or known associations. We introduce two, graph-based alignment methods: SPUD (Shortest Paths on the Union of Domains) constructs a unified graph via known correspondences to learn inter-domain geodesic distances. MASH (Manifold Alignment via Stochastic Hopping) captures local geometry and iteratively refines inter-domain correspondences via a diffusion process. MASH forms a coupling matrix linking heterogeneous domains. We compare SPUD and MASH with existing methods, showing superior alignment and cross-domain classification, and demonstrate label transfer applications.
C2 Landmark-Based Node Embeddings and Graph Distance Approximations My Le (Johns Hopkins University)*; Luana Ruiz (Johns Hopkins University); Souvik Dhara (Purdue University) Learning node representations is a core problem in graph machine learning. Although existing embedding methods effectively capture local similarity measures, they often fall short when it comes to preserving global functions such as graph distances. Motivated by Bourgain's pioneering work on Hilbert space embeddings of metric spaces (1985), we examine the performance of local distance-preserving node embeddings. These embeddings, known as landmark-based algorithms, approximate pairwise distances by calculating shortest paths from a small subset of reference nodes, called landmarks. Our primary theoretical contribution demonstrates that random graphs, such as Erdős–Rényi random graphs, require lower dimensions for landmark-based embeddings compared to worst-case graphs. Additionally, we show empirically that the GNN-based approximations of distances to landmarks generalize effectively to larger networks, providing a scalable solution for graph representation learning.
C3 Consistent Identification of Top-K Nodes in Noisy Networks Hui Shen (McGill University)*; Eric Kolaczyk (McGill University) Identifying important nodes in a network, often measured by centrality, is a key challenge in network analysis. However, real-world networks are often noisy or incomplete, distorting rankings and misidentifying key nodes. In this paper, we examine how network noise impacts the accurate recovery of top-k nodes. Specifically, we consider a model where the observed network is a noisy version of a true network, with edges randomly added or removed according to a probabilistic noise model. Under this framework, we establish conditions for consistent recovery of top-k nodes and assess their feasibility across common network models. We also explore the challenges of detecting key nodes under high noise and discuss broader implications. For cases between consistency and infeasibility, we propose a confidence set approach to ensure key nodes are included with high probability. We also provide an initial analysis of eigenvector centrality, on its robustness to spectral perturbations from noise.
D1 Graph Signal Processing: Frequency Analysis for Similar Matrices John Shi (Carnegie Mellon University)*; Jose Moura (Carnegie Mellon University) Spectral analysis is a fundamental part of both
DSP and GSP. Many signal processing applications rely on performing a spectral analysis by taking the Fourier transform.
In both DSP and GSP, the ordering of the frequencies (from
low to high) are determined by total variation of the eigenvectors. Additionally, eigenvectors associated with low frequencies have similar values between adjacent or neighboring nodes, while eigenvectors associated with high frequencies have larger oscillations between neighboring nodes. In this paper, we examine frequency analysis of "similar" graphs, i.e., graphs whose adjacency matrices are related by a similarity transformation. Similar matrices have the same eigenvalues, but different eigenvectors and entries. We show that the frequency signal, and frequency ordering is preserved across similar graphs. We examine examples of these similar graphs through the GSP companion model ([1]) that shows that GSP is DSP with appropriate boundary conditions.
D2 Interpretable Lightweight Transformer via Unrolling of Learned Graph Smoothness Priors Tam Thuc Do (York University)*; Parham Eftekhar (York University); Seyed Alireza Hosseini (York University); Gene Cheung (York University); Philip A. Chou (packet.media) We build interpretable and lightweight transformer-like neural networks by unrolling iterative optimization algorithms that minimize graph smoothness priors---e.g., the $\ell_1$-norm graph total variation (GTV)---subject to an interpolation constraint. The insight is that a normalized signal-dependent graph learning module amounts to a variant of the basic self-attention mechanism in conventional transformers.
Unlike ``black-box'' transformers that require learning huge parameter sets to compute the affinities and subsequent output embeddings, our unrolled networks employ shallow CNNs to learn low-dimensional features per node to construct similarity graphs. At each layer, the target interpolated signal is a low-pass graph filtered output derived from the minimization of an assumed graph smoothness prior, leading to a reduction in parameter count. Experiments for two image interpolation applications verify the restoration performance and parameter efficiency of our unrolled networks.
D3 Causal Analysis of Graph Signals for Brain Effectome Inference Srikar Mutnuri (University of Virginia)*; Aniruddha Adiga (University of Virginia); Srinivasan Venkatramanan (University of Virginia); Madhav V. Marathe (University of Virginia) Understanding the directed interactions between brain regions is critical for analyzing neuro-degenerative diseases like Alzheimer's (AD). Traditional functional connectivity (FC) methods capture statistical associations but fail to infer causal relationships, limiting their ability to reveal structural disruptions in disease progression. In this exploratory work, we propose a causal graph inference and spectral analysis framework for EEG signals. Specifically, we leverage Granger causality and spectral graph methods to construct and analyze the effective connectome (effectome) of the brain. Our work reveals that AD networks exhibit lower algebraic connectivity (λ2), reduced modularity (eigenvalue gaps), and increased structural sensitivity to perturbations, compared to healthy individuals. Additionally, we simulate diffusion processes on the inferred graph topology to model signal propagation, demonstrating disrupted information flow in AD-affected networks.

Posters: Extended Abstract Track

Session 1

Click to expand/collapse
Paper Title Authors Abstract
E(n) Equivariant Topological Neural Networks Claudio Battiloro (Harvard University)*; Ege Karaismailoğlu (Harvard University); Mauricio Tec (Harvard University); George Dasoulas (Harvard University); Michelle Audirac (Harvard University); Francesca Dominici (Harvard University) We introduce E(n)-Equivariant Topological Neural Networks (ETNNs), which are E(n)-equivariant message-passing networks operating on combinatorial complexes, formal objects unifying graphs, hypergraphs, simplicial, path, and cell complexes. ETNNs incorporate geometric node features while respecting rotation, reflection, and translation equivariance. ETNNs are natively ready for settings with heterogeneous interactions. We provide a theoretical analysis to show the improved expressiveness of ETNNs over architectures for geometric graphs. The broad applicability of ETNNs is demonstrated through two tasks of vastly different scales: i) molecular property prediction on the QM9 benchmark and ii) land-use regression for hyper-local estimation of air pollution with multi-resolution irregular geospatial data. The results indicate that ETNNs match or surpass SotA equivariant neural networks of the same class with a significantly smaller computational burden.
Tensor-Based Graph Neural Networks for Graph Topology Inference Maggie Cheng (Illinois Institute of Technology)* Graph neural networks have been the de facto tools for graph learning tasks. One basic function for graph learning is to learn the link topology of a graph. The ability to infer graph topology is imperative to many graph classification, regression, and even generation tasks. In this work, we propose a new set of orthogonal bases to construct linear layers for the graph neural networks. This set of orthogonal bases is defined for GNNs based on order-3 tensors and is enough to provide sufficient discriminative power for achieving $3$-WL expressiveness.
Sparse polynomial approximation of diffusion on community-structured graphs Giuseppe Alessio D'Inverno (International School for Advanced Studies, Trieste)*; Simone Brugiapaglia (Concordia University, Montréal); Kylian Ajavon (Concordia University, Montréal) Diffusion kernels over graphs have been widely utilized as effective tools in various applications due to their ability to accurately model the flow of information through nodes and edges. However, there is a notable gap in the literature regarding the development of surrogate models for diffusion processes on graphs. In this work, we fill this gap by proposing sparse polynomial-based surrogate models for parametric diffusion equations on graphs with community structure. In tandem, we provide convergence guarantees for both least squares and compressed sensing-based approximations by showing the holomorphic regularity of parametric solutions to these diffusion equations. Our theoretical findings are accompanied by a series of numerical experiments conducted on both synthetic and real-world graphs that demonstrate the applicability of our methodology.
Crop Classification in Satellite Images via Signed Graph Spectral Classifier Learning Parham Eftekhar (York University)*; Gene Cheung (York University); Tim Eadie (Growers Edge) Crop classification in satellite images is vital for remote crop monitoring. We propose a novel signed graph spectral algorithm with supervised feature learning for binary classification.

During training, we extract pixel-wise feature vectors via a shallow CNN from spectral bands. Our contrastive learning objective minimizes feature distances for same-label pixels while maximizing them for different-label pairs.

At inference, we compute feature vectors and map non-negative distances to signed edge weights via a shifted logistic function, forming a signed graph encoding pairwise (dis)similarities. We then compute the first eigenvector of a convex combination of the signed and combinatorial graph Laplacians in linear time, using its entry signs for classification.

Experiments show our method achieves performance comparable to deep learning models with significantly fewer parameters.
Higher-Order Semi-Supervised Learning on Point Clouds Using Hypergraphs Adrien Weihs (University of California Los Angeles)* We propose a higher-order hypergraph method for semi-supervised learning on point clouds. This is motivated by the fact that the classical hypergraph learning algorithm is asymptotically equivalent to p-Laplace Learning on graphs. Our new framework includes additional hypergraph geometric information by penalizing higher-order derivatives on hyperedges. We also preserve the quadratic form structure of Laplace Learning which greatly simplifies numerical implementations and we can reduce computational complexity through spectral truncation. In addition, this allows us to formulate the learning problem in the Bayesian setting. We present numerical results demonstrating the effectiveness of our methodology compared to other graph-based semi-supervised learning methods.
Online Learning for State-dependent Causal Graph Process Model Yuzhe Li (Tsinghua University); Hong Zhao (Tsinghua University)* Graph Signal Processing (GSP) theory is a powerful framework for reconstructing graph structures from observed data. While significant progress has been made in the field of graph learning, the majority of existing studies focus on static graphs. However, dynamic graphs are more prevalent in real-world applications, necessitating the extension of GSP tools, theories, and algorithms to accommodate such scenarios. In this work, we extend the existing causal graph process model to dynamic weight scenarios. A state-dependent causal graph process model, along with its learning algorithm, is proposed for graph signal time series. Preliminary experimental results demonstrate the effectiveness of the proposed method.
Efficient Learning of Balanced Signed Graphs via Iterative Linear Programming Haruki Yokota (The University of Osaka)*; Hiroshi Higashi (The University of Osaka); Yuichi Tanaka (The University of Osaka); Gene Cheung (York University) Signed graphs incorporate both positive and negative edge weights to represent positive/negative correlations in data. When a signed graph is balanced, eigenvectors of its graph Laplacian matrix can be linearly transformed to those of a positive graph Laplacian. This allows the reuse of spectral filters designed for positive graphs to signed graphs. We propose an efficient algorithm to learn a balanced signed graph Laplacian directly from data, by extending a sparse inverse covariance estimation problem with linear constraints to enforce sign consistency of the edges. Experiments on real data demonstrate superior performance over existing methods.
Orthogonal L1 Digraph Fourier Basis via Total Variation Vedran Mihal (ETH Zurich)*; Markus Püschel (ETH Zurich) We propose a novel orthogonal Fourier basis for arbitrary directed graphs based on iterative maximization of the L1-norm total variation. We leverage algorithms from L1-norm principal component analysis for its efficient computation, which also implies that the basis always exists. We prove that the basis covers the range of total variations better than prior bases and demonstrate its effectiveness in a prototypical signal recovery experiment.
Enabling Out-of-Sample Extension in Semi-Supervised Manifold Alignment through Twin Autoencoders Jake Rhodes (Brigham Young University)*; Adam Rustad (Brigham Young University); Marshall Nielsen (Brigham Young University) Manifold alignment seeks a shared representation that captures inter-domain relationships while preserving intra-domain structures. A key limitation of traditional approaches is the lack of an out-of-sample extension mechanism, requiring full recomputation when new data is introduced. To address this, we propose an out-of-sample extension framework using a twin autoencoder (AE) architecture, where each AE is trained on a single modality and regularized with a pre-aligned joint embedding. This structure enables seamless extension of new data points while maintaining geometric consistency. We validate our method on bimodal datasets, demonstrating robust alignment preservation.
Identifying Adversarial Attacks in Crowdsourcing via Dense Subgraph Detection Abdullah Karaaslanli (Michigan State University)*; Panagiotis Traganitis (Michigan State University); Aritra Konar (KU Leuven) Crowdsourcing is becoming increasingly important for contemporary applications in machine learning and artificial intelligence. However, crowdsourcing systems may be susceptible to adversarial attacks where a subset of annotators deliberately provide erroneous responses. This paper introduces a novel algorithm for identifying adversarial attacks in crowdsourcing systems by recasting the problem as dense subgraph detection in bipartite graphs. In particular, we represent crowdsourced data as a weighted bipartite graph between workers and data points where edge weights are computed from annotators' responses. The bipartite graph is then analyzed with a peeling algorithm to detect the dense subgraph that includes adversarial attacks. Compared to previous methods where only adversaries are detected, our proposed method can simultaneously identify adversarial annotators as well as affected data points. Preliminary results on real datasets showcase the potential of this novel approach.
Differentially Private Decentralized PCA of Graph Signals for Graph Clustering Andrew Campbell (Cornell University)*; Anna Scaglione (as337@cornell.edu); Sean Peisert (Lawrence Berkeley National Laboratory) We address the problem of decentralized principal component analysis (DPCA) of graph signals and its application to graph clustering of graphical data under privacy constraints. Leveraging graph data smoothness, node clustering reduces to computing principal eigenvectors of the sample covariance, followed by $k$-means clustering. We develop a distributed mechanism for this computation with differential privacy (DP) guarantees. Unlike the existing approaches, we consider a setting where each agent can only access sub-graph signals. To handle this complexity, we propose a decentralized and DP power method for computing the eigenvectors of the entire graph signal sample correlation. Our method leverages consensus aggregation, while requiring each agent to track only its portion of the global eigenvectors. This method enhances the privacy guarantees while reducing communication overhead. Finally, we establish $(\epsilon, \delta)$-DP bounds and demonstrate its efficacy through a simulation.
Graph Signal Matching for Image Registration Hang Liu (Cornell University)*; Anna Scaglione (Cornell University); Urbashi Mitra (University of Southern California) We propose a graph signal processing (GSP) approach for simultaneous pose and correspondence estimation in image registration. By modeling point sets as graph signals on k-nearest-neighbor (kNN) graphs, we recast the correspondence estimation task into a graph signal matching problem. We employ a spectral matching step to recover the unknown permutation, followed by a least-squares solution for the rigid transformation. Numerical experiments on 3D scanning data demonstrate that our method achieves superior registration accuracy compared to baseline algorithms, thus validating its effectiveness in both 3D-2D image registration scenarios.
DAG structure learning based on non-negative weights Samuel Rey (King Juan Carlos University)*; Seyed Saman Saboksayr (Rochester University); Gonzalo Mateos (Rochester University) We address the problem of learning the topology of directed acyclic graphs (DAGs) from nodal observations in a linear structural equation model. Recent methods frame this task as a continuous optimization problem but struggle with non-convexity. To overcome this, we assume non-negative edge weights, allowing cycles to be characterized and prevented using a convex acyclicity function based on the log-determinant of the adjacency matrix. This reformulates the problem as a convex optimization task. We propose a DAG recovery algorithm using the method of multipliers, guaranteeing a global minimizer. Moreover, we prove that with infinite samples, our approach recovers the true DAG structure. Empirical validation on synthetic data shows our algorithm outperforms state-of-the-art alternatives.
Unrolled Denoising of Attributes on Graphs with Local-and-Global Smoothness Assumption REINA KANEKO (The University of Osaka)*; Junya Hara (The University of Osaka); Hiroshi Higashi (The University of Osaka); Yuichi Tanaka (The University of Osaka) This paper presents a denoising method for attributes on graphs by using a local-and-global smoothness assumption. Restoration of graph signals is one of the most fundamental studies in graph signal processing. Many work assumes graphs are noiseless, however, both of the graphs and signal values may be noisy. In this paper, we address denoising of attributes by assuming local and-global smoothness nature of graph signals: They are locally smooth on the graph and are globally smooth in an appropriate space regardless of the graph. We formulate a corresponding convex problem that can be iteratively solved with a monotone operator splitting algorithm. Moreover, the iterations can be unrolled to apply deep algorithm unrolling (DAU) for tuning internal parameters automatically from the training data. In experiments, our method exhibits higher restoration performance in PSNR than existing model-based and DAU-based methods even when coordinates are corrupted by noise.
Period Estimation for Time-Varying Graph Signals Based on Cyclic Graph Wide Sense Stationarity Tsutahiro Fukuhara (The University of Osaka)*; Junya Hara (The University of Osaka); Hiroshi Higashi (The University of Osaka); Yuichi Tanaka (The University of Osaka) This paper proposes a period estimation method for time-varying graph signals (TVGSs). In many network-based measurement systems, TVGSs are stationary in the spatial domain and also exhibit periodic stationarity in the time domain. Although the accurate estimation of the period of TVGSs would result in accuracy gains in applications, most existing works are limited in addressing this issue. First, we introduce \textit{cyclic graph wide sense stationarity} on TVGSs where signals are stationary in the spatial domain and periodically stationary in the time domain.
Second, we propose a period estimation method. It consists of two phases: 1) we extract periodic features from TVGSs by decomposing a set of TVGSs using graph Fourier transform and a nested periodic dictionary, and 2) we estimate a period of TVGSs based on the extracted features. Numerical experiments on real-world data demonstrate that our method effectively estimates the period.
A Random Dot Product Graph Model for Weighted and Directed Networks Bernardo Marenco (Udelar)*; Paol Bermolen (Udelar); Marcelo Fiori (Udelar); Federico La Rocca (Udelar); Gonzalo Mateos (University of Rochester) The Random Dot Product Graph (RDPG) model assigns a low-dimensional vector to each vertex, and postulates that an edge between any two nodes exists with probability given by the inner product of said vectors. Recently, this latent position model has been extended to account for weighted graphs (the so-called Weighted (W-)RDPG), now embedding each node with a sequence of vectors. However, graphs adhering to this nonparametric W-RDPG model are constrained to be undirected and homophilic. In this work, we extend the model's expressivity by proposing a variant for directed graphs, which may also include heterophilic nodes. To this end, we endow each vertex with two sequences, respectively modeling the node's incoming and outgoing connectivity behavior. The effectiveness of the novel weighted and directed RDPG model is illustrated via several test cases, including both synthetic and real-life networks.
Accelerated graph clustering with PASCO: Parallel structured coarsening and partition alignment by optimal transport Etienne Lasalle (Inria, ENS de Lyon, CNRS, UCB Lyon 1, LIP, UMR 5668); Rémi Vaudaine (Inria, ENS de Lyon, CNRS, UCB Lyon 1, LIP, UMR 5668); Titouan Vayer (Inria, ENS de Lyon, CNRS, UCB Lyon 1, LIP, UMR 5668); Pierre Borgnat (CNRS, ENS Lyon)*; Paulo Gonçalves (Inria, ENS de Lyon, CNRS, UCB Lyon 1, LIP, UMR 5668); Rémi Gribonval (Inria, ENS de Lyon, CNRS, UCB Lyon 1, LIP, UMR 5668); Marton Karsai (Department of Network and Data Science, Central European University, Vienna) We develop a method, called PASCO, to accelerate clustering algorithms. It has three steps:
1- We compute several independent small graphs representing the input graph by applying an efficient and structure-preserving coarsening algorithm.
2- A clustering algorithm is run in parallel onto each small graph and provides several partitions of the initial graph.
3- These partitions are aligned and combined with an optimal transport method to output the final partition.
This framework relies on two key contributions:
a novel global algorithm structure designed to enable parallelization and a fast, empirically validated graph coarsening algorithm that preserves structural properties.
We demonstrate the strong performance of PASCO in terms of computational efficiency, structural preservation, and output partition quality, evaluated on both synthetic and real-world graph datasets.
Graph Pooling via Ricci Flow Amy Feng (Harvard University), Melanie Weber (Harvard University) Graph Machine Learning often involves the clustering of nodes based on similarity structure encoded in the graph's topology and the nodes' attributes. On homophilous graphs, the integration of pooling layers has been shown to enhance the performance of Graph Neural Networks by accounting for inherent multi-scale structure. Here, similar nodes are grouped together to coarsen the graph and reduce the input size in subsequent layers in deeper architectures. In both settings, the underlying clustering approach can be implemented via graph pooling operators, which often rely on classical tools from Graph Theory. In this work, we introduce a graph pooling operator (ORC-Pool), which utilizes a characterization of the graph's geometry via Ollivier's discrete Ricci curvature and an associated geometric flow. Previous Ricci flow based clustering approaches have shown great promise across several domains, but are by construction unable to account for similarity structure encoded in the node attributes. However, in many ML applications, such information is vital for downstream tasks. ORC-Pool extends such clustering approaches to attributed graphs, allowing for the integration of geometric coarsening into Graph Neural Networks as a pooling layer.

Session 2

Click to expand/collapse
Paper Title Authors Abstract
Estimating fair graphs with unbiased connections Madeline Navarro (Rice University)*; Samuel Rey (King Juan Carlos University); Andrei Buciulea (King Juan Carlos University); Antonio Marques (King Juan Carlos University); Santiago Segarra (Rice University) We propose estimating graphs that are fair with respect to sensitive nodal attributes. Many real-world models exhibit unfair discriminatory behavior due to biases in data. Such discrimination is known to be exacerbated when data is equipped with pairwise relationships encoded in a graph. We thus propose an optimization-based approach to accurately infer graphs while discouraging biased edges. To this end, we present bias metrics that measure topological demographic parity to be applied as convex penalties. We also propose an efficient proximal gradient algorithm to obtain the estimates. Theoretically, we express the tradeoff between fair and accurate estimated graphs. Critically, this includes demonstrating when accuracy can be preserved in the presence of a fairness regularizer. Our empirical validation includes synthetic and real-world simulations that illustrate the value and effectiveness of our proposed optimization problem and iterative algorithm.
Bilevel Optimization for Scalable and Robust Graph Neural Network Training Chenyue Zhang (The Chinese University of Hong Kong)*; Anna Scaglione (Cornell University); Tianyi Chen (Rensselaer Polytechnic Institute); Hoi-To Wai (The Chinese University of Hong Kong) In this paper, we propose a bilevel optimization framework to robustly train graph neural networks (GNNs) from multiple subgraph or multi-graph datasets. Our approach jointly learns the GNN parameters and subgraph selection weights, identifying and discarding noisy or distributionally misaligned graph datasets that could hinder model performance. By reformulating the training objective under a bilevel setup, we leverage a penalty-based algorithm that converges to a stationary point with manageable computation complexity. Experiments confirm that our method effectively boosts prediction accuracy for various tasks, highlighting its scalability and robustness in real-world graph machine learning scenarios.
Data-Driven Graph Filters via Adaptive Spectral Shaping Dylan Sandfelder (University of Oxford)*; Mihai Cucuringu (University of California, Los Angeles); Xiaowen Dong (University of Oxford) We present a data-driven framework for learning multi-scale filters on graphs via adaptive spectral shaping, addressing the pervasive challenge of heterogeneous frequency content in graph signals. Our model learns a data-driven parametric kernel that can adapt to different types of spectral regions. By combining this mechanism with a multi-scale extension, we capture signals that exhibit multiple peaks across the graph spectrum. Our implementation leverages polynomial approximations of the Laplacian to avoid costly eigendecompositions, making the approach scalable to larger graphs. Experimental results on synthetic datasets show that our adaptive, multi-scale filters outperform standard single-scale filters in reconstructing graph signals with complex spectra. These findings highlight the potential of incorporating data-driven designs into multi-scale graph signal processing to get more accurate analysis in networked systems, such as sensor arrays, communication networks, and beyond.
Graph Wavelet Neural Networks for EEG-Based Seizure Detection Wei Cao (Bishop's University)*; Rachid Hedjam (Bishop's University) Electroencephalography (EEG) is a widely used, non-invasive tool for seizure detection, but manual annotation is time-consuming and requires medical expertise. This study explores the use of Adaptive Graph Wavelet Neural Networks (AGWNNs) for EEG-based seizure detection, leveraging their ability to efficiently capture localized and multi-scale electrical signal patterns in the brain. With a proper selection of wavelet kernel functions, we devised an attention mechanism to flexibly adjust the weights of different scales of graph wavelets. Unlike traditional deep learning models that fail to incorporate spatial dependencies between EEG electrodes, AGWNNs provide an effective way to model EEG signals as graphs, preserving multi-context relationships. Experiments on the Temple University Hospital Seizure Detection (TUSZ) dataset illustrate that AGWNNs outperform other state-of-the-art models.
Forest Proximity Graphs for Time Series Ben Shaw (Utah State University)*; Jake Rhodes (Brigham Young University); Soukaina Boubrahimi (Utah State University); Kevin Moon (Utah State University) RF-GAP has recently been introduced as an improved random forest proximity measure. In this paper, we present PF-GAP, an extension of RF-GAP proximity graphs to proximity forests, an accurate and efficient time series classification model. We use the forest proximity graphs alongside Local Outlier Factors to investigate the connection between misclassified points and outliers, comparing with nearest neighbor classifiers which use time series distance measures. We show that the forest proximities seem to exhibit a stronger connection between misclassified points and outliers than nearest neighbor classifiers.
Online Learning Of Expanding Graphs Samuel Rey (King Juan Carlos University)*; Bishwadeep Das (TU Delft); Elvin Isufi (TU Delft) This paper addresses online network topology inference for expanding graphs using spatiotemporal signal streams. Unlike prior works that assume a fixed set of nodes, we consider dynamically growing graphs, which introduce challenges in modeling temporal dynamics and increase computational complexity. We propose an online algorithm based on projected proximal gradient descent that adapts to graph expansion by recursively updating the sample covariance matrix. Our method differentiates updates for new and existing nodes. We specialize our approach in Gaussian Markov random fields, analyzing computational complexity and dynamic cumulative regret. Finally, we validate our method through controlled experiments and real-world datasets from epidemic and financial networks.
Peer-to-Peer Learning Dynamics of Wide Neural Networks Shreyas Chaudhari ( Carnegie Mellon University); Srinivasa Pranav (Carnegie Mellon University)*; Emile Anand (Georgia Institute of Technology); José M. F. Moura (Carnegie Mellon University) Peer-to-peer learning is a framework that enables beyond-5G distributed edge devices to collaboratively train deep neural networks in a privacy-preserving manner without the aid of a central server. The neural network training algorithms have many design considerations that are difficult to tune in deployment settings, such as neural network architectures and hyperparameters. This presents a need for characterizing training dynamics of distributed optimization algorithms used to train nonconvex neural networks in peer-to-peer learning environments. In this work, we provide an explicit characterization of learning dynamics of wide neural networks trained using popular distributed gradient descent (DGD) algorithms. Our results leverage recent advancements in neural tangent kernel (NTK) theory and extensive previous work on distributed learning and consensus. Our methods accurately predict parameter and error dynamics of wide neural networks trained for classification tasks.
Accelerating Research with Graph Signal Processing and AI-Enhanced Knowledge Graphs Thomas Kerby (Utah State University); Benjamin Fuller (Utah State University); Kevin Moon (Utah State University)* In an era of exponential research growth and rapid technological advances, traditional literature review and synthesis methods face significant challenges. We present an open source software package that leverages graph signal processing (GSP) and advanced natural language processing (NLP) techniques. Our tool builds a dynamic knowledge/citation graph from a topic, a set of academic papers, or a BibTeX file using the Semantic Scholar API with the Neo4j graph database. By integrating Neo4j vector databases with LangGraph, it seamlessly connects to large language models (LLMs) to create interactive research agents. This framework streamlines navigation of complex research landscapes and introduces a novel method to compare traditional citation graphs with semantic graphs derived from embedding distances, offering fresh insights into paper influence and revealing latent connections overlooked by conventional bibliometrics.
Gradient Networks Shreyas Chaudhari ( Carnegie Mellon University); Srinivasa Pranav (Carnegie Mellon University)*; José M. F. Moura (Carnegie Mellon University) Directly parameterizing and learning gradients of functions has applications in inverse problems, generative modeling, and optimal transport. We introduce gradient networks (GradNets): neural network architectures that parameterize gradients of various function classes. We provide a GradNet design framework that includes methods for transforming them into monotone gradient networks (mGradNets), which represent gradients of convex functions. Our results establish that GradNets (and mGradNets) universally approximate gradients of (convex) functions. Furthermore, GradNets can be customized to correspond to specific spaces of potential functions, including transformed sums of (convex) ridge functions. We propose two distinct GradNet architectures and describe the corresponding monotone versions. Our empirical results demonstrate that these architectures outperform existing methods by up to 15 dB in gradient field tasks and by up to 11 dB in Hamiltonian dynamics learning tasks.
Inferring the Graph Structure of Images for Graph Neural Networks Mayur Gowda (Carnegie Mellon University); John Shi (Carnegie Mellon University)*; Augusto Santos (Instituto de Telecomunicacoes-IT); Jose Moura (Carnegie Mellon University) Image datasets such as MNIST are a key benchmark for testing GNN architectures. The images are traditionally represented as a grid graph with each node representing a pixel and edges connecting neighboring pixels (vertically and horizontally). The graph signal is the values (intensities) of each pixel in the image. The graphs are used as input to a GNN to classify the images.
In this work, we improve the accuracy of downstream graph neural network tasks by finding alternative graphs to the grid graph to represent the dataset images, following the approach in [4, 5]. We find row and column correlation graphs for each image in MNIST and Fashion-MNIST using correlations between the pixel values using the method in [4, 5]. We form the graph representing the image using the row and column graphs. We show that using these different graph representations as input into Graph CNNs and GAT improve the accuracy over using the traditional grid graph and other graph methods in the literature.
Robustness of Graph Topology Learning with Smooth Signals under Partial Observations Hoang-Son Nguyen (Oregon State University)*; Hoi-To Wai (The Chinese University of Hong Kong) Recently, many sophisticated algorithms have been proposed for graph topology learning from partial observations. Most of them have relied on advanced structures such as low-rankness and sparsity, but would otherwise require the number of unobserved nodes to be significantly smaller than the graph size. The aim of this ongoing work is to demonstrate theoretically that simple graph topology learning methods are implicitly robust to partial observations of low pass filtered graph signals. We achieve this result through extending the RIP property for the Dirichlet energy function. We show that smoothness-based graph learning formulation on partial observations is able to learn the ground truth graph topology corresponding to the observed nodes.
Joint Time-varying Graph Estimation / Signal Interpolation via Smoothness and Low-Rank Priors Saghar Bagheri (York University)*; Gene Cheung (York University); Tim Eadie (Growers Edge); Antonio Ortega (University of Southern California) In graph signal processing scenarios where node similarities evolve over time, the underlying graph structure must adapt.
We model small changes between consecutive adjacency matrices as a low-rank matrix. Specifically, given an initial adjacency W1 at t=1, we jointly interpolate a signal x2 and estimate W2 at t=2 by leveraging a graph signal smoothness prior and a low-rank prior on W2-W1. Our approach alternates between computing x2 with fixed W2 using conjugate gradient and updating W2 with fixed x2 via a variant of proximal gradient descent. We show that the proximal mapping for rank(W2-W1) can be efficiently approximated in linear time using a fast greedy algorithm, where a few rank-1 update matrices capture key node-to-node interactions.
By unrolling algorithm iterations into layers, we construct a lightweight neural network for data-driven parameter tuning. Experiments demonstrate that our joint optimization outperforms existing graph learning methods in signal interpolation.
InfoGain: Furthering the Design of Diffusion Wavelets for Graph-Structured Data David Johnson (Boise State University)*; Michael Perlmutter (Boise State University); Smita Krishnaswamy (Yale University) Diffusion wavelets extract information from graph signals at different scales of resolution by utilizing graph diffusion operators raised to various powers, known as diffusion scales. Traditionally, the diffusion scales are chosen to be dyadic integers, $\mathbf{2^j}$. Here, we propose a novel, unsupervised method for selecting the diffusion scales based on ideas from information theory. We then show that our method can be incorporated into wavelet-based GNNs via graph classification experiments.
Learning Graph Geometry and Topology Using Dynamical Systems Based Message-Passing Dhananjay Bhaskar (Yale University); Xingzhi Sun (Yale University)*; Yanlei Zhang (Mila - Quebec AI Institute); Charles Xu (Massachusetts Institute of Technology); Oluwadamilola Fasina (Yale University); Guy Wolf (Université de Montréal); Maximilian Nickel (Meta AI Research (FAIR)); Michael Perlmutter (Boise State University); Smita Krishnaswamy (Yale University) We introduce DYMAG, a novel, efficient message-passing paradigm for Graph Neural Networks (GNNs) that leverages continuous multiscale dynamics on graphs. Unlike traditional discrete methods that rely on simple diffusion and aggregation, DYMAG employs complex dynamics—drawing on heat, wave, and chaotic equations—to capture deeper topological properties. By approximating continuous dynamics with Chebyshev polynomials of a graph Laplacian, it computes multiscale representations embodying key topological and spectral features without explicit topological computation. We demonstrate DYMAG's superior performance in recovering random graph parameters, predicting persistent homology on synthetic and citation graphs, and outperforming existing methods in tasks such as protein shape prediction, total polar surface area, aromaticity, and material band gap estimation, underscoring its value in molecular research.
HiPoNet:A Topology-Preserving Multi-View Neural Network For High Dimensional Point Cloud and Single-Cell Data Siddharth Viswanath (Yale University)*; Hiren Madhu (Yale University); Dhananjay Bhaskar (Yale University); Jake Kovalic (Yale University ); David Johnson (Boise State University); Rex Ying (Yale University); Christopher Tape (University College London); Ian Adelstein (Yale University); Michael Perlmutter (Boise State University); Smita Krishnaswamy (Yale University) We propose HiPoNet, an end-to-end differentiable neural network for regression, classification, and representation learning on high dimensional point clouds. Existing point-cloud methods are tailored for 3D data hence struggle with scale and dimensionality of modern single-cell and spatial data. Most current approaches build a single nearest-neighbor graph, discarding important geometric information. In contrast, HiPoNet forms higher-order simplicial complexes through learnable feature reweighting, generating multiple data views that disentangle distinct biological processes. It then employs simplicial wavelet transforms to extract multi-scale features—capturing both local and global topology. We show that HiPoNet preserves topological information in the learned representations, and significantly outperforms state-of-the-art models on single cell and spatial transcriptomics datasets. Overall, HiPoNet offers a robust and scalable solution for high-dimensional data analysis.
STAGED: A Multi-Agent Neural Network for Learning Cellular Interaction Dynamics João Felipe Rocha (Yale University)*; Xingzhi Sun (Yale University); Ke Xu (Yale University); Jake Kovalic (Yale University); Ananya Krishna (Yale University); Smita Krishnaswamy (Yale University); Mark Gerstein (Yale University); Andrei Chupakhin (Lomonosov Moscow State University) The advent of single-cell technology has improved our understanding of cellular states and subpopulations in various tissues under normal and diseased conditions by using data-driven approaches like clustering and trajectory inference. However, these methods treat cells as independent data points. Spatial transcriptomics allows us to model cells as a dynamic system with interactions influencing cell states and communication. While agent-based modeling (ABM) provides a framework, traditional approaches rely on handcrafted rules rather than data. To address this, we introduce Spatial Temporal Agent-Based Graph Evolution Dynamics (STAGED), integrating ABM with deep learning. Using graph neural networks (GNNs) with shared weights per cell type, our model represents genes as vertices and interactions as edges, learning strengths via attention. Trained on time-lapse spatial data as a graph neural ODE (GDE), it captures intercellular and intracellular interactions adaptively.
Spatial Contrastive Pre-Training of Transformer Encoders for sEEG-Based Seizure Onset Zone Detection Paulo Goncalves (Inria); Pierre Borgnat (ENS de Lyon)*; Zacharie Rodière (ENS de Lyon) For the clinical study of epilepsy, we develop a transformer encoder for the detection of Seizure Onset Zone (SOZ) from stereo-EEG. It integrates clinically grounded time-frequency features with spatial contrastive pre-training. While prior spatial transformer approaches analyze learned representations, our method uniquely combines: (1) engineered time-frequency representations (TFRs) encoding epileptic spikes and oscillations, and (2) a contrastive objective leveraging anatomical relationships between the electrode contacts that are in the SOZ and the ones outside the SOZ.

The model processes heterogeneous sEEG records from different patients, using both ictal and interictal data. This contrastive strategy minimizes representational similarity between contact pairs on either side of the SOZ boundary while maximizing intra-SOZ similarity.

Attention heads provide interpretable connectivity patterns, bridging data-driven learning with the study of functional connectivity networks.

Accepted Papers: TMLR Track

Click to expand/collapse
Paper Title Authors
Graph-level Representation Learning with Joint-Embedding Predictive Architectures Geri Skenderi (Bocconi University), Hang Li (Michigan State University), Jiliang Tang (Michigan State University), Marco Cristani (University of Verona)
Towards Graph Foundation Models: A Study on the Generalization of Positional and Structural Encodings Billy Joe Franks (University of Kaiserslautern-Landau), Moshe Eliasof (University of Cambridge), Semih Cantürk (Université de Montréal & Mila), Guy Wolf (Université de Montréal & Mila), Carola-Bibiane Schönlieb (University of Cambridge), Sophie Fellenz (University of Kaiserslautern-Landau), Marius Kloft (University of Kaiserslautern-Landau)
Sparse Decomposition of Graph Neural Networks Yaochen Hu (Huawei), Mai Zeng (McGill University & Mila), Ge Zhang (Huawei), Pavel Rumiantsev (McGill University & Mila), Liheng Ma (McGill University & Mila), Yingxue Zhang (Huawei), Mark Coates (McGill University & Mila)
Graph Knowledge Distillation to Mixture of Experts Pavel Rumiantsev (McGill University & Mila), Mark Coates (McGill University & Mila)