Colloquium - Pan Li, "Challenges and Opportunities in Graph Machine Learning: A Study on Graph Data Distribution Shifts"
From cs-speakerseries
views
comments
From cs-speakerseries
Abstract:
Graphs have been widely applied to model intricate relationships among entities. The application of Graph Machine Learning (GML) to enhance prediction capabilities for graph-structured data is prevalent in several scientific disciplines, such as particle physics, material science, and biology. However, applications in these domains often present changes in data distributions due to the label-collection process they employ. Specifically, the data used for model training often comes from thoroughly investigated regimes, whose distributions often do not align well with the under-explored regime of scientific interest. Furthermore, the interconnected nature of entities in a graph presents an additional level of complexity, making current distributionally robust methods suboptimal when being applied to graph data.
This presentation will focus on our recent studies on GML under distribution shifts. Our studies are motivated by the observation of the data distribution shift between simulation and real experiments in particle physics. We propose a method named graph structure alignment. The key idea of our approach is to estimate and quantify shifts in entity connection patterns from the training phase to real-world evaluation. Consequently, the influence of neighboring entities on a central node can be appropriately calibrated based on prior estimations, serving to mitigate the distribution shift in graph data.
Bio:
Pan Li is an assistant professor at Georgia Tech. ECE department and holds an adjunct position at Purdue CS department. Pan's research interest lies broadly in the area of machine learning and optimization with graph data. His recent work includes algorithms and analysis for graph neural networks, hypergraph spectral methods, and their applications in physics and circuit design. Pan Li's work has been recognized by several awards including NSF Early Career Award, several industry research awards, and the Best Paper award at the Learning on Graph conference 2022.