CAREER: Sparse Linear Algebra as a Scalable Computational Paradigm

NSF Award Search · 01002930DB NSF RESEARCH & RELATED ACTIVIT · $735,079 · view on nsf.gov ↗

Abstract

Scientific discovery increasingly relies on the ability to analyze datasets of unprecedented scale and complexity. In genomics, studies that once examined a few thousand individuals now aim to analyze millions. Computing systems capable of handling this scale exist but using them effectively requires expertise in parallel programming and modern accelerator architectures, which most scientists lack. As a result, important scientific questions remain unanswered because computations would take weeks or months to complete. This award addresses that gap by establishing sparse linear algebra as a unifying abstraction for scientists to express their computations in a standard form, enabling portable execution on evolving computing hardware. By lowering the barrier to advanced computing systems, the project will accelerate discovery in data-intensive sciences while strengthening the United States workforce in high-performance computing. A central component is an undergraduate training program that recruits students from across the United States to participate in international supercomputing competitions, creating a pipeline of talent prepared to use the computing facilities in which the nation has invested. This project develops a systematic methodology for mapping irregular scientific computations onto sparse linear algebra primitives, enabling portable execution across heterogeneous systems, including CPUs, GPUs, and emerging architectures. The project advances three research thrusts. The first develops methods to automatically translate domain code into sparse linear algebra, using a semantic-signature layer that abstracts over syntactic variations of common computational motifs and equality saturation to rewrite code into canonical sparse-primitive form. The second identifies sparsity structure and algebraic provenance from code and uses these properties to specialize sparse primitives. The third extends these application-aware primitives to distributed memory throug

Key facts

NSF award ID: 2542947
Awardee: Cornell University (NY)
SAM.gov UEI: G56PUALJ3KT5
PI: Giulia Guidi
Primary program: 01002930DB NSF RESEARCH & RELATED ACTIVIT
All programs: Artificial Intelligence (AI), CAREER-Faculty Erly Career Dev
Estimated total: $735,079
Funds obligated: $425,666
Transaction type: Continuing Grant
Period: 07/01/2026 → 06/30/2031