Projects •

Projects

My research spans nuclear and particle physics, machine learning applications, and the intersection of physics with artificial intelligence. This work demonstrates how fundamental physics insights can drive innovations in AI, while advanced computational methods open new frontiers in experimental physics.


Generative AI: Research and Applied Projects

The Physics of Transformers

2024 - Present
Independent Research

This project treats transformer weights as a dynamical system and applies methods from statistical physics to provide insight into their behavior. The work develops a statistical analysis pipeline that ingests trained models and extracts a set of interpretable metrics to characterize their behavior and track its evolution during training. The framework supports both an empirical characterization of the weights and a physics-motivated interpretation grounded in the correspondence between transformer self-attention and the statistical mechanics of spin systems. The analysis spans multiple architectures (e.g. GPT-2, LLaMA, Mistral) at scales from 70M to 12B parameters, with a temporal study across training checkpoints, and produces open-source code, HuggingFace datasets, and interactive visualization dashboards.

Resources:

📝 Read the series  

Code Dashboard Data

Spectral Structure in Neural Network Solutions of the KdV Equation

2026 - Present Independent Research

This project trains physics-informed neural networks to solve the Korteweg–de Vries equation, a prototypical integrable nonlinear wave equation whose soliton solutions arise from a precise balance between nonlinear steepening and dispersion. Rather than imposing conventional boundary conditions on the field, the PINN is driven by scattering data from the inverse scattering transform — eigenvalues and norming constants of an associated Schrödinger operator — making the boundary value problem an inverse spectral problem. The KdV equation’s integrability provides a powerful validation framework: an infinite hierarchy of conservation laws, each computable via autograd, serves as unsupervised diagnostics that are independent of the training loss. The learned solutions preserve these conservation laws locally throughout the domain and, more strikingly, retain the full spectral structure of the Lax pair — isospectrality and eigenfunction dynamics — without any explicit spectral inductive bias in the architecture or loss function. The approach scales to multi-soliton configurations, with an interactive explorer for visualizing soliton interactions, eigenvalue recovery, and eigenfunction evolution in real time.

Resources:

📝 Read the post   Code Dashboard

Generative Modeling of Grateful Dead Setlists

2025 - Present
Independent Research

This project builds generative models of concert setlists using supervised fine-tuning with GPT-2, treating the Grateful Dead’s 30-year performance history as a natural language corpus. Each setlist becomes a sequence and each song a token, yielding a vocabulary of approximately 417 unique songs. Much like natural language, setlists exhibit patterns like opener conventions, set-closing sequences, and thematic pairings that experienced listeners recognize intuitively but that emerge statistically from the data. The data pipeline initially processed Archive.org’s 17,000+ concert recordings, providing an opportunity to develop fuzzy matching and vocabulary canonicalization techniques for messy real-world data. Production training uses cleaned data from setlist.fm for reliability. The availability of original recordings on Archive.org offers a compelling path to extend this work into audio modality, connecting setlist structure to the musical content itself.

Resources:

Skills: SFT NLP Gen AI LLM

Generative Modeling of Baseball Game States

2025 - Present
Independent Research

This project applies transformer language models to sequential game state prediction, treating baseball games as sequences of discrete state transitions. The system processes 3.3 million pitch sequences from MLB’s Statcast (2015–present) and Retrosheet’s historical archives (1871–present), converting games into tokenized sequences for language model training. The research addresses a question in mechanistic interpretability: how do traditional baseball statistics emerge as learned properties from underlying game dynamics? The evaluation framework assesses not just prediction accuracy but whether models learn actual baseball rules—detecting illegal transitions that violate game constraints. State representations range from simple 24-state models encoding outs and baserunners to complex 57,000-state encodings that capture detailed game context.

Resources:

Skills: Pretraining NLP Gen AI LLM


Machine Learning & AI Applications in Physics

AI-Informed Detector Design

Jan 2021 - Jan 2024
Lawrence Livermore National Laboratory

Using deep learning as a new tool to guide the design of detectors for collider physics experiments. This work represents a paradigm shift from traditional engineering approaches to ML-optimized detector configurations.

Key Achievements:

Skills: Machine LearningDeep LearningGenerative AI ToolsExperimental PhysicsData Analysis

Publications:

Improving Particle Reconstruction with Deep Learning

Jan 2019 - Jan 2022
Lawrence Livermore National Laboratory · ATLAS Experiment at CERN

We utilized deep learning methods to improve particle identification and reconstruction using the ATLAS calorimeter. We studied convolutional, graph and transformer architectures and compared their results, achieving major improvements in energy calibration by using full spatial information from electromagnetic and hadronic showers.

Key Achievements:

Skills: Experimental PhysicsData AnalysisMachine Learning

Code Repositories:

Publications:


Heavy-Ion Physics & QCD

Jet Energy Loss and Substructure

Jan 2019 - Jan 2022
Lawrence Livermore National Laboratory · ATLAS Experiment at CERN

Can we experimentally observe whether wide jets lose more energy than narrow ones?

This project explored fundamental questions about how jets interact with the quark-gluon plasma, providing new insights into the substructure dependence of energy loss mechanisms.

Skills: Experimental PhysicsData AnalysisResearch ProjectsAnalytical SkillsUncertainty Quantification

Publications:

Recognition:

New Approaches to Ultra-Peripheral Collisions

Lawrence Livermore National Laboratory · ATLAS Experiment at CERN

Advanced studies of ultra-peripheral heavy-ion collisions, exploring photonuclear processes and novel QCD phenomena in extreme electromagnetic field environments.

Skills: Experimental PhysicsData AnalysisAnalytical SkillsUncertainty QuantificationMonte Carlo SimulationHigh Performance Computing

Publications:

Observation of Jet Quenching at the LHC

Columbia University · ATLAS Experiment at CERN

Landmark discovery that ushered in the LHC era of heavy-ion physics

In the first Pb+Pb collisions at the LHC, the ATLAS experiment observed highly imbalanced dijet pairs, providing the first direct evidence of jet quenching in the quark-gluon plasma at unprecedented energies.

Skills: Independent ResearchData Analysis

Publications:

Precision Measurements of Jet Quenching

Columbia University · ATLAS Experiment at CERN

Systematic studies of jet suppression phenomena, establishing quantitative frameworks for understanding energy loss in the quark-gluon plasma.

Skills: Experimental PhysicsData AnalysisUncertainty QuantificationMonte Carlo SimulationAnalytical Skills

Publications: