Projects •
Projects
My research spans nuclear and particle physics, machine learning applications, and the intersection of physics with artificial intelligence. This work demonstrates how fundamental physics insights can drive innovations in AI, while advanced computational methods open new frontiers in experimental physics.
Generative AI: Research and Applied Projects
The Physics of Transformers
2024 - Present
Independent Research
This project treats transformer weights as a dynamical system and applies methods from statistical physics to provide insight into their behavior. The work develops a statistical analysis pipeline that ingests trained models and extracts a set of interpretable metrics to characterize their behavior and track its evolution during training. The framework supports both an empirical characterization of the weights and a physics-motivated interpretation grounded in the correspondence between transformer self-attention and the statistical mechanics of spin systems. The analysis spans multiple architectures (e.g. GPT-2, LLaMA, Mistral) at scales from 70M to 12B parameters, with a temporal study across training checkpoints, and produces open-source code, HuggingFace datasets, and interactive visualization dashboards.
Resources:
Spectral Structure in Neural Network Solutions of the KdV Equation
2026 - Present Independent Research
This project trains physics-informed neural networks to solve the Korteweg–de Vries equation, a prototypical integrable nonlinear wave equation whose soliton solutions arise from a precise balance between nonlinear steepening and dispersion. Rather than imposing conventional boundary conditions on the field, the PINN is driven by scattering data from the inverse scattering transform — eigenvalues and norming constants of an associated Schrödinger operator — making the boundary value problem an inverse spectral problem. The KdV equation’s integrability provides a powerful validation framework: an infinite hierarchy of conservation laws, each computable via autograd, serves as unsupervised diagnostics that are independent of the training loss. The learned solutions preserve these conservation laws locally throughout the domain and, more strikingly, retain the full spectral structure of the Lax pair — isospectrality and eigenfunction dynamics — without any explicit spectral inductive bias in the architecture or loss function. The approach scales to multi-soliton configurations, with an interactive explorer for visualizing soliton interactions, eigenvalue recovery, and eigenfunction evolution in real time.
Resources:
Generative Modeling of Grateful Dead Setlists
2025 - Present
Independent Research
This project builds generative models of concert setlists using supervised fine-tuning with GPT-2, treating the Grateful Dead’s 30-year performance history as a natural language corpus. Each setlist becomes a sequence and each song a token, yielding a vocabulary of approximately 417 unique songs. Much like natural language, setlists exhibit patterns like opener conventions, set-closing sequences, and thematic pairings that experienced listeners recognize intuitively but that emerge statistically from the data. The data pipeline initially processed Archive.org’s 17,000+ concert recordings, providing an opportunity to develop fuzzy matching and vocabulary canonicalization techniques for messy real-world data. Production training uses cleaned data from setlist.fm for reliability. The availability of original recordings on Archive.org offers a compelling path to extend this work into audio modality, connecting setlist structure to the musical content itself.
Resources:
- 💻 Code repository <!– - 📝 Read the post (coming soon)
- 🎮 Demo –>
Skills: SFT NLP Gen AI LLM
Generative Modeling of Baseball Game States
2025 - Present
Independent Research
This project applies transformer language models to sequential game state prediction, treating baseball games as sequences of discrete state transitions. The system processes 3.3 million pitch sequences from MLB’s Statcast (2015–present) and Retrosheet’s historical archives (1871–present), converting games into tokenized sequences for language model training. The research addresses a question in mechanistic interpretability: how do traditional baseball statistics emerge as learned properties from underlying game dynamics? The evaluation framework assesses not just prediction accuracy but whether models learn actual baseball rules—detecting illegal transitions that violate game constraints. State representations range from simple 24-state models encoding outs and baserunners to complex 57,000-state encodings that capture detailed game context.
Resources:
Skills: Pretraining NLP Gen AI LLM
Machine Learning & AI Applications in Physics
AI-Informed Detector Design
Jan 2021 - Jan 2024
Lawrence Livermore National Laboratory
Using deep learning as a new tool to guide the design of detectors for collider physics experiments. This work represents a paradigm shift from traditional engineering approaches to ML-optimized detector configurations.
Key Achievements:
- Improved energy resolution by 40% over baseline designs
- Established optimization framework for future detector systems
- Demonstrated 50% improvement in particle identification accuracy
Skills: Machine LearningDeep LearningGenerative AI ToolsExperimental PhysicsData Analysis
Publications:
- Design of a SiPM-on-Tile ZDC for the future EIC and its Performance with Graph Neural Networks
- The Optimal use of Segmentation for Sampling Calorimeters
- Comparison of Point Cloud and Image-based Models for Calorimeter Fast Simulation
Improving Particle Reconstruction with Deep Learning
Jan 2019 - Jan 2022
Lawrence Livermore National Laboratory · ATLAS Experiment at CERN
We utilized deep learning methods to improve particle identification and reconstruction using the ATLAS calorimeter. We studied convolutional, graph and transformer architectures and compared their results, achieving major improvements in energy calibration by using full spatial information from electromagnetic and hadronic showers.
Key Achievements:
- 50% improvement in energy resolution
- 10x reduction in false positive rate while maintaining >95% recall
- Demonstrated superiority of graph neural networks for particle reconstruction
Skills: Experimental PhysicsData AnalysisMachine Learning
Code Repositories:
- GitHub - atlas-calo-ml/MLTree
- GitHub - atlas-calo-ml/gn4pions_eastbay: Using graph_nets for pion classification and energy
Publications:
Heavy-Ion Physics & QCD
Jet Energy Loss and Substructure
Jan 2019 - Jan 2022
Lawrence Livermore National Laboratory · ATLAS Experiment at CERN
Can we experimentally observe whether wide jets lose more energy than narrow ones?
This project explored fundamental questions about how jets interact with the quark-gluon plasma, providing new insights into the substructure dependence of energy loss mechanisms.
Skills: Experimental PhysicsData AnalysisResearch ProjectsAnalytical SkillsUncertainty Quantification
Publications:
Recognition:
New Approaches to Ultra-Peripheral Collisions
Lawrence Livermore National Laboratory · ATLAS Experiment at CERN
Advanced studies of ultra-peripheral heavy-ion collisions, exploring photonuclear processes and novel QCD phenomena in extreme electromagnetic field environments.
Skills: Experimental PhysicsData AnalysisAnalytical SkillsUncertainty QuantificationMonte Carlo SimulationHigh Performance Computing
Publications:
- Measurement of photonuclear jet production in ultra-peripheral Pb+Pb collisions
- Observation of centrality-dependent acoplanarity for muon pairs produced via two-photon scattering
Observation of Jet Quenching at the LHC
Columbia University · ATLAS Experiment at CERN
Landmark discovery that ushered in the LHC era of heavy-ion physics
In the first Pb+Pb collisions at the LHC, the ATLAS experiment observed highly imbalanced dijet pairs, providing the first direct evidence of jet quenching in the quark-gluon plasma at unprecedented energies.
Skills: Independent ResearchData Analysis
Publications:
- Observation of a Centrality-Dependent Dijet Asymmetry in Lead-Lead Collisions (First jet measurement in heavy-ion collisions - field-defining result)
Precision Measurements of Jet Quenching
Columbia University · ATLAS Experiment at CERN
Systematic studies of jet suppression phenomena, establishing quantitative frameworks for understanding energy loss in the quark-gluon plasma.
Skills: Experimental PhysicsData AnalysisUncertainty QuantificationMonte Carlo SimulationAnalytical Skills
Publications:
- Measurement of jet pT correlations in Pb+Pb and pp collisions
- Measurements of the Nuclear Modification Factor for Jets in Pb+Pb Collisions
- Jet size dependence of single jet suppression in lead-lead collisions