Model-Based Co-Training for Multi-Agent RL
Opponent-aware, model-based co-training for multi-agent reinforcement learning — a learned model of how the other agents behave, planned against as they adapt. Co-authored manuscript in preparation.
I work on multi-agent reinforcement learning and physics-informed machine learning — systems that respect structure: multi-agent dynamics, physical law, and calibrated uncertainty. Recent work spans opponent-aware model-based co-training, probabilistic hyperdimensional computing, and safety for autonomous UAV swarms.
Opponent-aware, model-based co-training for multi-agent reinforcement learning — a learned model of how the other agents behave, planned against as they adapt. Co-authored manuscript in preparation.
A probabilistic library that makes every hypervector a posterior distribution — an algebra of uncertainty for hyperdimensional computing, with closed-form moment propagation, calibrated predictions, and coverage-guaranteed prediction sets.
A survey of safety and verification methods for AI-driven UAV swarms — reliability, anomaly detection, formation control, and test strategies. NSF-funded; accepted at the IEEE S&P SecureTrans Workshop.
Real-time telemetry for USC's Formula SAE racecar — live sensor data off the car's MoTeC data-acquisition hardware, over its telemetry radio, to a live dashboard; plus role-scoped config and an LDX watcher that re-injects values MoTeC strips from its .ldx files. Hardware to software: DAQ, telemetry, Next.js/FastAPI/WebSockets.
A BCI that decodes error-related potentials (ErrP) from EEG and turns them into a reinforcement signal — so an agent can learn to map neural activity to actions. EEG detection at ~76% within-subject; built with NeuroTech@USC, runner-up at the 2026 intercollegiate BCI Competition at UC Berkeley.
Incoming summer research with Prof. Saikat Dutta (Cornell).
Robotic Embedded Systems Lab — multi-agent reinforcement learning: opponent-aware, model-based co-training (a latent model of other agents' strategies, planned against as they adapt). Also robustness/safety testing for autonomous-driving perception (anomaly detection on Waymo/KITTI/nuScenes, LiDAR–camera fusion).
FastAPI backend and Next.js frontend serving 100+ restaurant chains; built the automated testing and CI/CD pipeline.
AI Explainability & Accountability Lab — interpretability and accountability methods for machine-learning systems.
Algorithmic safety for autonomous drone swarms with Prof. Yongxin Liu — the work behind the UAV-swarm safety survey above.
Led a 6-person team building a cable impedance tester; optimized quantum-cascade-laser controller SDKs.
Why PDE constraints work as inductive bias, and where the framing strains.
On rare events, fragility, and what models miss.
Systems thinking through the lens of what the model leaves out.