Graduate Researcher @ Stanford AI Lab
LLM Post-Training β’ Agentic AI β’ Reinforcement Learning β’ Wireless ML
I am a researcher at Stanford AI Lab (SAIL)** co-advised by Dr. Emily Fox and Dr. John M. Cioffi, focusing on:
- π§ LLM Post-Training & Inference β preference optimization, alignment, and reasoning calibration
- π€ Internet of Evolving Agents β emergent, self-organizing multi-agent ecosystems
- π‘ Reinforcement Learning & Wireless ML β dynamic decision-making and optimization
- βοΈ Adaptive Test-Time Compute for Reasoning Models
- Continuous-Utility Direct Preference Optimization (CU-DPO)
- Active Bayesian Preference Models
- Adaptive inference strategies for reasoning accuracy
- RL for high-diversity generation
- Self-evolving multi-agent systems
- Bayesian reputation and dynamic team formation
- Social graph-based coordination
- Emergent specialization (NeurIPS 2026, In Progress)
- GNN-accelerated SDP solvers
- Neural Gaussian Radio Fields
- RL for non-stationary decision systems
- Channel estimation & optimization
| Year | Title | Venue |
|---|---|---|
| 2026 (In Progress) | Internet of Evolving Agents | NeurIPS 2026 (In Progress) |
| 2026 (Submitted) | Detecting & Removing Sycophancy in LLMs | CoLM 2026 |
| 2026 (Submitted) | Continuous-Utility Direct Preference Optimization | ICML 2026 |
| 2026 (Submitted) | Adaptive Test-Time Compute Strategies | ICML 2026 |
| 2026 (Submitted) | Active Alignment with Bayesian General Preference Models | CoLM 2026 |
| 2026 | GNN for Accelerating Low-Rank SDP Solvers | KDD 2026 |
| 2026 | Neural Gaussian Radio Fields for Channel Estimation | TMLR 2026 |
| 2025 | Structured Prompting for Robust Evaluation | (co-authored paper) |
π Full list available on my website & CV.
- Stanford Graduate Fellowship
- Knight-Hennessy Fellowship Finalist
- IEEE Best Workshop Paper Award
- IEEE FIT Best Main Conference Paper Award
- Area Chair: NeurIPS 2025, ICASSP 2026
- Reviewer: ICML, NeurIPS, ICLR, KDD, AAAI
π Sep 2025 β Technical Program Committee Member & Reviewer @ NeurIPS 2025
π Aug 2025 β Recognized as Exemplary Reviewer (IEEE Wireless Communications Magazine)
π Jul 2025 β Founding Member (IEEE SIG on AI-Driven TN-NTN Networks)
π Jul 2025 β ICML 2025 paper accepted + Student Travel Grant award
π May 2025 β ICC Student Travel Grant + Best Workshop Paper Award
π Jan 2025 β 2 papers accepted @ AAAI 2025
π Dec 2024 β 2 papers accepted @ IEEE ICASSP 2025
π Apr 2024 β Rectorβs Gold Medal (Best Undergraduate Thesis)
π Jan 2024 β PhD Admission (Stanford Graduate Fellowship)
- Reinforcement Learning
- LLM Alignment & Preference Optimization
- Bayesian Inference
- Graph Neural Networks
- Dynamic Agent Planning
- CUDA & GPU Acceleration
- Linux, Git, VS Code
- Large-Scale Training Pipelines
I am always open to research collaborations and impactful projects in:
- LLM reasoning and alignment
- Multi-agent systems & meta-learning
- Reinforcement learning for dynamic systems
- Wireless + machine learning innovation
π« Email: muahmed@stanford.edu
π Portfolio: https://ahmd-mohsin.github.io/
βBuilding adaptive intelligence for the next generation of AI systems.β π



