Skip to content
View ahmd-mohsin's full-sized avatar
🎯
Focusing
🎯
Focusing

Block or report ahmd-mohsin

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
ahmd-mohsin/README.md

πŸ‘‹ Hi, I'm Muhammad Ahmed Mohsin

Graduate Researcher @ Stanford AI Lab
LLM Post-Training β€’ Agentic AI β€’ Reinforcement Learning β€’ Wireless ML


πŸ“ About Me

I am a researcher at Stanford AI Lab (SAIL)** co-advised by Dr. Emily Fox and Dr. John M. Cioffi, focusing on:

  • 🧠 LLM Post-Training & Inference β€” preference optimization, alignment, and reasoning calibration
  • πŸ€– Internet of Evolving Agents β€” emergent, self-organizing multi-agent ecosystems
  • πŸ“‘ Reinforcement Learning & Wireless ML β€” dynamic decision-making and optimization
  • βš™οΈ Adaptive Test-Time Compute for Reasoning Models

πŸ”¬ Research Focus

🧠 LLM Post-Training & Alignment

  • Continuous-Utility Direct Preference Optimization (CU-DPO)
  • Active Bayesian Preference Models
  • Adaptive inference strategies for reasoning accuracy
  • RL for high-diversity generation

πŸ€– Internet of Evolving Agents

  • Self-evolving multi-agent systems
  • Bayesian reputation and dynamic team formation
  • Social graph-based coordination
  • Emergent specialization (NeurIPS 2026, In Progress)

πŸ“‘ Applied Reinforcement Learning & Wireless ML

  • GNN-accelerated SDP solvers
  • Neural Gaussian Radio Fields
  • RL for non-stationary decision systems
  • Channel estimation & optimization

πŸ“„ Selected Publications πŸš€

Year Title Venue
2026 (In Progress) Internet of Evolving Agents NeurIPS 2026 (In Progress)
2026 (Submitted) Detecting & Removing Sycophancy in LLMs CoLM 2026
2026 (Submitted) Continuous-Utility Direct Preference Optimization ICML 2026
2026 (Submitted) Adaptive Test-Time Compute Strategies ICML 2026
2026 (Submitted) Active Alignment with Bayesian General Preference Models CoLM 2026
2026 GNN for Accelerating Low-Rank SDP Solvers KDD 2026
2026 Neural Gaussian Radio Fields for Channel Estimation TMLR 2026
2025 Structured Prompting for Robust Evaluation (co-authored paper)

πŸ”Ž Full list available on my website & CV.


πŸ† Honors & Recognition ✨

  • Stanford Graduate Fellowship
  • Knight-Hennessy Fellowship Finalist
  • IEEE Best Workshop Paper Award
  • IEEE FIT Best Main Conference Paper Award
  • Area Chair: NeurIPS 2025, ICASSP 2026
  • Reviewer: ICML, NeurIPS, ICLR, KDD, AAAI

πŸ“° News Highlights πŸ—žοΈ

πŸ—“ Sep 2025 β€” Technical Program Committee Member & Reviewer @ NeurIPS 2025
πŸ—“ Aug 2025 β€” Recognized as Exemplary Reviewer (IEEE Wireless Communications Magazine)
πŸ—“ Jul 2025 β€” Founding Member (IEEE SIG on AI-Driven TN-NTN Networks)
πŸ—“ Jul 2025 β€” ICML 2025 paper accepted + Student Travel Grant award
πŸ—“ May 2025 β€” ICC Student Travel Grant + Best Workshop Paper Award
πŸ—“ Jan 2025 β€” 2 papers accepted @ AAAI 2025
πŸ—“ Dec 2024 β€” 2 papers accepted @ IEEE ICASSP 2025
πŸ—“ Apr 2024 β€” Rector’s Gold Medal (Best Undergraduate Thesis)
πŸ—“ Jan 2024 β€” PhD Admission (Stanford Graduate Fellowship)


πŸ’» Technical Skills

🧠 Core Languages & Frameworks

πŸ€– ML/AI

  • Reinforcement Learning
  • LLM Alignment & Preference Optimization
  • Bayesian Inference
  • Graph Neural Networks
  • Dynamic Agent Planning

πŸ›  Tools & Systems

  • CUDA & GPU Acceleration
  • Linux, Git, VS Code
  • Large-Scale Training Pipelines

πŸ“Š GitHub Stats


🀝 Let’s Collaborate!

I am always open to research collaborations and impactful projects in:

  • LLM reasoning and alignment
  • Multi-agent systems & meta-learning
  • Reinforcement learning for dynamic systems
  • Wireless + machine learning innovation

πŸ“« Email: muahmed@stanford.edu
πŸ”— Portfolio: https://ahmd-mohsin.github.io/


β€œBuilding adaptive intelligence for the next generation of AI systems.” πŸš€

Pinned Loading

  1. Hierarchical-Deep-Reinforcement-Learning-for-Adaptive-Resource-Management-in-Integrated-Terrestrial- Hierarchical-Deep-Reinforcement-Learning-for-Adaptive-Resource-Management-in-Integrated-Terrestrial- Public

    Hierarchical Deep Reinforcement Learning for Adaptive Resource Management in Integrated Terrestrial and Non-Terrestrial Networks

    Python 11 4

  2. DRL-Active-RIS- DRL-Active-RIS- Public

    Deep Reinforcement Learning Optimization for Active RIS

    Python 10 3

  3. DTIC-Digital-Twin-Induction-Motor DTIC-Digital-Twin-Induction-Motor Public

    Digital Twin of an Induction Motor: Fault Analysis and Predictive Maintenance

    MATLAB 16 1

  4. ahmd-mohsin.github.io ahmd-mohsin.github.io Public

    About me

    TypeScript 7

  5. NUST-Semester-6 NUST-Semester-6 Public template

    My Files for Semester 6

    HTML 8