AI engineer, researcher, and builder — translating cutting-edge ML research into systems that work in production.
I'm Muhammad Nouman Khan — an AI Engineer and MLOps Specialist with a deep passion for deploying machine learning at scale. My work spans the full AI stack: from fine-tuning foundation models and designing multi-agent architectures, to provisioning NVIDIA DGX clusters, managing Kubernetes environments, and optimizing inference pipelines for real-world production loads.
I hold a Bachelor of Science in Artificial Intelligence from Pak-Austria Fachhochschule Institute of Applied Sciences and Technology (2021–2025), and I'm an IEEE-published researcher with collaborative work from the University of Hull, UK, focused on Graph-based Retrieval-Augmented Generation.
Whether it's managing Kubernetes clusters via Terraform, Kubeadm, and BCM, fine-tuning Stable Diffusion models on custom datasets, or architecting RAG systems for US-based e-learning platforms — I bridge the gap between AI research and production reality.
I believe great AI engineering isn't just about writing code — it's about building systems that are reliable, scalable, and observable long after the model is deployed.
Created interactive simulations to raise environmental sustainability awareness, visually demonstrating how small everyday actions, when multiplied by millions, drive significant global consequences — encouraging actionable steps toward a healthier planet.