Iβm an AI / ML Engineer with 2.5+ years of hands-on experience building production-grade AI systems across Machine Learning, NLP, Generative AI, and Agentic AI.
I specialize in taking AI solutions from idea β prototype β scalable production.
Currently, I work as a Specialist, Software Engineering (Data & AI) at Accelerate People (Blenheim Chalcot), where I design and deploy LLM-powered, agentic, and multimodal AI systems used at scale.
- Agentic AI & RAG Systems (LangChain, LangGraph, AutoGen)
- LLM-based Applications (OpenAI, Azure-hosted models)
- Production ML Systems (FastAPI, Docker, CI/CD)
- Multimodal AI (OCR, Vision Transformers, NLP)
- MLOps (model monitoring, drift detection, retraining pipelines)
- Cloud AI Platforms (Azure, AWS)
π Impact Highlight:
Reduced assessment turnaround time from 72 hours to under 3 hours by architecting an AI-backed assessment platform used by the UKβs largest apprenticeship body.
Specialist, Software Engineering (Data & AI)
π Accelerate People (Blenheim Chalcot) | Mumbai, India
- Architected LLM-powered and Agentic RAG systems enabling natural-language analytics
- Built scalable inference microservices using FastAPI, Docker, and cloud platforms
- Fine-tuned LLMs using prompt tuning & LoRA, improving accuracy by 40%
- Designed multimodal AI pipelines improving contextual extraction accuracy by 65%
- Implemented MLOps workflows (monitoring, drift detection, CI/CD)
- Delivered AI solutions processing thousands of assessments monthly
- Python, SQL, R
- FastAPI, Flask
- Git, Docker
- PyTorch, TensorFlow, Scikit-learn
- NLP, Statistical Learning
- Supervised, Unsupervised & Reinforcement Learning
- LLMs (OpenAI, Azure-hosted)
- RAG & Agentic RAG
- LangChain, LangGraph, LlamaIndex, AutoGen
- Prompt & Context Engineering
- Embeddings & Vector Search
- Azure (ML, Data Factory, AI Search)
- AWS (S3, Lambda)
- CI/CD pipelines, Model monitoring
- PostgreSQL, MongoDB
- Power BI, Tableau
- Data validation & pipeline optimization
Some of my repositories focus on:
- πΉ LLM-based chat systems & RAG pipelines
- πΉ Agentic AI workflows
- πΉ ML / NLP experimentation
- πΉ Production-ready API services
- πΉ Local open-source LLM setups (Ollama, OpenWebUI)
π Explore here: https://github.com/Bprs68?tab=repositories
- Advanced Agentic AI patterns
- Evaluation frameworks for LLMs
- Cost optimization & latency tuning for GenAI systems
- Knowledge Graphs + RAG
I regularly write about AI engineering, Generative AI, Agentic systems, and production MLβsharing practical insights from real-world deployments.
- π Medium: https://medium.com/@bhaskards6869
Topics I write about:
- LLMs, RAG & Agentic AI
- Production ML & MLOps
- Debugging & system design for AI
- Open-source AI tooling (Ollama, OpenWebUI, MCP)
- AI / ML Engineer roles
- Generative AI & Agentic AI projects
- Production ML & MLOps challenges
- Open-source collaboration
- π LinkedIn: https://www.linkedin.com/in/bhaskar-kumar-/
- βοΈ Email: bhaskards6869@gmail.com

