GenAI Engineer | Multi-Agent Systems & RAG Architect | AI Infrastructure & Observability Expert

Location: United States
Rate, USD: $52 / hour
Work schedule: Full Time,
Links
Language skills: English
Available for Hire: Yes

About me

I am a GenAI Engineer with over 3 years of experience transforming advanced AI concepts into production-ready solutions that reduce costs, speed delivery, and boost accuracy. My expertise lies in building safe and observable AI workflows that deliver measurable ROI and earn trust from executives and stakeholders through strong end-user adoption. I excel at collaborating across functions, translating deep technical skills into impactful business outcomes that resonate with clients and leadership.

Throughout my career, I have optimized multi-agent workflows and AI infrastructure to significantly improve latency, throughput, and operational costs at scale. I have developed robust MCP servers with audit trails and orchestration tools that reduce integration lead times and enforce safety guardrails, ensuring zero PII incidents in high-volume environments.

I have launched Agentic RAG systems that enhance accuracy and groundedness while deflecting support tickets, saving operational expenses. My work includes creating LLM-driven pipelines and analytics copilots that reduce forecast errors, automate workflows, and improve data integrity with AI observability tools.

I am skilled in deploying scalable machine learning models using TensorFlow, Docker, and CI/CD pipelines, improving inference latency and experiment cycle times. I have experience building microservices for batch inference and benchmarking, which streamline analyst reviews and improve model deployment.

My projects include developing multi-agent travel planners, autonomous delivery agents, and commerce copilots that leverage hybrid retrieval, policy-aware RAG, and audit trails to deliver high accuracy, cost savings, and user-friendly automation. I am passionate about advancing AI infrastructure and observability to create safe, efficient, and impactful AI systems.

Professional area

Developer & Engineer

Skills

AWS CI/CD Docker Flask GCP Kubernetes Machine Learning SQL TensorFlow

Education

2022-2024 masters in computer science @ pace university

Experience

July 2025 - Present GenAI Engineer @ KGS Technology Group, Inc.: Optimized vLLM+KV-cache behind LangServe and redesigned multi-agent workflows, cutting P95 latency 45%, doubling throughput, and reducing task cost 33% at production scale. Developed MCP servers with audit trails and n8n orchestration, reducing integration lead time from 3 weeks to 3 days with zero PII incidents across 60k+ tool calls; enforced safety guardrails. Launched Agentic RAG, improving groundedness +12 pts, accuracy +9 pts, and deflecting 22% of tickets, saving $3.8k/month in OpEx.
August 2024 - July 2025 Data Analyst @ CloudData Technology: Created LLM-driven pipelines with LangGraph and LangSmith tracing, reducing forecast error 18%, false positives 27%, and boosting precision +11 pts. Shipped NL→SQL analytics copilot automating Power BI workflows, eliminating 12 hrs/week manual ELT, cutting time-to-insight 55%, and increasing adoption +35%. Implemented Great Expectations, AI observability, and LlamaIndex RAG assistant, improving data integrity +23% and cutting incidents 60% while maintaining SLA 99.7%.
Aug 2021 - April 2022 Jr. Data Scientist @ Bharat Soft Solutions Pvt. Ltd.: Boosted F1 score 0.71→0.83 and reduced inference latency 35% using TensorFlow pipelines, transfer learning, augmentation, and quantization. Scaled dataset 5x and reduced experiment cycle time 7→3 days with reproducible MLflow workflows and parameterized training scripts. Packaged models as Dockerized Flask microservices with batch inference and CI benchmarking, reducing analyst review time 30% across 12 variants.
Sept 2020 - June 2021 Jr. Data Scientist @ Connection Loops Pvt. Ltd.: Owned SQL pipelines with validation checks, reducing prep time 58% and stabilizing refresh reliability to 99.4% for flagship product. Built scikit-learn baselines with targeted feature engineering, improving AUC 0.74→0.82 and enabling earlier case detection. Delivered KPI dashboards for clinical stakeholders, shortening decision lead-time by 3 days and enabling 3 new research cohorts.

Recommend this talent

Related resumes

More Resumes

GenAI Engineer | Multi-Agent Systems & RAG Architect | AI Infrastructure & Observability Expert

About me

Professional area

Developer & Engineer

Skills

AWS CI/CD Docker Flask GCP Kubernetes Machine Learning SQL TensorFlow

Education

Experience

Recommend this talent

Related resumes

Jobicy

Community

Discover

Online Tools

Hire me

About me

Professional area

Developer & Engineer

Skills

AWSCI/CDDockerFlaskGCPKubernetesMachine LearningSQLTensorFlow

Education

Experience

Recommend this talent

Related resumes

QA Engineer

Data Ops / DevOps Engineer & Database Reliability Engineer

Economist

Application Support Engineer

Software Engineer

Recommend this specialist

Jobicy+ Subscription

AWS CI/CD Docker Flask GCP Kubernetes Machine Learning SQL TensorFlow