AI/ML Architect (Senior) with skills AI/ML Development, TensorFlow, Large Language Models (LLM), Retrieval-Augmented Generation (RAG), Python for location Gurugram, India
ROLES & RESPONSIBILITIES

12–15+ years of experience in Analytics, Data Science, AI/ML, or advanced engineering roles.

Extensive hands-on experience with Generative AI and LLM-based systems.

Strong proficiency in Python for AI/ML and GenAI application development.

Proven experience designing and deploying cloud-native solutions on AWS.

Solid understanding of data engineering, model lifecycle management, and production deployment.

________________________________________

Core Technical Skills (Required)

Generative AI & LLM Architecture

• Expertise in:

o Prompt design and prompt engineering using Python frameworks

o Retrieval-Augmented Generation (RAG), Corrective RAG, and hybrid retrieval approaches

o Knowledge Graph–based RAG implementations

o Multi-agent and agentic frameworks in Python

o Model evaluation, hallucination mitigation, and response quality measurement

• Experience with:

o Fine-tuning techniques (LoRA / QLoRA) using Python ML stacks

o Embedding strategies, reranking, and semantic search optimization

o Automated and human-in-the-loop evaluation frameworks

Python & AI/ML Development

• Advanced Python expertise for:

o Building GenAI services, APIs, and orchestration layers

o Integrating LLMs with vector databases, search engines, and enterprise data sources

o Developing reusable libraries and modular AI components

• Experience with:

o LangChain, LangGraph, LlamaIndex, or similar frameworks

o FastAPI or Flask for AI service deployment

Multimodal AI

• Experience or strong architectural exposure to:

o Text, image, audio, and video AI workflows

o Multimodal generation pipelines orchestrated via Python services

o Content structuring, summarization, and transformation using GenAI

AWS & Cloud Architecture

• Strong hands-on experience with AWS services, including:

o Amazon Bedrock, SageMaker

o Lambda, ECS/EKS

o S3, OpenSearch, DynamoDB, RDS

o IAM, VPC, KMS, CloudWatch

• Proven ability to design:

o Secure, scalable, and cost-optimized GenAI architectures

o VPC-isolated AI workloads with governed access models

o CI/CD pipelines and Python-based MLOps workflows

Data & Platform Engineering

• Strong understanding of:

o Python-based ETL pipelines

o Structured and unstructured data processing

o SQL and relational/non-relational databases

• Exposure to Big Data and distributed processing frameworks is a plus.

________________________________________

Key Responsibilities

Architecture & Solution Design

• Design end-to-end architectures for Python-driven Generative AI and LLM solutions.

• Define reusable Python frameworks for prompt management, RAG pipelines, agent orchestration, and evaluation.

• Ensure architectures meet scalability, security, governance, and compliance standards.

Technical Leadership & Delivery

• Lead GenAI solution development from design through production deployment.

• Guide engineering teams on best practices for Python-based GenAI development.

• Ensure delivery of high-quality, production-ready AI systems.

Governance, Quality & Risk Management

• Implement governance, safety controls, and evaluation frameworks using Python-based tooling.

• Monitor performance, reliability, cost, and response quality of deployed GenAI systems.

Mentoring & Best Practices

• Mentor engineers and data scientists on GenAI architecture, Python development, and LLM integration patterns.

• Conduct code and architecture reviews to ensure maintainability, scalability, and quality.

Innovation & Continuous Learning

• Stay current with advancements in Generative AI, LLMs, and Python-based AI ecosystems.

• Evaluate and adopt new tools, frameworks, and methodologies where appropriate.

EXPERIENCE
  • 12-14 Years
SKILLS
  • Primary Skill: AI/ML Development
  • Sub Skill(s): AI/ML Development
  • Additional Skill(s): TensorFlow, Large Language Models (LLM), Retrieval-Augmented Generation (RAG), Python
ABOUT THE COMPANY

Infogain is a human-centered digital platform and software engineering company based out of Silicon Valley. We engineer business outcomes for Fortune 500 companies and digital natives in the technology, healthcare, insurance, travel, telecom, and retail & CPG industries using technologies such as cloud, microservices, automation, IoT, and artificial intelligence. We accelerate experience-led transformation in the delivery of digital platforms. Infogain is also a Microsoft (NASDAQ: MSFT) Gold Partner and Azure Expert Managed Services Provider (MSP).

Infogain, an Apax Funds portfolio company, has offices in California, Washington, Texas, the UK, the UAE, and Singapore, with delivery centers in Seattle, Houston, Austin, Kraków, Noida, Gurgaon, Mumbai, Pune, and Bengaluru.

Express Application
Upload Microsoft word, PDF file upto 500KB.
Recent Jobs
Posted on January 25, 2026
Cloud Native App Developer (Lead) | 8-11 Years | CNA Development - ReactJS, Core Java, Java Webservices, Spring Boot, GCP-Apps...
Posted on January 25, 2026
AWS Data Engineer (Standard) | 3-4.5 Years | Data Engineering - Python, ETL , SQL, AWS Glue
Posted on January 25, 2026
Technical Architect (Standard) | 12-14 Years | Application Architecture - Systems Architecture
Posted on January 25, 2026
Azure Data Engineer (Senior) | 6-8 Years | Data Engineering - Python, Databricks, SQL, Azure Data Factory