AI Engineer with strong Python skills and hands-on experience building production-grade ML/LLM-enabled systems.
About the job
We are seeking an AI Engineer with strong Python skills and hands-on experience building production-grade ML/LLM-enabled systems. You will design advanced RAG pipelines, develop agentic workflows using LangGraph/LangChain, and deploy scalable AI services via FastAPI. This role also includes computer vision and multimodal capabilities—image preprocessing with OpenCV/PIL and working with image generation models such as Stable Diffusion and FLUX for enterprise use cases.
Key Responsibilities
- Design and implement RAG pipelines end-to-end: ingestion, chunking, embedding, retrieval, reranking, grounding, and evaluation for knowledge-intensive applications.
- Design and implement end-to-end image generation pipelines using Hugging Face models and the Diffusers ecosystem, covering dataset/input preparation, prompt/workflow orchestration, inference serving, safety/quality checks, and production monitoring.
- Build autonomous/agentic AI workflows using LangGraph and frameworks like LangChain (tool calling, routing, state/memory, multi-step orchestration, guardrails).
- Develop and maintain Python REST APIs for AI services using FastAPI, including request/response schemas, validation, error handling, API versioning, and documentation.
- Design, train, and validate supervised and unsupervised ML models for business use-cases (classification, regression, ranking).
- Productionize models: containerize ML artifacts/services with Docker, define CI/CD pipelines, and deploy services to cloud-based infrastructure.
- Build and maintain reliable data ingestion and feature engineering pipelines using SQL and Python data-processing libraries.
- Implement model monitoring and quality controls: evaluation metrics, offline/online testing, A/B testing, drift detection, latency/cost monitoring, and rollback strategies.
- Integrate AI/ML services into backend systems; collaborate with engineers and product owners to deliver end-to-end solutions.
- Work on computer vision pipelines: preprocessing, augmentation, and transformation using OpenCV and PIL; integrate CV modules into ML workflows.
- Document experiments, maintain reproducible workflows, and contribute to engineering best practices and knowledge sharing.
Requirements
Must-Have Skills & Qualifications
- 5+ years of professional experience building and deploying ML/AI solutions.
- Strong programming skills in Python with solid software engineering practices.
- Hands-on experience with ML/deep learning frameworks: PyTorch, TensorFlow, scikit-learn.
- Strong applied statistics and ML fundamentals (evaluation, regularization, feature engineering, error analysis).
- Strong SQL skills and experience building data/feature pipelines.
- Experience with LLMs and NLP techniques for enterprise applications (prompting, embeddings, RAG patterns, evaluation).
- Strong experience building REST APIs using FastAPI.
- Experience with containerization using Docker.
- Excellent communication skills. Ability to clearly explain technical concepts to non-technical stakeholders, write clean documentation, and collaborate effectively across engineering, data, and product teams.
Preferred Skills
- Experience with AWS (or Azure/GCP) and deploying inference services in cloud environments.
- Kubernetes for orchestration, autoscaling, and production deployments.
- MLOps practices: CI/CD for ML, experiment tracking, model registry, automated evaluation, monitoring, and governance.
- Experience with agentic frameworks: LangGraph, LangChain, AutoGen (or similar), including tool integration and multi-agent orchestration.
- Experience with RAG infrastructure components (vector databases, embedding models, rerankers) and evaluation strategies.
- Inference optimization (batching, caching, concurrency tuning) and cost-performance trade-offs.
- Familiarity with enterprise security/compliance (PII handling, access controls, auditability).
Nice-to-Have
- Practical knowledge of OpenCV and PIL for image processing integrated into ML pipelines.
- Experience implementing image generation workflows using Stable Diffusion and FLUX (pipelines, prompt engineering, safety filtering, evaluation).
- Experience building multimodal applications (text + image) and deploying them as reliable services.
- GenAI testing and evaluation experience: building/using evaluation harnesses for RAG/agentic systems, defining quality metrics, regression testing prompts/workflows, and monitoring quality drift in production.
