Distributed Training & Inference Optimization Engineer

Google India · Mumbai, Maharashtra, India

Location
Mumbai
Job Type
Full-time
Posted
June 04, 2026

Job Description

Overview
Join a highly advanced AI infrastructure team focused on building and optimizing large-scale machine learning systems. This environment leverages cutting-edge technologies to enable high-performance experimentation, scalable model deployment, and efficient processing of large datasets.
The team operates globally, bringing together engineers and researchers to push the boundaries of deep learning, distributed systems, and next-generation compute platforms.
About the Role
This position is centered on

maximizing the efficiency and scalability of GPU-based machine learning workloads , particularly for large language models (LLMs) and generative AI systems.
You will work on improving both

training performance and inference efficiency , ensuring optimal utilization of hardware resources, reduced latency, and faster model iteration cycles. The role requires hands-on expertise in deep learning frameworks, distributed systems, and performance optimizati...

Ready to Apply?

Submit your application for Distributed Training & Inference Optimization Engineer at Google India

Apply Now