About the job
Googles software engineers develop the next-generation technologies that change how billions of users connect, explore, and interact with information and one another. Our products need to handle information at massive scale, and extend well beyond web search. Were looking for engineers who bring fresh ideas from all areas, including information retrieval, distributed computing, large-scale system design, networking and data storage, security, artificial intelligence, natural language processing, UI design and mobile; the list goes on and is growing every day. As a software engineer, you will work on a specific project critical to Google’s needs with opportunities to switch teams and projects as you and our fast-paced business grow and evolve. We need our engineers to be versatile, display leadership qualities and be enthusiastic to take on new problems across the full-stack as we continue to push technology forward.
Google Kubernetes Engine (GKE) is a managed, production-ready environment for deploying containerized workloads and services. As a GKE AI Training team our mission is to become a top platform to train and fine tune one of the largest and most complex GenAI models, offering industry leading performance, scalability and obtainability. The team is developing OSS APIs to make Kubernetes a framework of choice for AI/ML/HPC workloads as well as building GKE-specific features and differentiators.
Google Cloud accelerates every organization’s ability to digitally transform its business and industry. We deliver enterprise-grade solutions that leverage Google’s cutting-edge technology, and tools that help developers build more sustainably. Customers in more than 200 countries and territories turn to Google Cloud as their trusted partner to enable growth and solve their most critical business problems.
Responsibilities:
- Utilize your expertise to engage in and enhance the whole lifecycle of software development - design, develop, test, deploy, maintain, and improve software.
- Manage individual project priorities and deliverables. Play a crucial role in shaping the future of AI training features in Kubernetes, both in the open-source community and within our GKE offering.
- Collaborate within and across teams and learn as you will be working on cutting-edge cloud technologies designed for large scale, high performance and high reliability
- Contribute to Google-internal, but also OSS projects. Be part of a team that values innovation, collaboration, and the pursuit of excellence in a rapidly evolving tech landscape.
- Contribute to creating a robust, scalable infrastructure that enables some of the most significant and complex LLM training projects in the industry.
Minimum qualifications:
- Bachelors degree or equivalent practical experience.
- 2 years of experience with software development in one or more programming languages, or 1 year of experience with an advanced degree.
- 2 years of experience with data structures or algorithms.
Preferred qualifications:
- Masters degree or PhD in Computer Science or related technical fields.
- Experience developing accessible technologies.
- Experience with one or more general purpose programming languages including: Java, C/C++, C#, Objective-C, Python, JavaScript, Go, etc.
Benefits:
- Health and Wellbeing (Medical, dental, and vision insurance for employees and dependents)
- Financial wellbeing (Competitive compensation, regular bonus and equity refresh opportunities)
- Flexibility and time off (Paid time off, including vacation, bereavement, jury duty, sick leave, parental leave, disability, and holidays)
- Family support and care (Fertility and growing family support, parental leave and baby bonding leave)
- Community and personal development (Educational reimbursement)
- Googley extras (Inspiring spaces to work, recharge, and collaborate with fellow Googlers)