1 day ago Be among the first 25 applicants
Get AI-powered advice on this job and more exclusive features.
The ROCm Communication Collectives Library (RCCL) is a stand‑alone library that provides multi‑GPU and multi‑node collective communication primitives optimized for AMD GPUs. It uses PCIe and xGMI high‑speed interconnects.
Responsibilities
- Provide deep technical leadership and guidance for GPU communication technologies, define the technical vision and direction for the GPU communication software stack.
- Engage with executives and key stakeholders to provide insight into industry trends and recommend strategic initiatives. Influence the future direction of the company’s technical portfolio.
- Represent AMD in leadership positions at industry organizations and standards bodies.
- Engage with clients and industry partners to deeply understand technical needs, ensuring their satisfaction with tailored solutions that leverage your experience in strategic customer engagements and architectural wins.
- Collaborate with hardware and software architects, system engineers and business teams in identifying requirements and building roadmaps for future products.
- Mentor engineers and technical leaders, fostering a culture of innovation and excellence. Help develop the next generation of leaders through coaching, training, and feedback.
Mandatory Skills Description
- Experience architecting and developing communication software solutions for accelerators using RDMA and accelerator‑to‑accelerator fabrics (e.g. Infinity Fabric, UALink), from low‑level device drivers and OS internals up through applications and AI/ML frameworks.
- Deep expertise with distributed programming models (MPI, SHMEM), and the implementation and optimization of collective communication algorithms.
- Deep expertise with RoCE, RDMA, and network topologies.
- Experience with system software development in C/C++, and GPU software development and parallel programming.
- Analytical and performance analysis skills.
- Effective communication and problem‑solving skills.
- Proven history of communication software thought leadership, backed with patents, publications, and participation in industry standards bodies.
Mode of work: Hybrid (2 days per week in the office)
What do we offer our employees
- Higher net salary for developers.
- Fully remote recruitment process.
- Stable employment based on an employment contract.
- MyBenefit program (sports card, well‑being program, etc.).
- Employee assistance program for you and your family (consultations with a psychologist, coaching sessions, assistance in private life).
- LuxTalent platform (webinars, training, courses with certificates).
- Internal Mobility program – possibility of rotation between projects, locations, accounts.
- And even more!