Join our AI Emerging Technology & External Collaborations team. Our work spans across multiple high-impact initiatives, from developing intelligent data products to enabling end-to-end AI workflows.
A unique opportunity to help shape the future of data and AI by directly supporting key strategic decisions across our global Partnering organization.
Data Scientist
Your responsibilities
- Drive the entire machine learning lifecycle, from exploratory data analysis (EDA) and advanced feature engineering to model training, validation, deployment, and post-launch monitoring for performance and concept drift.
- Translate ambiguous business requirements and domain challenges into well-defined technical problems, testable hypotheses, and robust machine learning solutions.
- Design, test, and validate multiple modeling approaches to find the optimal solution, establishing clear and relevant evaluation metrics that directly align with business goals.
- Utilize our Triple AI SageMaker environment to efficiently train, deploy, and manage scalable models in a production setting.
- Communicate complex model outputs and data-driven insights through compelling storytelling and clear visualizations, empowering business stakeholders to make informed, data-backed decisions.
- Develop a deep understanding of the business domain and product vision, ensuring that your work delivers tangible and measurable value to the end-user.
- Actively collaborate with engineers, product managers, and business leaders, fostering a culture of shared knowledge, open feedback, and continuous improvement.
- Proactively identify opportunities for impact and focus on delivering concrete business results and outcomes over exhaustive documentation.
Our requirements
- Master's degree in Computer Science, Statistics, Mathematics, Engineering, or a related quantitative field.
- 3+ years of hands-on professional experience in a data science role focused on building and deploying machine learning models.
- Strong proficiency in Python and its core data science libraries (e.g., pandas, NumPy, scikit-learn, Matplotlib/Seaborn).
- Solid proficiency in SQL for complex data querying, transformation, and analysis.
- Experience building models for business applications such as forecasting, classification, clustering, or regression.
- Familiarity with at least one major cloud platform (AWS, GCP, Azure).
- 6+ years of experience in a product-focused data science environment.
- Direct hands-on experience using Amazon SageMaker for model development, training, and deployment.
- Proven experience implementing and managing model monitoring systems to detect data and model drift in a production environment.
- A forward-looking interest in the application of Generative AI, with an enthusiasm to learn how to combine LLMs and other generative techniques with traditional machine learning.
- Hands-on experience with MLOps principles and tools (e.g., MLflow, Kubeflow, feature stores).
- Exceptional communication and data storytelling skills, with a proven ability to listen, understand business context, and influence both technical and non-technical audiences.
- A strong portfolio of completed data science projects that demonstrate a focus on delivering business impact.
What we offer
- Be at the forefront of innovation in AI and emerging technologies.
- Help shape a scalable and intelligent data foundation to support Roche's strategic partnering decisions.
- Collaborate with world-class researchers, engineers, and external partners.
- Work in an agile, mission-driven environment with autonomy and purpose.