.
Senior Machine Learning /AI Engineer (RL)
  • Warsaw
Senior Machine Learning /AI Engineer (RL)
Warszawa, Warsaw, Masovian Voivodeship, Polska
ACAISOFT POLAND Sp. z o.o.
27. 1. 2026
Informacje o stanowisku

Senior Machine Learning /AI Engineer (RL) Miejsce pracy: Warszawa Technologies we use Expected Python Langchain Langraph mcp-server Operating system macOS About the project You will be cooperating with a leading provider of AI evaluation and optimization solutions, trusted by multinational companies to optimize AI agents and detect performance issues in large language models. In this role, you’ll help develop advanced reinforcement learning (RL) environments and scalable evaluation systems that guide and shape the behavior of cutting-edge AI models. The company’s mission is to enable safe, verifiable, and aligned AGI through rigorous, real-world agent evaluation. Due to the client’s time zone, we would appreciate a candidate who can work 2 p.m. - 10 p.m. Join us and make a real impact! If you’re ready to broaden your horizons and work with an innovative company at the forefront of AI, we’d love to hear from you. You’ll help build the environments that shape how future AI systems are trained, evaluated, and aligned - and collaborate with world-class engineers and researchers on one of the most important technical challenges of our time. Your responsibilities Design and implement RL environments that support large-scale agent evaluation and reinforcement learning experiments. Build task generation pipelines, dynamic datasets, and scripted environments with controlled complexity and stochasticity. Develop verifiers and reward models to automatically score trajectories and evaluate model reasoning. Collaborate with infrastructure and systems engineers to ensure environments are scalable, reproducible, and instrumented for detailed telemetry. Design APIs and orchestration frameworks for running, resetting, and evaluating agents across environments. Optimize environment performance, logging, and reward reproducibility across distributed setups. Our requirements 6 years of experience in Python software engineering. Minimum 3 years in a Machine Learning/Environment Engineering, Data Scientist position. Practical knowledge of AI frameworks (Langchain, Langraph, mcp-server ). Extensive practical experience in working with AI, including prompt engineering and vibe coding. Experience in working with business requirements (analysis, summarizing, responding to changes). Expertise in planning your own work or that of a small team. Being able to work 2 p.m. - 10 p.m. Optional Knowledge of Codex or Claude Code. Experience in integrating AI with a system would be an advantage. Understanding of RL concepts - reward modeling, environment dynamics, verifiability, evaluation, and agent interaction loops. Familiarity with instrumentation, metrics, and data pipelines for RL evaluation. This is how we organize our work This is how we work at the clients site you focus on a single project at a time you can change the project you focus on product development agile Development opportunities we offer industry-specific e-learning platforms space for experimenting substantive support from technological leaders time for development of your ideas What we offer Great atmosphere - we value a friendly, informal atmosphere, and direct contact with everyone in the company. Outstanding People - we understand that great teams are about personalities, not just skills. Therefore our team accommodates a fantastic blend of individuals and management that removes roadblocks. Modern technologies - we use proven technologies that are currently up-to-date. Even if you have not used all of them, you can make up for it with us! Unlimited possibilities - you’ll get the opportunity to develop your qualifications thanks to sponsorship for industry meetups and conferences and working on challenging international projects with the latest technologies. Private medical care and Multisport - we care about your health and wellbeing so you’ll get access to private medical care for you and your family, and partial funding for a sports card. Benefits sharing the costs of sports activities private medical care sharing the costs of professional training & courses remote work opportunities flexible working time integration events corporate sports team no dress code video games at work coffee / tea drinks parking space for employees leisure zone extra social benefits baby layette school layette employee referral program charity initiatives company sports team Gift vouchers for kids (birthdays, Christmas, Childs Day) Recruitment stages HR call- max15min A short call with our Technical Delivery Manager- max 15min Technical interview with our client- max 30min* ACAISOFT POLAND Sp. z o.o. At Acaisoft we specialize in cloud-native application development and transformations from legacy to cloud-native environments. We provide end-to-end software solutions, from business analysis, through project evaluation, to UI/UX, Frontend, and Backend design and implementation. We integrate manual and automated QA finest practices, to make sure that the final product is top-notch. Our customers range from startups to large enterprises based in the US, mainly Silicon Valley, and Western Europe. Since technology is constantly being developed at such a fast pace, we always strive to be one step ahead of the market and keep up with the latest solutions. Wszystkie informacje o przetwarzaniu danych osobowych w tej rekrutacji znajdziesz w formularzu aplikacyjnym, po kliknięciu w przycisk "Aplikuj Teraz".

  • Praca Warszawa
  • Warszawa - Oferty pracy w okolicznych lokalizacjach


    122 580
    18 317