.
Mid Machine Learning /AI Engineer (RL) @ Acaisoft
  • Warsaw
Mid Machine Learning /AI Engineer (RL) @ Acaisoft
Warszawa, Warsaw, Masovian Voivodeship, Polska
Acaisoft
18. 12. 2025
Informacje o stanowisku

You will be cooperating with a leading provider of AI evaluation and optimization solutions, trusted by multinational companies to optimize AI agents and detect performance issues in large language models.
In this role, you’ll help develop advanced reinforcement learning (RL) environments and scalable evaluation systems that guide and shape the behavior of cutting-edge AI models. The company’s mission is to enable safe, verifiable, and aligned AGI through rigorous, real-world agent evaluation.

Due to the client’s time zone, we would appreciate a candidate who can work 12 p.m. - 8 p.m.

This is a 100% remote position, but if you enjoy working from an office, you’re warmly welcome to join us there too ?


 Join us and make a real impact! 

If you’re ready to broaden your horizons and work with an innovative company at the forefront of AI, we’d love to hear from you. You’ll help build the environments that shape how future AI systems are trained, evaluated, and aligned - and collaborate with world-class engineers and researchers on one of the most important technical challenges of our time.


  • 4+ years of experience in Python software engineering.
  • Minimum 2 years in Machine Learning/Environment Engineering, Data Scientist roles.
  • Practical knowledge of AI frameworks (Langchain, Langraph, mcp-server ).
  • Practical experience in working with AI, including prompt engineering and vibe coding.
  • Experience in working with business requirements (analysis, summarizing, responding to changes).
  • Expertise in planning your own work or that of a small team.

Nice to have:

  • Knowledge of Codex or Claude Code.
  • Experience in integrating AI with a system would be an advantage.
  • Understanding of RL concepts - reward modeling, environment dynamics, verifiability, evaluation, and agent interaction loops.
  • Familiarity with instrumentation, metrics, and data pipelines for RL evaluation.

You will be cooperating with a leading provider of AI evaluation and optimization solutions, trusted by multinational companies to optimize AI agents and detect performance issues in large language models.
In this role, you’ll help develop advanced reinforcement learning (RL) environments and scalable evaluation systems that guide and shape the behavior of cutting-edge AI models. The company’s mission is to enable safe, verifiable, and aligned AGI through rigorous, real-world agent evaluation.

Due to the client’s time zone, we would appreciate a candidate who can work 12 p.m. - 8 p.m.

This is a 100% remote position, but if you enjoy working from an office, you’re warmly welcome to join us there too ?


 Join us and make a real impact! 

If you’re ready to broaden your horizons and work with an innovative company at the forefront of AI, we’d love to hear from you. You’ll help build the environments that shape how future AI systems are trained, evaluated, and aligned - and collaborate with world-class engineers and researchers on one of the most important technical challenges of our time.

,[Designing and deploying large-scale, fault-tolerant AI inference services and distributed systems to support real-time voice agents., Leveraging and orchestrating Foundational Models using tools like LangChain, LangGraph, including state-of-the-art prompting, agent design, and RAG (Retrieval-Augmented Generation) techniques., Working with Reinforcement Learning (RL) techniques and implementing Continuous Learning/Online Optimization systems for production., Architecting solutions using microservices and asynchronous messaging technologies like Kafka, Azure ServiceBus, etc. (critical for high-volume, real-time interactions). Requirements: Python, Machine learning, AI frameworks, Langchain, Langraph, mcp-server, AI, Codex, Claude Code, RL concepts Additionally: Sport subscription, Private healthcare, Flat structure, Small teams, International projects, Free coffee, Bike parking, Free snacks, Free beverages, Free parking, Modern office, Startup atmosphere, No dress code.

  • Praca Warszawa
  • Warszawa - Oferty pracy w okolicznych lokalizacjach


    127 348
    18 839