Mid Machine Learning /AI Engineer (RL) @ Acaisoft

Warsaw

Nazwa pozycji Mid Machine Learning /AI Engineer (RL) @ Acaisoft

Lokalizacja Warszawa, Warsaw, Masovian Voivodeship, Polska

Firma Acaisoft

Dodano 18. 12. 2025

Informacje o stanowisku

You will be cooperating with a leading provider of AI evaluation and optimization solutions, trusted by multinational companies to optimize AI agents and detect performance issues in large language models.
In this role, you’ll help develop advanced reinforcement learning (RL) environments and scalable evaluation systems that guide and shape the behavior of cutting-edge AI models. The company’s mission is to enable safe, verifiable, and aligned AGI through rigorous, real-world agent evaluation.

Due to the client’s time zone, we would appreciate a candidate who can work 12 p.m. - 8 p.m.

This is a 100% remote position, but if you enjoy working from an office, you’re warmly welcome to join us there too ?

Join us and make a real impact!

If you’re ready to broaden your horizons and work with an innovative company at the forefront of AI, we’d love to hear from you. You’ll help build the environments that shape how future AI systems are trained, evaluated, and aligned - and collaborate with world-class engineers and researchers on one of the most important technical challenges of our time.

4+ years of experience in Python software engineering.
Minimum 2 years in Machine Learning/Environment Engineering, Data Scientist roles.
Practical knowledge of AI frameworks (Langchain, Langraph, mcp-server ).
Practical experience in working with AI, including prompt engineering and vibe coding.
Experience in working with business requirements (analysis, summarizing, responding to changes).
Expertise in planning your own work or that of a small team.

Nice to have:

Knowledge of Codex or Claude Code.
Experience in integrating AI with a system would be an advantage.
Understanding of RL concepts - reward modeling, environment dynamics, verifiability, evaluation, and agent interaction loops.
Familiarity with instrumentation, metrics, and data pipelines for RL evaluation.

Due to the client’s time zone, we would appreciate a candidate who can work 12 p.m. - 8 p.m.

This is a 100% remote position, but if you enjoy working from an office, you’re warmly welcome to join us there too ?

Join us and make a real impact!

,[Designing and deploying large-scale, fault-tolerant AI inference services and distributed systems to support real-time voice agents., Leveraging and orchestrating Foundational Models using tools like LangChain, LangGraph, including state-of-the-art prompting, agent design, and RAG (Retrieval-Augmented Generation) techniques., Working with Reinforcement Learning (RL) techniques and implementing Continuous Learning/Online Optimization systems for production., Architecting solutions using microservices and asynchronous messaging technologies like Kafka, Azure ServiceBus, etc. (critical for high-volume, real-time interactions). Requirements: Python, Machine learning, AI frameworks, Langchain, Langraph, mcp-server, AI, Codex, Claude Code, RL concepts Additionally: Sport subscription, Private healthcare, Flat structure, Small teams, International projects, Free coffee, Bike parking, Free snacks, Free beverages, Free parking, Modern office, Startup atmosphere, No dress code.

Praca Warszawa

Mid Machine Learning /AI Engineer (RL) @ Acaisoft

Informacje o stanowisku

Join us and make a real impact!

Join us and make a real impact!

Warszawa - Oferty pracy w okolicznych lokalizacjach

Praca Warszawa - Ciekawe oferty pracy w okolicy:

Mid AI Engineer LLM

Senior Machine Learning /AI Engineer (RL) @ Acaisoft

Mid Python Engineer @ Acaisoft

Mid AI Engineer CV

Machine Learning Systems Engineer @ eConsulting

Machine learning/MLOPS Engineer

Junior AI Engineer @ Acaisoft

Senior Machine Learning Engineer NLP LLMs

Machine learning/MLOPS Engineer @ Scalo