You will be cooperating with a leading provider of AI evaluation and optimization solutions, trusted by multinational companies to optimize AI agents and detect performance issues in large language models.
In this role, you’ll help develop advanced reinforcement learning (RL) environments and scalable evaluation systems that guide and shape the behavior of cutting-edge AI models. The company’s mission is to enable safe, verifiable, and aligned AGI through rigorous, real-world agent evaluation.
Due to the client’s time zone, we would appreciate a candidate who can work 12 p.m. - 8 p.m.
This is a 100% remote position, but if you enjoy working from an office, you’re warmly welcome to join us there too ?
If you’re ready to broaden your horizons and work with an innovative company at the forefront of AI, we’d love to hear from you. You’ll help build the environments that shape how future AI systems are trained, evaluated, and aligned - and collaborate with world-class engineers and researchers on one of the most important technical challenges of our time.
Nice to have:
You will be cooperating with a leading provider of AI evaluation and optimization solutions, trusted by multinational companies to optimize AI agents and detect performance issues in large language models.
In this role, you’ll help develop advanced reinforcement learning (RL) environments and scalable evaluation systems that guide and shape the behavior of cutting-edge AI models. The company’s mission is to enable safe, verifiable, and aligned AGI through rigorous, real-world agent evaluation.
Due to the client’s time zone, we would appreciate a candidate who can work 12 p.m. - 8 p.m.
This is a 100% remote position, but if you enjoy working from an office, you’re warmly welcome to join us there too ?
If you’re ready to broaden your horizons and work with an innovative company at the forefront of AI, we’d love to hear from you. You’ll help build the environments that shape how future AI systems are trained, evaluated, and aligned - and collaborate with world-class engineers and researchers on one of the most important technical challenges of our time.
,[Designing and deploying large-scale, fault-tolerant AI inference services and distributed systems to support real-time voice agents., Leveraging and orchestrating Foundational Models using tools like LangChain, LangGraph, including state-of-the-art prompting, agent design, and RAG (Retrieval-Augmented Generation) techniques., Working with Reinforcement Learning (RL) techniques and implementing Continuous Learning/Online Optimization systems for production., Architecting solutions using microservices and asynchronous messaging technologies like Kafka, Azure ServiceBus, etc. (critical for high-volume, real-time interactions). Requirements: Python, Machine learning, AI frameworks, Langchain, Langraph, mcp-server, AI, Codex, Claude Code, RL concepts Additionally: Sport subscription, Private healthcare, Flat structure, Small teams, International projects, Free coffee, Bike parking, Free snacks, Free beverages, Free parking, Modern office, Startup atmosphere, No dress code.