.
Data Scientist/ML (Agentic AI)
  • Warsaw
Data Scientist/ML (Agentic AI)
Warszawa, Warsaw, Masovian Voivodeship, Polska
emagine Polska
29. 3. 2026
Informacje o stanowisku

Location : The role offers flexibility for occasional travel to the Warsaw office and potential international travel to Germany, approximately once a quarter.

Start: Preferably ASAP or max one month notice

Industry: Pharmaceuticals / Consumer Health

The Mission: You are the "brain" designer. Moving beyond classic ML models, you will design complex, multi-agent workflows. Your mission is to build the cognitive architecture of our Co-pilot for sales representatives - ranging from strict Text-to-SQL routing to human-like conversational interviews - ensuring compliance and continuous improvement.

Who You Are & What You'll Do:

  • Agentic Workflows: You have deep, practical experience building complex agent routing and state management using LangChain, LangGraph, or tools like n8n.

  • Multimodal & Conversational AI: You have deployed advanced RAG systems and have experience integrating TTS/STT (Text-to-Speech/Speech-to-Text) pipelines for asynchronous "Human-AI interview" conversational flows.

  • Text-to-SQL & Business Logic Routing: You excel at mapping natural language to exact SQL parameters. You understand that the AI shouldn't "guess" financial math; instead, it must flawlessly route intents to deterministic business logic/SQL views provided by our data teams.

  • Continuous Evaluation & Guardrails: You know that standard testing fails with GenAI. You will design systemic validation pipelines (e.g., LLM-as-a-judge) to monitor hallucination rates, using tools like Langfuse to establish a continuous improvement loop.

  • Multilingual Evaluation: You know standard English testing fails in localized markets. You will design systemic validation pipelines to evaluate RAG, intent classification, and transcription accuracy in Italian (handling medical/pharmaceutical jargon).

  • Optimization & Fallbacks: You understand how to optimize AI processes—utilizing prompt caching, context-window management, and configuring faster fallback models when primary LLMs time out, ensuring a seamless user experience.

Must Haves:

  • Deep, practical experience with LangChain, LangGraph, or similar tools.

  • Knowledge of integration concepts for TTS/STT systems.

  • Strong Text-to-SQL skills for accurate data routing.

  • Experience with validation of generative AI and RAG pipelines.

  • Proficiency in Python, SQL, and Spark programming.

Nice to Haves:

  • Familiarity with GitHub for version control.

  • Experience with FastAPI for application development.

  • Awareness of data solutions like Databricks.

  • Understanding of Azure cloud services.

  • Praca Warszawa
  • Warszawa - Oferty pracy w okolicznych lokalizacjach


    105 189
    14 861