.
VLA Researcher
  • Warsaw
VLA Researcher
Warszawa, Warsaw, Masovian Voivodeship, Polska
TechTree
22. 1. 2026
Informacje o stanowisku

Join to apply for the VLA Researcher role at TechTree

This range is provided by TechTree. Your actual pay will be based on your skills and experience—talk with your recruiter to learn more.

Base pay range

$80,000.00/yr - $100,000.00/yr

About The Role

Expert in deep learning, robotics, and multi‑modal models with strong PyTorch skills. 5‑8+ years in ML research/engineering, 2+ years in robotics or VLA‑related work. Warsaw, $100k USD + equity.

The job involves working with deep learning technologies, specifically focusing on transformers, self‑supervised learning, and multi‑modal models. The role requires strong proficiency in PyTorch and experience with large‑scale distributed training. A deep understanding of vision‑language‑action model design, including policy transformers, diffusion policies, behaviour cloning, and temporal modelling, is essential.

The position also requires knowledge of robotics fundamentals, such as kinematics/dynamics, manipulation, teleoperation data, and policy deployment on real robots. Experience with sensor fusion, including RGB, depth, and proprioception, as well as multi‑modal representation learning, is necessary.

The job involves developing data pipelines for video‑language datasets, robot demonstrations, and simulation data using tools like Isaac Sim and MuJoCo. Expertise in pre‑training and post‑training processes, including fine‑tuning, evaluation, and model optimisation for inference, is required.

Strong engineering capabilities in Python, machine learning infrastructure, ROS/ROS2, and simulation tools are needed. The role includes leading or significantly contributing to training large‑scale vision‑language models, language models, or multi‑modal foundation models. Hands‑on experience in training action‑conditioned models, building and maintaining large‑scale multi‑modal datasets, and deploying policies on real robot hardware is expected.

The job also involves designing and running large‑scale GPU training jobs and publishing or shipping work related to robotics learning, vision‑language‑action, or multi‑modal modelling. Experience in a startup or fast‑paced research environment, delivering end‑to‑end experiments and rapid iterations, is beneficial. The position is based in Warsaw, with a salary of $100k USD plus equity.

Requirements

  • 5–8+ years in machine learning research/engineering
  • At least 2+ years in robotics or VLA‑related work
  • Led or significantly contributed to training large‑scale VLM/LLM or multi‑modal foundation models
  • Hands‑on experience training action‑conditioned models
  • Built and maintained large‑scale multi‑modal datasets
  • Proven track record deploying policies on real robot hardwarePrior experience designing and running large‑scale GPU training jobs
  • Published or shipped work related to robotics learning, VLA, or multi‑modal modelling
  • Experience in a startup or fast‑paced research environment

Required Skills

  • Deep learning (transformers, self‑supervised learning, multi‑modal models)
  • PyTorch
  • Large‑scale distributed training
  • Vision‑language‑action model design
  • Robotics fundamentals
  • Sensor fusion
  • Data pipeline development
  • Pre‑training and post‑training expertise
  • Engineering capabilities (Python, ML infra, ROS/ROS2, simulation tools)

Salary

80000 - 100000 USD

Equity

equity, negotiable

Seniority level

Mid‑Senior level

Employment type

Full‑time

Job function

Research, Analyst, and Information Technology

Industries

Software Development

Referrals increase your chances of interviewing at TechTree by 2x

Get notified about new Researcher jobs in Warsaw, Mazowieckie, Poland.

#J-18808-Ljbffr

  • Praca Warszawa
  • Warszawa - Oferty pracy w okolicznych lokalizacjach


    138 684
    20 326