.
Data Engineer for Voice Assistant
  • Warsaw
Data Engineer for Voice Assistant
Warszawa, Warsaw, Masovian Voivodeship, Polska
Samsung R&D Institute Poland
3. 12. 2023
Informacje o stanowisku

technologies-expected :


  • Python
  • DevOps
  • Linux
  • Bash
  • git
  • Jenkins
  • Docker
  • Openstack
  • nginx
  • Ansible
  • PostgreSQL
  • InfluxDB
  • Voila
  • Dash
  • Grafana
  • Flask
  • Jupyter
  • Python visualization stack

about-project :


  • We invite you to the one of the largest speech and language processing teams in Europe. We work closely with other R&D teams to develop and test our next-generation personal Intelligent Assistant. In our lab engineers, researchers, and linguists work together on innovative products for the multilingual European market. We define the way users access, explore and interact with devices, knowledge, information, and services. With us you have unique opportunity to work on product available on a wide range of devices and used by millions of users.
  • Technologies in use
  • Python,
  • DevOps (Linux, Bash, git, Jenkins, Docker, Openstack, nginx, Ansible)
  • Data Engineering & Data Science (variety of libraries for training & test data collection, data augmentation, text corpus processing),
  • Databases (PostgreSQL, InfluxDB)
  • Data Visualization and dashboarding tools (Voila, Dash, Grafana, Flask, Jupyter, Python visualization stack)

responsibilities :


  • Development and maintenance of dashboards and internal web services to present, access, annotate text or visualize usage data related to Voice Assistant.
  • Management of Linux servers used for data acquisition and processing.
  • Development and maintenance of data processing pipelines used for language analytics tasks.
  • Automation of repetitive tasks for Natural Language Processing (NLP), such as: retrieval of text data, text corpora management, text corpora annotation.
  • Exploration of available text data, to create meaningful reports (e.g. trends report, usage patterns report) and define metrics (e.g. end to end success rate) for other development teams,.
  • Significant influence on the direction of work in the team, opportunity to participate in creation of project proposals, research and patent applications (especially in the field of data processing and analytics).
  • Significant impact on technological stack: this is R&D team and we can decide what technologies we use more freely than regular development teams.

requirements-expected :


  • Bachelors or masters degree in Computer Science, Mathematics, Telecommunications or related fields.
  • Proficiency in Python.
  • Practical knowledge of the Linux environment and Bash scripting.
  • Experience in Git, Github, Jenkins, Grafana, Docker or similar tools.
  • Knowledge of English at a level that allows for easy communication.
  • Creativity, ability to adapt knowledge to create innovation and open-mind is a plus.

offered :


  • Friendly working atmosphere.
  • Wide range of trainings (technical / soft-skills / e-learning platform).
  • Opportunity to work in multiple projects.
  • Multidisciplinary and multicultural team.
  • Working with the latest technologies on the market.
  • Monthly integration budget.
  • Possibility to attend local and foreign conferences.
  • Opportunity to participate in science research (scientific papers, project proposals, patents applications, development of own side-projects).

benefits :


  • sharing the costs of sports activities
  • private medical care
  • sharing the costs of foreign language classes
  • life insurance
  • corporate products and services at discounted prices
  • integration events
  • dental care
  • no dress code
  • leisure zone
  • pre-paid cards
  • redeployment package
  • baby layette
  • employee referral program
  • charity initiatives
  • unlimited free access to Copernicus Science Center
  • mentoring program
  • psychological support
  • possibility to test new Samsung products
  • work in Korea as a part of our Mobility Program

  • Praca Warszawa
  • Warszawa - Oferty pracy w okolicznych lokalizacjach


    118 346
    18 862