.
Middle Big Data Engineer
  • Warsaw
Middle Big Data Engineer
Warszawa, Warsaw, Masovian Voivodeship, Polska
KITRUM
31. 5. 2024
Informacje o stanowisku

technologies-expected :


  • Spark
  • SQL
  • Python

technologies-optional :


  • Kafka
  • Terraform
  • Airflow
  • Jira
  • JetBrains IDEs
  • Git
  • GitLab
  • Docker
  • Jenkins

about-project :


  • Client is an American e-book and audiobook subscription service that includes one million titles. Platform hosts 60 million documents on its open publishing platform.
  • The platform allows:
  • — anyone to share his/her ideas with the world;
  • — access to audio books;
  • — access to world’s composers who publish their music;
  • — incorporates articles from private publishers and world magazines;
  • — allows access to exclusive content.
  • Core Platform provides robust and foundational software, increasing operational excellence to scale apps and data. We are focused on building, testing, deploying apps and infrastructure which will help other teams rapidly scale, inter-operate, integrate with real-time data, and incorporate machine learning into their products. Working with our customers in the Data Science and Content Engineering, and our peers in Internal Tools and Infrastructure teams we bring systems-level visibility and focus to our projects.
  • Client’s goal is not total architectural or design perfection, but rather choosing the right trade-offs to strike a balance between speed, quality and cost.

responsibilities :


  • Manage data quality and integrity
  • Assist with building tools and technology to ensure that downstream customers can have faith in the data they’re consuming
  • Cross-functional work with the Data Science or Content Engineering teams to troubleshoot, process, or optimize business-critical pipelines
  • Work with Core Platform to implement better processing jobs for scaling the consumption of streaming data sets

requirements-expected :


  • 3+ years of experience in data engineering creating or managing end-to-end data pipelines on large complex datasets.
  • Proficiency in Spark
  • Expertise in Scala, and/or Python
  • Fluency with at least one dialect of SQL
  • Level of English: Upper-Intermediate

offered :


  • High compensation according to your technical skills
  • Long-term projects (12m+) with great Customers
  • 5-day working week, 8-hour working day, flexible schedule
  • Democratic management style & friendly environment
  • WFH mode
  • Annual Paid vacation — 30 b/days + unpaid vacation
  • Paid sick leaves — 6 b/days per year
  • Corporate Perks (external training, English course, business speaking club, corporate events/team buildings)
  • Professional and personal growth

  • Praca Warszawa
  • Warszawa - Oferty pracy w okolicznych lokalizacjach


    79 366
    15 289