.
Data Science Engineer
  • Wysokie Mazowieckie
Data Science Engineer
Wysokie Mazowieckie, Wysokie Mazowieckie, Podlaskie Voivodeship, Polska
Accuris
29. 9. 2024
Informacje o stanowisku

Project & Team: Join our R&D team in developing the Core Intelligence platform, an integral part of Accuris products – Engineering Workbench, Goldfire and Parts Intelligence. Our mission is to drive digital transformation in engineering organizations by unlocking valuable insights from the unstructured content of corporate repositories and industry sources, seamlessly integrating them into daily engineering workflows. Our solutions serve over 7,500 companies, empowering engineers and knowledge workers with cutting-edge tools and industry content from 450 Standards Development Organizations. The project focuses on creating intelligent mechanisms for the extraction, decomposition, analysis, and retrieval of relevant engineering data, utilizing advanced Machine Learning and Deep Learning models within a scalable and optimized cloud infrastructure. The Impact: This role is critical for enhancing, developing new features, and maintaining of an existing in-house developed solution for decomposition and structural parsing of documents in PDF, scans, MS Office, and other formats, including intelligent processing with ML models and specialized Natural Language Processing and Computer Vision algorithms. This solution is a core component of data processing pipelines across multiple products of our organization. What We Offer: Engaging and innovative tasks with a dedicated team focused on developing proprietary solutions utilizing advanced state-of-the-art Machine Learning models and data-driven algorithms. Close collaboration with experienced software developers, data scientists, data analysts and researchers. Comprehensive support for personal growth and career development at the corporate level. A fully remote work environment. Provision of all necessary equipment. Health and life insurance. Wellness program. Multisport card. Medical Package. Role & Responsibilities: Develop and maintain core components of data processing pipelines focused on parsing and intelligent analysis of unstructured content (e.G., PDF, MS Office formats). Design, implement, and optimize machine learning models, ensuring seamless integration and deployment within the continuous release cycle of the unstructured document processing pipeline. Oversee MLOps functions, including dataset management, maintenance of training pipelines, solution packaging, and management of third-party dependencies. Create and maintain comprehensive documentation, and actively collaborate with other development teams to promote updates and enhancements. Job Requirements: Experience : 3 years as a Python developer and Data Scientist with a focus on developing and deploying code and ML models in production, particularly for unstructured data processing projects. Python Proficiency : Advanced Python programming and engineering skills, including environment management, library packaging, code analysis, and unit testing. Deep Learning : Hands-on experience in training deep learning models for NLP and/or Computer Vision such as transformers, RNNs, CNNs. Algorithmic Expertise : Strong understanding of algorithms and data structures. Communication : Fluent in English, with excellent communication and collaboration abilities. Nice-to-Have: Skills in shell scripting and Linux. Experience with data extraction from documents.

  • Praca Wysokie Mazowieckie
  • Wysokie Mazowieckie - Oferty pracy w okolicznych lokalizacjach


    105 511
    20 291