As a Senior Data Engineer – AI Healthcare Analytics, you will be working for our client, a leader in healthcare data solutions dedicated to transforming complex health data into actionable insights. You’ll build scalable data pipelines and infrastructure that fuel AI and analytics projects, empowering smarter patient care and operational excellence.
Unleash the Power of Data — Shape the Future of Healthcare Intelligence!
Poland or Portugal-based opportunity with a fully remote work model (5 days per week).
Only candidates with an existing legal right to work in Europe will be considered for this role.
responsibilities :
Design and develop production ETL/ELT pipelines from diverse healthcare data sources, including claims, patient records, and referrals, into S3 data lakes and data warehouses.
Implement and manage batch workflows with orchestration, error handling, retries, and lineage tracking.
Write modern Python-based data processing jobs using the latest libraries to transform and merge healthcare datasets.
Build and optimize AWS data infrastructure, including S3, Athena, IAM, and Glue, focusing on cost efficiency and query performance.
Integrate healthcare-specific data such as ICD/CPT codes, NPI data, and demographic details, normalizing and processing them for analytical use.
Develop and optimize complex SQL queries for healthcare metrics, cohort analysis, patient segmentation, and performance dashboards.
Ensure data quality through validation, schema enforcement, anomaly detection, and automated validation pipelines.
Profile data for completeness, accuracy, and duplication, preparing comprehensive data quality reports.
Monitor and optimize data pipelines for throughput, memory efficiency, and cost — implementing caching, partitioning, and cost-tracking strategies.
Collaborate using Git, maintain detailed documentation, and develop clear technical specifications and data dictionaries.
requirements-expected :
At least 5 years of professional experience building production-grade ETL/ELT pipelines processing large volumes of healthcare data.
Strong proficiency in Python (3.12+) and experience with modern data processing libraries.
Hands-on AWS experience with S3, Athena, IAM, and Glue, including cost management and schema design.
Expertise in writing and optimizing advanced SQL queries on datasets with millions or billions of rows, utilizing CTEs and window functions.
Experience working with large-scale datasets and performance trade-offs.
Good understanding of development practices such as Git workflows, code reviews, and documentation.
Fluent English
offered :
Stable and long-term cooperation with very good conditions
Enhance your skills and develop your expertise in the financial industry
Work on the most strategic projects available in the market
Define your career roadmap and develop yourself in the best and fastest possible way by delivering strategic projects for different clients of ITDS over several years
Participate in Social Events, training, and work in an international environment
Access to attractive Medical Package
Access to Multisport Program
Access to Pluralsight
Flexible hours
benefits :
sharing the costs of sports activities
private medical care
remote work opportunities
flexible working time
fruits
integration events
corporate gym
saving & investment scheme
no dress code
coffee / tea
drinks
christmas gifts
birthday celebration
sharing the costs of a streaming platform subscription