Senior Data Engineer (Python/PySpark/Kafka) - Full Remote Portugal
Senior Data Engineer (Python/PySpark/Kafka) - Full Remote Portugal - HumanIT Digital Consulting | Career Page (function(w,d,s,l,i){w[l]=w[l]||[];w[l].push({'gtm.start': new Date().getTime(),event:'gtm.js'});var f=d.getElementsByTagName(s)[0], j=d.createElement(s),dl=l!='dataLayer'?'&l='+l:'';j.async=true;j.src= 'https://www.googletagmanager.com/gtm.js?id='+i+dl;f.parentNode.insertBefore(j,f); })(window,document,'script','dataLayer','GTM-WNGRBS6'); .primary-color { color: #051b19; } .bg-primary-color { background-color: #051b19; } .btn-info, .btn-info:hover { background-color: #051b19; } .btn-apply, .btn-apply:hover { background-color: #051b19 !important; border-color: #051b19 !important; } .search-form { width: fit-content; margin: 0px auto 20px; padding: 10px; width: 60%; } .select2-container { width: 100% !important; } .search-dropdown-options { position: absolute; margin: 5px -12px; width: calc(100% - 17px); border: 1px solid #cacaca; border-top: 0px; z-index: 1; background: #fff; max-height: 200px; overflow: auto; border-radius: 0px 0px 6px 6px; } .search-dropdown-options li { cursor: pointer; } .search-dropdown-options ul li:hover { background-color: #5897fb; color: white; } .search-dropdown-options label { width: calc(100% - 22px); font-size: 14px; } .search-dropdown-placeholder { font-size: 14px; margin: 3px; cursor: pointer; } .select2-container--default .select2-selection--single { height: 38px !important; } .select2-container--default .select2-selection--single { height: 38px !important; border: 1px solid #ced4da !important; } .select2-selection__arrow { height: 36px !important; } .select2-results__option { font-size: 14px; } .select2-selection__rendered { line-height: 38px !important; font-size: 14px; color: #969696; } .width-100 { width: 100%; } ::-webkit-input-placeholder { /* Chrome/Opera/Safari */ font-size: 14px; color: #969696; } ::-moz-placeholder { /* Firefox 19+ */ font-size: 14px; color: #969696; } :-ms-input-placeholder { /* IE 10+ */ font-size: 14px; color: #969696; } :-moz-placeholder { /* Firefox 18- */ font-size: 14px; color: #969696; } @media (max-width: 575px) { .search-form { width: 100%; } .display-4{ font-size: 2.5rem; } } .positions { font-size: 16px; color: #808080; } .serach_count { padding: 4px; } .empty-result { color: #808080; } .fa-chevron-right { padding: 0px 8px; }
Senior Data Engineer (Python/PySpark/Kafka) - Full Remote Portugal
Apply for Position Or refer someone
Job Openings Senior Data Engineer (Python/PySpark/Kafka) - Full Remote Portugal
About the job Senior Data Engineer (Python/PySpark/Kafka) - Full Remote Portugal
===
ABOUT THE OPPORTUNITY
Join a digital healthcare company revolutionizing physical therapy through AI and wearable technology. As a Senior Data Engineer, you'll architect the lakehouse infrastructure that powers virtual physical therapy platforms helping patients recover from musculoskeletal conditions through personalized, remotely-guided exercise programs. This full remote position from Portugal offers the chance to build mission-critical data systems that directly improve patient outcomes, reduce pain, and lower healthcare costs. You'll spearhead the migration to Apache Iceberg format, establish robust data pipelines, and create AI-ready data infrastructure that powers machine learning models across the platform — all while working with cutting-edge technologies in a healthcare environment where data quality and governance are paramount.
PROJECT & CONTEXT
You'll lead the migration of existing workloads to the Iceberg format, establishing and maturing the foundational lakehouse architecture that will serve as the backbone for data-driven decision making. Your responsibilities include architecting and building robust batch and streaming data pipelines using Spark and Flink, collaborating closely with Backend Engineering teams on API integrations and formal data contract establishment, and contributing to a unified lineage and governance framework using DataHub. You'll provide comprehensive support to the Core Team in adopting new data platform capabilities, ensuring solutions are platform-oriented and designed for broad organizational use. Building AI-ready data infrastructure is central — you'll ensure clean, governed, and accessible data pipelines that power machine learning models and AI-driven products across the platform, while leveraging AI coding assistants and LLMs to accelerate development and improve code quality.
WHAT WE'RE LOOKING FOR (Required)
- Demonstrated proficiency with Python and PySpark for data processing at scale
- Hands-on experience with data lake formats: Iceberg, Delta Lake, or Hudi
- Solid understanding of Kafka and event-driven architectures
- Proven experience building and orchestrating data pipelines at scale
- Strong SQL proficiency with comprehensive data modeling knowledge
- Familiarity with workflow orchestration tools: Airflow, Dagster, or similar
- Platform-oriented mindset: developing solutions for broad organizational use, not individual purposes
- Ownership mentality: committed to seeing problems through to resolution
- Clear communication skills: ability to articulate complex technical concepts to non-technical stakeholders
- Highly collaborative: excels working alongside backend engineers, data engineers, and analysts
- Pragmatic approach: balancing ideal solutions with practical delivery timelines
- Experience building and maintaining AI-ready data infrastructure
- Ability to leverage AI coding assistants and LLMs to accelerate development
- English proficiency at B2 Upper Intermediate level minimum
NICE TO HAVE (Preferred)
- Demonstrated expertise with Flink or comparable streaming frameworks
- Proficiency in DBT and familiarity with the modern data stack
- Experience with modern data platforms: BigQuery, Trino, Snowflake, or Databricks
- Proven background developing self-service data platforms
- Experience working in regulated healthcare or compliance-sensitive environments
- Knowledge of data governance frameworks and metadata management
- Understanding of healthcare data standards (HL7, FHIR)
- Familiarity with DataHub or similar data catalog/lineage tools
- Experience with infrastructure-as-code and CI/CD for data pipelines
Languages Required: English (B2 Upper Intermediate minimum)
Work Model: Full Remote — must be based in Portugal
Experience Level: Senior
Apply for Position
Or refer someone
Share
- Line
- [ LinkedIn](https://www.linkedin.com/shareArticle?mini=true&url=https://www.careers-page.com/humanit/job/93RW8RX4&title=Senior Data Engineer (Python/PySpark/Kafka) - Full Remote Portugal)
- X (Formerly Twitter)
- [ Email](https://www.careers-page.com/humanitmailto://?&subject=Job: Senior%20Data%20Engineer%20%28Python/PySpark/Kafka%29%20-%20Full%20Remote%20Portugal&body=Hi there,%0D%0A %0D%0A I would like to share with you this job:%0D%0A %0D%0A https://www.careers-page.com/humanit/job/93RW8RX4%0D%0A %0D%0A Best regards%0D%0A)
.redactor-styles { font-family: -apple-system,BlinkMacSystemFont,"Segoe UI",Roboto,"Helvetica Neue",Arial,"Noto Sans",sans-serif,"Apple Color Emoji","Segoe UI Emoji","Segoe UI Symbol","Noto Color Emoji"; }