Data Engineer (Microsoft Azure/Storage/ETL, Ukraine)
At Capgemini Engineering, the world leader in engineering services, we bring together a global team of engineers, scientists, and architects to help the world’s most innovative companies unleash their potential. From autonomous cars to life-saving robots, our digital and software technology experts think outside the box as they provide unique R&D and engineering services across all industries. Join us for a career full of opportunities. Where you can make a difference. Where no two days are the same.
Your Client
You’ll be working with a globally recognized leader in the beverage and consumer goods industry, known for its commitment to innovation, sustainability, and data-driven decision-making. The client is undergoing a digital transformation, leveraging advanced data platforms and cloud technologies to optimize operations and enhance customer experiences across markets.
Your Role
Semantic & Knowledge Graph Engineering
Implement RDF mapping using RML/YARRRML/RDF-star and tools like Morph-KGC.
Integrate SHACL validation for semantic rule enforcement.
Ingest and query knowledge graphs using Apache Jena and SPARQL
Data Architecture & Modeling
Design and implement scalable data models using Azure Synapse Analytics, Delta Lake, and SQL (T‑SQL).
Apply normalisation techniques and dimensional modelling to support analytical workloads.
Ensure data structures align with business requirements and performance goals.
Data Pipeline Development
Build and maintain robust ETL/ELT pipelines using Azure Data Factory (ADF) and Spark (PySpark/Scala/Spark SQL).
Implement incremental/delta processing strategies to optimise data refresh cycles.
Integrate CI/CD pipelines for data workflows using GitHub Actions or Azure DevOps.
Data Quality & Governance
Define and enforce data quality rules, validation checks, and cleansing logic.
Collaborate with stakeholders to implement data governance policies.
Leverage Azure Purview for metadata cataloguing, lineage tracking, and compliance.
Storage & Infrastructure
Manage and optimise data storage in Azure Data Lake Storage Gen2 (ADLS Gen2).
Configure and monitor GraphDB, OpenSearch, and other infrastructure components as needed.
Monitoring & Performance
Conduct performance tuning of queries and pipelines to ensure efficiency.
Set up monitoring and alerting using App Insights and Log Analytics for proactive issue detection.
Collaboration & Delivery
Work closely with data scientists, analysts, and business users to understand data needs.
Translate business requirements into technical specifications and deliverables.
Document data flows, transformations, and architecture decisions.
Your Skillset
RML/YARRRML/RDF-star for RDF Mapping, SHACL Validation Integration, Knowledge of RDF/SPARQL for KG ingestion. MorphKGC, Apache Jena, Azure Synapse Analytics, Spark (PySpark/Scala/Spark SQL), Delta Lake, SQL (T‑SQL), Data Modeling & Normalization, Data Quality & Governance, Azure Data Factory (ADF), Azure Data Lake Storage Gen2 (ADLS Gen2), Azure Purview (catalog & lineage), ETL/ELT Pipeline Design, Performance Tuning & Optimization, Incremental/Deltas Processing, CI/CD for Data Pipelines (GitHub Actions/Azure DevOps), Monitoring & Alerting (App Insights/Log Analytics)
What You Will Love About Working Here
- We care about all our employees and want them to feel as comfortable as possible. That's why we offer them health insurance from the first days, regardless of the probationary period.
- The gift from the company - Christmas holidays from 25 December to 31 December.
- Сooperation with Superhumans center and Veteran HUB. Capgemini Engineering has supported the launch of psychological rehabilitation department of Superhumans. Our team also donated over UAH 500 000 prosthetics for three Ukrainian defenders. Currently, we support psychological counseling provided by the Veteran Hub, and we have implemented an internal policy making the company friendly to military and veterans with the assistance of the Hub.
Capgemini is a global business and technology transformation partner, helping organizations to accelerate their dual transition to a digital and sustainable world, while creating tangible impact for enterprises and society. It is a responsible and diverse group of 340,000 team members in more than 50 countries. With its strong over 55-year heritage, Capgemini is trusted by its clients to unlock the value of technology to address the entire breadth of their business needs. It delivers end-to-end services and solutions leveraging strengths from strategy and design to engineering, all fueled by its market leading capabilities in AI, generative AI, cloud and data, combined with its deep industry expertise and partner ecosystem.
Odesa, UA Kyiv, UA Lviv, UA Odesa, UA Rivne, UA