Digital Continuity & Manuf Engineer
At Capgemini Engineering, the world leader in engineering services, we bring together a global team of engineers, scientists, and architects to help the world’s most innovative companies unleash their potential. From autonomous cars to life-saving robots, our digital and software technology experts think outside the box as they provide unique R&D and engineering services across all industries. Join us for a career full of opportunities. Where you can make a difference. Where no two days are the same.
Job Description
We are seeking a skilled and motivated Data Engineer to design, build, and maintain scalable data pipelines and infrastructure. The ideal candidate will have hands-on experience with big data technologies, cloud platforms, and programming languages, and will play a key role in enabling data-driven decision-making across the organization.
Key Responsibilities:
- Design, develop, and optimize data pipelines for ETL processes using Apache Hadoop, Spark, and other big data tools.
- Implement and manage data workflows in cloud environments, primarily Microsoft Azure.
- Collaborate with data scientists, analysts, and business stakeholders to understand data requirements and deliver robust solutions.
- Ensure data quality, integrity, and security across all stages of data processing.
- Develop and maintain scalable data architectures for structured and unstructured data.
- Write efficient SQL queries for data extraction, transformation, and analysis.
- Monitor and troubleshoot data pipeline performance and reliability.
- Document data engineering processes and best practices.
Primary Skills:
- Big Data Technologies: Apache Hadoop, Spark
- Cloud Platforms: Microsoft Azure (Data Factory, Synapse, Blob Storage, etc.)
- Programming Languages: Python, Java
- ETL Tools & Techniques: Data ingestion, transformation, and loading
- SQL & Data Querying: Advanced SQL for data manipulation and analysis
- Data Processing & Management: Batch and real-time data processing
- Data Analysis & Business Intelligence: Integration with BI tools and dashboards
Secondary Skills:
- Cloud Computing Concepts: Public cloud, hybrid cloud, cloud security
- Multi-Paradigm Programming: Functional and object-oriented programming
- Software Development Practices: Version control, CI/CD, testing
- Data Science Fundamentals: Understanding of statistical methods and machine learning workflows
- Information Technology: General IT knowledge including networking, storage, and system architecture
- Cloud Providers: Familiarity with AWS or Google Cloud Platform is a plus
- Communication & Collaboration: Ability to work cross-functionally and explain technical concepts to non-technical stakeholders
Capgemini is a global business and technology transformation partner, helping organizations to accelerate their dual transition to a digital and sustainable world, while creating tangible impact for enterprises and society. It is a responsible and diverse group of 340,000 team members in more than 50 countries. With its strong over 55-year heritage, Capgemini is trusted by its clients to unlock the value of technology to address the entire breadth of their business needs. It delivers end-to-end services and solutions leveraging strengths from strategy and design to engineering, all fueled by its market leading capabilities in AI, generative AI, cloud and data, combined with its deep industry expertise and partner ecosystem.
Bangalore, IN