Senior Data Pipeline Engineer and Data Integration Engineer / API Developer
Job Description
Role Overview: |
Develop efficient SQL queries and maintain views, models, and data structures across federated and transactional DB to support analytics and reporting. |
• SQL (Advanced) |
• Python – for data exploration and scripting |
• Shell scripting – for lightweight automation |
Key Responsibilities: |
• Write complex SQL queries for data extraction and transformations |
• Build and maintain views, materialized views, and data models |
• Enable efficient federated queries and optimize joins across databases |
• Support performance tuning, indexing, and query optimization efforts |
Primary: |
• Expertise in MS SQL Server / Oracle DB / PostgresSQL , Columnar DBs like DuckDB , and federated data access |
• Good understanding of Apache Arrow columnar data format, Flight SQL, Apache Calcite |
• Secondary: Experience with data modelling, ER diagrams, and schema design |
• Familiarity with reporting layer backend (e.g., Power BI datasets) |
• Familiarity with utility operations and power distribution is preferred |
• Experience with cloud-hosted databases is preferred |
• Exposure to data lake in cloud ecosystems is a plus |
Optional |
• Familiar with Grid CIM (Common Information Model; IEC 61970, IEC 61968) |
• Familiarity with GE ADMS DNOM (Distribution Network Object Model) |
• GE GridOS Data Fabric |
Hyderabad, IN