Data Engineering Lead
Your Role
In this role, you will:
- Architect and implement modern, scalable data pipelines supporting batch, streaming, and real-time ingestion.
- Define and institutionalize data modeling best practices for analytical, AI, and retrieval-ready datasets.
- Build and enforce robust data quality frameworks ensuring high trust in data across systems.
- Drive performance optimization initiatives across ingestion, transformation, and storage layers.
- Establish strong data governance, access control, classification, and compliance mechanisms.
- Lead technical decision-making for large-scale data platform architecture.
- Mentor and grow a high-performing data engineering team, fostering engineering excellence and ownership.
- Collaborate cross-functionally with ML, RAG, MLOps, Security, and Product teams to deliver impactful data solutions.
- Continuously evaluate and adopt new technologies to improve data platform capabilities and efficiency.
Your Profile
- Strong hands-on experience with Python, SQL, and ETL/ELT pipeline development.
- Experience with AWS data services: S3, Glue, Glue Data Catalog, Athena, Lambda, Step Functions, CloudWatch, IAM, KMS.
- Working knowledge of Amazon Bedrock Knowledge Bases, Agents, Guardrails, and model inference APIs. Bedrock Guardrails can be associated with agents to validate inputs and outputs for safer GenAI applications.
- Experience with RAG data pipelines, including document ingestion, chunking, metadata tagging, embedding generation, vector indexing, and retrieval validation.
- Experience with alarm/event data processing, network telemetry, logs, ticket data, and topology/inventory datasets.
- Ability to parse semi-structured data from CSV, JSON, logs, SNMP trap payloads, and alarm text.
- Understanding of network topology concepts: device, interface, link, LAG, site, region, upstream/downstream, service dependency.
- Experience with data quality frameworks, schema validation, deduplication, and anomaly detection.
- Familiarity with OpenSearch, DynamoDB, Redshift, PostgreSQL, or vector databases.
- Good understanding of security, access control, encryption, and audit logging in AWS.
What You’ll Love About Working Here
- Opportunity to lead and shape next-generation data platforms powering AI and analytics at scale.
- Work on cutting-edge technologies including AI-ready data systems, RAG pipelines, and intelligent automation.
- Collaborative environment with highly skilled teams across data, AI, and product engineering.
- Strong focus on innovation, engineering excellence, and continuous learning.
- Flexible work culture that supports work-life balance and professional growth.
- Visibility and impact — your work directly influences critical business decisions and AI capabilities.
About Us
Gurgaon, IN