Apply now »

Metrics Platform Site Reliability Engineer

Choosing Capgemini means choosing a company where you will be empowered to shape your career in the way you’d like, where you’ll be supported and inspired by a collaborative community of colleagues around the world, and where you’ll be able to reimagine what’s possible. Join us and help the world’s leading organizations unlock the value of technology and build a more sustainable, more inclusive world. 

Job Location - Atlanta, GA

Job Description

Key Responsibilities

Manage and mentor a team of Site Reliability Engineers

Define and implement SRE strategies and best practices in alignment with organizational objectives

Monitor clients service level agreements SLAs service level objectives SLOs and service level indicators SLIs

Lead initiatives to improve system reliability availability scalability and performance

Collaborate with development and operations teams to ensure reliability and resiliency goals are met

Implement and improve incident management processes to minimize downtime and ensure timely resolutions

Review and contribute to the architecture of critical systems ensuring they meet reliability and performance goals

Drive observability practices by implementing robust monitoring logging and alerting systems

 

Skills required

Proficiency in writing Splunk Queries and Alerts is a must

Hands on experience with at least one APM tool NewRelic AppDynamics Honeycomb Data Dog is a must

Expertise in automation tools and scripting languages Python Or JavaScript is a must

Proficiency in scripting languages Python or NodeJs a must

Proficiency in any cloud platforms AWS GCP Azure is a must

Strong understanding of distributed systems microservices architecture and container orchestration tools eg Kubernetes

Experience with monitoring tools like Prometheus Grafana a must

 

Job Description

Monitoring and Alerting

Implement and maintain monitoring systems to proactively identify potential issues and alert engineers to problems before they impact users

Incident Response

Respond to incidents and outages diagnose problems and implement solutions to minimize downtime and restore service

Automation

Automate repetitive tasks and processes to improve efficiency and reduce manual effort

Performance Optimization

Identify and address performance bottlenecks to ensure systems run efficiently and effectively

Infrastructure Management

Manage and maintain the underlying infrastructure including servers networks and cloud resources

Capacity Planning

Plan for future capacity needs to ensure systems can handle anticipated workloads

Release Engineering

Develop and maintain processes for deploying software updates and releases

Collaboration

Work closely with developers operations teams and other stakeholders to ensure system reliability and availability

Documentation

Maintain clear and concise documentation of systems processes and procedures

Continuous Improvement

Identify areas for improvement and implement changes to enhance system reliability and performance"

Life at Capgemini

Capgemini supports all aspects of your well-being throughout the changing stages of your life and career. For eligible employees, we offer:

  • Flexible work
  • Healthcare including dental, vision, mental health, and well-being programs
  • Financial well-being programs such as 401(k) and Employee Share Ownership Plan
  • Paid time off and paid holidays
  • Paid parental leave
  • Family building benefits like adoption assistance, surrogacy, and cryopreservation
  • Social well-being benefits like subsidized back-up child/elder care and tutoring
  • Mentoring, coaching and learning programs
  • Employee Resource Groups
  • Disaster Relief

Disclaimer:-

Capgemini is an Equal Opportunity Employer encouraging diversity in the workplace. All qualified applicants will receive consideration for employment without regard to race, national origin, gender identity/expression, age, religion, disability, sexual orientation, genetics, veteran status, marital status or any other characteristic protected by law.
This is a general description of the Duties, Responsibilities and Qualifications required for this position. Physical, mental, sensory or environmental demands may be referenced in an attempt to communicate the manner in which this position traditionally is performed. Whenever necessary to provide individuals with disabilities an equal employment opportunity, Capgemini will consider reasonable accommodations that might involve varying job requirements and/or changing the way this job is performed, provided that such accommodations do not pose an undue hardship.
Capgemini is committed to providing reasonable accommodations during our recruitment process. If you need assistance or accommodation, please reach out to your recruiting contact.

Click the following link for more information on your rights as an Applicant http://www.capgemini.com/resources/equal-employment-opportunity-is-the-law

Salary Transparency:

Capgemini discloses salary range information in compliance with state and local pay transparency obligations. The disclosed range represents the lowest to highest salary we, in good faith, believe we would pay for this role at the time of this posting, although we may ultimately pay more or less than the disclosed range, and the range may be modified in the future. The disclosed range takes into account the wide range of factors that are considered in making compensation decisions including, but not limited to, geographic location, relevant education, qualifications, certifications, experience, skills, seniority, performance, sales or revenue-based metrics, and business or organizational needs. At Capgemini, it is not typical for an individual to be hired at or near the top of the range for their role. The base salary range for the tagged location is $100000 - $130000 / year.

This role may be eligible for other compensation including variable compensation, bonus, or commission. Full time regular employees are eligible for paid time off, medical/dental/vision insurance, 401(k), and any other benefits to eligible employees.

Note: No amount of pay is considered to be wages or compensation until such amount is earned, vested, and determinable. The amount and availability of any bonus, commission, or any other form of compensation that are allocable to a particular employee remains in the Company's sole discretion unless and until paid and may be modified at the Company’s sole discretion, consistent with the law.

Capgemini is a global business and technology transformation partner, helping organizations to accelerate their dual transition to a digital and sustainable world, while creating tangible impact for enterprises and society. It is a responsible and diverse group of 340,000 team members in more than 50 countries. With its strong over 55-year heritage, Capgemini is trusted by its clients to unlock the value of technology to address the entire breadth of their business needs. It delivers end-to-end services and solutions leveraging strengths from strategy and design to engineering, all fueled by its market leading capabilities in AI, generative AI, cloud and data, combined with its deep industry expertise and partner ecosystem.

Ref. code:  360290
Posted on:  Nov 19, 2025
Experience Level:  Experienced Professionals
Contract Type:  Permanent
Location: 

Atlanta, GA, US New York, US New York, NY, US

Brand:  Capgemini
Professional Community:  Software Engineering


Nearest Major Market: Atlanta

Apply now »