Software-Development Operations

Senior Site Reliability Engineer, ISBN Cloud Ops

We help the world run better

Our company culture is focused on helping our employees enable innovation by building breakthroughs together. How? We focus every day on building the foundation for tomorrow and creating a workplace that embraces differences, values flexibility, and is aligned to our purpose-driven and future-focused work. We offer a highly collaborative, caring team environment with a strong focus on learning and development, recognition for your individual contributions, and a variety of benefit options for you to choose from. Apply now!

As a Senior Site Reliability Engineer (SRE), you will be to part of a high-performance team which continuously improves the reliability of critical systems, working closely with development and operations teams. The SRE is responsible for monitoring, troubleshooting, and developing tolling and automation to optimize system performance and efficiency.


The ideal candidate will challenge the status quo. To be successful in this role, you will need to thrive in an agile environment where teams work together toward a common goal.

· Identify engineering defects in the existing code base and continuously improve the code quality.

· Collaborate with development teams to implement and deploy new features and enhancements, ensuring they meet reliability and performance standards.

· Performs code reviews and pair program with other engineers on the team.

· Define and implement efficient end-to-end provisioning of automation solutions.

· Build CI/CD pipeline configurations to orchestrate provisioning and deployment.

· Automate monitoring tools to monitor system health and reliability to support high uptime requirements.

· Automate repetitive tasks and processes to improve efficiency and reduce manual intervention.

· Collaborate with cross-functional teams to define and establish service level indicators (SLIs), service level objectives (SLOs) and key software engineering metrics.

· Automate infrastructure in AWS and in private data centers with CloudFormation, Terraform, Ansible, and AWS DevOps tools.

· Conduct post-incident analyses to identify root causes and implement preventive measures to avoid future incidents.

· Perform capacity planning and resource allocation to ensure optimal system performance and scalability.

· Stay up to date with industry best practices, new technologies, and emerging trends in site reliability engineering.

· Create and maintain documentation for system architecture, configuration, and troubleshooting procedures.



· Full understanding of DevOps, SRE and agile software development roles and concepts.

· Senior level ability to use one or more of these languages: Phyton, Typescript/Javascript, Golang, Java, or C#.

· Full understanding of Git (code version control) and software development best practices (GitOps).

· Strong knowledge of Linux/Unix systems and command line tools.

· Senior level knowledge of IaC and Configuration Management, using technologies such as Cloud Formation, Terraform, Puppet and Ansible.

· Senior level knowledge of AWS main resources (VPC, EC2, IAM, API Gateway, autoscaling, availability zones, Lambda...) and deploying and running systems at scale.

· Full understanding of microservices architecture (concepts).

· Full understanding of observability best practices, and monitoring and logging tools such as Dynatrace, New Relic, Prometheus, Grafana, ELK stack, Splunk…

· Senior level knowledge of Jenkins, AWS Code Deploy or similar CI/CD tools and pipelines.

· Prior experience with containerized deployments

We build breakthroughs together

SAP innovations help more than 400,000 customers worldwide work together more efficiently and use business insight more effectively. Originally known for leadership in enterprise resource planning (ERP) software, SAP has evolved to become a market leader in end-to-end business application software and related services for database, analytics, intelligent technologies, and experience management. As a cloud company with 200 million users and more than 100,000 employees worldwide, we are purpose-driven and future-focused, with a highly collaborative team ethic and commitment to personal development. Whether connecting global industries, people, or platforms, we help ensure every challenge gets the solution it deserves. At SAP, we build breakthroughs, together.

We win with inclusion

SAP’s culture of inclusion, focus on health and well-being, and flexible working models help ensure that everyone – regardless of background – feels included and can run at their best. At SAP, we believe we are made stronger by the unique capabilities and qualities that each person brings to our company, and we invest in our employees to inspire confidence and help everyone realize their full potential. We ultimately believe in unleashing all talent and creating a better and more equitable world.
SAP is proud to be an equal opportunity workplace and is an affirmative action employer. We are committed to the values of Equal Employment Opportunity and provide accessibility accommodations to applicants with physical and/or mental disabilities. If you are interested in applying for employment with SAP and are in need of accommodation or special assistance to navigate our website or to complete your application, please send an e-mail with your request to Recruiting Operations Team:
For SAP employees: Only permanent roles are eligible for the SAP Employee Referral Program, according to the eligibility rules set in the SAP Referral Policy. Specific conditions may apply for roles in Vocational Training.

EOE AA M/F/Vet/Disability:

Qualified applicants will receive consideration for employment without regard to their age, race, religion, national origin, ethnicity, age, gender (including pregnancy, childbirth, et al), sexual orientation, gender identity or expression, protected veteran status, or disability.
Successful candidates might be required to undergo a background verification with an external vendor.

Requisition ID: 393471  | Work Area: Software-Development Operations  | Expected Travel: 0 - 10%  | Career Status: Professional  | Employment Type: Regular Full Time   | Additional Locations: #LI-Hybrid.

Requisition ID:  393471
Posted Date:  Jun 21, 2024
Work Area:  Software-Development Operations
Career Status:  Professional
Employment Type:  Regular Full Time
Expected Travel:  0 - 10%

São Leopoldo, BR, 93022-718

Job alert

Job Segment: ERP, Cloud, SAP, Software Engineer, Unix, Technology, Engineering