Share this Job
Information Technology

Senior Service Reliability Engineer (SRE De - Escalation Infrastructure Expert)

What we offer

Our company culture is focused on helping our employees enable innovation by building breakthroughs together. How? We focus every day on building the foundation for tomorrow and creating a workplace that embraces differences, values flexibility, and is aligned to our purpose-driven and future-focused work. We offer a highly collaborative, caring team environment with a strong focus on learning and development, recognition for your individual contributions, and a variety of benefit options for you to choose from. Apply now!




Global Cloud Services (GCS) is responsible for SAP’s Infrastructure & Technical foundation, including state-of-the-art data centers, public cloud, and associated platforms. This serves as the underpinning for SAP’s Cloud Solutions, including internal development, training, and demo landscapes. 

Service Reliability Engineering (SRE) is a team within the GCS organization. It contributes to ensure the reliability and availability of SAP cloud services (internal or external) by building and running tools that helps to either prevent or isolate an incident. SRE’s proactively help automate and optimize processes. The primary goal of the team is to reduce MTTD/MTTR and contribute to technical troubleshooting for major incidents, develop the Monitor of Monitors tool, integrations for ServiceNow/PagerDuty and support technical RCA follow-ups to move technical service quality forward. The SRE team runs globally in a follow the sun model. 

We are looking for a Senior Service Reliability Engineer (SRE De-Escalation Infrastructure Expert) focusing on both soft and physical layers of our global operations.  



In this role you will have visibility into all tiers of the service, from infrastructure to application lifecycle and code. You will be a core driver in identifying technical gaps, while defining and driving preventive measures. You will have not only the freedom but the directive to own the situation and control the resources if necessary, to resolve the issue. 


Everything you work on is geared to the big picture of SAP’s Cloud Solutions availability. You are semi-autonomous of the other Engineering, Infrastructure and Delivery teams. This means providing an independent perspective on technical operations across GCS with a focus on availability, performance, and risk through our team driven solutions. 

Do you love to solve infrastructure related problems and troubleshoot using tools you build? Are you a network and/or compute expert? Do you analyze data/problems in such a way that helps to automate processes to avoid incidents? Have you developed integrations for monitoring tools? Do you have an SRE / DevOps mindset with strong infrastructure skills? Would you like to work in an agile environment? 

What makes this position unique is the need for a strong technical background, combined with leadership and problem-solving capabilities. Your mission will be to proactively own high severity technical escalations, lead the technical engagement across teams and help restore services as quickly as possible. This results in a primary emphasis to reduce incident resolution time on service/business impacting events.




  • Bachelors or Masters Degree in Computer Science or a related technical field – or equivalent applied experience 
  • 10+ years professional experience out of which 6+ years experience with networking, compute, storage and other infrastructure platforms, technical analysis (code or infrastructure) and/or software development 
  • Self Starter who acts with a Sense of Urgency to quickly move issues forward efficiently and effectively.  
  • Fast learner, with initiative to learn a new skill on your own by studying resources and practicing independently. 
  • Excellent communication and interpersonal skills, a trustworthy team player. 
  • Calm and composed in critical situations to interact with stakeholders and make timely decisions 
  • Rotational weekend and/or holiday coverage with allowance and time compensation in accordance with local policies 
  • Knowledgeable in (with one area of expertise) programmatic language development, networking, SaaS, PaaS, Software Cloud architecture, CI/CD, large-scale distributed systems 
  • Solid Understanding of Enterprise / Service Provider Data Center Architecture (high density servers, backbone routers/switches, load balancers, SAN/NAS), IT security principles and disaster recovery 
  • Strong familiarity Enterprise class Fault Monitoring and Performance Management tools 
  • Kubernetes, Python (or other scripting language), Public cloud (AWS, GCP, Azure) 
  • Elastic, Logstash, Kibana experience a plus
  • Industry Technical Certifications (CCNA, CCNP, OCA, RHCE, etc.) and ITIL related courseware a plus



We are SAP

SAP innovations help more than 400,000 customers worldwide work together more efficiently and use business insight more effectively. Originally known for leadership in enterprise resource planning (ERP) software, SAP has evolved to become a market leader in end-to-end business application software and related services for database, analytics, intelligent technologies, and experience management. As a cloud company with 200 million users and more than 100,000 employees worldwide, we are purpose-driven and future-focused, with a highly collaborative team ethic and commitment to personal development. Whether connecting global industries, people, or platforms, we help ensure every challenge gets the solution it deserves. At SAP, we build breakthroughs, together.


Our inclusion promise

SAP’s culture of inclusion, focus on health and well-being, and flexible working models help ensure that everyone – regardless of background – feels included and can run at their best. At SAP, we believe we are made stronger by the unique capabilities and qualities that each person brings to our company, and we invest in our employees to inspire confidence and help everyone realize their full potential. We ultimately believe in unleashing all talent and creating a better and more equitable world.


SAP is proud to be an equal opportunity workplace and is an affirmative action employer. We are committed to the values of Equal Employment Opportunity and provide accessibility accommodations to applicants with physical and/or mental disabilities. If you are interested in applying for employment with SAP and are in need of accommodation or special assistance to navigate our website or to complete your application, please send an e-mail with your request to Recruiting Operations Team: Americas: Careers.NorthAmerica@sap.com or Careers.LatinAmerica@sap.com, APJ: Careers.APJ@sap.com, EMEA: Careers@sap.com.


EOE AA M/F/Vet/Disability:

Qualified applicants will receive consideration for employment without regard to their age, race, religion, national origin, ethnicity, age, gender (including pregnancy, childbirth, et al), sexual orientation, gender identity or expression, protected veteran status, or disability.

Successful candidates might be required to undergo a background verification with an external vendor.

 Requisition ID:297942 | Work Area: Information Technology | Expected Travel: 0 - 10% | Career Status: Professional | Employment Type: Regular Full Time  | Additional Locations: 

Senior Service Reliability Engineer (SRE De - Escalation Infrastructure Expert)

Facility:  297942
Posted Date:  Oct 16, 2021
Work Area:  Information Technology
Career Status:  Professional
Employment Type:  Regular Full Time
Expected Travel:  0 - 10%

Budapest, HU, 1031

Job Segment: ERP, Engineer, Cisco, Cloud, SAP, Technology, Engineering