Share this Job
Software-Design and Development

Senior Site Reliability Engineer

What we offer

Our company culture is focused on helping our employees enable innovation by building breakthroughs together. How? We focus every day on building the foundation for tomorrow and creating a workplace that embraces differences, values flexibility, and is aligned to our purpose-driven and future-focused work. We offer a highly collaborative, caring team environment with a strong focus on learning and development, recognition for your individual contributions, and a variety of benefit options for you to choose from. Apply now!




The Site Reliability Engineering (SRE) team at Ariba (an SAP Company) is a highly motivated technical group determined to triage and restore mission critical services with a passion for improving and maintaining uptime. The primary responsibility of the SRE team is to monitor critical infrastructure/applications, manage fault tolerance across an enterprise cloud business and provide the necessary coverage to protect Ariba's Business Commerce 24x7 within the Cloud Operations organization. The successful candidate will possess the necessary experience to have strong knowledge of Unix systems, networking protocols, desire to build the necessary tools in order to accomplish the task at hand. The candidate must understand incident management and methodologies possess excellent verbal and written communication skills and be able to interact effectively with engineering and other operations teams.


Primary Responsibilities:

  • Proactively monitor availability and performance of the Ariba Cloud using key performance tools.
  • Effectively and quickly respond to monitoring alerts, incident tickets and overall technical support for the Ariba product suite
  • Perform extensive application and web site troubleshooting to quickly resolve issues.
  • Work closely with subject matter experts within various Engineering teams
  • Ensure user tickets and monitoring alerts are handled according to pre-defined SLA's for response time, updates and closure.
  • Develop and automate manual tasks to improve day-to-day monitoring and scalability of time critical operations.
  • Handle communication and notification on major site issues to executive management teams.
  • Document standard operating procedures to effectively utilize ITIL best practices.
  • Ensure effective shift turnovers for continuous 24/7 support.
  • Experience with Application performance monitoring and Real user monitoring tools like Datadog, Dynatrace.
  • Experience in provisioning and managing Public cloud Environment, preferably GCP, AWS
  • Experience in Cloud provisioning tools, preferably Terraform
  • Experience in CI/CD and Devops tools like Jenkins, Artifactory, Docker, Vault
  • Basic knowledge in Kubernetes
  • Basic knowledge of Hana database administration




Minimum Qualifications

  • 5-7 years of experience working in a Unix environment
  • Experience working in a 24 x 7 enterprise environment
  • Experience with Application performance monitoring and Real user monitoring tools like Datadog, Dynatrace.
  • Experience in provisioning and managing Public cloud Environment, preferably GCP
  • Experience in Cloud provisioning tools, preferably Terraform
  • Experience in CI/CD and devops tools like Jenkins, Artifactory, Docker, Vault
  • Triage and support system applications including but not limited to Apache, DNS, Sendmail, SSH, TCP/IP, NFS and common Internet protocols.
  • Excellent knowledge of operating system internals, file system structures and machine architectures in a Linux operating environment.
  • Basic knowledge of Hana database administration
  • Ability to write and maintain Perl and Shell scripts to automate processes and enhance productivity.
  • Experienced working in a dynamic, fast-paced environment with well-developed practices and procedures.
  • Outstanding interpersonal, analytical, and communication skills
  • Must be reliable and dependable with ability to multi-task across various functions
  • BA/BS degree in MIS/CS or equivalent experience




We are SAP

SAP innovations help more than 400,000 customers worldwide work together more efficiently and use business insight more effectively. Originally known for leadership in enterprise resource planning (ERP) software, SAP has evolved to become a market leader in end-to-end business application software and related services for database, analytics, intelligent technologies, and experience management. As a cloud company with 200 million users and more than 100,000 employees worldwide, we are purpose-driven and future-focused, with a highly collaborative team ethic and commitment to personal development. Whether connecting global industries, people, or platforms, we help ensure every challenge gets the solution it deserves. At SAP, we build breakthroughs, together.


Our inclusion promise

SAP’s culture of inclusion, focus on health and well-being, and flexible working models help ensure that everyone – regardless of background – feels included and can run at their best. At SAP, we believe we are made stronger by the unique capabilities and qualities that each person brings to our company, and we invest in our employees to inspire confidence and help everyone realize their full potential. We ultimately believe in unleashing all talent and creating a better and more equitable world.


SAP is proud to be an equal opportunity workplace and is an affirmative action employer. We are committed to the values of Equal Employment Opportunity and provide accessibility accommodations to applicants with physical and/or mental disabilities. If you are interested in applying for employment with SAP and are in need of accommodation or special assistance to navigate our website or to complete your application, please send an e-mail with your request to Recruiting Operations Team: Americas: Careers.NorthAmerica@sap.com or Careers.LatinAmerica@sap.com, APJ: Careers.APJ@sap.com, EMEA: Careers@sap.com.


EOE AA M/F/Vet/Disability:

Qualified applicants will receive consideration for employment without regard to their age, race, religion, national origin, ethnicity, age, gender (including pregnancy, childbirth, et al), sexual orientation, gender identity or expression, protected veteran status, or disability.

Successful candidates might be required to undergo a background verification with an external vendor.

 Requisition ID:292326 | Work Area: Software-Design and Development | Expected Travel: 0 - 10% | Career Status: Professional | Employment Type: Regular Full Time  | Additional Locations: 

Senior Site Reliability Engineer

Facility:  292326
Posted Date:  Oct 3, 2021
Work Area:  Software-Design and Development
Career Status:  Professional
Employment Type:  Regular Full Time
Expected Travel:  0 - 10%

Palo Alto, CA, US, 94304

Nearest Major Market: San Jose
Nearest Secondary Market: Palo Alto

Job Segment: Engineer, ERP, SAP, Database, Unix, Engineering, Technology