Site Reliability Engineer III



Software Engineering, IT
Atlanta, GA, USA
Posted on Thursday, June 8, 2023

Your work days are brighter here.

At Workday, it all began with a conversation over breakfast. When our founders met at a sunny California diner, they came up with an idea to revolutionize the enterprise software market. And when we began to rise, one thing that really set us apart was our culture. A culture which was driven by our value of putting our people first. And ever since, the happiness, development, and contribution of every Workmate is central to who we are. Our Workmates believe a healthy employee-centric, collaborative culture is the essential mix of ingredients for success in business. That’s why we look after our people, communities and the planet while still being profitable. Feel encouraged to shine, however that manifests: you don’t need to hide who you are. You can feel the energy and the passion, it's what makes us unique. Inspired to make a brighter work day for all and transform with us to the next stage of our growth journey? Bring your brightest version of you and have a brighter work day here.

About the Team

At Workday, we help the world’s largest organizations adapt to what’s next by bringing finance, HR, and planning into a single enterprise cloud. We work hard, and we’re serious about what we do. But we like to have fun, too. We put people first, celebrate diversity, drive innovation, and do good in the communities where we live and work.

The Service Reliability Engineering team at Workday relentlessly pursues reliability and availability of customer environments by employing a culture of learning, continuous improvement, and an engineering focus.

The team builds and crafts innovative solutions to scale our day to day operations. We support our Workday customers with high energy, attention to detail and strong collaboration.

About the Role

Are you a creative SRE looking for more opportunities to improve reliability, and enjoy building solutions to reduce toil and manual effort?

With constant attention and focus on our customers (both internal and external), you will deliver quickly on a wide range of daily tasks - from environment provisioning, performance monitoring, environment problem solving, ad-hoc requests and automation efforts; while providing transparency of work being performed.
This role requires a good understanding of Linux systems in a Production Environment. You will be part of a team that supports customers located in public and private cloud environments.

About You

We would love to hear from you if you have been part of an Operations to SRE transformation in a previous role, like trying new techniques and approaches to sophisticated problems, love to learn new technologies, are a natural collaborator and an extraordinary teammate who brings out the best in everyone around you.

You understand that availability of Workday Service is paramount, are able to support a daytime-only shift pattern that includes some weekends, provide careful planning of changes, write detailed runbooks, share knowledge with colleagues, and engage in effective teamwork. You respond to impactful issues promptly and can handle an incident through to completion.
If the work performed is manual and repeated often, you like to find a way to automate the task. More so, you deliver!

Basic Qualifications for Site Reliability Engineer:

  • 3+ years of experience running and maintaining a 24x7 large-scale production environment, preferably across multiple data centers

  • BS or MS degree in Computer Science, Engineering, or related technical field, or equivalent experience

Other Qualifications for Site Reliability Engineer:

    • Experience deploying and operating: Apache Tomcat, HTTPd, MySQL, Java Web Applications preferably with source control

    • Proven expertise with Linux, debug fundamentals and have a solid understanding of how to quickly isolate issues.

    • Experience with many tool sets: Chef, Puppet, OSSEC, Splunk, Elasticsearch, Bladelogic, Ansible, JIRA, Confluence, WaveFront, Grafana, Kubernetes, Prometheus

    • Strong understanding of enterprise level thinking on a few levels; documentation, runbooks, root cause analysis, capacity-trending, bug fixes and scripting

    • Secret passion about monitoring. When false positives show up on your radar you quickly address it. Your inner wish list is to "make monitoring phenomenal again".

    • Can balance multiple tasks, make the right business decisions and tackle problems while under pressure, and prioritize and organize effectively.

    • Able to work some weekends is required as part of the on-call support and production update rotation.

    • Experience with (CentOS, SunOS, Solaris/Linux/DevOps) is a plus.

Our Approach to Flexible Work

With Flex Work, we’re combining the best of both worlds: in-person time and remote. Our approach enables our teams to deepen connections, maintain a strong community, and do their best work. We know that flexibility can take shape in many ways, so rather than a number of required days in-office each week, we simply spend at least half (50%) of our time each quarter in the office or in the field with our customers, prospects, and partners (depending on role). This means you'll have the freedom to create a flexible schedule that caters to your business, team, and personal needs, while being intentional to make the most of time spent together. Those in our remote "home office" roles also have the opportunity to come together in our offices for important moments that matter.

Workday Pay Transparency Statement - United States

Workday pay ranges vary based on work location. As a part of the total compensation package, this role may be eligible for the Workday Bonus Plan or a role-specific commission/bonus, as well as annual refresh stock grants. Recruiters can share more detail during the hiring process. Each candidate’s compensation offer will be based on multiple factors including, but not limited to, geography, experience, skills, job duties, and business need, among other things. For more information regarding Workday’s comprehensive benefits, please click here.

Pursuant to applicable Fair Chance law, Workday will consider for employment qualified applicants with arrest and conviction records.

Workday is an Equal Opportunity Employer including individuals with disabilities and protected veterans.

Are you being referred to one of our roles? If so, ask your connection at Workday about our Employee Referral process!