3 Easy Steps

  • 1Search for courses by Study Area, Level and Location
  • 2We deliver you all the matched results
  • 3Choose one or more course providers to contact you
Industry

Distance from location (kms)

Exact 5 10 25 50 100

Posted since

All 2 Days 1 Week 2 Weeks 1 Month

Sort results by

Relevance Date

12

April

Site Reliability Engineer

SAI Global - Sydney, NSW

IT
Source: uWorkin

JOB DESCRIPTION

Site Reliability Engineer

Sydney

Risk/ Applications

Full Time

SAI Global’s Cloud Operations team is expanding their Network Engineering team and looking to add a seasoned systems automation engineer that has a broad skillset in working all things cloud automation to further build out our automation tooling. Site Reliability Engineers (SREs) are responsible for keeping all user-facing services and other SAI Global production systems running smoothly. SREs are a blend of pragmatic operators and software craftspeople that apply sound engineering principles, operational discipline, and mature automation to our environments and the SAI Global codebase. We specialize in systems, whether it be networking, the Linux kernel, or some more specific interest in scaling, algorithms, or distributed systems.

As An SRE You Will

    • Be on an On-Call rotation to respond to SAI Global production application availability incidents and provide support for service engineers with customer incidents.
    • Use your on-call shift to prevent incidents from ever happening.
    • Run our infrastructure with Ansible, Puppet, Terraform and Kubernetes.
    • Make monitoring and alerting alert on symptoms and not on outages.
    • Document every action so, your findings turn into repeatable actions–and then into automation.
    • Improve the deployment process to make it as boring as possible.
    • Design, build and maintain core infrastructure pieces that allow SAI Global production applications scaling to support hundreds of thousands of concurrent users.
    • Debug production issues across services and levels of the stack.
    • Plan the growth of SAI Global's infrastructure.
    You may be a fit to this role if you:
  • Think in a cloud first mindset regardless of the flavor of public cloud
  • Think in a security-centric mindset, we do sell compliance software!
  • Think about systems - edge cases, failure modes, behaviors, specific implementations.
  • Know your way around Linux and Windows.
  • Know what is the use of config management systems like Ansible or Puppet
  • Have strong programming skills – Pyhon, Java, Golang, Node.js
  • Have an urge to collaborate and communicate asynchronously.
  • Have an urge to document all the things so you don't need to learn the same thing twice.
  • Have an enthusiastic, go-for-it attitude. When you see something broken, you can't help but fix it.
  • Have an urge for delivering quickly and iterating fast.
  • Share our values, and work in accordance with those values.
  • Have experience with Nginx, HAProxy, Docker, Kubernetes, Terraform, or similar technologies

Projects You Could Work On

  • Coding infrastructure automation with Ansible and Terraform
  • Improving our Prometheus Monitoring or building new Metrics
  • Helping release managers deploy and fix new versions of our application software.
  • Plan, prepare for, and execute the migration from virtual machines running on AWS to cloud-native container-based deployments with Kubernetes using Amazon Kubernetes Engine
  • Develop a relationship with a product group, define their SRE KPIs as SAI Global is early in our SRE Journey

  • SAI Global is a recognized leading provider of integrated risk management solutions and assurance solutions. We help organizations protect their brands by proactively managing risk to achieve business excellence, growth, sustainability, and trust.

    We have a history rich in the development of innovative business solutions, and today this tradition of innovation continues with industry-leading products and solutions in our core business areas of risk management software, standards aggregation, regulatory content, ethics and compliance learning, risk assessments, certification, testing and audits.

    SAI Global acquired BWise from Nasdaq in 2019. The combination of BWise’s award-winning risk management, internal audit and regulatory compliance platform with SAI Global’s industry leading SAI360 risk and compliance solution created the most complete integrated approach to risk management in the marketplace.

    For more information, visit our company site at www.saiglobal.com or our career site.