3 Easy Steps

  • 1Search for courses by Study Area, Level and Location
  • 2We deliver you all the matched results
  • 3Choose one or more course providers to contact you
Industry

Distance from location (kms)

Exact 5 10 25 50 100

Posted since

All 2 Days 1 Week 2 Weeks 1 Month

Sort results by

Relevance Date

20

April

Site Reliability Engineer - Cloud

Flexera - Melbourne, VIC

IT
Source: uWorkin

JOB DESCRIPTION

This is a unique opportunity for a degreed developer or System Administrator that is passionate about operational excellence to ramp into a SRE role that supports our SaaS offerings. This team works with product development to define our Service Level Objectives and performs the work required to ensure we meet those SLOs. These teams employ agile and lean principles in a culture of constant learning and improving. 

As a SRE you will be tasked with everything from helping with product design, to diagnosing issues, and writing automated scripts for mediating issues that occur in our production systems. You will be driven to build fault tolerant, scalable systems and automate away as much operational toil as is possible. You align with the goals of the DevOps movement in improving collaboration between the development and operations disciplines.


Responsibilities:

·

  • Work with teams across several continents, buils relationships with our engineers by listening and understanding their needs and balancing this with the needs of our business.
  • Help to eliminate operational toil - seek to automate repetitive operations work
  • Work with product development teams to ensure that our new features are able to meet SLAs
  • Help mature the delivery process for teams; defining/managing automated deployment pipelines such as Jenkins pipelines, designing canary release deploys, building in automated fallbacks or optimizing the build chain, Infrastructure & pipeline as code, you help craft the appropriate solution for the product
  • Optimize product service code to ensure that it is secure, scalable and performant
  • Improve the fault detection for our services
  • Work with product engineering teams to understand performance and behavior patterns
  • Be part of an on-call rotation for alerts that require engineering expertise to diagnose
  • Help carry out root cause analysis for incidents, and design solutions (both software and human processes) that will help to ensure the same problem doesn't happen in the same way again
  • Contribute to platform security

Minimum Qualifications:

  • Developer experience and a passion for operational excellence in the development and delivery of enterprise software


  • A desire to learn Go
  • Excellent communication skills including experience in writing good documentation and running workshops
  • Knowledge of tools and patterns around CI/CD (familiar with Travis CI, Circle CI, Buildkite or similar)
  • AWS, Google Cloud, or Azure experience (AWS preferred)
  • Bachelor’s in computer science or MIS
  • 1+ years of related experience

Preferred:

  •  Experience with DevOps
  • Knowledge of containers (Docker) container orchestration (Kubernetes) 
  • Knowledge of operations including incident management, immutable infrastructure as code (esp. Terraform or CloudFormation), problem solving
  • Observability knowledge; Logs, Tracing, Metrics and experience in a few of Elastic Stack, XRay, Jaeger, Zipkin, Prometheus, Honeycomb or LightStep