We are looking for a site reliability engineer with relevant experience to apply the concepts of site reliability engineering to our growing platform of cloud and edge devices.
- Experience in an SRE Position
- Expertise in designing, analyzing, and troubleshooting distributed systems.
- Ability to debug, optimize code, and automate routine tasks.
- Systematic problem-solving approach, coupled with effective communication skills and a sense of drive.
- Understanding of Unix/Linux operating systems.
- Release Engineering (Test suite + automated deployment workflow)
- Alerting (Alerts are good, but only alert reliably and for critical reasons.)
- Outages (continual improvement) eg. Why did it happen and can we avoid it happening again?
- Work on "eliminating toil"
- Build an outage tracker
You need to be hands on and comfortable programming.
- Kubernetes on prem
- Docker
- Python
- Prometheus
- Google cloud platform
- Javascript
- Firebase
- BigQuery
- Experience with IoT and MLOps
- Golang
- Equipment of your own choice
- Flexible work environment (remote-friendly) but also onsite offices in Norway
If the above seems interesting, and you feel you have what it takes to join our team and help us build an extraordinary product and platform then feel free to email me at alastair@createview.no
For more information about us please refer to our website