About the Role

In the automotive industry, safety and reliability always come hand in hand. One key differentiator that Sibros’ product will offer is reliability. While our products scale across millions of vehicles, we also need to ensure our overall system is highly secure, reliable and available. Sibros’ products operate in multiple clouds. As part of the team, your mission is to help others move faster without breaking things. You will be working on pretty much all the well-known cloud environments, and leveraging various open-source tools and cloud-native solutions to harden the reliability of the solution, and continue improving SLA. You will also help other engineers in the team move faster by optimizing the end-to-end development flow. Your responsibilities include, but are not limited to:

  • Design and implement robust monitoring and alerting systems
  • Automate every possible perspective of the system, and remove human intervention from the day to day operations
  • Work with the rest of the engineering team to design and implement CI/CD system
  • Own the cloud infrastructure and provide optimization in terms of cost and operations
  • Document operation flow and runbook, in order to provide in-depth insight into the system
  • Handle the on call and critical issues in the infrastructure, and provide 24 x 7 support as a team

Minimum Qualifications

  • 3+ years of experience leading site reliability engineering team
  • 5+ years of experience as a site reliability engineer
  • Expert in at least on the public cloud providers. E.g. AWS, Azure, GCP
  • Good track record of being a site reliability engineer in a production-ready cloud environment. Familiar with operations that help achieve high scalability, reliability and availability. E.g. backup and restore, disaster recovery, failover, sharding
  • Experience in managing infrastructure as a code and familiar with Terraform
  • Experience in setting up monitoring and alerting in production environments
  • Familiar with different tools for CI/CD pipeline, and hands-on experience in automating pipelines in the past
  • Passionate about the vision and mission of the company, and interested in solving challenging problems in the automotive IoT domain

    Preferred Qualifications

    • 5+ years of experience leading site reliability engineering team
    • 7+ years of experience as a site reliability engineer
    • Experience in managing large scale IoT devices, and familiar with IoT offerings from different cloud providers
    • Experience in building a production environment from scratch by leveraging open-source / off-shelf technologies
    • Experience in cost optimization in well-known cloud providers like AWS, Azure, GCP
    • Familiar with release management for frontend, mobile, backend and infrastructure
    • Expert in cyber security, and familiar with how to set up secure production environments
    • Experience in working on compliance and regulation-related efforts, especially in automotive and IoT industries. E.g. GDPR, SOC type II

      Equal Employment Opportunity

      Sibros is committed to a policy of equal employment opportunity. We recruit, employ, train, compensate, and promote without regard to race, color, age, sex, ancestry, marital status, religion, national origin, disability, sexual orientation, veteran status, present or past history of mental disability, genetic information or any other classification protected by state or federal law.