Company

Sibros delivers an IoT software and data management platform that connects any vehicle to the cloud to provide real-time software management and data analytics, helping automakers build better, safer, and more reliable connected products at scale. Our embedded SaaS platform, Deep Connected Platform (DCP), is leverageable in every sector of the automotive industry from agriculture to commercial vehicles and light vehicles to two-wheelers, RVs, and beyond.

About the Role

In the automotive industry, safety and reliability always go hand-in-hand. One key differentiator that sets Sibros’ product apart is the high reliability we provide. Our products scale across millions of vehicles operating in multiple clouds. Ensuring our overall system is highly secure, reliable, and available is what has made our Deep Connected Platform an award-winning solution.

As part of the SRE team, your mission is to help others move quickly without breaking things. You will be working on all the well-known cloud environments, leveraging various open-source tools and cloud-native solutions to fortify the reliability of the solution, and continue improving SLA. You will also help other engineers in the team move faster by optimizing the end-to-end development flow. Your responsibilities include, but are not limited to:

  • Designing and implementing robust monitoring and alerting systems
  • Automating every possible perspective of the system, and removing human intervention from the day-to-day operations
  • Working with the broader Engineering Team to design and implement the CI/CD system
  • Owning the cloud infrastructure and providing optimization in terms of cost and operations
  • Documenting operation flow and runbook, in order to provide in-depth insight into the system
  • Handling the on-call and critical issues within the infrastructure, and providing 24/7 support as a team

Minimum Qualifications

  • 3+ years of experience leading a Site Reliability Engineering team
  • 5+ years of experience as a Site Reliability Engineer
  • Expert in at least on the public cloud providers (e.g. AWS, Azure, GCP)
  • Good track record of being a Site Reliability Engineer in a production-ready cloud environment
  • Familiar with operations that help achieve high scalability, reliability and availability (e.g. backup and restore, disaster recovery, failover, sharding)
  • Experience in managing infrastructure as a code and familiar with Terraform
  • Experience in setting up monitoring and alerting in production environments
  • Familiar with different tools for CI/CD pipeline, and hands-on experience in automating pipelines in the past
  • Passionate about the vision and mission of the company, and interested in solving challenging problems in the automotive IoT domain

Preferred Qualifications

  • 5+ years of experience leading Site Reliability Engineering team
  • 7+ years of experience as a Site Reliability Engineer
  • Experience in managing large scale IoT devices, and familiar with IoT offerings from different cloud providers
  • Experience in building a production environment from scratch by leveraging open-source / off-shelf technologies
  • Experience in cost optimization in well-known cloud providers such as AWS, Azure, and GCP
  • Familiar with release management for frontend, mobile, backend and infrastructure
  • Expert in cyber security, and familiar with how to set up secure production environments
  • Experience in working on compliance and regulation-related efforts, especially in automotive and IoT industries (e.g. GDPR, SOC Type II)

Equal Employment Opportunity

Sibros is committed to a policy of equal employment opportunity. We recruit, employ, train, compensate, and promote without regard to race, color, age, sex, ancestry, marital status, religion, national origin, disability, sexual orientation, veteran status, present or past history of mental disability, genetic information or any other classification protected by state or federal law.