FOUNDING Site Reliability Engineer
Location: san francisco, ca
About the Role Company Overview The next generation of voice assistants is poised to transform how billions of people interact with technology. As conversational AI becomes the default interface, the demand for seamless voice integration is skyrocketing. Yet, many businesses and products are unprepared for this shift. We're developing the infrastructure that bridges the gap between advanced language models and real-world voice AI applications. Our platform enables developers to deploy voice solutions at scale. Experience it firsthand here. Since our launch, we've achieved significant revenue growth and secured recent funding. Our close-knit team of six is expanding, and we're seeking passionate individuals to join our founding team on-site in San Francisco to amplify our success. Mission As a Founding Site Reliability Engineer , you'll be instrumental in maintaining the health of our real-time distributed systems. Your efforts will directly impact our ability to deliver consistent and high-quality voice AI services, bridging the gap between legacy systems and cutting-edge AI technologies. Key Responsibilities Participate in an on-call rotation to ensure 24/7 system availability and swift incident response. Maintain 99.99% uptime for our real-time distributed infrastructure. Automate infrastructure management to significantly reduce deployment and recovery times. Optimize system performance for seamless integration with AI models. Develop and implement robust incident response protocols to minimize downtime and production issues. Design infrastructure and processes that scale efficiently with the company's rapid growth. Desired Skills & Experience Proficiency with tools and technologies such as Kubernetes, Terraform, Ansible, and cloud platforms like AWS, GCP, or Azure. Strong scripting or programming skills in languages like Python, Go, or Bash. Familiarity with observability tools like Prometheus, Grafana, or the ELK stack. Excellent communication skills to convey complex system statuses and issues to diverse stakeholders. Demonstrated ability to troubleshoot and resolve complex infrastructure challenges. Organizational skills to build and maintain scalable, reliable systems. Adaptability to thrive in a startup environment with evolving challenges. Proactive mindset to generate impactful ideas and drive them to completion. Qualifications Proven experience with distributed systems, real-time infrastructure, and infrastructure-as-code practices. At least 2 year of experience in companies with fewer than 100 employees. A minimum of 7 years in engineering roles. Over 2 years specializing in SRE and/or Devops Bonus: Previous experience as a technical founder in a B2B SaaS company or a background in telecommunications. Ideally familiar with the following: aws github actions datadog been in telephony or video streaming infrastructure Go typescript dist systems
Apply