ABOUT THE CLIENT:Our Client is a free online television service broadcasting over 75 live channels full of TV shows, movies, and Internet videos, all on one dial. The company was founded in 2013 and is based in Los Angeles, California. TechHive, a home technology review site, is referred to as the best cord-cutting application. The product runs as an app on AppleTV, Android TV, Amazon Fire Stick, Roku, and PlayStation. The company currently has around 75 content partners. The company also as of October 2016 has about 5 million monthly active users up from 500,000 in early 2015. We generate revenue from video advertisements between programming and have a goal to continue to stay as free television.
$ads={1}
PROJECT DETAILS:Our product is led by a seasoned team of entrepreneurs, TV and technology executives, and top-of-class team members.$43M+ funding with investors like US Venture Partners, Sky, Chicago Ventures, Universal Music Group, United Talent Agency, and Pritzker Group. Top app with millions of users across platforms like web, iOS, Android, Roku, Apple TV, Amazon Fire TV, Android TV, Vizio, Samsung, and more.
One of the hottest startups in Los Angeles and one of the leading video platforms in the industry -often covered in NY Times, Forbes, Techcrunch, Variety, USA Today, CNN, Mashable, and more!
High energy culture. Passionate entrepreneurial team. Massive market. Employing the best of the best in the industry. We’re building the future of television.
YOUR TEAM:This critical Platform Reliability Engineering role in our team includes responsibilities for system development and cloud resource management, configuration management, troubleshooting, preventative and corrective maintenance, performance monitoring, and enhancement for our cloud / hosted large-scale consumer video service.
The Engineer in this role will enable the Platform Reliability group to increase reliability across various tools and technologies.
We’re looking for a star Engineer who has proficiency in backend systems and is confident with high volume and high-performance systems, as well as a vast knowledge of the infrastructure to apply to both short and long-term fixes for code and infrastructure.
- WHAT’S IN FOR YOU:Analyze and improve system design to reduce failure modes and promote self-healing systems
- Develop reliability tools and frameworks for use by all engineers
- Work with development partners to shape the architecture, design, and implementations of new and existing systems to enhance their reliability, performance, efficiency, and scalability
- Ability to work both independently as well as part of a geographically dispersed yet integrated team.
- Ability to balance multiple priorities in a fast-paced environment, demonstrable experience supporting large-scale projects.
- Ability to identify measures or indicators of application performance and the actions needed to improve or correct application performance.
- Ability to deal with ambiguity, uncertainty, and incomplete information when evaluating alternatives and making recommendations.
- Ability to work seamlessly within a team as well as manage individual tasks
- Build and maintain observability pipelines & resources – logging, monitoring, distributed tracing, alerting and offline test tools needed.
- Respond to emerging incidents, solve critical issues, and follow through with a plan for resolution or future mitigation
- Act as an SME on the Engineering Operations team, partnering with backend services teams and application teams to overcome challenges across all the platforms where we stream our service
- RESPONSIBILITIES:5+ years experience in software development
- Degree in Computer Science or related or equivalent work experience
- You have solid engineering and coding skills, data structure knowledge, and the ability to write high-performance production quality code.
- Experience building service-oriented APIs and cloud services (preferable against AWS)
- Experience writing for and deploying to AWS leveraging micro-services
- Extremely technical hands-on server software experience
- Good programming skills in one TypeScript, Golang and an ability to pick up new ones.
- Experience in the Linux environment and a good understanding of its fundamentals and internals: filesystems and modern memory management, threads and processes, the user/kernel-space divide, etc.
- A good understanding of large-scale distributed systems in practice, including multi-tier architectures, application security, monitoring, and storage systems.
- Working knowledge of the TCP/IP stack, internet routing, and load balancing.
- Experience with infrastructure as code tools (e.g. Terraform)
- Experience with Kubernetes and/or DataDog are a plus