???Are you an engineer interested in new media, social activities and television? Paramount+ is developing the next generation of multimedia and cross-platform entertainment! The team at Paramount+ values software which exemplifies simplicity of design, maintainability and foundational robustness.
Role Details:
Paramount Streaming seeks a Senior Site Reliability Engineer for our online television and media-focused web properties. In this role, you will support our Kubernetes platform that serves our streaming products in the cloud. Our team seeks to produce Observability infrastructure that’s fast, self-healing, and operates at a global scale. We aim to produce a platform that is both opinionated to reliability best practices, while also providing best-in-class tooling for our engineering organization. This a great opportunity for a seasoned Site Reliability Engineer to build systems that have that global reach, and which impact millions of users.
About You:
You have a passion for data, and seek to quantify all things! You thrive on designing systems with an eye towards scale, self-healing, and automation as your guiding principles. You love the challenges of monitoring at large scales, and are compelled by problems of analysis, and large-scale data collection. You are at home with system-engineering challenges and service-based architecture. You have experience with being on-call, and seek-out ways to improve the on-call rotation for the team. You can plan project lifecycles and can evangelize best practices across teams. You are passionate about mentorship and seek to promote a culture of collaboration.
Required Qualifications:
- Experience with Thanos, ArgoCD, Kafka, and/or Kibana
-
Provide support and guidance of the Observability platform, integrations, and best practices across multiple engineering teams.
-
Build and manage Observability infrastructure for a global scale.
-
Build self-healing and automated systems on Kubernetes.
-
Design and build systems to collect, visualize, and store service health indicators.
-
Design Observability tooling utilizing a hybrid of open-source and enterprise solutions.
-
Additional other duties and responsibilities, as assigned.
-
Implementing log collection and storage via Elasticsearch.
-
Building visualizations for multiple services, utilizing different types of data sources.
-
Working with Prometheus time-series data, producing metrics and integrations.
-
Building and supporting robust event queues via Kafka.
-
Work with our development teams to instrument their applications and capture events that support our global product deployment.
-
Bachelor’s degree or equivalent experience
-
5+ years managing and monitoring Linux systems
-
2+ years leading the design and implementation of Cloud systems in AWS/GCP using tools like Terraform, and Kubernetes.
-
CI/CD tooling such as Jenkins.
-
4+ years? experience working with monitoring, logging, and visualization tooling, such as Prometheus, Elasticsearch, and Grafana.
-
2+ years? experience programming in a programming language such as Java, Python, Go
-
On call experience
-
Ability to manage the lifecycle of multiple projects
-
Ability to collaborate across teams
-
Experience mentoring junior engineers and writing onboarding documentation
Pay:
$82.00 – $95.00 per hourNo Corp to Corp contracts.
Pay:
$82 – $95 per hour.