$2,213.00 Fixed
NexGen Media
Contract · Flexible hours
About the role
NexGen Media is scaling its real‑time video delivery platform and needs a senior Site Reliability Engineer to design and implement resilient, automated operations. You will own reliability for critical services, ensure high performance, and support rapid feature delivery.
Key responsibilities
- Design and implement monitoring, alerting, and incident response for micro‑service architecture.
- Automate infrastructure provisioning and configuration using IaC tools.
- Optimize CI/CD pipelines for faster, reliable deployments.
- Collaborate with developers to improve application observability and fault tolerance.
- Conduct capacity planning and performance tuning on cloud resources.
- Document runbooks and standard operating procedures.
Must-have skills
- Deep experience with AWS services and networking.
- Strong proficiency in Docker and Kubernetes orchestration.
- Extensive background in Linux system administration.
- Expertise in Terraform or CloudFormation for IaC.
- Proven track record of incident management and root‑cause analysis.
Nice to have
- Experience with Prometheus/Grafana monitoring stacks.
- Familiarity with serverless architectures.
- Proposal: 0
- Less than 3 month
Robert Burnam
,
Member since
Oct 28, 2025
Total Job