An independent platform for cutting-edge, progressive, legal, and political opinion.
Senior Site Reliability Engineer
Location
Florida
Posted
29 days ago
Salary
Not specified
Job Description
Job Requirements
- Degree in computer science or a related field, or equivalent work experience
- 5+ years in SRE, DevOps, or similar Infrastructure roles
- Experience managing large-scale, high-availability production systems
- Track record of incident response and post-mortem processes
- Experience with capacity planning and performance optimization
- 3+ years hands-on experience managing production Kubernetes clusters
- Deep understanding of k8s architecture, networking, storage, and security
- Experience with cluster scaling (Karpenter), upgrades, and multi-cluster management
- Proficiency with kubectl, Helm, and Kubernetes operators
- Container orchestration and troubleshooting expertise
- Advanced expertise with the Grafana stack for dashboards, alerting, and visualization
- Hands-on experience with Grafana Alloy for telemetry data collection
- Proficiency in PromQL
- Experience with Loki for log aggregation and analysis
- Experience building comprehensive monitoring and alerting strategies
- Hands-on experience managing Java-based applications in large-scale, distributed environments, with a focus on JVM tuning and application optimization.
- Cloud Platform expertise (AWS, GCP, or Azure)
- Familiarity with infrastructure as code (IAC) tools like Terraform/Terragrunt or Ansible.
- ArgoCD proficiency for GitOps workflows and continuous deployment
- Strong scripting abilities in Bash, Python, or Go
- Experience with CI/CD pipleines and automation tools
- Configuration Management and deployment automation
- Strong troubleshooting skills, with a proactive approach to diagnosing and resolving performance bottlenecks.
- Proven experience managing on-call rotations, incident response, and root cause analysis.
- Ability to mentor junior team members
- Strong communication skills (both written and verbal), positive attitude, and ability to receive constructive feedback.
Benefits
- Competitive pay and benefits
- Flexible vacation allowance
- A hybrid / remote working environment
- Startup culture backed by a secure, global brand
Related Guides
Related Categories
Related Job Pages
More DevOps Engineer Jobs
Site Reliability Engineer improving application reliability for Adobe
Senior DevOps Engineer
YassirYassir is the leading super App in the Maghreb region set to changing the way daily services are provided. It currently operates in 45 cities across Algeria, Morocco and Tunisia with recent expansions into France, Canada and Sub-Saharan Africa. It is backed (~$200M in funding) by VCs from Silicon Valley, Europe and other parts of the world. We offer on-demand services such as ride-hailing and last-mile delivery. Building on this infrastructure, we are now introducing financial services to help our users pay, save and borrow digitally. Helping usher the continent into a digital economy era. We’re not just about serving people - we’re about creating a marketplace to bring people what they need while infusing social values
As a Senior DevOps Engineer, you will build and maintain scalable cloud systems, improve CI/CD processes, automate deployments, and support engineering teams.
Integration and DevSecOps Engineer supporting cloud-native system integration
Lead DevOps Engineer managing a team of DevOps Engineers remotely in the United States