Jobgether

We use an AI-powered matching process to ensure your application is reviewed quickly, objectively, and fairly against the role's core requirements. Our system identifies the top-fitting candidates, and this shortlist is then shared directly with the hiring company. The final decision and next steps (interviews, assessments) are managed by their internal team. We appreciate your interest and wish you the best! Data Privacy Notice: By submitting your application, you acknowledge that Jobgether will process your personal data to evaluate your candidacy and share relevant information with the hiring employer. This processing is based on legitimate interest and pre-contractual measures under applicable data protection laws (including GDPR). You may exercise your rights (access, rectification, erasure, objection) at any time. #LI-CL1 We may use artificial intelligence (AI) tools to support parts of the hiring process, such as reviewing applications, analyzing resumes, or assessing responses. These tools assist our recruitment team but do not replace human judgment. Final hiring decisions are ultimately made by humans. If you would like more information about how your data is processed, please contact us.

Staff Software Engineer, Compute

Software EngineerSoftware EngineerFull TimeRemote

Location

United States

Posted

3 days ago

Salary

Not specified

Distributed SystemsGoKubernetesCloud InfrastructureMulti TenancyAutoscalingObservabilityIAMReliability EngineeringSystem Design

Job Description

This description is a summary of our understanding of the job description. Click on 'Apply' button to find out more.

Role Description

This role offers an exciting opportunity to design and build the compute foundations that power large-scale distributed systems used by modern AI and enterprise applications. As a Staff Software Engineer focused on cloud compute infrastructure, you will develop scalable platform primitives that enable reliable, elastic, and secure execution environments. Working at the intersection of control plane and data plane architecture, you will tackle complex challenges such as autoscaling, multi-tenancy, observability, and cross-cloud orchestration. The position requires deep technical expertise, strong system design skills, and the ability to influence platform architecture across engineering teams. You will help create infrastructure abstractions that simplify development while maintaining performance, reliability, and operational excellence. This is an ideal opportunity for engineers passionate about distributed systems, cloud platforms, and building developer-focused infrastructure at scale.

  • Design and build managed compute primitives that power scalable execution environments for cloud-based applications and distributed systems.
  • Architect and implement autoscaling systems that dynamically optimize resource allocation while maintaining reliability, performance, and safety.
  • Develop and operate services on critical execution paths where performance, stability, and correctness directly impact users.
  • Define and evolve architecture boundaries between open-source server components and managed cloud platform capabilities.
  • Build secure integrations with cloud providers, including handling IAM boundaries, credentials management, networking constraints, and operational safeguards.
  • Ensure platform observability and operational excellence through monitoring, tracing, service-level objectives (SLOs), and reliability testing.
  • Lead the full lifecycle of platform features, including API design, rollout strategies, backward compatibility, and long-term maintenance.
  • Provide technical leadership through architecture discussions, design documentation, code reviews, and mentorship of engineers across teams.
  • Collaborate cross-functionally with teams working on server infrastructure, SDKs, security, and control plane systems to deliver cohesive platform improvements.

Qualifications

  • Extensive experience designing and building distributed systems or multi-tenant platform services in production environments.
  • Strong understanding of core systems engineering principles including concurrency, performance optimization, reliability engineering, and failure-mode analysis.
  • Proven track record of delivering infrastructure or platform capabilities used by other developers, including APIs, control planes, or data plane services.
  • Experience owning production services with responsibility for reliability, monitoring, incident response, and continuous improvement of operational quality.
  • Strong written and verbal communication skills with the ability to document architectural decisions and technical trade-offs clearly.
  • Experience with cloud infrastructure platforms and scalable compute systems is highly desirable.
  • Familiarity with identity and access management (IAM) models and secure cross-account execution environments is a plus.
  • Experience building Kubernetes controllers, managing containerized workloads, or operating heterogeneous compute fleets is beneficial.
  • Proficiency in systems programming languages such as Go is advantageous, though strong architectural judgment and system design expertise are most important.

Benefits

  • Competitive salary ranging from $230,000 to $275,000 based on experience and qualifications.
  • Eligibility to participate in a company equity program.
  • Unlimited paid time off plus 12 company holidays and 2 floating holidays.
  • Comprehensive health coverage with 100% premium coverage for medical, dental, and vision plans.
  • Life insurance, disability insurance, and additional financial protection benefits.
  • 401(k) retirement savings plan.
  • Annual stipends including $3,600 for work-from-home meals, $1,800 for professional development, and $1,200 for lifestyle spending.
  • Home office setup allowance and company-provided equipment.
  • Monthly internet reimbursement and additional remote work support.
  • Access to wellness resources including a mental health app subscription.
  • Opportunities for global collaboration and occasional travel to team offsites and company events.

Job Requirements

  • Extensive experience designing and building distributed systems or multi-tenant platform services in production environments.
  • Strong understanding of core systems engineering principles including concurrency, performance optimization, reliability engineering, and failure-mode analysis.
  • Proven track record of delivering infrastructure or platform capabilities used by other developers, including APIs, control planes, or data plane services.
  • Experience owning production services with responsibility for reliability, monitoring, incident response, and continuous improvement of operational quality.
  • Strong written and verbal communication skills with the ability to document architectural decisions and technical trade-offs clearly.
  • Experience with cloud infrastructure platforms and scalable compute systems is highly desirable.
  • Familiarity with identity and access management (IAM) models and secure cross-account execution environments is a plus.
  • Experience building Kubernetes controllers, managing containerized workloads, or operating heterogeneous compute fleets is beneficial.
  • Proficiency in systems programming languages such as Go is advantageous, though strong architectural judgment and system design expertise are most important.

Benefits

  • Competitive salary ranging from $230,000 to $275,000 based on experience and qualifications.
  • Eligibility to participate in a company equity program.
  • Unlimited paid time off plus 12 company holidays and 2 floating holidays.
  • Comprehensive health coverage with 100% premium coverage for medical, dental, and vision plans.
  • Life insurance, disability insurance, and additional financial protection benefits.
  • 401(k) retirement savings plan.
  • Annual stipends including $3,600 for work-from-home meals, $1,800 for professional development, and $1,200 for lifestyle spending.
  • Home office setup allowance and company-provided equipment.
  • Monthly internet reimbursement and additional remote work support.
  • Access to wellness resources including a mental health app subscription.
  • Opportunities for global collaboration and occasional travel to team offsites and company events.

Related Job Pages

More Software Engineer Jobs

Software Engineer - Authentication

Jobgether

We use an AI-powered matching process to ensure your application is reviewed quickly, objectively, and fairly against the role's core requirements. Our system identifies the top-fitting candidates, and this shortlist is then shared directly with the hiring company. The final decision and next steps (interviews, assessments) are managed by their internal team. We appreciate your interest and wish you the best! Data Privacy Notice: By submitting your application, you acknowledge that Jobgether will process your personal data to evaluate your candidacy and share relevant information with the hiring employer. This processing is based on legitimate interest and pre-contractual measures under applicable data protection laws (including GDPR). You may exercise your rights (access, rectification, erasure, objection) at any time. #LI-CL1 We may use artificial intelligence (AI) tools to support parts of the hiring process, such as reviewing applications, analyzing resumes, or assessing responses. These tools assist our recruitment team but do not replace human judgment. Final hiring decisions are ultimately made by humans. If you would like more information about how your data is processed, please contact us.

Software Engineer3 days ago
Full TimeRemote

This role offers the opportunity to design, build, and operate high-performance authentication systems that secure voice, video, and real-time interactions at scale. The Software Engineer - Authentication will work on distributed production systems, developing robust, reliable, a...

PythonGodistributed systemsAPImicroservicesDockerKubernetesCI/CDAWSDynamoDBKinesisS3PrometheusGrafanaLinux
United States
Software Engineer3 days ago
Full TimeRemote

Saalex Corporation is seeking an Innovation Developer - Lead to drive enterprise software modernization initiatives through advanced development automation, Digital Workforce Agents (DWAs), and secure DevSecOps practices.This role ser...

PythonJavaScriptDockerKubernetesTerraformCI/CDGitHub ActionsGitLabJenkinsAzure DevOpsHelmAWSAzureGCPKafkaSonarQubeSnykTrivyREST APIGraphQLNISTRMFZero Trust
District of Columbia
Software Engineer3 days ago
Full TimeRemoteTeam 201-500

The role involves working on the most critical technical problems across backend, frontend, infrastructure, APIs, and data, taking significant ownership of the core product. Responsibilities include scaling APIs for developer integration and applying skills wherever needed across teams.

ReactNext.jsTypeScriptGoPythonAPIAWSAzureHerokuDocker
United States
Full TimeRemoteTeam 501-1,000

The primary focus involves designing and prototyping software for the ngVLA project, which includes refining requirements, designing systems, performing trade studies, and building prototypes. This role will also be directly involved in the early implementation and testing of key software features for the ngVLA and Radar projects.

JavaC++PythonVersion ControlCI/CDAgileSoftware Engineering Principles
United States
$60K - $92K / year