Metsi Technologies

Global Systems Integrator | Digital Maturity | Data Center Automation | Hybrid Multicloud | Anything-as-a-Service

Senior GenAI, High Performance Computing Delivery Engineer

EngineerEngineerFull TimeRemoteTeam 51-200H1B No SponsorCompany SiteLinkedIn

Location

Texas

Posted

7 days ago

Salary

$153.9K - $199.1K / year

Bachelor Degree7 yrs expEnglishDockerKubernetesLinuxNode.js

Job Description

• Deploy, configure, and validate GPU accelerated compute clusters for AI, ML, and HPC with NVIDIA Base Command Manager (Warewulf and OpenHPC knowledge are a plus) • Perform benchmarking with HPL GPU, HPL MxP, STREAM, NCCL, RCCL, OSU Microbenchmarks, and related tools • Produce as-built documentation, performance reports, and share best practices amongst the team. • Configure and secure RHEL, Ubuntu, Rocky for GenAI or HPC workloads • Work directly with customers onsite (travel both regionally and across the U.S.)

Job Requirements

  • 7+ years with HPC or GenAI clusters, GPU based systems, AI infrastructure, or related fields
  • Deep hands on experience with GPU deployment, configuration, and multi-node testing using NVIDIA Base Command Manager
  • Proficiency with benchmarking tools: HPL, STREAM, NCCL, RCCL, MxP, OSU Microbenchmarks
  • Red Hat certification (RHCSA/RHCE) or 7+ years of relevant RH distros experience
  • Experience with GenAI/HPC networking (InfiniBand and/or RoCE)
  • Experience working in Linux based parallel computing environments at scale
  • Experience with containers/orchestration (Docker, Singularity/Apptainer, Kubernetes, Slurm)
  • Ability to travel up to 70% of the time across the U.S. as needed for projects
  • Strong customer facing and communication skills

Benefits

  • Health insurance
  • Paid time off
  • Flexible work arrangements
  • Professional development

Related Categories

Related Job Pages

More Engineer Jobs

Engineer7 days ago
Full TimeRemoteTeam 51-200H1B No Sponsor

Senior Electrical Engineer leading high voltage protection and control systems design

United States
$133.3K - $168.9K / year

Sharepoint Engineer

Accenture Federal Services

We believe in the power of change, harnessed in ways that matter for our country and communities.

Engineer7 days ago
Full TimeRemoteTeam 10,001+Since 2017H1B No Sponsor

SharePoint Engineer designing and maintaining environments for federal EHRM program

District of Columbia + 1 moreAll locations: District of Columbia, Washington
$106.3K - $221.1K / year

Aerospace Engineering Intern

American Systems

AMERICAN SYSTEMS is committed to pay transparency for our applicants and employee-owners. The salary range for this position is USD $47,300.00/Yr. - USD $78,900.00/Yr. Actual compensation will be determined based on several factors permitted by law.

Engineer7 days ago
Full TimeRemote

Are you a highly motivated, engaged student currently enrolled in a Bachelor of Science program in Aerospace Engineering or similar field? Are you looking to do meaningful, exciting work during your summer break? Then come aboard as AMERICAN SYSTEMS' next Aerospace Engineering In...

United States

Core Client Engineer

Tailscale

Simple, secure networks for teams of any scale. Built on WireGuard.

Engineer7 days ago
Full TimeRemoteTeam 51-200Since 2020H1B No Sponsor

Go Core Client Engineer designing and implementing Go-based client code at Tailscale

Distributed SystemsGo
United States
$163K - $226K / year