Powering growth. Expanding possibilities.

At SGH, we design and develop high-performance, high-availability, enterprise solutions that help our customers solve for the future. Across our computing, memory, and LED lines of business, we focus on serving our customers by providing deep technical knowledge and expertise, custom design engineering, build-to-order flexibility, and a commitment to best-in-class quality.

We come from a broad collection of experiences and diverse backgrounds, but we’re united by a drive to raise the bar for the impactful technologies we design and manufacture, our customers, and each other. With an open and inclusive culture, we help one another think creatively and look beyond – because when we’re each at our best, we’re even more powerful together.

Powering growth. Expanding possibilities.

At SGH, we design and develop high-performance, high-availability, enterprise solutions that help our customers solve for the future. Across our computing, memory, and LED lines of business, we focus on serving our customers by providing deep technical knowledge and expertise, custom design engineering, build-to-order flexibility, and a commitment to best-in-class quality.

We come from a broad collection of experiences and diverse backgrounds, but we’re united by a drive to raise the bar for the impactful technologies we design and manufacture, our customers, and each other. With an open and inclusive culture, we help one another think creatively and look beyond – because when we’re each at our best, we’re even more powerful together.

Sr. Managed Services Engineer

Date Posted:  Apr 25, 2024
Requisition ID:  1098
Location: 

VA, US MD, US

Brand:  PenguinSolutions

The Penguin Solutions™ portfolio, which includes Penguin Computing™ and Penguin Edge™, accelerates customers’ digital transformation with the power of emerging technologies in HPC, AI, and IoT with solutions and services that span the continuum of edge, core, and cloud. By designing highly advanced infrastructure, machines, and networked systems we enable the world’s most innovative enterprises and government institutions to build the autonomous future, drive discovery and amplify human potential.

 

Overview

Penguin Solutions Managed Services provides dedicated, remote, Linux systems administration for complex, integrated environments involving high-performance computing, cloud, and enterprise systems. This position requires technical skills and the ability to understand, document, configure, administer, troubleshoot, and resolve issues in Linux environments. This is a customer-facing position.

 

Responsibilities

  • Support a Linux-based, high-performance computing (HPC) and artificial intelligence (AI) environment featuring a wide range of technologies.
  • Manage and maintain system infrastructure (hardware and software).
  • Install, configure, and manage compute, storage, and networking infrastructure of Linux servers
  • Render professional, timely, and expert user support.
  • Troubleshoot software and hardware issues.
  • Fully document processes, procedures, and all work performed.

 

Qualifications

  • Bachelor’s Degree in Computer Science, Computer/Electrical Engineering, or a related field (or equivalent experience)
  • Top Secret cleared; or willing and able to pursue
  • Must be a US Citizen
  • 8+ years of hands-on experience with UNIX/Linux server environments
  • 8+ years of proven software development experience focused on performance and scale
  • Strong Ansible scripting
  • Strong Linux systems administration skills and experience with open-source technologies
  • Understanding of Linux networking implementation and protocols
  • HPC/AI Performance Specialist and practical knowledge of the administration of High-Performance Computing (HPC) technologies, including cluster resource management, job scheduling, Ethernet networking, InfiniBand, etc.
  • Proven expertise in solving Linux OS and user environment performance issues
  • HPC Systems Management knowledge (Scyld Clusterware preferred)
  • Ability to run scaling benchmark codes on large HPC clusters
  • Familiarity with several cpu and gpu compilers including gcc, Intel, AMD (AOCC, ROCm) and NVIDIA (PGI OpenACC,CUDA)
  • HPC Scheduler knowledge (SLURM, PBS, LSF)
  • Data: High-Performance Storage and Parallel file systems used in HPC/AI and Cloud
  • Datacenter infrastructure knowledge
  • In-depth knowledge of Linux cluster technologies and optimization techniques
  • Linux Certifications (e.g., RHCSA, RHCE)
  • Able to install, configure, and tune software applications and provide overall support
  • Will take initiative to refer to Application OEM/Vendor for Application operations, features, functions, and questions
  • Will take initiative to refer with customer application experts for collaborative support
  • Ability to communicate clearly and effectively with team members and clients

 

Location

This is a hybrid position in the Metro D.C. area with approximately 25% travel.

 

Compensation & Benefits

The pay range that the Company reasonably expects to pay for this position in Washington, D.C. is $121,000 - $166,000; the pay ultimately offered within the expected range may vary based on business considerations, including job-related knowledge, skills, experience, and education. The position is bonus-eligible, and there are medical, dental, and vision benefits available. There is a 401k saving plan and other benefits, such as Paid Time Off, Life Insurance, and an Employee Assistance Plan.   

 

Diversity and Inclusion Statement

SGH, together with its affiliates, is committed to creating a diverse environment that embraces differences and fosters inclusion.

 

Equal Opportunity Statement

We are an Affirmative Action/Equal Opportunity Employer and strongly committed to all policies which will afford equal opportunity employment to all qualified persons without regard to age, national origin, race, ethnicity, creed, gender, disability, veteran status, or any other characteristic protected by law.