HPC Application Manager
Martek Global Services, Inc. (“Martek”) has been awarded several long-term Federal contract to provide a wide range of IT talent. Our team is seeking a HPC Application Manager to join our team supporting the Department of Defense (DoD) with High Performance Computing (HPC) Moderation Program and Army Corps of Engineers, Engineer Research and Development Center (ERDC) DoD Supercomputing Resource Center (DSRC) with the required education and experiences outlined below.
This position is located Wright-Patterson Air Force Base, Ohio.
Hourly Rate: $43.00 - $45.00 per hour.
Responsibilities & Duties
As an HPC Application Manager, you will be responsible for the full lifecycle management of a defined portfolio of scientific and engineering software. Your primary focus will be ensuring the stability, performance, and availability of this software for users within a large-scale High-Performance Computing (HPC) environment. This position directly supports the research and development activities of the DoD High Performance Computing Modernization Program (HPCMP). The HPCMP provides large-scale systems and environments to the DoD RDT&E community which supports the warfighter.
- Software Management: Install, configure, and maintain complex software packages on multiple Linux-based HPC systems.
- User Support: Provide direct technical support by responding to and resolving user-submitted ServiceNow tickets for the assigned application portfolio.
- Troubleshooting: Diagnose and resolve complex application failures, including OS-level issues such as library dependencies and driver conflicts.
- Validation & Testing: Test and validate application functionality through system changes like maintenance, OS upgrades, and new system deployments.
- Maintenance: Deploy software patches and updates to resolve bugs and address security vulnerabilities.
- Compliance: Ensure all software management tasks adhere to established Standard Operating Procedures (SOPs), including formal Request for Change (RFC) processes.
- Collaboration: Collaborate with internal technical teams and external software vendors as necessary to resolve complex application-related issues.
Educational Requirements
- Must hold an active in scope SECRET clearance with the ability to obtain and maintain a top-secret security clearance. (US Citizenship required)
- Bachelor’s degree and 2+ years of related experience, OR an equivalent combination of education and experience including familiarity of an HPC environment.
- Active 8570.01M/IAT-II certification.
Additional Required Knowledge and Skills
Preferred experience in various HPC Systems and various HPC Technologies.
- Cray/Cray Ex, IBM, SGI, Penguin, or customer architecture
- Red Hat Enterprise Linux (RHEL), CentOS, or Linux variants operating systems (OS)
- InfiniBand (IB), Intel Omni-Path interconnects
- Portable Batch System (PBS), Simple Linux Utility for Resource Management (SLURM), or IBM Platform Load Sharing Facility (LSF) schedulers
- Intel Xeon, AMD EPYC, or ARM CPUs
- Nvidia graphic processing units (GPU)s
- ITLI Foundation