High Performance Computing Systems Engineer

Clearance Level
None
Category
Systems Engineering
Locations
Remote, Working from Maryland
Remote, Based in District of Columbia
Remote, Based in Virginia
Key Skills For Success

Ansible (Software)

Git

Linux

Scripting

Slurm Workload Manager

REQ#: RQ183396
Public Trust: NACI (T1)
Requisition Type: Regular
Your Impact

Own your opportunity to work alongside federal civilian agencies. Make an impact by providing services that help the government ensure the well being of U.S. citizens.

Job Description

As an HPC Software Support Engineer, you will bridge the gap between our researchers and the computing resources. You will be one of the faces of our High Performance Compute (HPC) clusters to the NIAID research community who will rely on you to help them get their important research work done.  You will focus on installing scientific applications, optimizing submission scripts and running jobs, and monitoring the health of NIAID’s HPC clusters; a 4000+ core HPC cluster that is GPU-focused and a 1,500+ core HPC cluster.

Work Visa sponsorship is not provided for this position.

HOW AN HPC SOFTWARE SUPPORT ENGINEER WILL MAKE AN IMPACT:

  • Work with a 4000+ core HPC cluster that is GPU-focused and a 1,500+ HPC cluster, including installing and supporting bioinformatics applications for a large and diverse research community with needs in genomics, cryo-electron microscopy, and AI/ML
  • Monitor the portfolio of software applications and be proactive in planning upgrades and license renewals
  • Monitor and report on cluster performance and generate data to show usage and trends
  • Triage support requests from the research community and work with others in the Scientific Infrastructure team to resolve issues and complete service requests
  • Collaborate with researchers to guide them in effective use of the HPC resources, such as job scheduler submission, data formats, and building data workflows
  • Engage with researchers to understand their HPC needs to include data life cycle management, integration of scientific instruments to HPC, and storage capacity and compute requirements
  • Provide input to the Scientific Infrastructure team leader for setting priorities for cluster operations, scheduling policies, resources needed, etc.
  • Attend and actively participate in daily standup meetings to provide updates on progress, discuss obstacles, and co-ordinate tasks with other team members
  • Work collaboratively in a team environment to achieve project goals
  • Engage in open communication, share knowledge, and support fellow teammates
  • Provide feedback and contribute to the continuous improvement of team processes


WHAT YOU’LL NEED TO SUCCEED:

Required Experience:  Minimum of five years of related experience

Education:  BS/BA (or equivalent)

Required Technical Skills:

  • Minimum of five years of experience with HPC technologies
  • Experience with Spack package manager, including making packages from PyPi, R, Github
  • Experience with Slurm job scheduling, including troubleshooting job status and optimizing submission scripts
  • Experience installing and packaging GPU applications and optimizing job submission scripts that are used for ML model training, data mining operations, or high-res graphics rendering
  • Experience with Python scripting
  • Experience using Git to manage shared software configuration code

Security Clearance Level:   Must be able to obtain a NIH Public Trust

Required Skills and Abilities:

  • Ability to translate technical concepts in HPC and research computing to scientists and other non- technical personnel
  • Ability to determine meaningful metrics and usage data for leadership

Location:  This position is primarily remote. However you must be able to commute at your own expense to the NIAID’s datacenter in Rockville, Maryland once a month and possibly more often to meet contractual obligations.


GDIT IS YOUR PLACE:

  • 401K with company match
  • Comprehensive health and wellness packages
  • Internal mobility team dedicated to helping you own your career
  • Professional growth opportunities including paid education and certifications
  • Cutting-edge technology you can learn from
  • Rest and recharge with paid vacation and holidays

#GDITFedHealthJobs -NIH

#GDITFedHealthJobs

Work Requirements
Years of Experience

5 + years of related experience

* may vary based on technical training, certification(s), or degree

Certification

Travel Required

None

Salary and Benefit Information

The likely salary range for this position is $110,500 - $149,500. This is not, however, a guarantee of compensation or salary. Rather, salary will be set based on experience, geographic location and possibly contractual requirements and could fall outside of this range.
View information about benefits and our total rewards program.

About Our Work

We are GDIT. A global technology and professional services company that delivers consulting, technology and mission services to every major agency across the U.S. government, defense and intelligence community. Our 30,000 experts extract the power of technology to create immediate value and deliver solutions at the edge of innovation. We operate across 30 countries worldwide, offering leading capabilities in digital modernization, AI/ML, Cloud, Cyber and application development. Together with our clients, we strive to create a safer, smarter world by harnessing the power of deep expertise and advanced technology.

GDIT is an Equal Opportunity/Affirmative Action employer. All qualified applicants will receive consideration for employment without regard to race, color, religion, sex, sexual orientation, gender identity, national origin, disability, or veteran status, or any other protected class.