Close

Presentation

Senior HPC Engineer
·
Texas A&M University
·
College Station
DescriptionHere’s a Glimpse of the Job
We are making a bold leap into the future of artificial intelligence with a $45 million
investment in an NVIDIA DGX SuperPOD. This investment underscores our commitment
to all Texas A&M System members’ faculty and staff providing cutting-edge research
and super computing needs. As a Senior High Performance Computing Engineer (HPC),
you will provide technical expertise and consultation for the design and deployment of
HPC systems. Get in on the ground floor with a team that is shaping the next
generation of innovation.

This position is security sensitive requiring U.S. Citizenship.

Opportunities to Contribute
• Manage large-scale HPC cluster operations, including OS upgrades, firmware
patching, and performance tuning.
• Oversee networking, security, and infrastructure for HPC systems.
• Lead the development of specialized HPC computing clouds and scalable storage
systems.
• Collaborate with stakeholders to develop service-based solutions.
• Serve as a strategic technical resource across departments.
• Lead enterprise-wide HPC projects using established project management
protocols.
• Mentor junior system administrators and enforce performance standards.

What you need to know
Salary: $125-136K
Location: In-person role in College Station, Texas
Schedule: This role may require working outside of standard office hours, including
evenings, weekends, and holidays, to support the demands of technology services and
ensure the seamless operation of essential systems.
Citizenship: Must be a United States citizen, permanent resident, or a person granted
asylum or refugee status in accordance with 15 CFR, Part 762; 22 CFR §§122.5, 123.22
and 123.26; and 31 CFR § 501.601
RequirementsQualifications • Bachelor’s degree in applicable field or equivalent combination of education and experience • 12 years of related experience A well-qualified candidate should possess one or more of the following: • Experience with High Performance Computing (HPC) environments • Advanced Linux system administration skills • Familiarity with computer networking concepts and protocols • Experience with container orchestration tools such as Kubernetes • Knowledge of Run:ai for AI workload management • Proficiency with Slurm workload manager • Experience working with NVIDIA DGX systems • Understanding of virtualization technologies • Familiarity with Infrastructure as a Service (IaaS) platforms • Experience with DDN storage solutions • Knowledge of network-attached storage systems
Company DescriptionAt Texas A&M University, we stand to be the best for the world. Since 1876, Aggies have answered the call to lead with character and serve with compassion — for Texas, the nation and beyond. Our research drives solutions that matter: from disaster preparedness and agriculture to immersive technology, robotics, leadership and civics. Our students leave prepared to create anything possible — equipped with knowledge, shaped by values and committed to serving for the good of our communities. This is what it means to be an Aggie: to stand together as a force for good.
·
·
2025-11-12
Event Type
Job Posting
TimeMonday, 17 November 20254:42pm - 4:42pm CST
LocationHall 6
Countries
United States of America
Companies
Texas A&M University
In-Person / Remotes
In-person
Part Time / Full Times
Full Time
Position Types
Permanent