HPC Engineer at Tufts University!

Overview

Tufts Technology Services (TTS) is a university-wide service organization committed to delivering technology solutions in support of Tufts’ mission of teaching, learning, research, innovation, and sustainability. With staff across all of Tufts’ campuses, as well as a 24x7 IT Service Desk, we collaborate with schools and divisions to meet the demands of a global, mobile community. We promote a collaborative, flexible work environment, embrace diversity and inclusion, and encourage personal and professional development. Learn more about TTS on our website.

What You’ll Do

The HPC Engineer works with directorates within TTS to support, refine and advance the system administration/management of Tufts High Performance (HPC) compute cluster. The HPC system is critical to researchers and users (faculty, student, staff) at all levels across the university and is maintained with enterprise level expectations. The role will also assist the larger team within Research Technology to assess and evaluate ongoing innovation and cutting-edge solutions to meet research computing needs.

What We’re Looking For

Minimum Required Experience, Education, Background, And Certifications/Licenses

• Knowledge and experience typically acquired through a Bachelor’s Degree in related field with two years related experience or High School diploma plus 3 or more years related experience in a higher education, research, scientific or technical computing environment.
• Understanding of and experience with high performance computing, scientific gateways from both an architecture, subsystems and networking perspective as well as daily usage, support, and application-level knowledge.
• Experience maintaining specific technologies used in research and high-performance computing such as job schedulers (Slurm), Containers (singularity, docker), RDMA over ethernet, Infiniband, GPUDirect, etc.
• Experience with scripting basics (e.g., Shell, Batch, Perl, Python, etc.).
• Experience with modern system administration devops and design patterns to automate Linux HPC clusters, operating system, software installation via scripting as well as configuration management systems such as ansible, puppet.
• Experience installing, maintaining open source and commercial research computing web gateway solutions such as OpenOnDemand, OpenXDMod, FastX, Airvata, HubZero, NanoHub, or Taverna.
• Experience installing, configuring, maintaining, troubleshooting common frameworks and software used in research and high-performance computing such as scikit-learn, TensorFlow/TensorBoard, Keras, Theano, Caffe, Pytorch, MXNet, DGL, GPU libraries such as NVIDIA RAPIDS suite (cuDF, cuML, cuGraph, cuDNN). on both GPU and CPU architectures.
• Experience and resourcefulness with all aspects of the system management and development cycle from analysis through evaluation and documentation when approaching system engineering challenges.
• Willing and able to learn technologies and required domain knowledge at a rapid pace.
• Background supporting academic researchers (e.g., faculty, staff, students, etc.).
• Strong communication, presentation, customer service, problem-solving skills in pursuit of system management and innovation.
• Demonstrated ability to work effectively in a dynamic, collaborative environment with colleagues and build partnerships across technical disciplines, job functions and departments.

Additional Preferred Experience, Education, etc.
• Master’s Degree in science or engineering field plus 2 or more years related experience in a higher education, research, scientific or technical computing environment.
• Familiarity and experience with resources at private or public sector HPC research computing environments, national centers or XSEDE (eXtreme Science and Engineering Discovery Environment) beneficial.
• Knowledge of the continuum of research computing and scalability from desktop to HPC to cloud and grid solutions.
• Experience with relational databases such as mariadb, mysql or postgres.

An employee in this position must complete all appropriate background checks at the time of hire, promotion, or transfer.

To apply or see job full description, visit https://jobs.tufts.edu/jobs/17638?lang=en-us