HPC Simulation Software & Machine Learning: Air Force Research Laboratory (AFRL) Summer Internship

Organization
U.S. Department of Defense (DOD)
Reference Code
ERDC-ITL-2023-0003
How to Apply

Click on Apply now to start your application. 

Description

The composites performance team at the Air Force Research Laboratory Materials and Manufacturing Directorate uses a combination of novel and high-impact experiments, in-house high-fidelity HPC simulation software, and machine learning to characterize and predict the performance of current and emerging materials.

Exquisite supercomputers have been the target of high-fidelity scientific simulation for the past three decades, due to their extreme throughput, interconnect, and I/O. However, Kubernetes clusters designed for machine learning tasks offer significantly greater computational capacity than high-end workstations, though significantly less than a "Top500" system. This project explores the challenges and suitability of using containerized Kubernetes jobs to perform communication-heavy scientific simulations at various scales.

What will I be doing?

This project will provide useful insights that may impact future plans for acquiring and building computational systems at AFRL, such as,  1) high-end workstations with a few GPUs for research scientists/engineers and 2) modest, in-house HPC systems with high-performance interconnect (InfiniBand) and I/O.  Outside of AFRL, the DoD HPCMP provides the highest-end systems available at the time. However, the increasing importance of machine learning (ML) and ML-specific workloads have prompted a few groups in AFRL to invest in performant Kubernetes clusters composed of many GPUs and many-core compute nodes, albeit with a lower investment into the interconnect and I/O compared to exquisite HPC systems.  These Kubernetes clusters provide significantly more compute capacity than a single workstation can provide, software portability/reproducibility through containerization, job scheduling infrastructure, and scalability. They leverage the massive advances made for enterprise computing and blur the lines between the workstation/in-house HPC/DoD HPC segmentation of compute capacity that is followed at AFRL. Performant enterprise-like clusters can critically meet the needs of the ML community and flexibly fill in the continuum of compute capacity between a workstation and a DoD HPC system. This project aims to explore the challenges and establish guidelines for using these clusters for distributed scientific simulation workloads that were designed for DoD HPC systems. 

Under the guidance of a mentor, you will research in a range of activities typical of a research scientist using simulation tools. You will consist of an update meeting where they will present a few slides summarizing progress and receive guidance from the mentors, additional technical meetings as needed to troubleshoot issues that arise, documenting processes and configurations, creating simulation input files, creating Volcano job .yml files, and measuring performance under varying compute conditions.  Any missing skills will be learned throughout the summer using a combination of direct meetings/lectures with the mentor and guided reading/exercises.  Throughout the project you will exercise technical communication skills by writing a final report and presenting your research to the composites research team at AFRL, gain experience running simulation software designed for HPC, and become familiar with containerization technologies. Additionally, you will have the opportunity to attend multiple seminars at AFRL each week on topics ranging from machine learning to material science. Tours of laboratories within RX, including those used for material characterization, polymer science, and composite manufacturing, can be arranged if you have an interest in the material science and engineering aspect of the work in AFRL/RX.  Finally, you will have regular discussions with multiple researchers at AFRL and various institutions contributing to ACE, including the University of Dayton Research Institute.

Why should I apply?  This fellowship provides the opportunity to independently utilize your skills and engage with experts in innovative ideas to move the proposed research forward.

Where will I be located?  Wright-Patterson AFB, Ohio

What is the anticipated start date? June 2023 - Exact start dates will be determined at the time of selection and in coordination with the selected candidate.

What is the appointment length?  This appointment is a summer research appointment. Appointments may be extended depending on funding availability, project assignment, program rules, and availability of the participant.

What are the benefits?  You will receive a stipend to be determined by the sponsor. Stipends are typically based on a participant’s academic standing, discipline, experience, and research facility location. Other benefits may include the following:

  • Health Insurance Supplement (Participants are eligible to purchase health insurance through ORISE)
  • Relocation Allowance
  • Training and Travel Allowance

About ORISE

This program, administered by Oak Ridge Associated Universities (ORAU) through its contract with the U.S. Department of Energy (DOE) to manage the Oak Ridge Institute for Science and Education (ORISE), was established through an interagency agreement between DOE and DoD. Participants do not enter into an employee/employer relationship with ORISE, ORAU, DoD or any other office or agency. Instead, you will be affiliated with ORISE for the administration of the appointment through the ORISE appointment letter and Terms of Appointment.  Proof of health insurance is required for participation in this program. Health insurance can be obtained through ORISE.  For more information, visit the ORISE Research Participation Program at the U.S. Department of Defense.

Qualifications

A candidate should:
- Have some prior experience in technical communication (including school projects)
- Be familiar with software development in some language (Python is preferred)
- Be organized and able to document processes clearly
- Have completed at least a few computer science courses

An understanding of Kubernetes, Volcano, MPI, and InfiniBand would be helpful but certainly not a requirement.

Security Investigation: Applicants should be able to pass a National Agency Check and Inquiries (NACI) security investigation should they be selected and accept the internship offer.

Application Requirements

A complete application consists of:

  • Zintellect Profile
  • Educational and Employment History
  • Essay Questions (goals, experiences, and skills relevant to the opportunity)
  • Resume (PDF)
  • Transcripts/Academic Records - For this opportunity, an official transcript or copy of the student academic records printed by the applicant or by academic advisors from internal institution systems may be submitted. Click here for detailed information about acceptable transcripts.
  • One recommendation. Your application will be considered incomplete and will not be reviewed until one recommendation is submitted. We encourage you to contact your recommender(s) as soon as you start your application to ensure they are able to complete the recommendation form and to let them know to expect a message from Zintellect. Recommenders will be asked to rate your scientific capabilities, personal characteristics, and describe how they know you. You can always log back in to your Zintellect account and check the status of your application.

If you have questions, send an email to USACE@orise.orau.gov. Please list the reference code of this opportunity in the subject line of the email. Please understand that ORISE does not review applications or select applicants; selections are made by the sponsoring agency identified on this opportunity. All application materials should be submitted via the “Apply” button at the bottom of this opportunity listing.  Please do not send application materials to the email address above.

Connect with ORISE...on the GO! Download the new ORISE GO mobile app in the Apple App Store or Google Play Store to help you stay engaged, connected, and informed during your ORISE experience and beyond!

Eligibility Requirements
  • Citizenship: U.S. Citizen Only
  • Degree: Bachelor's Degree, Master's Degree, or Doctoral Degree received within the last 60 months or currently pursuing.
  • Overall GPA: 3.00
  • Discipline(s):
    • Computer, Information, and Data Sciences (17 )
    • Engineering (27 )
    • Mathematics and Statistics (11 )
    • Physics (16 )
  • Age: Must be 18 years of age
  • Veteran Status: Veterans Preference, degree received within the last 120 month(s).
ORISE
ORISE ORISE GO
ORISE

The ORISE GO mobile app helps you stay engaged, connected and informed during your ORISE experience – from application, to offer, through your appointment and even as an ORISE alum!