Description

AI/ML R&D Infrastructure & Systems Engineer

Job Category: Information Technology

Time Type: Full time

Minimum Clearance Required to Start: None

Employee Type: Regular

Percentage of Travel Required: None

Type of Travel: None

Anticipated Posting End: 11/15/2024

The Opportunity:

As a key member of our dynamic and forward-thinking AI Research and Development team, you will have the unique opportunity to collaborate closely with our world-class scientists and engineers to design, deploy, and maintain a cutting-edge Linux-based research and development (R&D) infrastructure. By maintaining and improving the hardware and software infrastructure supporting projects that span cutting-edge research and development, primarily machine learning and artificial intelligence (including computer vision, natural language processing (NLP), large language models (LLMs), and more), you will be at the forefront of technological innovation. We are looking for a skilled infrastructure engineer with expertise in managing a GPU-enabled compute cluster to support our R&D team. Join us in shaping the future of AI in the defense industry!

Responsibilities:

  • Design, procure, build, implement, and maintain on-prem servers, workstations, and software tooling across multiple development environments to support and enable cutting-edge AI/ML research

  • Implement infrastructure components to support distributed compute, storage, and dataset management

  • Ensure system compliance with security requirements, including keeping systems updated and developing and managing a system security plan (SSP)

  • Manage access and integration to commercial cloud providers such as AWS

  • Coordinate with data scientists and software engineers to understand and plan infrastructure requirements

  • Support other hardware, such as edge devices, as required by projects/customers

Qualifications:

Required:

  • 5+ years of Linux systems administration experience

  • Bachelor’s degree or equivalent experience

  • Expertise in scripting languages, such as Bash, Python, and/or Ansible, for automation and orchestration of Linux systems

  • Docker and Kubernetes knowledge, including deployment, scaling, and management of containerized applications

  • Experience with on-prem servers, including hardware selection, deployment, maintenance, and troubleshooting

  • Networking skills, including knowledge of network protocols, routing, and switching

Desired:

  • Ability to get a Top Secret or Top-Secret SCI clearance

  • Experience managing a GPU-enabled compute cluster

  • Experience with high availability data solutions such as Longhorn or MinIO

  • Experience with cross domain/hybrid solutions, such as connecting on-prem and cloud infrastructure or connecting infrastructure of different security levels

  • Experience with DevOps tools such as CI/CD pipelines and logging and monitoring tools

  • Experience with virtualization, including VMware or KVM

  • Understanding of AI/ML concepts

  • Experience with the AI/ML development lifecycle

  • Experience working on proposals

  • Security+ Certification

  • Experience working with hardware in classified environments


What You Can Expect:

A culture of integrity.

At CACI, we place character and innovation at the center of everything we do. As a valued team member, you’ll be part of a high-performing group dedicated to our customer’s missions and driven by a higher purpose – to ensure the safety of our nation.

An environment of trust.

CACI takes pride in fostering a diverse and accessible culture where every individual feels supported to chart their own path. You’ll have the autonomy to take the time you need through a unique flexible time off benefit and have access to robust learning resources to make your ambitions a reality.

A focus on continuous growth.

Together, we will advance our nation’s most critical missions, build on our lengthy track record of business success, and find opportunities to break new ground — in your career and in our legacy.

Your potential is limitless. So is ours.

Learn more about CACI here. (https://careers.caci.com/global/en/life-at-caci)


Pay Range : There are a host of factors that can influence final salary including, but not limited to, geographic location, Federal Government contract labor categories and contract wage rates, relevant prior work experience, specific skills and competencies, education, and certifications. Our employees value the flexibility at CACI that allows them to balance quality work and their personal lives. We offer competitive compensation, benefits and learning and development opportunities. Our broad and competitive mix of benefits options is designed to support and protect employees and their families. At CACI, you will receive comprehensive benefits such as; healthcare, wellness, financial, retirement, family support, continuing education, and time off benefits. Learn more here (https://careers.caci.com/global/en/employee-benefits) .

The proposed salary range for this position is:

$90,300 – 189,600 USD

CACI is an Equal Opportunity/Affirmative Action Employer. All qualified applicants will receive consideration for employment without regard to race, color, religion, sex, pregnancy, sexual orientation, gender identity, age, national origin, disability, status as a protected veteran, or any other protected characteristic.