Share this Job

DevOps Engineer

Date: Nov 22, 2021

Location: Oak Ridge, TN, US, 37830

Company: Oak Ridge National Laboratory

Requisition Id 6465 

The Information Technology Services Division in the Business Services Directorate at the Oak Ridge National Laboratory is seeking qualified applicants for a DevOps Engineer position in the Research and Development Systems Engineering group. The R&D Systems Engineering group exists to facilitate lab goals through systems engineering, integration, and support for the research community at ORNL.  

 

Our Team 

The DevOps Team is responsible for facilitating DevOps for R&D projects.  We work as team to provide deployment, automation, monitoring, and management tool infrastructure for researchers. We advocate and promote DevOps practices to researchers who develop code as a part of their project.  We operate within an Agile Scrum workflow and work with researchers to provide automation solutions. 

 

Our Stack and Workflow 

  • Work planning and documentation - Jira/Confluence 

  • Code repository, CI/CD - Gitlab 

  • Configuration and Package Management – Helm, Ansible 

  • Container Orchestration - Kubernetes, Rancher 

  • Monitoring, Analytics and Visualization - Prometheus, Grafana, Elasticsearch, Fluentd 

  • Other technologies utilized/supported include, but aren’t limited to Rook/Ceph, Selenium, nginx, Guard, and Checkmarx depending on supported program needs 

 

What does our ideal teammate look like and what will you be doing? 

Our primary goal is to partner with research organizations to enable research excellence and delivery at ORNL. The DevOps team is going into its 3rd year of operation, so we are still growing and maturing our stack and capabilities. In that time our team has grown annually 100% year-over-year, so the ability to document, collaborate and integrate new team members into our environment is important to maintain velocity and consistency of delivery.  

 

Our team views success through the lens of what additional capabilities, cost savings and optimizations we bring to our research partners. As we deliver success, we continue to add roles and offer new capabilities. We want researchers and their projects focused on their delivery and worrying less about their IT.  

 

You should enjoy evaluating and documenting new tools to pitch to the rest of the team, with an eye toward improved service and delivery. You will troubleshoot and debug various Kubernetes workloads, CI/CD pipelines, and implement solutions to optimize performance and scalability both on-premise and in the cloud. You will work with researchers and developers to encourage and facilitate Kubernetes best practices into their applications, write Helm charts, configure complex pipelines to streamline multiple code repositories for automated deployments/upgrades, and work with others across the organization to ensure that we are delivering secure solutions in compliance with Internal Operating Procedures. 

 

We optimize our workflows and monitoring solutions to take advantage of our 24/7 operations staff, which significantly reduces the need for off-hours support. We also offer a flexible work schedule and utilize Email, Jira, Confluence, Teams, Slack, and other collaboration solutions to stay in contact. Also, we know it’s tough, but please try to avoid the confidence gap. You don’t have to match all the listed requirements exactly to be considered for this role. 

 

Basic Requirements 

  • Bachelor’s degree in a scientific field or equivalent combination of education and experience 

  • 5 years of experience managing UNIX/Linux Systems 

  • 1 year experience utilizing Kubernetes for container orchestration 

  • Experience utilizing configuration management and automation tools such as Git, Jenkins, Ansible, Puppet, or other CI/CD pipeline tools. 

  • Moderate fluency in at least one scripting language such as Bash, Python, Go or equivalent 

 

Preferred Qualifications  

 

  • Kubernetes certifications such as CKA and CKAD and 2+ years building and maintaining Kubernetes environments 

  • Experiencing building and managing Kubernetes infrastructure in a production environment 

  • Experience managing virtual infrastructure on public clouds (AWS, Azure, GCP, etc) 

  • Strong knowledge of multiple operating systems 

  • Experience with performance and diagnostic tools for benchmarking, analysis and tuning of systems, networking, and storage 

  • Previous experience working in a government, scientific, or other highly technical environment. 

  • Excellent interpersonal skills suitable for user support and ability to work well with peers. 

  • Demonstrated ability to balance complex research and security requirements 

  • Technical documentation skills, including ability to prepare simple documentation web pages. 

  • RHSA, VCP, AWS Certified DevOps Engineer 

 

Relocation:  Moving can be overwhelming and expensive. UT-Battelle offers a generous relocation package to ease the transition process. Domestic and international relocation assistance is available for certain positions. If invited to interview, be sure to ask your Recruiter (Talent Acquisition Partner) for details. 

 

This position will remain open for a minimum of 5 days after which it will close when a qualified candidate is identified and/or hired.

We accept Word (.doc, .docx), Adobe (unsecured .pdf), Rich Text Format (.rtf), and HTML (.htm, .html) up to 5MB in size. Resumes from third party vendors will not be accepted; these resumes will be deleted and the candidates submitted will not be considered for employment.


If you have trouble applying for a position, please email ORNLRecruiting@ornl.gov.


ORNL is an equal opportunity employer. All qualified applicants, including individuals with disabilities and protected veterans, are encouraged to apply.  UT-Battelle is an E-Verify employer.


Nearest Major Market: Knoxville