Senior DevOps Engineer Job at Oak Ridge National Laboratory

Oak Ridge National Laboratory Oak Ridge, TN 37830

Requisition Id 10227

Our Team:

The DevOps Team is responsible for facilitating DevOps for R&D projects. We work as team to provide deployment, automation, monitoring, and management tool infrastructure for researchers. We advocate and promote DevOps practices to researchers who develop code as a part of their project. We operate within an Agile Scrum workflow and work with researchers to provide automation solutions. Our team is part of the Information Technology Services Division which supports IT services for Oak Ridge National Laboratory.


Our Stack and Workflow:

  • Work planning and documentation - Jira/Confluence
  • Code repository, CI/CD - GitLab and GitHub
  • Configuration and Package Management – Helm, Ansible and Conda
  • Container Orchestration - Kubernetes, Rancher
  • Monitoring, Analytics and Visualization - Prometheus, Grafana, Elasticsearch, Fluentd
  • Other technologies utilized/supported include, but aren’t limited to Rook/Ceph, Selenium, nginx, Guard, and Checkmarx depending on supported program needs


What does our ideal teammate look like and what will you be doing?

Our primary goal is to partner with research organizations to enable research excellence and delivery for client. The DevOps team is going into its 3rd year of operation, so we are still growing and maturing our stack and capabilities. In that time our team has grown annually 100% year-over-year, so the ability to document, collaborate and integrate new team members into our environment is important to maintain velocity and consistency of delivery.


Our team views success through the lens of what additional capabilities, cost savings and optimizations we bring to our research partners. As we deliver success, we continue to add roles and offer new capabilities. We want researchers and their projects focused on their delivery and worrying less about their IT.


You should enjoy evaluating and documenting new tools to pitch to the rest of the team, with an eye toward improved service and delivery. You will troubleshoot and debug various Kubernetes workloads, CI/CD pipelines, and implement solutions to optimize performance and scalability both on-premise and in the cloud. You will work with researchers and developers to encourage and facilitate Kubernetes best practices into their applications, write Helm charts, configure complex pipelines to streamline multiple code repositories for automated deployments/upgrades, and work with others across the organization to ensure that we are delivering secure solutions in compliance with Internal Operating Procedures.


We optimize our workflows and monitoring solutions to take advantage of our 24/7 operations staff, which significantly reduces the need for off-hours support. We also offer a flexible work schedule and utilize Email, Jira, Confluence, Teams, Slack, and other collaboration solutions to stay in contact. Also, we know it’s tough, but please try to avoid the confidence gap. You don’t have to match all the listed requirements exactly to be considered for this role.


Basic Requirements:

  • Bachelor’s degree in a scientific field and a minimum of 8 years of relevant experience. An equivalent combination of education and experience will also be considered.
  • A minimum of 5 years of experience managing UNIX/Linux Systems.
  • At least 1 year of experience utilizing Kubernetes for container orchestration.
  • Experience utilizing configuration management and automation tools such as Git, Jenkins, Ansible, Puppet, or other CI/CD pipeline tools.
  • Moderate fluency in at least one scripting language such as Bash, Python, Go or equivalent.


Preferred Qualifications:

  • Kubernetes certifications such as CKA and CKAD and 2+ years building and maintaining Kubernetes environments.
  • Experiencing building and managing Kubernetes infrastructure in a production environment.
  • Experience managing virtual infrastructure on public clouds (AWS, Azure, GCP, etc).
  • Strong knowledge of multiple operating systems.
  • Experience with performance and diagnostic tools for benchmarking, analysis and tuning of systems, networking, and storage
  • Previous experience working in a government, scientific, or other highly technical environment.
  • Excellent interpersonal skills suitable for user support and ability to work well with peers.
  • Demonstrated ability to balance complex research and security requirements.
  • Technical documentation skills, including ability to prepare simple documentation web pages.
  • RHSA, VCP, AWS Certified DevOps Engineer.


Benefits at ORNL:

ORNL offers competitive pay and benefits programs to attract and retain talented people. The laboratory offers many employee benefits, including medical and retirement plans and flexible work hours, to help you and your family live happy and healthy. Employee amenities such as on-site fitness, banking, and cafeteria facilities are also provided for convenience.


Other benefits include: Prescription Drug Plan, Dental Plan, Vision Plan, 401(k) Retirement Plan, Contributory Pension Plan, Life Insurance, Disability Benefits, Generous Vacation and Holidays, Parental Leave, Legal Insurance with Identity Theft Protection, Employee Assistance Plan, Flexible Spending Accounts, Health Savings Accounts, Wellness Programs, Educational Assistance, Relocation Assistance, and Employee Discounts.


#LI-DC1


This position will remain open for a minimum of 5 days after which it will close when a qualified candidate is identified and/or hired.

We accept Word (.doc, .docx), Adobe (unsecured .pdf), Rich Text Format (.rtf), and HTML (.htm, .html) up to 5MB in size. Resumes from third party vendors will not be accepted; these resumes will be deleted and the candidates submitted will not be considered for employment.


If you have trouble applying for a position, please email ORNLRecruiting@ornl.gov.


ORNL is an equal opportunity employer. All qualified applicants, including individuals with disabilities and protected veterans, are encouraged to apply. UT-Battelle is an E-Verify employer.




Please Note :
ajayjain.com is the go-to platform for job seekers looking for the best job postings from around the web. With a focus on quality, the platform guarantees that all job postings are from reliable sources and are up-to-date. It also offers a variety of tools to help users find the perfect job for them, such as searching by location and filtering by industry. Furthermore, ajayjain.com provides helpful resources like resume tips and career advice to give job seekers an edge in their search. With its commitment to quality and user-friendliness, Site.com is the ideal place to find your next job.