What you'll do
IT’s Compute and Devices Group is looking for a computing engineer to take over responsibility for the onsite High-Performance Compute (HPC) SLURM clusters which have established use cases in the Organisation including for the ATS sector and the Theory department.
This position will be in the Compute and Configuration section, which is responsible for large scale compute, from the HPC farm through High Throughput Compute, Volunteer Computing and Configuration & Secret Management services.
Your responsibilities
- Ensure service delivery for the SLURM HPC clusters to the user community as the primary service manager and technical lead of the service.
- Serve as the escalation point for the user community support requests, help to gather requirements and usage best practices.
- Configure, upgrade, monitor the clusters and provide ongoing maintenance.
- Ensure high utilisation of the resources by interfacing with HTCondor brokered backfill of resources.
- Define procedures and best practices for the wider team in order to promote operational support coverage.
- Look for synergies with other team members and teams for management of compute resources, or access to high performance compute resources for the community.
Still here? Let's make a quick check about
Your profile
- Support experience of HPC or batch systems, ideally SLURM but knowledge of HTCondor or similar would be an advantage.
- Demonstrated knowledge of configuration management systems such as Puppet, Chef, Ansible or Terraform, and monitoring of distributed systems.
- Knowledge of system administration, in particular Linux environments.
- Dealing with user relations, user support and user requirements definition.
- Programming techniques and languages, in particular Python or Go.
- Master's degree or equivalent relevant experience in the field of Computer Science or a related field.
Your skills
- Knowledge of operating systems (Linux).
- Knowledge of system configuration tools (Puppet, Ansible, Terraform).
- Architecture and design of ICT systems.
- Identification and selection of relevant emerging ICT technologies.
- Knowledge and application of software life-cycle tools and procedures.
- Works well in groups and readily fits into a team; participates fully and takes an active role in team activities.
- Addresses complex problems by breaking them down into manageable components.
- Takes initiative beyond regular tasks and makes things happen.
- Shows appreciation for the ideas and contributions of others and encourages others to express their views, even if controversial.
- Spoken and written English, with a commitment to learn French.
Employment conditions
- Stand-by duty, and work during nights, Sundays and official holidays, when required by the needs of the Organisation.
Global Benefits at CERN
Let's get you ready
Be sure to meet the eligibility criteria
- You are a national of a CERN Member State or Associate Member State. Currently, we cannot consider applications from Pakistani and Lithuanian nationals for positions with a 2026 start date, as the ceiling defined under Article II.5 of the Associate Membership Agreement has been reached.
- You have relevant qualifications and professional experience.
- If you have previously held a Staff contract at CERN, you will not be eligible for these positions.
- Please pay attention to the additional criteria and requirements for this specific position and mentioned above.
You will need these documents to complete your application
- Your CV (English or French)
- Any document you consider relevant to your application