Springe zum Hauptteil

Karriere
am Klinikum & Medizinischer Fakultät

The Faculty of Medicine is one of the four founding faculties of the Eberhard Karls University of Tübingen. With its non-clinical facilities as well as its research and teaching area corresponding to the organisational units of the University Hospital, it is one of the largest medical training and research institutions in Baden-Württemberg.
The Hertie Institute for AI in Brain Health is looking for a

AI HPC Cluster Engineer (f/m/d)

for the Central Office in fulltime and fixed-term until 31.01.2028 with a strong prospect of extension.
HeaderBild
The "Hertie Institute for AI in Brain Health" (Hertie AI) is a research institute of the Faculty of Medicine, funded by the Gemeinnützige Hertie Stiftung, with the aim of detecting diseases of the nervous system earlier and treating them better with the help of artificial intelligence. Currently, Hertie AI is in a dynamic build-up phase. Hertie AI cooperates with the strong and innovative AI ecosystem in Tübingen (e.g. Cyber Valley, Cluster of Excellence “Machine Learning in Science”, Tübingen AI Center). Hertie AI uses and benefits greatly from shared infrastructures with these initiatives, like the Machine Learning Cloud (ML Cloud), but has special compute requirements due to its goal to analyze brain data and simulate neural circuits. The ML Cloud, is a state-of-the-art compute infrastructure with powerful AI CPU and GPU compute capacities, petabyte-scale storage volumes, used by more than 400 researchers and engineers.
High performance computing (HPC) is essential for modern science. You will play a pivotal role in developing specialized solutions for AI research at the intersection of neuroscience and AI, working closely with scientists to make use of state-of-the-art computing hardware, administering infrastructure and developing tools for scalable AI-focused scientific computing on heterogeneous systems. As part of a motivated team, you will be part of maintaining and improving the ML Cloud infrastructure (AI accelerators, networking and storage technologies).
 

What You'll Do:

  • Communication with scientists at Hertie AI to understand their project specific scientific needs, consulting and support to implement projects on the ML Cloud
  • Develop efficient solutions to deploy high-level libraries such as JAX/PyTorch on state-of-the-art computing hardware to enable scalable brain simulation and analysis
  • Establish, monitor and further develop the ML Cloud infrastructure, including administration of GPU/CPU servers, storage and network infrastructure and automation of daily activities
  • Help end-users with troubleshooting and resolution of their problems with the HPC infrastructure
  • Coordination with IT service provider

What will you bring:

  • Specialist knowledge and professional experience in information technology, applied computer science or computer engineering equivalent to the level of a Master's degree
  • Experience with HPC cluster manager & job scheduling software (e.g. Slurm, PBS, etc)
  • Experience in working with scientists is a plus
  • Administration experience with Linux OS (e.g. SLES/RHEL/CentOS/Ubuntu etc.).
  • Experience with Authentication/Authorization (e.g. LDAP, Shibboleth, Keycloak, etc.).
  • Good knowledge of the scripting language Bash and/or Python.
  • Experience with Parallel file systems like GPFS/Lustre/Ceph/BeeGFS/Weka.
  • Knowledge of common deep learning frameworks (JAX, PyTorch, etc)
  • Independent, result driven work, demonstrates ownership and accountability.
  • English proficiency.
  • Interest in artificial intelligence and motivation to collaborate with scientists and professionals in the field of AI research

Relevant experience in some of the following technologies:

  • Experience with automation tools for configuration management (e.g. Ansible, Puppet, Chef) and revision control systems (e.g. Git)
  • Experience with containers (Docker/ Singularity/Podman / Kubernetes)
  • HPC system troubleshooting and support
  • Experience with Ethernet and/or InfiniBand network technologies

We offer:

  • Collaboration in the multifaceted environment of a modern university hospital, which in addition to patient care, also focuses on medical research and teaching
  • Future-proof workplace and location as well as attractive remuneration including a company pension scheme (VBL) and at the same time the most flexible working hours possible
  • Subsidization of the job ticket for public transport and attractive discounts on employee offer platforms
  • Structured onboarding phase, clinic's own academy to develop professional, social and methodological skills
  • Preventive health care through a wide range of sports activities
We offer remuneration in accordance with TV-L (collective wage agreement for the Public Service of the German Federal States), severely handicapped persons with equal qualifications are given preferential consideration. Interview expenses are not covered. Please note the applicable vaccination regulations.
If you have any questions, please contact:
Dr. Kristina Kapanova
kristina.kapanova@uni-tuebingen.de


Closing date for applications:
29.09.2024
We are looking forward to your application under specification of the
index number 5079.
Please also indicate your salary expectations and possible starting date in the following questionnaire.

For more information, please visit:
www.medizin.uni-tuebingen.de/karriere

Ihr Browser ist veraltet!

Bitte aktualisieren Sie Ihren Browser, um diese Webseite korrekt darzustellen. Jetzt aktualisieren

×