EDA IT Engineer

EDA IT Engineer

EDA IT Engineer

Fractile

11 hours ago

No application

About

  • Fractile’s mission is to enable a new chapter in the AI revolution. We’re pioneering AI innovation where hardware and software join to create something truly extraordinary, unlocking the power of the world’s largest language models with speed increases of x100. Our team is rapidly expanding, and we're searching for visionary engineers, scientists, and thinkers who share our passion for pushing boundaries and redefining what's possible. If you're ready to join a dynamic group of innovators shaping AI's future, we want to hear from you!
  • We are seeking an IT engineer experienced in managing shared compute infrastructure for EDA workloads. In this role you will be responsible for setting up and maintaining a compute farm to support a variety of workloads from both our frontend and backend silicon engineering teams. You will collaborate closely with our infrastructure team to resolve bottlenecks, optimise workflows, and provide a productive environment that will enable our engineering team to execute with scale and speed.

Key Responsibilities

  • Assist with the specification and deployment of scalable compute infrastructure on a cloud provider (AWS/GCP), and perform regular maintenance and security updates.
  • Maintain infrastructure-as-code specifications for deployment and automation of changes using Terraform, Ansible, or similar.
  • Manage centralised authentication for users using LDAP or similar.
  • Setup centralised storage and associated backups for both filesystem and object storage.
  • Configure a grid computing scheduler able to handle complex resource requests (CPU/RAM/licenses) that is compatible with EDA tools (e.g. Slurm/LSF/SGE).
  • Maintain license servers for EDA tools.
  • Setup and monitor observation tooling for resource utilisation, machine failures, and more (e.g. Prometheus/Zabbix).
  • Work with engineering teams to optimise workloads and resolve bottlenecks in scheduling/disk activity/network traffic.
  • Work with senior leadership to balance compute capability against expenditure.

Preferred Qualifications

  • 5+ years experience in managing infrastructure, ideally related to EDA workloads.
  • Strong proficiency in use and administration of Linux/Unix systems.
  • Experience managing shared compute infrastructure and related tools (LDAP, NFS, autofs, …).
  • Experience with identifying network/storage/CPU/RAM bottlenecks across complex workloads.
  • Experience deploying and managing a grid compute system (Slurm/LSF/SGE).
  • Proficiency in modern software development language(s) and infrastructure-as-code frameworks.
  • Proficiency with containerisation frameworks (Docker/Singularity).