Job Title: HPC Systems Administrator (AI Infrastructure)
Client: Elite Technology Firm
Salary: Up to $300k CAD Base + Exceptional Bonus
Location: Montreal (Hybrid)
Sells- Build and scale one of the most advanced AI research computing environments in the industry.
- Huge technical ownership, exceptional resources, a genuinely collaborative culture, and stunning Montreal offices with hybrid working
- Best benefits in the industry
An Elite Technology Firm is expanding its AI infrastructure team and looking for exceptional Systems Engineers who thrive on solving complex problems across large-scale Linux environments.
Role- Building out a large-scale Linux-based AI research compute environment with an unlimited budget Working on performance and availability issues across distributed systems
- Optimising high-performance storage platforms
- Work closely with developers and researchers to improve system performance
- Design and implement next-generation compute and storage infrastructure
- Working with a DevOps mindset
Skills- Strong Linux/HPC Engineering background with a DevOps mindset
- Experience with distributed resource management systems (Kubernetes, Slurm etc.)
- Experience with high-performance storage technologies (Lustre, Weka, GPFS, Ceph, VAST etc.)
- Strong scripting ability, ideally Python
- Experience working on LLM/AI Infrastructure
- They're looking for someone passionate about tech in and outside of work
Sells- Flexible hours/work options
- The chance to work with the most cutting-edge technology in the Industry
- Technologists only report to technologists
- Techies treated as top commodity so ‘spoiled’
- Small team size in a growing company so an ability to make a significant impact
- Constantly exciting greenfield projects in an ever-evolving environment
- No red tape
- Beautiful offices