Go to content
Job description

Site Reliability Expert (SRE)

IT

Québec

Simons Campus - IT

Full time

Want to be part of our IT team in a unique role that contributes to the production environment’s optimal maintenance? Join the Simons family as a Site Reliability Expert (SRE).
The incumbent plays a critical role in ensuring the smooth operation of the production environment through a proactive and software engineering-focused approach. Reporting to the Director of Solution Architecture and Software Engineering, the Expert is responsible for ensuring the continued availability of widely distributed software applications, while maintaining a high level of performance and reliability.

Key Responsibilities:
• Provide primary operational support for several widely distributed software applications.
• Gather and analyze metrics from operating systems and applications to help optimize performance and troubleshoot.
• Measure and optimize the system’s performance.
• Deliver infrastructure services using IaC (Infrastructure as Code).
• Implement and support continuous integration and deployment (CI/CD) tools.
• Automate IT operations tasks using Ansible.
• Participate in system design consultations, platform management, and capacity planning.
• Balance the speed of features’ development and the reliability with well-defined service-level targets.
• Work with the development teams to improve services through rigorous testing procedures.
• Create sustainable systems and services through automation and continuous improvement.
• Develop software and systems to manage platform infrastructure and applications.

Desired Profile:
• Hold a Bachelor’s degree in Computer Science, Software Engineering, IT Engineering, Electrical Engineering, or any other training deemed relevant.   
• Have at least two (2) years of experience in a role related to DevOps, SRE, platform engineering, or software engineering.
• Have experience with full-stack observability platforms, such as Datadog and New Relic.
• Have working knowledge of coding beyond simple scripts.
• Have experience with Kubernetes, preferably in OpenShift.
• Have significant knowledge of the native cloud approach.
• Have advanced programming capabilities (structured and object-oriented) by using one or more high-level languages such as Java, Python, C/C++, Go, and JavaScript.
• Demonstrate proactivity by identifying issues, performance bottlenecks, and areas to improve.
• Be a team player and know how to properly communicate with different stakeholders in a constantly changing environment.
• Be fluent in English and French, both written and spoken, in order to use systems and tools, and to perform various tasks in English.  

Benefits Available:  
• Possibility of hybrid work.
• A telemedicine service and Employee and Family Assistance Program.  
• Group insurance plan and RRSP.  
• Up to 40% off Simons purchases.  
• Fitness area with changing rooms, group classes, and kinesiology services. 
• Cafeteria service offering an extensive and affordable menu. 

 

Simons Campus - IT
9205 John-Simons Street
Quebec (Quebec) G2B 0S6
1-877-666-1840 ext. 1498
--
Area not accessible to the public

km

Monday 8 a.m. to 4:30 p.m.
Tuesday 8 a.m. to 4:30 p.m.
Wednesday 8 a.m. to 4:30 p.m.
Thursday 8 a.m. to 4:30 p.m.
Friday 8 a.m. to 4:30 p.m.
Saturday 8 a.m. to 4:30 p.m.
Sunday 8 a.m. to 4:30 p.m.