Staff SRE (Site Reliability Engineer)
Gigster
This job is no longer accepting applications
See open jobs at Gigster.See open jobs similar to "Staff SRE (Site Reliability Engineer)" Structure Capital.Software Engineering
Chile
Posted on Sep 8, 2024
Do you want to work on cutting-edge projects with the world’s best developers? Do you wish you could control which projects to work on and choose your own pay rate? Are you interested in the future of work and how the cloud will form teams? If so - the Gigster Talent Network is for you.
At Gigster, whether working with entrepreneurs to realize 'the next great vision' or with Fortune 500 companies to deliver a big product launch, we build really cool solutions that make a difference! From blockchain to AI/ML to VR and more, Gigster builds enterprise software on cutting-edge technology.
We are seeking highly skilled and experienced Staff Site Reliability Engineers (SRE) to join our dynamic team. As a member of the Gigster Network, you will have the chance to become a member of amazing teams where you'll be responsible for ensuring the reliability, scalability, and performance of our critical systems and services. As a Staff SRE, you will play a pivotal role in shaping infrastructure for our client and driving initiatives that improve the overall service quality.
Requirements
System Design and Architecture:
Recruitment Process
Benefits - We don’t call them perks, they’re just part of what makes working at Gigster great.
At Gigster, whether working with entrepreneurs to realize 'the next great vision' or with Fortune 500 companies to deliver a big product launch, we build really cool solutions that make a difference! From blockchain to AI/ML to VR and more, Gigster builds enterprise software on cutting-edge technology.
We are seeking highly skilled and experienced Staff Site Reliability Engineers (SRE) to join our dynamic team. As a member of the Gigster Network, you will have the chance to become a member of amazing teams where you'll be responsible for ensuring the reliability, scalability, and performance of our critical systems and services. As a Staff SRE, you will play a pivotal role in shaping infrastructure for our client and driving initiatives that improve the overall service quality.
Requirements
System Design and Architecture:
- Design, build, and maintain scalable and reliable infrastructure.
- Collaborate with engineering teams to ensure systems are designed with reliability and scalability in mind.
- Evaluate and integrate new technologies to enhance our infrastructure.
- Implement and maintain monitoring and alerting systems to detect and respond to issues promptly.
- Lead incident response efforts, ensuring quick resolution and effective communication.
- Conduct post-incident reviews and drive improvements based on findings.
- Architect & Build innovative automation projects (preferably in Python/GoLang)from scratch to help reduce day-to-day SRE toil
- Create Bash scripts to automate mundate manual activities like upgrades, status checks and deployment
- Develop and maintain infrastructure as code (IaC) using tools such as Terraform, Ansible, or similar.
- Automate repetitive tasks and processes to improve efficiency and reduce manual intervention.
- Collaborate with cross-functional teams to deliver high-quality products and services.
- Mentor and guide junior SREs and other team members.
- Advocate for best practices in reliability engineering across the organization.
- Drive initiatives to improve service reliability, capacity, and performance.
- Participate in capacity planning and disaster recovery exercises.
- Stay current with industry trends and emerging technologies.
- Bachelor's degree in Computer Science, Engineering, or a related field (or equivalent practical experience).
- 8+ years of minimum experience in the industry as a Software Engineer, SRE or Platform Engineer.
- Minimum 3+ years of experience as a Platform Engineer or SRE.
- Proven experience in managing large-scale, mission-critical infrastructure.
- Deep understanding of Linux/Unix systems and networking.
- Proficiency in at least one or more programming languages (e.g., Python, Go, Java).
- Intermediate to Expert level skill in bash scripting
- Experience with cloud platforms (AWS, Azure, GCP) and container orchestration (Docker, Kubernetes).
- Strong knowledge of monitoring and logging tools (e.g., Prometheus, Grafana, ELK stack).
- Familiarity with CI/CD pipelines and tools (e.g., Jenkins, GitLab CI).
- Excellent problem-solving skills and a proactive attitude.
- Strong communication and collaboration skills.
- Ability to work independently and as part of a team.
- Demonstrated leadership and mentoring abilities.
Recruitment Process
- English Proficiency Assessment (25 mins)
- Technical Assessment (45 mins)
- Recruiter screen (30 mins)
- Technical Interview (30-45 mins)
Benefits - We don’t call them perks, they’re just part of what makes working at Gigster great.
- World-class network. Be part of a network with the most talented people in the world.
- Amazing cutting-edge projects. Pick the projects from F500 companies that you’re interested in.
- 100% remote and global. Live your best life, wherever that may be, and never lose out on career opportunities because of it.
- Flexible work hours. There is a time to overlap with the customer’s timezone, but most of the time, we work asynchronously and don’t care when you’re online; you just deliver great results.
- Flexible offerings. Choose how many hours you want to work and how much you want to earn.
- Swag! Because who doesn’t love swag?
This job is no longer accepting applications
See open jobs at Gigster.See open jobs similar to "Staff SRE (Site Reliability Engineer)" Structure Capital.