Remote SRE/DevOps Engineer

Remote
Roles:
DevOps
Must-have skills:
GoKubernetes
Considering candidates from:
Baltics, Europe, Armenia, Austria, Croatia, Czech Republic, Georgia, Hungary, Kazakhstan, Poland, Romania, Serbia, Slovakia, Slovenia and Uzbekistan
Work arrangement: Remote
Industry: Travel Arrangements
Language: English, Russian
Level: Middle or senior
Required experience: 4+ years
Size: 51 - 200 employees
Logo of Hellotickets

Remote SRE/DevOps Engineer

Remote
Hellotickets is building the largest global marketplace for travel experiences, from a Broadway musical show to a helicopter tour in Rio de Janeiro. Although they are a young startup, their platform is already presented in 15 countries — all with their local currencies and payment methods. The company is financially backed by key players in the entertainment and travel industry, like LCD SoundSystem’s James Murphy, Sony Music’s Managing Director and the Founder of TripAdvisor.
HelloTickets is building the largest global marketplace for travel experiences,  for now they are looking for a DevOps/SRE to join the team remotely

Tasks:
  • Assume responsibility for the stability and issue during core business hours
  • Perform routine maintenance on file, data store, and job control systems
  • Participate in on-call rotations to address any emerging issues promptly
  • Identify and manage risks, promptly flag major issues, and escalate incidents when required
  • Collaborate with software development teams to design and build reliable, scalable, and efficient systems
  • Implement and maintain monitoring and alerting systems to proactively identify and address issues
  • Troubleshoot and resolve complex technical issues related to infrastructure and applications
  • Design, implement, and manage automated deployment and configuration management processes
  • Perform capacity planning and ensure the scalability of our systems
  • Lead incident response and post-incident analysis to prevent future occurrences
  • Suggest innovative solutions leveraging technology to enhance differentiation, efficiency, and user experiences
Must-have:
  • 4+ years of commercial experience as a DevOps Engineer and Site Reliability Engineer
  • Hands-on experience with at least one of the leading cloud platforms (AWS, Azure, Google Cloud)
  • Technology skills in a broad selection of Terraform, ELK, Grafana, Kubernetes, Docker, Istio, Helm, Git, Bash, CI/CD
  • Experience in cloud-native development and microservice architectures
  • Experience with Golang programming language
  • Strong problem-solving skills and the ability to work well under pressure
  • Strong knowledge of PostgreSQL for data store interrogation
  • Automation-first mindset with a focus on process and system efficiency
  • Previous exposure to technologies like Bitbucket, Grafana, Jira, ArgoCD, TeamCity, or similar
  • Commitment to meeting timelines and resolving issues promptly
  • Strong teamwork skills
  • Fluent knowledge of English
  • Fluent knowledge of Russian 
Benefits and conditions:
  • Full-time, remote job
  • 4 days working week (Monday - Thursday, Friday day off)
  • Paid vacation (20 days) and sick leaves
  • Flexible working schedule
  • Friendly professional staff and warm atmosphere
Interview process:
  1. Intro call with Toughbyte
  2. Call with HR
  3. Deeper technical interview with CTO
  4. Last interview with Head of Product