Talent Search Solutions

Search missions

Strong Mid/Senior DevOps Engineer

 Our client was founded in 2000 basing on the idea that all people deserve value-priced domains delivered through stellar service. Today, it is a leading ICANN-accredited domain name registrar and web hosting company with over two million customers and nearly five million domains under management — and the’re just getting started.

Reporting to the Head of Product, this position will be responsible for systems & application service uptime in a high-availability customer facing business critical 24×7 SaaS environment where uptime is critical and requires immediate response to service impacting issues. You must have a strong command over installation, configuration, and diagnostics with extensive hands-on expertise in open source Linux, systems in a scalable environment.

The right candidate will have excellent communication skills, a passion for implementing open source technology tools & application diagnostics in at least two of Dovecot / Postfix / MySQL / Java / Perl environments for a SaaS enterprise with a structured approach to achieve high-quality sustainable production operations

Extensive experience and production working knowledge of as many of following technologies and areas as possible:
- Large-scale production applications on modern platforms (Apache, JAVA, Dovecot/Postfix/MySQL, etc)
- Large scale deployments with modern configuration and deployment management systems (Puppet, cfengine, Chef, Capistrano, Fabric, etc)
- Mission-critical Linux (Debian, Ubuntu, Red Hat, etc) production servers
- Scalable systems (load balancers, memcached, master/slave architectures, sharding, Nginx, RabbitMQ)
- Working knowledge of production database servers (MySQL, Hadoop, Hive, H-Base)
- Carry out monitoring and performance metrics analysis using common monitoring tools (Nagios, Cacti, Munin, Ganglia, Hyperic, etc)
- Best practices relating to security, performance, monitoring Systems – Linux, Java & open source software
- Command over popular scripting languages to enable automation of release processes, monitoring, trending, alerting techniques – ideally a working knowledge of Ruby & Shell
- Automation using Ansible / Chef / Other in a cloud environment
- Good Networking fundamentals with Protocols, Load Balancers, VPN, switches/routers/firewalls, LDAP, SNMP, IMAP/POP3/SMTP
- Good understanding of filesystem Technologies – to build and/or troubleshoot filesystem issues
- Virtualization/Cloud technologies – Strong working knowledge of AWS with a good understanding of other technologies like OpenStack, OpenShift, Google Cloud
- Web servers/reverse proxies such as apache, nginx and haproxy
- Monitoring, trending & diagnostics tools including Nagios, Cacti, Zenoss, Graphite, etc
- Logging tools such as Splunk, ELK stack, etc
- Using source code control systems such as svn and git (or mercurial)
- Work/defect tracking systems such as JIRA/Trello
- Wiki tools such as Confluence
- Knowledge of the use and maintenance of continuous integration and continuous deployment systems
- Linux environment experience is a MUST; SaaS environment experience is a strong plus
- Strong technical systems & application operations/release management experience with a passion for troubleshooting and triage of incidents, bringing issues to rapid resolution
- Excellent communication skills: good spoken & written English skills
- Ability to take on-call rotation & coordinate work under production critical situations is essential
- Understanding of modern software engineering
- Understanding that security incident response are continuous efforts

Position Duties & Responsibilities:
- Application release management & configuration, upgrades/patches & support of Unix/Linux systems in a SaaS environment
- Identify, diagnose, and resolve complex technical issues efficiently in live production environment and drive to quick resolutions – as well as – leverage those events to improve current technology & processes towards prevention of such issues
- Work closely with the Development Teams to escalate issues for triage and resolution
- Routinely review tickets and diagnostics with post-mortem to identify trends/chronic issues 
- Hands-on implementation & upgrade of tools for monitoring, trending & diagnostics
- Audit proactive monitoring of all systems to detect and resolve problems to ensure uninterrupted operation of all infrastructure systems
- Update corresponding documentation on installation process & configurations

Responsibilities & Project Work:
- Working with the team to design and build scalable systems that support high traffic email services and backend tools
- Planning and executing projects to improve production infrastructure
- Performing and automating production deployments
- Monitoring, maintaining and optimizing production systems
- Working closely with developers and other staff to solve operational issues with our services, tools, and apps
- Scheduling, coordinating and communicating maintenance windows
- Ensuring the security of our critical systems

Client offers:
- High & competitive salary
- Challenging work in an international professional environment
- Opportunity to influence software development process, to be the owner of the product in your field of expertise
- Opportunity to apply SAFe methodology
- Individual development plans for employees
- Flexible management
- Relocation Bonus when moving from a different city/country
- Full benefits package: paid vacation and sick leave
- Continuous professional development (free internal and external professional training)
- Free English classes in the company office
- Free use of the services provided by customer 
- Quarterly team building activities
- Coffee, tea, fruits, office lunch delivery