We are looking for a Site Reliability Engineer who will design, build and monitor our applications and systems infrastructure that can handle millions of monthly page views. The Site reliability engineer will handle deployment details of capacity provisioning, load balancing, auto-scaling, and application health monitoring. Skills Required - 4+ years of experience with Linux/Unix/BSD
- Demonstrable knowledge of TCP/IP, HTTP, security, replication, sharding, storage, and memcache
- Experience running and scaling infrastructure with Amazon Web Services (EC2, EBS, S3, ELB, Cloud Watch)
- Experience implementing high availability and scalability of database systems (MySQL(preferred), MongoDB, Redis) using different techniques
- Experience of using best practices related to security, performance and disaster recovery
- Automation experience with tools such as Puppet(preferred), Chef or Capistrano
- Experience implementing version control systems like git and svn
- Strong scripting skill in Bash(highly preferred), PHP or Python
|