As a Senior Site Reliability Engineer we want you to use your software and system engineering expertise to build, scale & improve our cloud based SaaS systems and products.
You will be working with the world’s top 1% talent and cutting edge cloud platforms and technologies while you balance availability, customer experience and the need to constantly enhance the systems.
There’s a breadth of opportunities for SREs in our organization. Starting with the due-diligence & import teams that handle our constant stream of acquisitions, going through our infrastructure teams that manage and constantly improve our Kubernetes, Docker & VmWare clusters, going all the way to our SaaS operations which will ensure great up-time and customer experience from our myriad of more than 100 products.
Candidate Responsibilities
Ensure that our multi-tenant infrastructure running more than 100 different products yields four nines and more of availability
Use IaaC to automate and enable scaling of environments and systems
Eliminate complexity from both architecture and processes
Optimize our public cloud computing costs
Manage the uptime error budget of your product
Be proactive and work closely with the engineering teams to enhance our design and improve our platforms offering
Perform capacity planning and pre-launch reviews
Employ modern instrumentation to enable production applications and infrastructure observability and then act upon the results
Practice sustainable incident response and blameless postmortems
Candidate Requirements
Bachelor's degree in Computer Science or related technical field involving coding (e.g., physics or mathematics), or equivalent practical experience.
3+ years of demonstrated experience managing and maintaining large-scale SaaS applications in one of the major platforms (Azure, GPC, AWS, IBM Cloud) and cloud orchestration tools (Kubernetes, Marathon, VMware, etc.).
2+ years of experience with Linux operating system (strong understanding)
3+ years of experience in at least one programming language: Java, C, C++, Python, Go, Perl or Ruby
Ability to debug and optimize code and automate routine tasks
(Desired) Experienced with declarative configuration management and provisioning tools like Ansible, Puppet or Chef
Crossover is redefining the way people work. Brick and mortar offices are history. The future of our global workforce will be built from teams collaborating from every corner of the world. We have embarked on an expedition to find and engage with that talent. Crossover has developed a unique method of finding, curating, and managing remote contractors. Our platform connects customers to the worlds best talent for both technical and non-technical employment. But we don’t just find the best, we also provide the tools, training, and relationship building support to ensure success for long term growth.