Senior Site Reliability Engineer (Chief Architect) ($100K/year) - Remote Work

Employer: Crossover Romania
  • Internet - eCommerce
  • IT Hardware
  • IT Software
  • Job type: full-time
    Job level: 1 - 5 years of experience
  • Cluj Napoca
  • Timisoara
  • nationwide
    Updated at: 16.02.2019

    As a Senior Site Reliability Engineer we want you to use your software and system engineering expertise to build, scale & improve our cloud based SaaS systems and products.

    You will be working with the world’s top 1% talent and cutting edge cloud platforms and technologies while you balance availability, customer experience and the need to constantly enhance the systems.

    There’s a breadth of opportunities for SREs in our organization. Starting with the due-diligence & import teams that handle our constant stream of acquisitions, going through our infrastructure teams that manage and constantly improve our Kubernetes, Docker & VmWare clusters, going all the way to our SaaS operations which will ensure great up-time and customer experience from our myriad of more than 100 products.


    Candidate Responsibilities

    • Ensure that our multi-tenant infrastructure running more than 100 different products yields four nines and more of availability
    • Use IaaC to automate and enable scaling of environments and systems
    • Eliminate complexity from both architecture and processes
    • Optimize our public cloud computing costs
    • Manage the uptime error budget of your product
    • Be proactive and work closely with the engineering teams to enhance our design and improve our platforms offering
    • Perform capacity planning and pre-launch reviews
    • Employ modern instrumentation to enable production applications and infrastructure observability and then act upon the results
    • Practice sustainable incident response and blameless postmortems

    Candidate Requirements

    • Bachelor's degree in Computer Science or related technical field involving coding (e.g., physics or mathematics), or equivalent practical experience.
    • 3+ years of demonstrated experience managing and maintaining large-scale SaaS applications in one of the major platforms (Azure, GPC, AWS, IBM Cloud) and cloud orchestration tools (Kubernetes, Marathon, VMware, etc.).
    • 2+ years of experience with Linux operating system (strong understanding)
    • 3+ years of experience in at least one programming language: Java, C, C++, Python, Go, Perl or Ruby
    • Ability to debug and optimize code and automate routine tasks
    • (Desired) Experienced with declarative configuration management and provisioning tools like Ansible, Puppet or Chef