Cloud Site Reliability Engineering Expert (SRE)

Employer: Euro-Testing Software Solutions
Domain:
  • Telecommunication
  • Job type: full-time
    Job level: peste 5 years of experience
    Location:
  • BUCHAREST
  • nationwide
    Updated at: 05.12.2021
    Short company description

    Euro-Testing Software Solutions is a privately-owned software company specialized in Full-Service Software Testing, Penetration Testing, Vulnerability Identification & Management, Application and Data Security, Static & Dynamic Code Analysis as well as, DevOps/DevSecOps, Robotic Process Automation, Implementation and Customization for Atlassian and Micro Focus (HPE) products.

    Requirements

    • Completed technical or industrial engineering degree or long-time experience in the field of IT infrastructure and data center, IT operation, IT engineering
    • Practical experience in the field of public cloud solutions, setting up and managing infrastructure as code (e.g., at AWS Services, Kubernetes, container solutions (Docker))
    • Safe handling in the area of automation and configuration management of infrastructure and software deployments (Jenkins, groovy, Nexus, Bitbucket, Ansible, etc.)
    • Several years of professional experience in IT, preferably in IT operations, IT consulting or IT architecture
    • Good technical knowledge of the technologies and products used in large, heterogeneous IT infrastructures and the services they provide
    • Good knowledge of server and client hardware, operating systems, databases and standard applications (ERP, CRM, billing)
    • Pronounced analytical, solution-oriented and entrepreneurial thinking and acting as well as pronounced technical know-how in the field of application
    • Good technical knowledge of the following products in large heterogeneous IT infrastructures (Microsoft product range, Linux, HP, Oracle, VMWare, Citrix, networks, firewalls, etc.) and central IT infrastructures such as Storage, backup, SAN, Antivir, software distribution, monitoring

    • Fluent in spoken and written English
    • Experience in negotiations and in dealing with customers and suppliers
    • Knowledge of IT architectures and IT service management processes (ITIL) and business processes (PPM, internal ordering and communication processes, etc.)
    • High resilience and experience in crisis management for the subject-related topics
    • Knowledge of dealing with complex technical issues and for solving problems as well as preparing information for the addressees

    Responsibilities

    • Buildup, development and further operation of infrastructure as code components (e.g., for AWS services, Kubernetes, container solutions) including taking on-call responsibilities
    • Development of automated solutions for operational aspects such as on-call monitoring, performance and capacity planning, and disaster response
    • Creating fault tolerant and self-healing infrastructure components that improves the reliability of systems, fixing issues and responding to incidents
    • Ensuring and implementing security and connectivity requirements
    • Definition of standards for the cloud and on-premises platforms, components and systems
    • Advice and support for departments in the introduction and use of cloud services
    • Definition and conception of standards in the areas of cloud, containers and container orchestration systems, including an associated shared responsibility model
    • Consulting for strategic inquiries from departments (e.g., for security, VCI)
    • Controlling cloud costs and developing measures for continuous cost reduction