top of page

SRE Consultant

Bengaluru, Karnataka, India

Job Type

Full Time

About the Role

Mandatory Skills
Site Reliability Engineer, AWS, Devops, automation, Prometheus, monitoring, framework, design review

Skill to Evaluate
Site Reliability Engineer, AWS, Devops, automation, Prometheus, monitoring, framework, design review

Experience
8 to 10 Years
Design and Architect SRE element into all the existing and new apps and services along with defining several controls/processes that ensures SLAs/KPIs are met.
Define SLAs/SLIs/SLOs metrics at a technical level and ensure 100% adherence.
Proactively maintain services once they are live by measuring and monitoring availability, latency and overall system health.
Respond quickly to issues and mobilise responsible individuals quickly to achieve the fasted possible resolution.
Support services before they go live through activities such as system design consulting, developing software platforms and frameworks, capacity planning and launch reviews
Scale system and service sustainably through mechanisms like automation and evolve systems by pushing for changes that improve reliability and speed of service resolution.
Continually analyse service to end customers with a view to enhancing customer experience, eradicating issues, fixing root causes and driving quality into everything we do.
Educating support operations and customer help desks to adapt to new ways of working by increasing skills and knowledge.
Perform RCAs, publish reports and take it to the next level by inventing short/long term fixes and further Runbooks.
Be part of the Agile Mode of delivering Work Products by performing Backlog planning, Sprint Planning, Design Reviews, Peer Reviews and Retrospectives
Education Qualificaiton
Bachelor Degree
Experience in one or more of the following: C, C++, Java, Python, Go, Ruby or shell scripting
Experience with Windows and Unix/Linux operating systems internals and administration (e.g. filesystems, system calls) or networking (e.g. TCP/IP, routing, network topologies and hardware)
Experience with containers and containers orchestration (e.g. Kubernetes, Docker) Extensive knowledge of AWS
Hands-on experience with IAC tools such as Cloudformation and Terraform
Experience with Configuration Management tools such as Ansible, Chef.
Experience with cloud hosted application-monitoring tools such as Kibana, ELK stack etc
Experience with Observability tools such as Dynatrace or Datadog
Excellent communication skills with the ability to present complex technical information in a clear and concise manner to a variety of audiences, both technical and non-technical
Comfortable working in a fast-paced, multi-tasking, dynamic environment
Experience with deployment automation, working with platforms for configuration management, provisioning and artifact repositories.
Preferred to have expertise with Make, Maven, Groovy, Gitlab, Gitlab pipelines, ArgoCD, AWS Codebuild/Codepipeline/CodeDeploy
Experience in improving internal processes and good understanding of security engineering
Capable of grasping, modifying and maintaining systems and code developed by others.
Ability to debug and optimise code and automate routine tasks
Systematic problem-solving approach, coupled with a strong sense of ownership, drive and determination.
Ability to think outside the box and find innovative solutions to complex problems.

Requirements

Mandatory Skills

Site Reliability Engineer, AWS, Devops, automation, Prometheus, monitoring, framework, design review


Skill to Evaluate

Site Reliability Engineer, AWS, Devops, automation, Prometheus, monitoring, framework, design review


Experience

8 to 10 Years

Design and Architect SRE element into all the existing and new apps and services along with defining several controls/processes that ensures SLAs/KPIs are met.

Define SLAs/SLIs/SLOs metrics at a technical level and ensure 100% adherence.

Proactively maintain services once they are live by measuring and monitoring availability, latency and overall system health.

Respond quickly to issues and mobilise responsible individuals quickly to achieve the fasted possible resolution.

Support services before they go live through activities such as system design consulting, developing software platforms and frameworks, capacity planning and launch reviews

Scale system and service sustainably through mechanisms like automation and evolve systems by pushing for changes that improve reliability and speed of service resolution.

Continually analyse service to end customers with a view to enhancing customer experience, eradicating issues, fixing root causes and driving quality into everything we do.

Educating support operations and customer help desks to adapt to new ways of working by increasing skills and knowledge.

Perform RCAs, publish reports and take it to the next level by inventing short/long term fixes and further Runbooks.

Be part of the Agile Mode of delivering Work Products by performing Backlog planning, Sprint Planning, Design Reviews, Peer Reviews and Retrospectives

Education Qualificaiton

Bachelor Degree

Experience in one or more of the following: C, C++, Java, Python, Go, Ruby or shell scripting

Experience with Windows and Unix/Linux operating systems internals and administration (e.g. filesystems, system calls) or networking (e.g. TCP/IP, routing, network topologies and hardware)

Experience with containers and containers orchestration (e.g. Kubernetes, Docker) Extensive knowledge of AWS

Hands-on experience with IAC tools such as Cloudformation and Terraform

Experience with Configuration Management tools such as Ansible, Chef.

Experience with cloud hosted application-monitoring tools such as Kibana, ELK stack etc

Experience with Observability tools such as Dynatrace or Datadog

Excellent communication skills with the ability to present complex technical information in a clear and concise manner to a variety of audiences, both technical and non-technical

Comfortable working in a fast-paced, multi-tasking, dynamic environment

Experience with deployment automation, working with platforms for configuration management, provisioning and artifact repositories.

Preferred to have expertise with Make, Maven, Groovy, Gitlab, Gitlab pipelines, ArgoCD, AWS Codebuild/Codepipeline/CodeDeploy

Experience in improving internal processes and good understanding of security engineering

Capable of grasping, modifying and maintaining systems and code developed by others.

Ability to debug and optimise code and automate routine tasks

Systematic problem-solving approach, coupled with a strong sense of ownership, drive and determination.

Ability to think outside the box and find innovative solutions to complex problems.

About the Company

Cigres Technologies Private Limited is a technology consulting and services company that focuses on helping clients resolve their significant digital problems and enabling radical digital transformation using multiple technologies on premise or in the cloud. The company was founded with the goal of leveraging cutting-edge technology to deliver innovative solutions to clients across various industries.

Cigres Technologies Private Limited - Bangalore

#46/4, Novel Tech Park, Kudlu Gate,

Garvebhavipalya, Bangalore-560068, Karnataka

Cigres Technologies Private Limited - Pune

123,A wing, Sohrab Hall, 21, Sassoon Road,Opp-Jahangir Hospital,Sangamwadi, Pune-411001.

Cigres Technologies Private Limited - Mumbai

203,The Summit,Western Express Highway,

Vile Parle East, Mumbai-400057.

​

Cigres Inc.

8 The Green STE R

Dover, Delaware 19901

USA

Cigres Technologies Pte Ltd

60 Paya Lebar Road, #09-43 Paya Lebar Square

Singapore – 409051

bottom of page