Collaborate with development and operations teams to implement and manage observability solutions across cloud environments.
Set up and maintain monitoring, logging, and alerting systems using tools such as Prometheus, LGTM Stack
Develop and optimize CI/CD pipelines to ensure seamless integration of observability practices into the software development lifecycle.
Automate the collection and visualization of metrics, logs, and traces to provide insights into application performance and system health.
Work with infrastructure as code (IaC) tools (e.g., Terraform, CloudFormation) to provision and manage cloud resources with observability in mind.
Collaborate with cross-functional teams to define and implement best practices for observability, including metrics collection and alerting thresholds.
Maintain documentation related to observability practices, tools, and processes.
Stay updated with the latest trends and best practices in observability and cloud technologies.
Automate deployment processes using CI/CD pipelines and tools such as GitHub Actions, GitLab CI, AzureDevOps.
requirements-expected :
Bachelor’s degree in Computer Science, Information Technology, or a related field, or equivalent work experience.
Experience with OpenTelemetry, AWS X-Ray, CloudWatch, ADOT, Grafana
3-5 years of experience in DevOps, Cloud Engineering, or a related role, with a focus on Observability.
Proficiency in AWS
Familiarity with Docker and Kubernetes
Excellent problem-solving skills and the ability to work collaboratively in a team environment.