To be discussed
Contract, 6 months to start
Experience: 2+ Years | Focus: Reliability, Automation & Support
We are looking for a TechOps Engineer to ensure the stability and reliability of our production environment. You will bridge the gap between development and operations, automating manual tasks and monitoring system health.
In this role, the nature of the work is dynamic and requires a collaborative attitude. While you will have specific duties, it's important to understand that the entire team is responsible for the final delivery, and this may occasionally involve taking on additional tasks outside your primary responsibilities. The ability to adapt and contribute wherever needed is key to succeeding in this environment.
● Monitor system health and performance using Grafana and PromQL.
● Write and maintain scripts (Python, Bash/Shell) to automate operational tasks.
● Execute complex MySQL queries to support customer issues and generate data reports.
● Troubleshoot production incidents within an AWS Cloud environment.
● Read and debug Java logs/stack traces to identify root causes of errors.
● 2+ years of experience in Operations, SRE, or DevOps roles.
● Shift Availability: Must be willing to work in rotating shifts (Day/Night) to ensure 24/7 coverage.
● Strong problem-solving skills for rapid incident resolution.
● AI Tooling: Experience utilizing GitHub Copilot for scripting and automation logic.
● Strong proficiency in scripting languages (Python or Shell).
● Solid experience with AWS services (EC2, RDS, CloudWatch).
● Familiarity with Java (ability to read code).
● Create and maintain scripts for faster troubleshooting and analysing recurring issues.
● Ability to write complex SQL queries and set up Grafana dashboards. Experience with HyperDX and Clickhouse is a strong plus.
To apply for this job email your details to resumes@biblioso.com