Site Reliability Engineer at Kudimoney

Job Overview

Location
Lagos, Lagos
Job Type
Full Time
Date Posted
3 years ago

Additional Details

Job ID
6213
Job Views
116

Job Description



Location: Lekki Phase 1, Lagos


About the Position



  • Ensure your team is immediately aware of production errors and prioritizes their repair.

  • Provide architectural input to the teams’ development process from an operations and infrastructure POV, including but not limited to monitoring, alerting, persistence, tradeoffs given the state the available hardware, etc.

  • Provision cluster resources, repositories, CI/CD pipelines, and credentials for your responsible team and systems to consume.

  • Providing updates to the entire company during outages and downtime, scheduled maintenance and more in a professional, respectful, and timely manner.

  • Strive to work at the highest standards possible along with the rest of your team.


About You



  • Bachelors or higher in STEM course.

  • 3 years working as a software engineer/site reliability engineer professionally.

  • 3 years developing Python + Linux/Mac/Unix environments + git professionally.

  • 3 years working with Linux/Unix user environments, e.g. bash, grep, awk, sed, etc.

  • 2 years of experience working with cloud infrastructure, e.g GCP, AWS.

  • 2 years working with CI/CD tools, e.g. Jenkins, CircleCI, TravisCI, Semaphore.

  • 2 years working with SQL and NoSQL databases, e.g. PostgreSQL, Cassandra, MongoDB.

  • 2 years working with code as infrastructure tools such as Terraform, Ansible, Saltstack, Chef, Puppet.

  • Solid knowledge and experience in networking, e.g. HTTP, TCP, UDP, DNS, VPN ( IPSec, Wireguard), routing, firewalls, etc.

  • Solid knowledge and experience in encryption and security, e.g. AES, ECC, PKCS, PKI, OpenSSL, JWT.

  • Experience with Linux system administration, e.g. systemd, iptables, top, stat commands, kernel tuning, user management.

  • Experience working with containers & container orchestration, e.g. Docker, Kubernetes.

  • Experience with logging, monitoring, and incident management tools, e.g. Prometheus, Grafana, Cloud Logging, Opsgenie, Pagerduty.

  • Experience working with Web Servers/Load Balancers, e.g. Nginx, Apache, HAProxy.

  • Love for automation.

  • Ability and willingness to pick up new technologies quickly and be productive.


Nice to Have



  • Multilingual (programming) skills, in particular Python, Java, Javascript / Typescript, Golang.

  • Experience with Bazel.

  • Experience with identity and access management solutions eg. Keycloak.

  • Experience implementing PCI DSS, ISO 27001, ISO 22301 policies/standards.  

  • Experience with Google BigTable.

  • Experience managing Github organizations and repositories.


Similar Jobs

Cookies

This website uses cookies to ensure you get the best experience on our website. Cookie Policy

Accept