Job Description
Location: Lekki Phase 1, Lagos
About the Position
- Ensure your team is immediately aware of production errors and prioritizes their repair.
- Provide architectural input to the teams’ development process from an operations and infrastructure POV, including but not limited to monitoring, alerting, persistence, tradeoffs given the state the available hardware, etc.
- Provision cluster resources, repositories, CI/CD pipelines, and credentials for your responsible team and systems to consume.
- Providing updates to the entire company during outages and downtime, scheduled maintenance and more in a professional, respectful, and timely manner.
- Strive to work at the highest standards possible along with the rest of your team.
About You
- Bachelors or higher in STEM course.
- 3 years working as a software engineer/site reliability engineer professionally.
- 3 years developing Python + Linux/Mac/Unix environments + git professionally.
- 3 years working with Linux/Unix user environments, e.g. bash, grep, awk, sed, etc.
- 2 years of experience working with cloud infrastructure, e.g GCP, AWS.
- 2 years working with CI/CD tools, e.g. Jenkins, CircleCI, TravisCI, Semaphore.
- 2 years working with SQL and NoSQL databases, e.g. PostgreSQL, Cassandra, MongoDB.
- 2 years working with code as infrastructure tools such as Terraform, Ansible, Saltstack, Chef, Puppet.
- Solid knowledge and experience in networking, e.g. HTTP, TCP, UDP, DNS, VPN ( IPSec, Wireguard), routing, firewalls, etc.
- Solid knowledge and experience in encryption and security, e.g. AES, ECC, PKCS, PKI, OpenSSL, JWT.
- Experience with Linux system administration, e.g. systemd, iptables, top, stat commands, kernel tuning, user management.
- Experience working with containers & container orchestration, e.g. Docker, Kubernetes.
- Experience with logging, monitoring, and incident management tools, e.g. Prometheus, Grafana, Cloud Logging, Opsgenie, Pagerduty.
- Experience working with Web Servers/Load Balancers, e.g. Nginx, Apache, HAProxy.
- Love for automation.
- Ability and willingness to pick up new technologies quickly and be productive.
Nice to Have
- Multilingual (programming) skills, in particular Python, Java, Javascript / Typescript, Golang.
- Experience with Bazel.
- Experience with identity and access management solutions eg. Keycloak.
- Experience implementing PCI DSS, ISO 27001, ISO 22301 policies/standards.
- Experience with Google BigTable.
- Experience managing Github organizations and repositories.