Lead Infrastructure Engineer
We are seeking a skilled Lead Infrastructure Engineer to join our dynamicteam. This role in DigiValet combines software and systems engineering to build and run large-scale, massively distributed, fault-tolerant systems. We are looking for a brightand young mind with strong system engineering skills and about 5-8 years of experience in the domain.
What you'll do in this role:
- Engage in and improve the whole lifecycle of services—from inception and design, through deployment, operation, and refinement.
- Support services before they go live through activities such as system design consulting, developing software platforms and frameworks, capacity planning and launch reviews.
- Scale systems sustainably through mechanisms like automation and evolve systems by pushing for changes that improve reliability and velocity.
- To deploy new modules and upgrades and complete fixes within the production environment.
- To create requirements and procedures for implementing routine maintenance.
- Maintain services once they are live by measuring and monitoring availability, latency, and overall system health.
- Practice sustainable incident response and blameless postmortems.
Required Skills:
- CentOS / RHEL / Ubutnu hands-on Administration -YUM, Log Rotation, Cron, LVM, User Creation, LDAP,Key based Auth, Enterprise backup strategy, Bash Scripting, Systemd.
- Hands-on with managing SuperMicro/HP/Dell Servers-RAID Setup, IPMI, BIOS/UEFI Setup, CPU, RAM, PCIE, SATA, SSD, M2, etc.
- Strong networking knowledgein -TCP/UDP, IP Routing, Bonding, VLANs, Bridging, DHCP, DNS, FTP, SSH, IP Routing Troubleshooting, Packet Loss/Jitter.
- Basic Hands-on knowledge of Networking Hardware -Switches, Routers, AP & Firewalls.
- Strong knowledge of Linux Firewalld & Iptables.
- VirtualizationTechnologies -VMware, Hyper-V, OpenStack, KVM, etc.
- Apache, Tomcat, Nginx, Jenkins, Artifactory, MariaDB software configuration & installation
- Production Server patching and updates using tools like-Ansible, Puppet or Rundeck.
- Server Health monitoring using enterprise tools like Grafana, Prometheus, Nagios
- Server Performance Optimization knowledge will be beneficial.
- Knowledge of Containerization using Docker will be beneficial.
Personality Attributes:
- Self-managed and proactive.
- Manages time well, punctual and completes tasks on time.
- Embraces challenges, adapts to culture & technology and can work extra hours when needed.
- Focused on execution and growth, takes initiative, and understands KRAs/KPIs.
- Prioritizes vision above all and aligns with it.
- Who takes responsibility for tasks, role, workplace, and ethics.