Job Description :
We are seeking engineers who are deeply committed to ensuring the reliability, performance, and efficiency of our production services. If you have a passion for building tools, services, and automation to enhance system management and optimization, then we want to hear from you.
Role and Responsibilities:
- Proficiency in systems internals, security protocols, Linux administration, networking fundamentals, and monitoring techniques is essential.
- Hands-on experience with Azure cloud services is required.
- Collaborate on initiatives to enhance the reliability and performance of our next-generation distributed systems and containerized deployments.
- Diagnose and resolve complex issues in distributed systems processing millions of queries per second.
- Familiarity with Linux cloud services such as kvm, qemu, and lvm is highly desirable.
- Advanced scripting skills in Perl, GoLang, or Python are necessary for automating tasks with minimal manual intervention.
- Day-to-day operations involve extensive command-line usage, necessitating a deep understanding of Linux environments.
- Troubleshoot issues spanning hardware, software, applications, and network layers.
- Knowledge of database technologies, particularly MySQL and NoSQL, is advantageous.
- Willingness to participate in 24×7 on-call rotations to address critical system issues.
- Design, deploy, and maintain core infrastructure components to support PhonePe’s scalability for hundreds of thousands of concurrent users.
- Contribute actively to system analysis and improvement initiatives.
- Drive performance testing, capacity planning, and high availability strategies.
- Take ownership of implementing new technologies, ensuring thorough testing and comprehensive documentation.
- Proactively monitor and address issues that could impact our infrastructure’s stability.
- Possess strong teamwork skills and demonstrate a resourceful attitude.
- Mentor and support new team members, facilitating their integration into production environments.
This role offers an exciting opportunity to work on cutting-edge technologies and play a crucial role in ensuring the reliability and scalability of our systems. If you are passionate about solving complex problems and thrive in a dynamic, collaborative environment, then we encourage you to apply.
More Information
- Experience 5-10 Years