Job Description :

We are seeking engineers who are deeply committed to ensuring the reliability, performance, and efficiency of our production services. If you have a passion for building tools, services, and automation to enhance system management and optimization, then we want to hear from you.

Role and Responsibilities:

  • Proficiency in systems internals, security protocols, Linux administration, networking fundamentals, and monitoring techniques is essential.
  • Hands-on experience with Azure cloud services is required.
  • Collaborate on initiatives to enhance the reliability and performance of our next-generation distributed systems and containerized deployments.
  • Diagnose and resolve complex issues in distributed systems processing millions of queries per second.
  • Familiarity with Linux cloud services such as kvm, qemu, and lvm is highly desirable.
  • Advanced scripting skills in Perl, GoLang, or Python are necessary for automating tasks with minimal manual intervention.
  • Day-to-day operations involve extensive command-line usage, necessitating a deep understanding of Linux environments.
  • Troubleshoot issues spanning hardware, software, applications, and network layers.
  • Knowledge of database technologies, particularly MySQL and NoSQL, is advantageous.
  • Willingness to participate in 24×7 on-call rotations to address critical system issues.
  • Design, deploy, and maintain core infrastructure components to support PhonePe’s scalability for hundreds of thousands of concurrent users.
  • Contribute actively to system analysis and improvement initiatives.
  • Drive performance testing, capacity planning, and high availability strategies.
  • Take ownership of implementing new technologies, ensuring thorough testing and comprehensive documentation.
  • Proactively monitor and address issues that could impact our infrastructure’s stability.
  • Possess strong teamwork skills and demonstrate a resourceful attitude.
  • Mentor and support new team members, facilitating their integration into production environments.

This role offers an exciting opportunity to work on cutting-edge technologies and play a crucial role in ensuring the reliability and scalability of our systems. If you are passionate about solving complex problems and thrive in a dynamic, collaborative environment, then we encourage you to apply.

More Information

Apply for this job
Share this job