Post a job

Job has expired

This job post is expired and is no longer taking new applicants.

Return home Find similar jobs

Senior SRE (Site Reliability Engineer)

K

Location
United States
k-ID

Job Description

Job Summary

As a Senior SRE at k-ID, you will be instrumental in enhancing the reliability and scalability of our Global Compliance Engine. This role combines software engineering with systems engineering to build and run large-scale, massively distributed, fault-tolerant systems.

Responsibilities

  • API Reliability and Performance Optimization: Be a key contributor to the design, implementation, testing, and documentation of our public APIs. Develop, scale, and maintain the infrastructure necessary to deliver seamless service to tens of millions of worldwide players.
  • Systems Automation and Orchestration: Utilize Kubernetes and AWS to automate deployment, scaling, and management of containerized applications. Enhance our CI/CD pipeline integrating GitOps for streamlined operations across development, testing, and production environments.
  • Monitoring and Telemetry: Implement comprehensive monitoring solutions using Prometheus and AlertManager.
  • Cross-Functional Collaboration: Work closely with development teams to ensure architectural and operational requirements are incorporated during design and development. Promote a culture of excellence in code health and quality.
  • Security: Champion the integration of security best practices within backend architectures to protect sensitive user data against emerging threats.

Requirements

Qualifications

  • Professional Experience: At least 5 years of experience in software engineering with a focus on reliability, performance optimization, and infrastructure management.
  • Education: Bachelor’s or master’s degree in Computer Science, Engineering, or a related field, or equivalent practical experience.
  • Expertise in Cloud and Systems Engineering: Extensive experience with AWS, Kubernetes, and modern observability stacks (e.g., Prometheus). Familiarity with CI/CD tools, GitOps practices, and infrastructure as code (e.g., Terraform).
  • Performance Monitoring: Proficiency in setting up and managing telemetry and alerting systems, with a strong understanding of best practices in monitoring distributed systems.
  • Adaptability: Willingness to adapt to changing project demands. Experience working in a startup environment is a plus.
  • Communication: Communicate effectively with remote team members, both written and verbally, providing progress updates, flagging potential roadblocks. and fostering positive and productive working relationships.
  • Passion for Automation: Keen interest in automating repetitive tasks and finding innovative solutions to complex technical challenges.

Benefits

Competitive Salary

  • A competitive startup salary commensurate with experience and skills.

Benefits Package

  • Comprehensive benefits including health, dental, and vision insurance.
  • Employee Stock Ownership Plan (ESOP)

Professional Development

  • Opportunities for ongoing learning and development.
  • Exposure to multifaceted projects in a fast-growing industry.

Innovative Culture

  • A collaborative and inclusive work environment.
  • The opportunity to be a part of a company that’s making a positive impact on the online experiences of kids and teens.

Advice from our career coach

As a Senior SRE at k-ID, the successful applicant should have a strong background in software engineering, systems engineering, and cloud infrastructure management. Here are some tips to stand out as an applicant:

  • Highlight your experience in API reliability and performance optimization, showcasing your ability to design, implement, and maintain public APIs for large-scale systems.
  • Emphasize your expertise in systems automation and orchestration, particularly with Kubernetes, AWS, and CI/CD tools, to demonstrate your proficiency in automating deployment processes.
  • Showcase your experience in monitoring and telemetry, including your knowledge of Prometheus and best practices in setting up monitoring solutions for distributed systems.
  • Demonstrate your ability to collaborate cross-functionally with development teams, ensuring operational requirements are met during the design and development stages.
  • Illustrate your commitment to security best practices and your experience integrating security measures into backend architectures to protect user data.
  • Highlight your adaptability and willingness to work in a fast-paced startup environment, showcasing your ability to handle changing project demands.

Apply for this job

Expired?

Please let k-ID know you found this job with RemoteJobs.org. This helps us grow!

About the job

Jun 4, 2024

Full-time

  1. US United States
RemoteJobs.org mascot