Post a job

Sr. Site Reliability Engineer

M

Location
United States
Macrometa

Job Description

Our imagination is fueled by a vision of enabling developers to build apps and APIs without any limitations of time, space and cloud architectures. A world where ideas can be expressed instantly on a smart and reliable edge cloud platform that does all the heavy lifting of delivering their apps and data across the cloud and edge anywhere in the world.

Our mission is to make every developer a hero by making globally distributed application development and deployment simple and instant. This for us means taking responsibility for the entire experience of building and running cloud and edge apps. To do this we must provide the most powerful globally distributed stateful edge runtime, deep capillary networks, and a developer experience second to none.

Macrometa's culture is built on mutual respect and honest interactions. We value humble people who are curious to learn from and help each other. We prioritize our people first, customers second, and everything else third.

The Role:

Are you excited to work with a talented & experienced team on groundbreaking new ideas in building a planetary scale, distributed, decentralized, real-time data platform?

Are you interested in delivering, cutting-edge geo-distributed cloud infrastructure software, maintaining it, securing it and scaling it to meet users' needs while keeping an ever-watchful eye on capacity and performance? If yes - we may have your dream job at Macrometa.

What You Will Do:

  • You will be responsible for building, maintaining, and scaling across multiple clouds our complex and data-intensive kubernetes based digital edge fabric.
  • You will be writing / extending Kubernetes operators, Serverless like knative etc.
  • You will automate the continuous deployment processes.
  • You will also act as a consultant to development on infrastructure, networking, scalability, monitoring, operational process, infrastructure efficiency and release process.
  • You will scale systems sustainably through mechanisms like automation, and evolve systems by pushing for changes that improve reliability and velocity.

Who You Are:

  • You have extensive background in provisioning, scaling and operating cloud-based geo-distributed applications by monitoring availability and taking a holistic view of system health.
  • Deep understanding of Docker, Kubernetes, Helm and ways to orchestrate container systems. You have good understanding & experience operating on either AWS/GCE/Azure using container native technologies.
  • Fluent in Linux systems. You like to automate your job, using Go, Python, Bash or the likes.
  • You enjoy troubleshooting in a distributed Kubernetes environment and are comfortable in tracing problems through applications, systems and networks.
  • You enjoy talking about stability, scalability and performance limits of web-service
  • 5+ experience within the DevOps, CI-CD and similar.
  • Experience with Python, Go or Shell scripting.
  • Experience in designing, automating securing and supporting big, fast data stacks on AWS/GCE in containers/kubernetes.
  • Experience operating critical production systems at scale.
  • Have implemented logging systems and dashboards with tools like Graylog and Grafana
  • You are willing to work on-call rotation with your team colleagues

Note to recruitment agencies: Macrometa will not accept unsolicited resumes/CV's and will not pay fees of any kind for unsolicited resumes/CV's sent to us by third parties.

Apply for this job

Expired?

Please let Macrometa know you found this job with RemoteJobs.org. This helps us grow!

About the job

Apr 14, 2024

Full-time

  1. US United States
RemoteJobs.org mascot