Post a job

Senior Analytics Data Engineer

T

Location
United States
Base Salary
170k-210k USD
Trunk

Job Description

At Trunk, our mission is to help teams create high-quality software quickly. Merge conflicts, poor code quality or consistency, flaky tests, and dozens of other distractions quickly drain the productivity and morale of those teams. Engineering teams that can stay focused on designing, implementing, and delivering software will build magical, high-quality projects - and they will be happier doing it. We're building the tools that empower teams to land code faster and develop happier.
We are building the foundation for a modern software engineering team. Our founders started this journey in 2021 and have designed, delivered, and scaled software at some of the world's largest and fastest-growing tech companies - Uber, Google, YouTube, and Microsoft. We're building a game-changing company, and we hope you are excited to be a part of that audacious goal.
Software has eaten the world; almost every company produces software in some form or fashion, so our addressable market is virtually every company on earth. We're going after every engineering team on the planet - we're starting with smaller teams, but there are literally hundreds of thousands of companies out there for us to empower and maybe only a handful (Google, Facebook, Amazon), that are outside our scope. We are building the DevEx platform to empower the world.
In 2022, we raised a $25M Series A led by Initialized Capital (Garry Tan) and a16z (Peter Levine), with investments from Haystack Ventures, Garage VC, Tom Preston Warner (Founder/CEO of GitHub), Geoff Schmidt (Founder/CEO Apollo GraphQL), Nicolas Dessaigne (Founder/CEO Algolia), and Oleg Rognysky (Founder/CEO Peopl.ai).

What you'll do 🧑‍💻

  • Build data pipelines, text analysis algorithms, query engines, and decision making engines
  • Apply robust and fault-tolerant approaches to create scalable ingestion and data-processing systems
  • Debug, profile and optimize distributed data-intensive applicating, improving their latency, accuracy, resource consumption, and throughput
  • Work with existing applications built with Spark, S3, Timescale, Python and Rust
  • Directly implement services and features that leverage the results of your data pipelineImplement and improve machine learning and data pipelines

We're looking for 🔎

  • 5+ years of experience as an engineer with a strong understanding of key concepts in distributed systems
  • 3+ years of extensive experience in building and deploying data applications
  • Fluency in at least one, and ideally more than one, of these languages: Java/Scala/Kolin, Python, Go, Rust, or C++
  • Good understanding of following concepts: partitioning, replication, map-reduce, indexing, and CAP
  • Experience with distributed storage systems (S3, HDFS, Hive, ClickHouse, Elastic, etc), distributed processing engines (Spark, etc), and message queues (Kafka, SQS, etc)
  • Passion for building large-scale ML applications and improving software engineers' productivity
  • Some understanding of key concepts in natural language processing, machine learning, or statistical analysis
  • Some experience with machine learning stack (pandas, PyTorch, numpy, sci-kit, transformers, etc)

What we offer 🎁

  • Unlimited PTO
  • Competitive salary and equity
  • Work-life balance
  • Flexibility to be fully or partly remote
  • Few meetings, so you can ship fast and focus on building
  • One Medical membership on us!
  • Top-notch medical, dental, vision, short-term disability, long-term disability, and life insurance
  • All insurance is 100% company-paid ($0 premiums) for employees and highly subsidized for dependants
  • FSA, HSA with company contributions, and pre-tax commuter benefits
  • 401(k) plan
  • Paid parental leave ( up to 12 weeks)
The salary and equity range for this role are: $170K - $210K and .15% - .35%.
Don’t meet every single requirement? At Trunk, we are dedicated to building a diverse and inclusive workplace, so if you’re excited about this role but your past experience doesn’t align perfectly with every qualification in the job description, we encourage you to apply anyways. You may be just the right candidate for this or other roles.
If you need assistance or an accommodation due to a disability, we're happy to help accommodate. Please contact us at [email protected].

Advice from our career coach

A successful applicant for this role at Trunk should be well-versed in building data pipelines, text analysis algorithms, and query engines, with a strong background in distributed systems. Here are some tips to stand out as an applicant:

  • Highlight your experience in building and deploying data applications, showcasing your expertise in Java/Scala/Kotlin, Python, Go, Rust, or C++.
  • Demonstrate your knowledge of key concepts such as partitioning, replication, map-reduce, indexing, and CAP.
  • Showcase your experience with distributed storage systems, distributed processing engines, and message queues.
  • Emphasize your passion for building large-scale ML applications and improving software engineers' productivity.
  • Discuss any experience with machine learning stack (pandas, PyTorch, numpy, sci-kit, transformers, etc) to showcase your versatility.

Apply for this job

Expired?

Please let Trunk know you found this job with RemoteJobs.org. This helps us grow!

About the job

Apr 25, 2024

Full-time

170k-210k USD

  1. US United States
RemoteJobs.org mascot