Senior Data Engineer



Data Science
San Francisco, CA, USA
Posted on Thursday, July 11, 2024

About Collective:

Collective is on a mission to redefine the way businesses-of-one work. Collective’s technology and team of trusted advisors enables our members to achieve financial independence by taking care of everything from business incorporation to accounting, bookkeeping, tax services and access to a thriving community, all in one integrated platform. We believe in empowering self-employed people to enjoy the same tax savings that big companies get.

Featured in Forbes, Business Insider, Yahoo, Bloomberg, Financial Times, Techcrunch and more. We’re backed by General Catalyst, Sound Ventures (Ashton Kutcher and Guy Oseary), QED Investors, Google’s Gradient Ventures, Expa and prominent investors who have financed and built iconic companies like YouTube, Substack, Twitch, Box, Hims, Instacart, Lyft, and more.

About the role:

We are building Collective’s data analytics infrastructure and looking for an experienced Senior Data Engineer to join our data team. This role will suit someone who enjoys designing data architectures and has a strong technical background in data platforms and pipelines. The ability to communicate and translate business needs into technical data solutions is essential.

We are looking for an experienced Senior Data Engineer as the first engineer to join the data team. In this role, you will expand and optimize our data infrastructure, pipelines, and analytics platforms. You will also support business stakeholders by building and maintaining the right sources and aggregations. This role offers an exciting opportunity to iterate and test rapidly, lead, grow and scale our data analysis capabilities.

🛠 Responsibilities

  • Build a comprehensive strategic data roadmap with an emphasis on the long-term needs of the company and best practices for data development.
  • Develop and maintain our BigQuery data warehouse, connected to sources like Salesforce, Marketing Cloud, Facebook Ads, Segment, etc.
  • Create, maintain, and optimize our data pipeline architecture, ensuring all data systems meet company requirements for security, reliability, and efficiency. We use DBT as our data aggregation layer and Fivetran as our ETL tool.
  • Lead design of data architecture and create database views, schemas, tables, indexes, and other constructs to optimize stakeholders’ access to data..
  • Document and communicate data consumption processes, tools, and methodologies to stakeholders.
  • Work with data analysts, product managers, and other stakeholders to understand analytics and modeling needs and provide data to power dashboards, reports, and statistical models.
  • Proactively monitor and control data quality and establish procedures to ensure accuracy and reliability.
  • Stay up-to-date on the latest advancements in data engineering technologies and best practices to continually evaluate and improve existing systems
  • Implement data quality checks, data validation, and data cleaning processes to ensure the accuracy and high quality of ingested and processed data.

🙌 What we want you to bring:

  • Master's/bachelor's degree in computer science, business analytics, information technology, statistics, or a related field
  • 5+ years professional experience as a data engineer or similar role within a production environment.
  • Expert knowledge of SQL and Python
  • Experience with BigQuery or other data warehouses, aggregation technologies such as DBT or other ETL technologies.
  • Familiarity with data analysis techniques and tools
  • Experience building and maintaining data pipelines and ETL processes in the cloud (ideally GCP)
  • Experience in DDM (dimensional data modeling) and schema design
  • Experience developing data with CDP software such as Segment.
  • Working knowledge of BI tool such as Metabase, Tableau, or Looker.
  • Experience supporting business intelligence initiatives and working with analytical stakeholders to translate business requirements into technical specifications
  • Excellent communication and collaboration skills.
  • Knowledge of cloud infrastructure best practices and experience working with continuous integration and continuous deployment processes and tools
  • Ability to communicate effectively with stakeholders to define requirements and timelines
  • Passion and excitement to serve as a technical mentor and thought leader and interest in evangelizing data engineering best practices

Why work with us:

  • Collective is the first online back office platform designed for Businesses-of-One.
  • Tax is a very challenging space and you will grow fast.
  • We were backed by top tier VCs.
  • We promote open and transparent culture within the company.
  • We’re fast-paced and you learn every day.

What we offer in return

  • Compensation Range: $185,000 - $210,000 USD annual base salary
  • Hybrid work environment
  • A diverse and collaborative team culture
  • Stock options package
  • 14 company holidays + unlimited PTO
  • 401K
  • Employer paid Health, Vision and Dental Insurance
    • 100% coverage for employees
    • 75% coverage for dependents
    • up to $5,000 for out of state travel expenses, if needed
  • PC or Mac laptop + $750 Home office stipend
  • Paid parental leave
  • Team events and virtual gatherings
  • $600 wellness bonus

Equal Employment Opportunity

At Collective, we don’t just accept differences — we celebrate them, we support them, and we thrive on them for the benefit of our employees, our customers, and our community. We are committed to building a team that represents a variety of backgrounds, perspectives, and skills. If you’re good at what you do, come as you are. The more inclusive we are, the better our work will be. Collective is proud to be an equal opportunity workplace.