class: title, center, middle # Who is an analytics engineer? --- # Who are you? - [Intros!](https://docs.google.com/presentation/d/1-IIuQjG9KYL5QYPaoSc1njFnMjxrIpF5PLlHDsBVqbE/present) - Where are you calling from? (city + room) - What does your day-to-day work look like? --- # The traditional data team .left-column-33[ #### Data Engineers - Build data warehouse infrastructure - Extract & load data into your warehouse - Transform data to make it easier to work with ] .right-column-66[ .left-column[ #### Data Analyst - Build dashboards & reports - Insights work ] .right-column[ #### Stakeholder - Politely wait for their analyst to give them numbers ] ] --- # Modern data stack
??? - MPP warehouses replaced the need for engineers to maintain warehouse infrastructures - Stitch and Fivetran got data into your warehouse, but the data was then in a source conformed format - BI tools (Looker) allowed consumers to do their own deep dives into data - Problem: people only had access to the raw data Teaching notes: - flip back to traditional data team and annotate to show how it doesn't make sense anymore (cross parts out, arrow other parts around, particularly stakeholders not wanting to wait for the analysts) --- # Modern data stack: consequences .dense-text[ Have you ever: - Had two stakeholders have different values for a KPI, and not been sure what was right? - Rewritten the same snippet of SQL? Or forgotten to write it? ```sql select * from orders where status not in ('cancelled', 'false') and deleted_at is null and user_id != 12 ``` - Broken all your dashboards because of a bad join? - Had data go stale? And only found out when someone else noticed? - Not really been sure where your data came from? ] ??? audience: Share your story of the silliest data problem you've had --- # Enter: dbt
.center[_Read the [Viewpoint](https://docs.getdbt.com/docs/about/viewpoint/)_] ??? * The solution? Transform your data in a shape that's ready for analytics. But don't just do that: - test it - document it * Anyone who knows SQL can do this! * This role is neither data engineering nor data analysis... it's analytics engineering --- ## dbt & Analytics Engineering
??? What do these software engineering best practices look like? All that and more... after the break! --- class: subtitle # Questions so far? --- class: subtitle # Zoom Out
Who is an analytics engineer?
Distributed dbt Learn norms
dbt Project Design
Testing & CI