Data Engineer
Software Architect
Algolia
dbt
Python
dbt
for a little while, chances are your project has more than 50 models. Chances are more than 10 people are building dashboards based on those models.dbt
) - was writing:Amundsen
. As Paco Nathan writes p.115 of the book Data Teams by Jesse Anderson (you can find my review of the book here):dbt Docs
?dbt
project is tracked in version control, so by parsing git
's metadata, we can for example know each model's owner.stl_insert
, svv_table_info
, stl_query
, predicate columns).yml
file. Assuming there is a conscious curation effort happening from the code maintainers when they place a model in a folder in the dbt codebase, we can hence use folder names as a faceting attribute too. Lastly, we can use the dbt graph to propagate from left to right the source that models depend on; this will serve as a useful faceting attribute.dbt-metadata-utils
. I will walk through a couple of key parts here, but feel free to check out the full code there, and if you want to use it on your own project, hit me up.Posted Aug 23, 2024
In this post, I'll show you how to build a lightweight data catalog on top of dbt artifacts using Algolia.
0
3
Data Engineer
Software Architect
Algolia
dbt
Python