


index.
data model and dimnensions in a layman termdatetime_dim dimension tableGranularity of data.The level of detail at which the attributes and characteristics (columns) of data are defined so the more we dive deeper the granularity of data becomes (individual level info) and the more we aggregate data the granularity becomes higher (department level info). High Granularity can be related to as seeing a whole picture (bigger view) and low level granularity means seeing individual elements of the whole picture. e.g seeing an individual salary would be lower granular and seeing the department's average salary would be high granular level of detail.

passenger_count_dim dimension table

trip_distance_dim dimension table

rate_code_dim dimension table


pickup_location_dim dimension tablePULocationID and DOLocationID column which doesn't mean anything to us unless we have a location name. Therefore, we will be using a lookup table which has mappings for each of location_id and their zone/location. If we refer to the above diagram, the lookup table which i am referring to is as follows


dropoff_location_dim dimension table
payment_type_dim dimension table

fact_table dimension table

Posted Jan 12, 2024
A Complete Data Engineering Workflow, Data Modelling and Advanced Analytics using Python, DuckDB - GitHub - DanielOX/Data-Engineering-Workflow-DUCKDB: A Comple…