
(GitHub)(Powered by open source)(Pixi + GHA)(Pixi)Python Packaging framework design principles β
GitHub actions configured β
Vale.sh configured at PR level β
Pre-commit hooks configured for code linting/formatting β
Hello world LLM design example based on LangChain β
Environment management via pixi β
Reading data from online sources using intake β
Sample pipeline built using Dagster β
Building Dashboard using holoviews + panel β
Exploratory data analysis (EDA) using mito β
Web UI build on Flask β
Web UI re-done and expanded with FastHTML β
Leverage AI models to analyse data GitHub AI models BetaGITHUB_TOKEN, instructions herecd DataJourneypixi shellpixi run DJ_packagepixi run DJ_listpixi run <TASK_NAME>pixi run -v <TASK_NAME>tasks under DJGIT_TOKEN_CHECK Verifies the availability and validity of the Git authentication token. DJ_package Prepares and builds the Python package for the DataJourney project. DJ_pre_commit Runs pre-commit hooks to ensure code quality and adherence to standards. DJ_dagster Sets up and runs a Dagster workflow for orchestration in the project. DJ_fasthtml_app Executes a FastAPI-based HTML application. DJ_flask_app Configures and runs a Flask-based application for data services. DJ_mito_app Launches the Mito application for interactive data analysis in notebooks. DJ_panel_app Executes a Panel dashboard app for data visualization and analytics. DJ_llm_analysis Performs analysis using large language models (LLMs) on project data. DJ_hello_world_langchain Sets up a basic LangChain app as a "Hello World" example for LLMs. DJ_spanish_eng_translation Performs Spanish to English translation with Deepseek-R1 (NOTE: Takes about ~30 secs to execute this task) DJ_sync_dataset_trees Downloads and synchronizes the trees.csv dataset into the project structure.Posted Apr 27, 2025
Designed an open-source data management toolkit for modular workflows.
0
1