Dish Decode- Flask Based API for Recipe Extraction from video

Aniket Panchal

0

ML Engineer

Software Engineer

AI Developer

GitHub

Google Gemini

Postman

Flask API: Built an API to process audio files and YouTube transcripts. High-Accuracy Transcription: Used Deepgram & Whisper for precise audio-to-text conversion.
Recipe Extraction: Leveraged Gemini API to structure transcriptions into detailed recipes.
Multimedia Processing: Enabled both audio URL & YouTube video handling cues extraction.
Tech Stack: Demonstrated skills in Python, API integrations, Gemini AI, Whisper AI.

🛠️ Key Features

Deepgram API for accurate audio transcription.
Tesseract OCR for extracting text from video frames.
Gemini API for generating structured recipe information.
FFmpeg for seamless MP4-to-WAV conversion.
Supports both audio and video analysis for enhanced accuracy. 🎯
Like this project
0

Posted Feb 13, 2025

This project is a Flask-based API that extracts structured recipe information from cooking tutorial videos, using OCR, Whisper, and Gemini AI!

Likes

0

Views

0

Tags

ML Engineer

Software Engineer

AI Developer

GitHub

Google Gemini

Postman

Comment Feel - YouTube Comments Sentiment Analyzer Tool
Comment Feel - YouTube Comments Sentiment Analyzer Tool
🤖 MediBot AI – AI-Powered Medical Chatbot
🤖 MediBot AI – AI-Powered Medical Chatbot