AI-Powered Digital Archive for Cultural Heritage by Jossaafad Herrera AlfaroAI-Powered Digital Archive for Cultural Heritage by Jossaafad Herrera Alfaro
A cultural heritage organization needed a way to preserve and catalog thousands of historical artifacts, documents, and media files. Their existing process was entirely manual, with metadata scattered across spreadsheets and physical records. Finding a specific item could take hours.
The Solution
I built an AI-driven digital archive that automates cataloging and makes the entire collection searchable in seconds.
How it works:
Staff uploads media (photos, videos, documents) through a simple web interface
The system auto-generates tags and cross-references related items
Everything is stored in a searchable database with full-text and semantic search
Key features:
Automated metadata extraction powered by Gemini
Semantic search across the entire collection
Multi-format support (images, video, PDFs, audio)
Role-based access for researchers, curators, and public visitors
Export tools for academic citations and reports
Tech Stack
AI: Gemini
Backend: Python
Database: Firebase
Search: Custom semantic search pipeline
Results
Cataloging time reduced from 45 minutes per item to under 3 minutes
Entire collection searchable for the first time in the organization's history
Public access portal launched, increasing community engagement by 5x
Like this project
Posted May 25, 2026
AI-driven digital archive preserving cultural heritage through automated cataloging, Gemini-powered metadata extraction, and searchable media collections.