Multimodal Recipe Recommender using Qdrant, LlamaIndex, and Google Gemini 🚀

This repository contains a notebook with a multimodal system using images as frames from YouTube videos, LlamaIndex framework, Qdrant as a vector database, and Gemini as embedding and llm model.

Main Steps

Data Ingestion: Load videos and metadata from a YouTube playlist
Indexing: MultiModalVectorStoreIndex from LlamaIndex
Embedding and Model: Gemini
Vector Store: Qdrant with 2 collections (text and images)
Query Retrieval: Top recipe and frame images

Feel free to ⭐ and clone this repo 😉

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
README.md		README.md
youtube_recipes_multimodal.ipynb		youtube_recipes_multimodal.ipynb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Multimodal Recipe Recommender using Qdrant, LlamaIndex, and Google Gemini 🚀

Tech Stack

About

Releases

Packages

Languages

T-AIMaven/Multimodal-RAG-with-Video-Frames

Folders and files

Latest commit

History

Repository files navigation

Multimodal Recipe Recommender using Qdrant, LlamaIndex, and Google Gemini 🚀

Tech Stack

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages