Skip to content

Multimodal Recipe Recommender using Qdrant, LlamaIndex, and Google Gemini

Notifications You must be signed in to change notification settings

T-AIMaven/Multimodal-RAG-with-Video-Frames

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

3 Commits
 
 
 
 

Repository files navigation

Multimodal Recipe Recommender using Qdrant, LlamaIndex, and Google Gemini 🚀

youtube-multimodal3

This repository contains a notebook with a multimodal system using images as frames from YouTube videos, LlamaIndex framework, Qdrant as a vector database, and Gemini as embedding and llm model.

Main Steps

  • Data Ingestion: Load videos and metadata from a YouTube playlist
  • Indexing: MultiModalVectorStoreIndex from LlamaIndex
  • Embedding and Model: Gemini
  • Vector Store: Qdrant with 2 collections (text and images)
  • Query Retrieval: Top recipe and frame images

Feel free to ⭐ and clone this repo 😉

Tech Stack

Python Google Cloud Jupyter Notebook

About

Multimodal Recipe Recommender using Qdrant, LlamaIndex, and Google Gemini

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published