Code Monkey home page Code Monkey logo

vectordb-recipes's Introduction

VectorDB-recipes


Dive into building GenAI applications! This repository contains examples, applications, starter code, & tutorials to help you kickstart your GenAI projects.
  • These are built using LanceDB, a free, open-source, serverless vectorDB that requires no setup.
  • It integrates into Python data ecosystem so you can simply start using these in your existing data pipelines in pandas, arrow, pydantic etc.
  • LanceDB has native Typescript SDK using which you can run vector search in serverless functions!

Join our community for support - DiscordTwitter

This repository is divided into 2 sections:

  • Examples - Get right into the code with minimal introduction, aimed at getting you from an idea to PoC within minutes!
  • Applications - Ready to use Python and web apps using applied LLMs, VectorDB and GenAI tools

Examples

Applied examples that get right into the code with minimal introduction, aimed at getting you from an idea to PoC within minutes! Examples are available as:

  • Colab notebooks - that builds the application is stages allowing you to investigate results at every intermediate stage.
  • Python scripts - for cases where you'd like directly to use the file or snippets to integrate in your application
  • JS/TS scripts - Some examples are written using lancedb's native js library! These script/snippets can also be directly integrated in your web applications.

The following examples are organized into different tables to make similar types of examples easily accessible.

Build from Scratch

Build applications/examples using LanceDB for efficient vector-based document retrieval.

Build from Scratch     Interactive Notebook & Scripts  
Build RAG from Scratch Open In Colab LLM beginner
Local RAG from Scratch with Llama3 Python local LLM beginner
Multi-Head RAG from Scratch Python LLM local LLM beginner

MultiModal

Create a multimodal search application using LanceDB for efficient vector-based retrieval of text and image data. Input text or image queries to find the most relevant documents and images from your corpus.

Multimodal     Interactive Notebook & Scripts   Blog
Multimodal CLIP: DiffusionDB Open In Colab Python LLM beginner Ghost
Multimodal CLIP: Youtube videos Open In Colab Python LLM beginner Ghost
Multimodal Image + Text Search Open In Colab Python LLM intermediate Ghost

RAG

Develop a Retrieval-Augmented Generation (RAG) application using LanceDB for efficient vector-based information retrieval. Input text queries to retrieve relevant documents and generate comprehensive answers by combining retrieved information.

RAG     Interactive Notebook & Scripts Blog
RAG with Matryoshka Embeddings and LlamaIndex Open In Colab LLM beginner
Improve RAG with Re-ranking Open In Colab LLM beginner Ghost
Instruct-Multitask Open In Colab Python LLM beginner Ghost
Improve RAG with HyDE Open In Colab LLM intermediate Ghost
Improve RAG with LOTR Open In Colab LLM intermediate Ghost
Advanced RAG: Parent Document Retriever Open In Colab LLM intermediate Ghost
Corrective RAG with Langgraph Open In Colab LLM intermediate Ghost
Contextual-Compression-with-RAG Open In Colab local LLM intermediate Ghost
Improve RAG with FLARE Open In Colab local LLM LLM advanced Ghost
Query Expansion and Reranker Open In Colab LLM advanced
RAG Fusion Open In Colab LLM advanced
Agentic RAG Open In Colab LLM advanced

Vector Search

Build a vector search application using LanceDB for efficient vector-based document retrieval. Input text queries to find the most relevant documents from your corpus.

Vector Search     Interactive Notebook & Scripts   Blog
Inbuilt Hybrid Search Open In Colab LLM beginner
Hybrid search BM25 & lancedb Open In Colab LLM beginner Ghost
NER powered Semantic Search Open In Colab local LLM beginner Ghost
Audio Search Open In Colab Python LLM beginner
Multi-lingual search Open In Colab Python LLM beginner
Facial Recognition Open In Colab beginner
Sentiment Analysis : Analysing Hotel Reviews Open In Colab local LLM beginner Ghost
Vector Arithmetic with LanceDB Open In Colab LLM beginner Ghost
Imagebind demo app hf spaces intermediate
Search Within Images Open In Colab local LLM intermediate Ghost
Vector Search with TransformersJS JS LLM advanced
Accelerate Vector Search Applications Using OpenVINO Open In Colab local LLM advanced Ghost

Chatbot

Create a chatbot application using LanceDB for efficient vector-based response generation. Input user queries to retrieve relevant context and generate coherent, context-aware replies.

Chatbot     Interactive Notebook & Scripts   Blog  
Databricks DBRX Website Bot Python Databricks LLM beginner
CLI-based SDK Manual Chatbot with Phidata Python local LLM beginner
Youtube transcript search bot Open In Colab Python JS LLM intermediate
Langchain: Code Docs QA bot Open In Colab Python JS LLM intermediate
Context-Aware Chatbot using Llama 2 & LanceDB Open In Colab local LLM advanced Ghost
Chat with csv using Hybrid Search Open In Colab LLM intermediate Ghost

Evaluation

Develop an evaluation application. Input reference and candidate texts to measure their performance on various metrics.

Evaluation     Interactive Notebook & Scripts   Blog
Evaluating Prompts with Prompttools Open In Colab LLM local LLM advanced
Evaluating RAG with RAGAs Open In Colab LLM intermediate

AI Agents

Design an AI agents coordination application with LanceDB for efficient vector-based communication and collaboration. Input queries to enable AI agents to exchange information, coordinate tasks, and achieve shared goals effectively.

AI Agents     Interactive Notebook & Scripts   Blog
AI Agents: Reducing Hallucination Open In Colab Python JS LLM advanced Ghost
AI Trends Searcher with CrewAI Open In Colab LLM beginner Ghost
SuperAgent Autogen Open In Colab LLM intermediate

Recommender Systems

Create a recommender system application with LanceDB for efficient vector-based item recommendation. Input user preferences or item features to generate personalized recommendations and enhance user experience.

Recommender Systems Interactive Notebook & Scripts   Blog
Movie Recommender Open In Colab Python beginner
Movie Recommender with Genre Open In Colab beginner Ghost
Product Recommender Open In Colab Python intermediate
Arxiv paper recommender Open In Colab Python LLM beginner

Concepts

Checkout concepts of LLM applications pipeline to ensures accurate information retrieval.

Concepts Interactive Notebook Blog
A Primer on Text Chunking and its Types Open In Colab beginner Ghost
Langchain LlamaIndex Chunking Open In Colab beginner Ghost
Create structured dataset using Instructor Python beginner
Comparing Cohere Rerankers with LanceDB beginner Ghost
Product Quantization: Compress High Dimensional Vectors intermediate Ghost
LLMs, RAG, & the missing storage layer for AI intermediate Ghost
Fine-Tuning LLM using PEFT & QLoRA Open In Colab local LLM advanced Ghost
Extracting Complex tables-text from PDFs using LlamaParse Open In Colab LLM LlamaCloud beginner

Projects & Applications

These are ready to use applications built using LanceDB serverless vector database. You can explore these open source projects, use parts of them in your projects or build your applications on top of these.

Project Name Description Screenshot
YOLOExplorer Iterate on your YOLO / CV datasets using SQL, Vector semantic search, and more within seconds YOLOExplorer
Website Chatbot (Deployable Vercel Template) Create a chatbot from the sitemap of any website/docs of your choice. Built using vectorDB serverless native javascript package. Chatbot
Chat with multiple URL/website Conversational AI for Any Website with Mistral,Bge Embedding & LanceDB webui_aa
Talk with Youtube Video using GPT4 Vision API Talk with Youtube Video using GPT4 Vision API and Langchain demo
Talk with Podcast Talk with Youtube Podcast using Ollama and insanely-fast-whisper demo
Talk with Wikipedia Talk with Wikipedia Pages demo
Talk with Github Talk with Github Codespaces using Qwen1.5 demo
Document Chat with Langroid Talk with your Documents using Langroid demo
Hr chatbot Hr chatbot - ask your personal query using zero-shot React agent & tools image
Advanced Chatbot with Parler TTS This Chatbot app uses Lancedb Hybrid search, FTS & reranker method with Parlers TTS library. image
Multi-Modal Search Engine Create a Multi-modal search engine app, to search images using both images or text Search
Multimodal Myntra Fashion Search Engine This app uses OpenAI's CLIP to make a search engine that can understand and deal with both written words and pictures. image
Multilingual-RAG Multilingual RAG with cohere embedding & support 100+ languages image
Fastapi RAG template FastAPI based RAG template with Websocket support image
GTE MLX RAG mlx based RAG model using lancedb api support image
Healthcare Chatbot Healthcare chatbot using domain specific LLM & Embedding model image

🌟 New! 🌟 Applied GenAI and VectorDB course on Udacity Learn about GenAI and vectorDBs using LanceDB in the recently launched Udacity Course

Contributing Examples

If you're working on some cool applications that you'd like to add to this repo, please open a PR!

vectordb-recipes's People

Contributors

ayushexel avatar unkn-wn avatar akashad98 avatar prashantdixit0 avatar kaushal07wick avatar tevinwang avatar qianzhu avatar raghavdixit99 avatar albertlockett avatar dependabot[bot] avatar mdbecker avatar deshwalmahesh avatar akashmangoai avatar arig23498 avatar changhiskhan avatar kadirnar avatar nivekt avatar nishant-kumar-2002 avatar tanaymeh avatar vipul-maheshwari avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.