/plushcap/analysis/zilliz/choosing-the-right-embedding-model-for-your-data

Choosing the Right Embedding Model for Your Data

What's this blog post about?

Retrieval Augmented Generation (RAG) is an approach in Generative AI that utilizes data to enhance the knowledge of Language Learning Model (LLM) generators, such as ChatGPT. RAG consists of two LLMs: embedding and generator models, both used in inference mode. The HuggingFace MTEB leaderboard provides a comprehensive list of text embedding models, where users can filter by language or specialty domain like law. Users should be cautious when selecting models as some may be overfitted, resulting in deceptively high rankings. ResNet50 is a popular Convolutional Neural Network (CNN) model for image data and PANNs are commonly used embedding models for audio data. Multimodal embedding models like SigLIP or Unum can handle text, image, audio, or video data simultaneously. For multimodal applications involving sound or video, a generative LLM is often employed to convert the input into text before using RAG techniques.

Company
Zilliz

Date published
May 22, 2024

Author(s)
By Christy Bergman

Word count
1051

Language
English

Hacker News points
None found.


By Matt Makai. 2021-2024.