/plushcap/analysis/zilliz/use-milvus-and-airbyte-for-similarity-search-on-all-your-data

Use Milvus and Airbyte for Similarity Search on All Your Data

What's this blog post about?

Milvus is an open-source vector database used to store, index, and efficiently search high-dimensional vector data. It's particularly useful in applications involving similarity searches across unstructured data, such as Generative Chat responses, product recommendations, and more. By using Airbyte, it's straightforward to transfer data from many different sources into Milvus, calculating vector embeddings of texts along the way. The power of embeddings is to be able to search for relevant pieces of information, even if similar concepts are phrased differently. This article demonstrates how to use Zilliz Cloud as a vector store, Airbyte to extract and load the data, OpenAI embedding API to calculate embeddings, and Streamlit to build a smart submission form showing relevant data.

Company
Zilliz

Date published
Oct. 10, 2023

Author(s)
Joe Reuter

Word count
1909

Language
English

Hacker News points
None found.


By Matt Makai. 2021-2024.