How to Create Vector Embeddings in Python

Company

DataStax

Date Published

April 8, 2025

Author

Phil Nash

Word count

1409

Language

English

Hacker News points

None

URL

www.datastax.com/blog/how-to-create-vector-embeddings-in-python

Summary

When building a retrieval-augmented generation (RAG) app, you need to prepare your data by creating vector embeddings in various ways such as locally, via API, via a framework or with Astra DB's Vectorize. Pre-trained embedding models like Sentence Transformers and all-MiniLM-L6-v2 can be used to generate vector embeddings. Local embedding models are useful for experimentation on laptops or hardware acceleration, while APIs provided by services like OpenAI, Google and Cohere offer an alternative option. Frameworks like LangChain and LlamaIndex provide standardized interfaces that abstract the complexities of embedding models and APIs. Astra Vectorize enables Astra DB to automatically generate vector embeddings as documents are inserted or queries are performed, simplifying code maintenance, improving performance and efficiency.