/plushcap/analysis/weaviate/weaviate-hugging-face-inference-api-in-weaviate

Support for Hugging Face Inference API in Weaviate

What's this blog post about?

Vector databases like Weaviate use Machine Learning models to offer various functionalities such as summarizing text, auto-labeling data tokens, converting data into vectors for context-based queries, etc. However, running model inference in production is challenging due to the need for expensive specialized hardware and underutilization of resources. Companies like Hugging Face, OpenAI, and Cohere offer running model inference as a service. Weaviate v1.15 introduces support for Hugging Face Inference API, allowing users to use pre-trained sentence similarity models or upload their own private models. The integration is fully automated and optimized, with the option to choose between text2vec-huggingface and text2vec-openai modules.

Company
Weaviate

Date published
Sept. 27, 2022

Author(s)
Sebastian Witalec

Word count
1048

Hacker News points
None found.

Language
English


By Matt Makai. 2021-2024.