/plushcap/analysis/weaviate/weaviate-weaviate-non-english-languages

Using Weaviate with Non-English Languages

What's this blog post about?

The use of Weaviate with non-English languages is possible if the embedding models and large language models (LLMs) used support the chosen language. However, there are challenges when working with non-English languages in search or generative AI applications, such as the necessity of capable language models and character encoding issues for languages that cannot be ASCII encoded, like Chinese, Hindi, or Japanese. Weaviate can be used for semantic and generative searches with non-English languages by ensuring support from embedding models and converting escaped Unicode characters to human-readable ones. Currently, there are limitations in using Weaviate with non-English languages, mainly affecting keyword-based search functionality and hybrid search functionality.

Company
Weaviate

Date published
Jan. 30, 2024

Author(s)
Leonie Monigatti

Word count
1349

Language
English

Hacker News points
None found.


By Matt Makai. 2021-2024.