/plushcap/analysis/activeloop/activeloop-how-we-finetuned-a-large-language-model-to-search-patents-generate-new-patents

How We Finetuned a Large Language Model to Search Patents & Generate New Patents

What's this blog post about?

The blogpost discusses the technical journey behind building a fully custom LLM-based retrieval augmented generation and search app, PatentPT. It highlights the features of PatentPT, its technical architecture, dataset creation, domain training, finetuning large language models for patent generation, creating custom featurizers, standing up search indices, deploying search APIs, deploying LLM inference APIs, and the final application. The post emphasizes that while the stack for training and deploying fine-tuned LLMs is not yet solidified, PatentPT showcases an efficient approach using cutting-edge technologies like Deep Lake from Activeloop, Hugging Face Optimum by Intel, and Habana Gaudi HPU hardware.

Company
Activeloop

Date published
Jan. 15, 2024

Author(s)
Jacob Solawetz

Word count
1616

Language
English

Hacker News points
4


By Matt Makai. 2021-2024.