How We Finetuned a Large Language Model to Search Patents & Generate New Patents

Company

Activeloop

Date Published

Jan. 15, 2024

Author

Jacob Solawetz

Word count

1616

Language

English

Hacker News points

URL

www.activeloop.ai/resources/how-we-finetuned-a-large-language-model-to-search-patents-generate-new-patents

Summary

The blogpost discusses the technical journey behind building a fully custom LLM-based retrieval augmented generation and search app, PatentPT. It highlights the features of PatentPT, its technical architecture, dataset creation, domain training, finetuning large language models for patent generation, creating custom featurizers, standing up search indices, deploying search APIs, deploying LLM inference APIs, and the final application. The post emphasizes that while the stack for training and deploying fine-tuned LLMs is not yet solidified, PatentPT showcases an efficient approach using cutting-edge technologies like Deep Lake from Activeloop, Hugging Face Optimum by Intel, and Habana Gaudi HPU hardware.