How We Finetuned a Large Language Model to Search Patents & Generate New Patents
The blogpost discusses the technical journey behind building a fully custom LLM-based retrieval augmented generation and search app, PatentPT. It highlights the features of PatentPT, its technical architecture, dataset creation, domain training, finetuning large language models for patent generation, creating custom featurizers, standing up search indices, deploying search APIs, deploying LLM inference APIs, and the final application. The post emphasizes that while the stack for training and deploying fine-tuned LLMs is not yet solidified, PatentPT showcases an efficient approach using cutting-edge technologies like Deep Lake from Activeloop, Hugging Face Optimum by Intel, and Habana Gaudi HPU hardware.
Company
Activeloop
Date published
Jan. 15, 2024
Author(s)
Jacob Solawetz
Word count
1616
Language
English
Hacker News points
4