Some enterprises have trained small specialized models based on proprietary data...

Some enterprises have trained small specialized models based on proprietary data.

https://www.maginative.com/article/nvidia-leverages-ai-to-as...

> NVIDIA researchers customized LLaMA by training it on 24 billion tokens derived from internal documents, code, and other textual data related to chip design. This advanced “pretraining” tuned the model to understand the nuances of hardware engineering. The team then “fine-tuned” ChipNeMo on over 1,000 real-world examples of potential assistance applications collected from NVIDIA’s designers.

2023 paper, https://research.nvidia.com/publication/2023-10_chipnemo-dom...

> Our results show that these domain adaptation techniques enable significant LLM performance improvements over general-purpose base models across the three evaluated applications, enabling up to 5x model size reduction with similar or better performance on a range of design tasks.

2024 paper, https://developer.nvidia.com/blog/streamlining-data-processi...

> Domain-adaptive pretraining (DAPT) of large language models (LLMs) is an important step towards building domain-specific models. These models demonstrate greater capabilities in domain-specific tasks compared to their off-the-shelf open or commercial counterparts.