Hacker Newsnew | past | comments | ask | show | jobs | submitlogin
RecurrentGemma: Moving Past Transformers for Efficient Open Language Models [pdf] (storage.googleapis.com)
6 points by alekandreev on April 10, 2024 | hide | past | favorite | 1 comment


Code here: https://github.com/google-deepmind/recurrentgemma

Checkpoints here for both base pre-trained model and an IT version for dialogue: https://www.kaggle.com/models/google/recurrentgemma




Consider applying for YC's Fall 2025 batch! Applications are open till Aug 4

Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: