[2307.08621] Retentive Network: A Successor to Transformer for Large Language Models
Tags:
Au sujet de ce document
Infos sur le fichier
Documents with similar tags (experimental)
2023-10-19 A propos
2023-09-16 A propos
2023-06-15 A propos
2023-04-03 A propos
2023-03-21 A propos
2023-02-17 A propos
2023-02-07 A propos
2023-01-12 A propos
2022-06-30 A propos
2022-03-31 A propos
2021-11-19 A propos
2021-10-21 A propos
2021-04-11 A propos
2020-04-25 A propos
2020-02-10 A propos
2018-10-06 A propos