An implementation of "Retentive Network: A Successor to Transformer for Large Language Models"
-
Updated
Nov 2, 2023 - Python
An implementation of "Retentive Network: A Successor to Transformer for Large Language Models"
This is the repo for the MixKABRN Neural Network (Mixture of Kolmogorov-Arnold Bit Retentive Networks), and an attempt at first adapting it for training on text, and later adjust it for other modalities.
3D LiDAR Semantic Segmentation with range images and Retentive Networks
Add a description, image, and links to the retentive-network topic page so that developers can more easily learn about it.
To associate your repository with the retentive-network topic, visit your repo's landing page and select "manage topics."