Skip to content
View dukeraphaelng's full-sized avatar

Highlights

  • Pro
Block or Report

Block or report dukeraphaelng

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results

[AISTATS 2023] Error Estimation for Random Fourier Features

Python 3 Updated Apr 19, 2023
Python 12 Updated Aug 31, 2023

Implementation of a Transformer using ReLA (Rectified Linear Attention) from https://arxiv.org/abs/2104.07012

Python 50 7 Updated Apr 6, 2022

An implementation of masked language modeling for Pytorch, made as concise and simple as possible

Python 174 24 Updated Aug 9, 2023

Implementation of the transformer proposed in "Building Blocks for a Complex-Valued Transformer Architecture"

Python 54 3 Updated Oct 13, 2023

Implementation of RETRO, Deepmind's Retrieval based Attention net, in Pytorch

Python 847 106 Updated Oct 30, 2023

Implementation of Rotary Embeddings, from the Roformer paper, in Pytorch

Python 477 39 Updated Jul 2, 2024

Implementation of the specific Transformer architecture from PaLM - Scaling Language Modeling with Pathways

Python 818 82 Updated Nov 9, 2022

To eventually become an unofficial Pytorch implementation / replication of Alphafold2, as details of the architecture get released

Python 1,534 254 Updated Oct 29, 2022

A simple and working implementation of Electra, the fastest way to pretrain language models from scratch, in Pytorch

Python 220 43 Updated Jun 12, 2023

Reformer, the efficient Transformer, in Pytorch

Python 2,082 255 Updated Jun 21, 2023

Some personal experiments around routing tokens to different autoregressive attention, akin to mixture-of-experts

Python 100 3 Updated Jul 16, 2023

Implementation of an Attention layer where each head can attend to more than just one token, using coordinate descent to pick topk

Python 45 1 Updated Jul 16, 2023

Implementation of Memorizing Transformers (ICLR 2022), attention net augmented with indexing and retrieval of memories using approximate nearest neighbors, in Pytorch

Python 621 46 Updated Jul 17, 2023

Implementation of a memory efficient multi-head attention as proposed in the paper, "Self-attention Does Not Need O(n²) Memory"

Python 347 34 Updated Jul 18, 2023

Implementation of Linformer for Pytorch

Python 237 23 Updated Jan 5, 2024

Implementation of Nyström Self-attention, from the paper Nyströmformer

Python 119 15 Updated Jan 20, 2024

Implementation of Deformable Attention in Pytorch from the paper "Vision Transformer with Deformable Attention"

Python 266 29 Updated Apr 23, 2024

An implementation of local windowed attention for language modeling

Python 353 37 Updated Jul 8, 2024

Implementation of Vision Transformer, a simple way to achieve SOTA in vision classification with only a single transformer encoder, in Pytorch

Python 18,967 2,880 Updated Jul 20, 2024

Implementation of the convolutional module from the Conformer paper, for use in Transformers

Python 346 54 Updated May 17, 2023

A simple but complete full-attention transformer with a set of promising experimental features from various papers

Python 4,407 374 Updated Jul 20, 2024

The Electronic World Atlas of Varieties of English

TeX 6 1 Updated Apr 21, 2021

Chaospy - Toolbox for performing uncertainty quantification.

Python 433 86 Updated Jul 17, 2024

A Python wrapper around the Tasmanian sparse grid library

C++ 3 2 Updated Feb 26, 2014
Jupyter Notebook 2 1 Updated Feb 14, 2022

Skyformer: Remodel Self-Attention with Gaussian Kernel and Nystr\"om Method (NeurIPS 2021)

Python 53 9 Updated Apr 19, 2022

Long Range Arena for Benchmarking Efficient Transformers

Python 1 2 Updated Jan 5, 2023

[DEPRECATED] Repo for exploring multi-task learning approaches to learning sentence representations

Python 745 165 Updated Aug 3, 2021
Next