Skip to content

Navigation Menu

Explore
By size
By industry
By use case
Topics
- AI
- DevOps
- Security
- Software Development
- View all
Explore
- GitHub Sponsors
  Fund open source developers
- The ReadME Project
  GitHub community articles
Repositories
- Enterprise platform
  AI-powered developer platform
Available add-ons
Pricing

Search code, repositories, users, issues, pull requests...

Search

Clear

Search syntax tips

Provide feedback

We read every piece of feedback, and take your input very seriously.

Include my email address so I can be contacted

Saved searches

Use saved searches to filter your results more quickly

Name

Query

To see all available qualifiers, see our documentation.

You signed in with another tab or window. Reload to refresh your session. You signed out in another tab or window. Reload to refresh your session. You switched accounts on another tab or window. Reload to refresh your session.

Dismiss alert

sustcsonglin Follow

Overview Repositories 94 Projects 0 Packages 0 Stars 1.5k

More

Overview
Repositories
Projects
Packages
Stars

sustcsonglin

Follow

Songlin Yang sustcsonglin

Follow

ML & NLP Research. PhD student @ MIT CSAIL

637 followers · 206 following

MIT
Cambridge
17:39 (UTC -04:00)
https://sustcsonglin.github.io/
@SonglinYang4

Achievements

Achievements

Highlights

Pro

Block or report sustcsonglin

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Add an optional note:

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.

Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Overview Repositories 94 Projects 0 Packages 0 Stars 1.5k

More

Overview
Repositories
Projects
Packages
Stars

Type All

Select type

All Sources Forks Archived Can be sponsored Mirrors Templates

Language All

Select language

All Python HTML JavaScript Assembly Cuda Jupyter Notebook TeX C++ Scala C Java

Sort Last updated

Select order

Last updated Name Stars

flash-linear-attention Public

Efficient implementations of state-of-the-art linear attention models in Pytorch and Triton

natural-language-processing machine-learning-systems large-language-models

Python 1,262 66 MIT License Updated Oct 14, 2024
sustcsonglin.github.io Public

HTML MIT License Updated Sep 26, 2024
TN-PCFG Public

source code of NAACL2021 "PCFGs Can Do Better: Inducing Probabilistic Context-Free Grammars with Many Symbols“ and ACL2021 main conference "Neural Bilexicalized PCFG Induction"

Python 44 6 Updated Mar 18, 2024
transformers_ssm_copy Public
Forked from sjelassi/transformers_ssm_copy

Python 1 MIT License Updated Feb 26, 2024
zoology Public
Forked from HazyResearch/zoology

Understand and test language model architectures on synthetic tasks.

Python 1 Apache License 2.0 Updated Feb 23, 2024
mamba-triton Public

Python 42 2 Updated Jan 28, 2024
mamba.py Public
Forked from alxndrTL/mamba.py

An efficient Mamba implementation in PyTorch and MLX.

Python 1 Updated Jan 25, 2024
Academic-project-page-template Public template
Forked from eliahuhorwitz/Academic-project-page-template

A project page template for academic papers. Demo at https://eliahuhorwitz.github.io/Academic-project-page-template/

JavaScript Updated Jan 22, 2024
hyena-dna Public
Forked from HazyResearch/hyena-dna

Official implementation for HyenaDNA, a long-range genomic foundation model built with Hyena

Assembly Apache License 2.0 Updated Jan 20, 2024
lit-gpt Public
Forked from Lightning-AI/litgpt

Hackable implementation of state-of-the-art open-source LLMs based on nanoGPT. Supports flash attention, 4-bit and 8-bit quantization, LoRA and LLaMA-Adapter fine-tuning, pre-training. Apache 2.0-l…

Python 2 Apache License 2.0 Updated Jan 16, 2024
stk Public
Forked from stanford-futuredata/stk

Python 1 Apache License 2.0 Updated Jan 11, 2024
TinyLlama Public
Forked from jzhang38/TinyLlama

The TinyLlama project is an open endeavor to pretrain a 1.1B Llama model on 3 trillion tokens.

Python 1 Apache License 2.0 Updated Jan 10, 2024
gated_linear_attention_layer Public

Python 30 1 Updated Jan 7, 2024
nanokitchen Public
Forked from proger/nanokitchen

Parallel Associative Scan for Language Models

Python 1 Apache License 2.0 Updated Jan 2, 2024
cutlass-kernels Public
Forked from ColfaxResearch/cutlass-kernels

Cuda MIT License Updated Dec 20, 2023
mamba Public
Forked from state-spaces/mamba

Python Apache License 2.0 Updated Dec 4, 2023
cuda-playground Public

Cuda 1 Updated Oct 17, 2023
FlagAttention Public
Forked from FlagOpen/FlagAttention

A collection of memory efficient attention operators implemented in the Triton language.

Python 2 Other Updated Oct 13, 2023
streaming-llm Public
Forked from mit-han-lab/streaming-llm

Efficient Streaming Language Models with Attention Sinks

Python 1 MIT License Updated Oct 5, 2023
stack-attention Public
Forked from bdusell/stack-attention

Code for the paper "Stack Attention: Improving the Ability of Transformers to Model Hierarchical Patterns"

Python Updated Oct 4, 2023
safari Public
Forked from HazyResearch/safari

Convolutions for Sequence Modeling

Assembly 1 Apache License 2.0 Updated Sep 29, 2023
disco-pointer Public

Official Implementation of ACL2023: Don't Parse, Choose Spans! Continuous and Discontinuous Constituency Parsing via Autoregressive Span Selection

Python 13 Updated Aug 25, 2023
flash-linear-rnn Public

Implementations of various linear RNN layers using pytorch and triton

Python 44 1 Updated Aug 4, 2023
m2 Public
Forked from HazyResearch/m2

Monarch Mixer

Assembly Updated Jul 25, 2023
s5-pytorch Public
Forked from i404788/s5-pytorch

Pytorch implementation of Simplified Structured State-Spaces for Sequence Modeling (S5)

Python Mozilla Public License 2.0 Updated Jun 25, 2023
SGEMM_CUDA Public
Forked from siboehm/SGEMM_CUDA

Fast CUDA matrix multiplication from scratch

Cuda Updated Jun 13, 2023
sustcsonglin_old.github.io Public
Forked from imfing/vuepress-homepage

📄 Elegant & friendly homepage (bio, tech portfolio, resume, doc...) template with Markdown and VuePress

HTML 1 Updated Jun 4, 2023
S5 Public
Forked from lindermanlab/S5

Python MIT License Updated May 28, 2023
BeamTreeRecursiveCells Public
Forked from JRC1995/BeamTreeRecursiveCells

Python MIT License Updated May 27, 2023
state-spaces Public
Forked from state-spaces/s4

Sequence Modeling with Structured State Spaces

Jupyter Notebook Apache License 2.0 Updated May 25, 2023

Previous Next

Footer

© 2024 GitHub, Inc.

Footer navigation

Terms
Privacy
Security
Status
Docs
Contact

You can’t perform that action at this time.