Skip to content
@whyNLP

whyNLP

NLP research projects for Haoyi Wu.

Popular repositories Loading

  1. LCKV LCKV Public

    Layer-Condensed KV cache w/ 10 times larger batch size, fewer params and less computation. Dramatic speed up with better task performance. Accepted to ACL 2024.

    Python 110 6

  2. Conic10K Conic10K Public

    Conic10K: A large-scale dataset for closed-vocabulary math problem understanding. Accepted to EMNLP2023 Findings.

    Python 22 2

  3. Probabilistic-Transformer Probabilistic-Transformer Public

    A probabilitic model for contextual word representation. Accepted to ACL2023 Findings.

    Python 14 2

  4. tinyllama tinyllama Public

    A side project that follows all the acceleration tricks in tinyllama, with the minimal modification to the huggingface transformers code.

    Python 10

  5. nni-slurm nni-slurm Public

    Forked from microsoft/nni

    A patch for NNI with slurm and W&B.

    Python 6

  6. tinyllama-zh tinyllama-zh Public

    A side project that pretrains a tinyllama on Chinese corpora, with the minimal modification to the huggingface transformers code.

    Python 6 1

Repositories

Showing 6 of 6 repositories
  • LCKV Public

    Layer-Condensed KV cache w/ 10 times larger batch size, fewer params and less computation. Dramatic speed up with better task performance. Accepted to ACL 2024.

    whyNLP/LCKV’s past year of commit activity
    Python 110 6 0 1 Updated Jun 28, 2024
  • tinyllama Public

    A side project that follows all the acceleration tricks in tinyllama, with the minimal modification to the huggingface transformers code.

    whyNLP/tinyllama’s past year of commit activity
    Python 10 0 2 0 Updated May 20, 2024
  • tinyllama-zh Public

    A side project that pretrains a tinyllama on Chinese corpora, with the minimal modification to the huggingface transformers code.

    whyNLP/tinyllama-zh’s past year of commit activity
    Python 6 MIT 1 0 0 Updated Mar 11, 2024
  • Conic10K Public

    Conic10K: A large-scale dataset for closed-vocabulary math problem understanding. Accepted to EMNLP2023 Findings.

    whyNLP/Conic10K’s past year of commit activity
    Python 22 MIT 2 0 0 Updated Dec 6, 2023
  • Probabilistic-Transformer Public

    A probabilitic model for contextual word representation. Accepted to ACL2023 Findings.

    whyNLP/Probabilistic-Transformer’s past year of commit activity
    Python 14 MIT 2 0 0 Updated Oct 22, 2023
  • nni-slurm Public Forked from microsoft/nni

    A patch for NNI with slurm and W&B.

    whyNLP/nni-slurm’s past year of commit activity
    Python 6 MIT 1,841 1 0 Updated Apr 16, 2023

Top languages

Loading…

Most used topics

Loading…