Skip to content
View catid's full-sized avatar

Highlights

  • Pro

Block or report catid

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
  • quicsend Public

    quicsend :: Super-fast Internet-ready file transfer right from Python

    C++ 2 BSD 3-Clause "New" or "Revised" License Updated Oct 2, 2024
  • Cuda Mozilla Public License 2.0 Updated Sep 15, 2024
  • lllm Public

    Latent Large Language Models

    Python 16 Updated Aug 24, 2024
  • Implementation of Google's SELF-DISCOVER

    Python 275 31 BSD 3-Clause "New" or "Revised" License Updated Aug 9, 2024
  • TextWorld LLM Benchmark

    Python 3 BSD 3-Clause "New" or "Revised" License Updated Aug 9, 2024
  • longhair Public

    Longhair : O(N^2) Cauchy Reed-Solomon Block Erasure Code for Small Data

    C++ 158 35 Updated Aug 9, 2024
  • quiche Public

    Forked from cloudflare/quiche

    🥧 Savoury implementation of the QUIC transport protocol and HTTP/3

    Rust BSD 2-Clause "Simplified" License Updated Aug 5, 2024
  • Python package for compressing floating-point PyTorch tensors

    Cuda 10 1 BSD 3-Clause "New" or "Revised" License Updated Jul 22, 2024
  • dataloader Public

    High-performance tokenized language data-loader for Python C++ extension

    C++ 12 BSD 3-Clause "New" or "Revised" License Updated Jul 22, 2024
  • cuSZp Public

    Forked from szcompressor/cuSZp
    Cuda Other Updated Jul 3, 2024
  • Accessible large language models via k-bit quantization for PyTorch.

    Python MIT License Updated Jun 26, 2024
  • dora Public

    Implementation of DoRA

    Python 281 18 MIT License Updated Jun 7, 2024
  • Fix for CUDA out of memory

    Python 1 Apache License 2.0 Updated Jun 2, 2024
  • mirage Public

    Forked from mirage-project/mirage

    A multi-level tensor algebra superoptimizer

    C++ Apache License 2.0 Updated May 10, 2024
  • exllamav2 Public

    Forked from turboderp/exllamav2

    Add repeat-layer feature to exllamav2

    Python MIT License Updated May 6, 2024
  • tonk Public

    Tonk : Reliable UDP (rUDP) Network Library and Infinite Window Erasure Code

    C++ 104 10 BSD 3-Clause "New" or "Revised" License Updated Apr 25, 2024
  • oaillama3 Public

    Simple setup to self-host LLaMA3-70B model with an OpenAI API

    17 1 Updated Apr 24, 2024
  • PruneMe Public

    Forked from arcee-ai/PruneMe

    Automated Identification of Redundant Layer Blocks for Pruning in Large Language Models

    Python Updated Apr 23, 2024
  • Fixes to get it working for LLaMA3

    Python GNU General Public License v3.0 Updated Apr 22, 2024
  • AQLM Public

    Fixes for AQLM

    Python 6 Apache License 2.0 Updated Apr 21, 2024
  • 🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.

    Python Apache License 2.0 Updated Apr 20, 2024
  • Using DeepSpeed and Nvidia DALI to train various models to solve CIFAR-10

    Python 3 Updated Apr 16, 2024
  • gpt-neox Public

    Forked from EleutherAI/gpt-neox

    An implementation of model parallel autoregressive transformers on GPUs, based on the DeepSpeed library.

    Python Apache License 2.0 Updated Apr 15, 2024
  • pytorch Public

    Forked from pytorch/pytorch

    Tensors and Dynamic neural networks in Python with strong GPU acceleration

    Python Other Updated Apr 15, 2024
  • Python Updated Apr 14, 2024
  • swe_agent_playground

    1 Updated Apr 2, 2024
  • bitnet_cpu Public

    Experiments with BitNet inference on CPU

    C++ 46 2 Updated Apr 1, 2024
  • Chainlit AI UI with Anthropic Backend

    Python 3 Updated Mar 31, 2024
  • ansible Public

    Ansible scripts to set up my GPU cluster at home

    Shell Updated Mar 25, 2024
  • leopard Public

    Leopard-RS : O(N Log N) MDS Reed-Solomon Block Erasure Code for Large Data

    C++ 140 24 BSD 3-Clause "New" or "Revised" License Updated Mar 23, 2024