Skip to content
View ce107's full-sized avatar

Block or report ce107

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
133 results for forked starred repositories
Clear filter

Slurm: A Highly Scalable Workload Manager

C 2 3 Updated Sep 29, 2024

Library for specialized dense and sparse matrix operations, and deep learning primitives.

C 1 Updated Sep 25, 2024

The examples for the courses on advanced programming for scientific computing (AMSC and APSC), Politecnico di Milano

Shell 10 7 Updated Sep 27, 2024

Utilities for Dask and CUDA interactions

Python 4 Updated Apr 25, 2024

Using the famous cnn model in Pytorch, we run benchmarks on various gpu.

Python 1 Updated Aug 11, 2024

A fast and scalable x86-64 multicore simulator

C++ 3 Updated Sep 5, 2022

The simplest, fastest repository for training/finetuning medium-sized GPTs. Now, with kittens!

Makefile 45 3 Updated Aug 4, 2024

Slurm: A Highly Scalable Workload Manager

C 1 Updated May 20, 2024

Extend the original llama.cpp repo to support redpajama model.

C 117 16 Updated Sep 3, 2024
Python 1 Updated Nov 3, 2015
Cython 2 1 Updated Jun 26, 2024

Slurm HPC workload manager web JS dashboard and JSON REST API

JavaScript 1 1 Updated Feb 16, 2016

Custom Spawner for Jupyterhub to start slurm jobs when users log in

Python 1 Updated Aug 9, 2017

Add gpu utilization stats to Slurm batch scheduler accounting db

Python 2 1 Updated Sep 17, 2020
C++ 1 Updated Jan 12, 2023

A Python package for exporting the weights and biases of neural networks.

Python 3 1 Updated Aug 13, 2024
Python 23 1 Updated Mar 23, 2024

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 36 41 Updated Oct 8, 2024

A simple memory (or cache) bandwidth benchmark that isn't stream

C++ 1 Updated Oct 27, 2020

Ongoing research training transformer language models at scale, including: BERT & GPT-2

Python 68 6 Updated Jul 20, 2023

Best practice for training LLaMA models in Megatron-LM

Python 614 50 Updated Jan 2, 2024

Ongoing research training transformer models at scale

Python 1 2 Updated Aug 10, 2024

vLLM: A high-throughput and memory-efficient inference and serving engine for LLMs

Python 87 5 Updated Sep 24, 2024
2 1 Updated Jun 18, 2024

Fast and memory-efficient exact attention

Python 1 Updated Jul 31, 2023

Embedded Scalable Platforms: Heterogeneous SoC architecture and IP integration made easy

C 1 Updated Sep 19, 2024

pocl - Portable Computing Language

C 1 Updated Jun 20, 2023

Open source code for AlphaFold.

Python 5 4 Updated Feb 20, 2024
Next