Skip to content
View Hiroki11x's full-sized avatar
🦙
🦙

Organizations

@jphacks @rioyokotalab @crest-deep @TITAMAS @RotaPlusPlus @Agents-NY @ArtHackDay-Plus1 @MLHPC

Block or report Hiroki11x

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Amos optimizer with JEstimator lib.

Python 81 6 Updated May 15, 2024

Exploring differentiation with respect to hyperparameters

Python 297 71 Updated Jan 15, 2016

For optimization algorithm research and development. Submodule & Py38 ready

Python 4 Updated Oct 23, 2024

Medical Graph RAG: Graph RAG for the Medical Data

Python 217 34 Updated Oct 25, 2024

Prov-GigaPath: A whole-slide foundation model for digital pathology from real-world data

Python 415 54 Updated Sep 30, 2024

Meta Lingua: a lean, efficient, and easy-to-hack codebase to research LLMs.

Python 4,196 216 Updated Nov 13, 2024

Exploring the GLUE benchmark and fine tuning tasks on pre-trained BERT model using Hugging Face on PyTorch..

Jupyter Notebook 2 1 Updated May 17, 2021

This helps you to submit job with multinode & multgpu in Slurm in Torchrun

Shell 8 Updated Nov 18, 2023

Code for paper "Parameter Efficient Multi-task Model Fusion with Partial Linearization"

Jupyter Notebook 13 1 Updated Sep 13, 2024

Simple CIFAR10 ResNet example with JAX.

Python 21 2 Updated Jun 1, 2021

Codes for theoretical study of learning rate scheduling in non-convex problems.

C++ 2 Updated Feb 4, 2022

94% on CIFAR-10 in 2.6 seconds 💨 96% in 27 seconds

Python 173 9 Updated Nov 10, 2024

Granite Time Series Cookbook

Jupyter Notebook 12 4 Updated Nov 11, 2024

A tiny neural network for CIFAR-10 dataset

Python 3 Updated Apr 3, 2023

General tips to drive your research at Mila

Python 16 1 Updated May 14, 2024

A curated list of awesome Distributed Deep Learning resources.

405 84 Updated Jul 28, 2024

Efficient Triton Kernels for LLM Training

Python 3,419 199 Updated Nov 13, 2024

Research code for ECCV 2020 paper "UNITER: UNiversal Image-TExt Representation Learning"

Python 784 109 Updated Jun 30, 2021

The official implementation of “Sophia: A Scalable Stochastic Second-order Optimizer for Language Model Pre-training”

Python 937 54 Updated Jan 30, 2024

This is a Phi-3 book for getting started with Phi-3. Phi-3, a family of open sourced AI models developed by Microsoft. Phi-3 models are the most capable and cost-effective small language models (SL…

Jupyter Notebook 2,466 251 Updated Nov 11, 2024

The AI Scientist: Towards Fully Automated Open-Ended Scientific Discovery 🧑‍🔬

Jupyter Notebook 8,151 1,139 Updated Nov 8, 2024

[ICLR 2021] "Neural Architecture Search on ImageNet in Four GPU Hours: A Theoretically Inspired Perspective" by Wuyang Chen, Xinyu Gong, Zhangyang Wang

Python 165 31 Updated Dec 30, 2021

Towards Understanding Variants of Invariant Risk Minimization through the Lens of Calibration (TMLR 2024)

Python 1 Updated Jun 18, 2024

Implementation of Faithfulness measurable masked language models

Python 5 Updated Oct 16, 2023

Official code for "Distributed Deep Learning in Open Collaborations" (NeurIPS 2021)

Jupyter Notebook 116 6 Updated Jan 13, 2022

A native PyTorch Library for large model training

Python 2,597 204 Updated Nov 5, 2024

Model Stock: All we need is just a few fine-tuned models

Jupyter Notebook 92 1 Updated Sep 23, 2024

🌱🚿🔆↗️ 🌳 ENTROPY-SGD: A deep learning optimizer for biasing gradients towards wide valleys

Python 2 2 Updated Mar 26, 2020
Next