Hiroki11x

🦙

Hiroki Naganuma Hiroki11x

🦙

PhD Candidate at Université de Montréal, Mila / Student Research at Google DeepMind / HPC, Deep Learning, LLM / ex-Tokyo Tech, Microsoft Research, IBM Research

140 followers · 152 following

Mila, Université de Montréal
Montreal, QC, Canada
19:15 (UTC -05:00)
https://hiroki11x.github.io/
@_hiroki11x
in/hiroki11x

Achievements

Highlights

Developer Program Member
Pro

Organizations

Stars

google-research / jestimator

Amos optimizer with JEstimator lib.

Python 81 6 Updated May 15, 2024

HIPS / hypergrad

Exploring differentiation with respect to hyperparameters

Python 297 71 Updated Jan 15, 2016

ssnl / distributed_shampoo

Forked from facebookresearch/optimizers

For optimization algorithm research and development. Submodule & Py38 ready

Python 4 Updated Oct 23, 2024

MedicineToken / Medical-Graph-RAG

Medical Graph RAG: Graph RAG for the Medical Data

Python 217 34 Updated Oct 25, 2024

team-approx-bayes / ivon-experiments

Python 6 Updated Jul 4, 2024

prov-gigapath / prov-gigapath

Prov-GigaPath: A whole-slide foundation model for digital pathology from real-world data

Python 415 54 Updated Sep 30, 2024

facebookresearch / lingua

Meta Lingua: a lean, efficient, and easy-to-hack codebase to research LLMs.

Python 4,196 216 Updated Nov 13, 2024

ankur-98 / BERT_GLUE

Exploring the GLUE benchmark and fine tuning tasks on pre-trained BERT model using Hugging Face on PyTorch..

Jupyter Notebook 2 1 Updated May 17, 2021

jong980812 / Slurm_MultiNode_DDP

This helps you to submit job with multinode & multgpu in Slurm in Torchrun

Shell 8 Updated Nov 18, 2023

tanganke / peta

Code for paper "Parameter Efficient Multi-task Model Fusion with Partial Linearization"

Jupyter Notebook 13 1 Updated Sep 13, 2024

hushon / JAX-ResNet-CIFAR10

Simple CIFAR10 ResNet example with JAX.

Python 21 2 Updated Jun 1, 2021

mariaref / nonconvex-lr

Codes for theoretical study of learning rate scheduling in non-convex problems.

C++ 2 Updated Feb 4, 2022

KellerJordan / cifar10-airbench

94% on CIFAR-10 in 2.6 seconds 💨 96% in 27 seconds

Python 173 9 Updated Nov 10, 2024

ibm-granite-community / granite-timeseries-cookbook

Granite Time Series Cookbook

Jupyter Notebook 12 4 Updated Nov 11, 2024

soyflourbread / cifar10-tiny

A tiny neural network for CIFAR-10 dataset

Python 3 Updated Apr 3, 2023

vict0rsch / tips-research-mila

General tips to drive your research at Mila

Python 16 1 Updated May 14, 2024

bharathgs / Awesome-Distributed-Deep-Learning

A curated list of awesome Distributed Deep Learning resources.

405 84 Updated Jul 28, 2024

linkedin / Liger-Kernel

Efficient Triton Kernels for LLM Training

Python 3,419 199 Updated Nov 13, 2024

ChenRocks / UNITER

Research code for ECCV 2020 paper "UNITER: UNiversal Image-TExt Representation Learning"

Python 784 109 Updated Jun 30, 2021

Liuhong99 / Sophia

The official implementation of “Sophia: A Scalable Stochastic Second-order Optimizer for Language Model Pre-training”

Python 937 54 Updated Jan 30, 2024

microsoft / Phi-3CookBook

This is a Phi-3 book for getting started with Phi-3. Phi-3, a family of open sourced AI models developed by Microsoft. Phi-3 models are the most capable and cost-effective small language models (SL…

Jupyter Notebook 2,466 251 Updated Nov 11, 2024

SakanaAI / AI-Scientist

The AI Scientist: Towards Fully Automated Open-Ended Scientific Discovery 🧑‍🔬

Jupyter Notebook 8,151 1,139 Updated Nov 8, 2024

VITA-Group / TENAS

[ICLR 2021] "Neural Architecture Search on ImageNet in Four GPU Hours: A Theoretically Inspired Perspective" by Wuyang Chen, Xinyu Gong, Zhangyang Wang

Python 165 31 Updated Dec 30, 2021

katoro8989 / IRM_Variants_Calibration

Towards Understanding Variants of Invariant Risk Minimization through the Lens of Calibration (TMLR 2024)

Python 1 Updated Jun 18, 2024

AndreasMadsen / faithfulness-measurable-models

Implementation of Faithfulness measurable masked language models

Python 5 Updated Oct 16, 2023

yandex-research / DeDLOC

Official code for "Distributed Deep Learning in Open Collaborations" (NeurIPS 2021)

Jupyter Notebook 116 6 Updated Jan 13, 2022

pytorch / torchtitan

A native PyTorch Library for large model training

Python 2,597 204 Updated Nov 5, 2024

prateeky2806 / ties-merging

Python 150 18 Updated Feb 3, 2024

naver-ai / model-stock

Model Stock: All we need is just a few fine-tuned models

Jupyter Notebook 92 1 Updated Sep 23, 2024

steph1793 / Entropy-SGD

🌱🚿🔆↗️ 🌳 ENTROPY-SGD: A deep learning optimizer for biasing gradients towards wide valleys

Python 2 2 Updated Mar 26, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Hiroki Naganuma Hiroki11x

Achievements

Achievements

Highlights

Organizations

Block or report Hiroki11x

Stars

google-research / jestimator

HIPS / hypergrad

ssnl / distributed_shampoo

MedicineToken / Medical-Graph-RAG

team-approx-bayes / ivon-experiments

prov-gigapath / prov-gigapath

facebookresearch / lingua

ankur-98 / BERT_GLUE

jong980812 / Slurm_MultiNode_DDP

tanganke / peta

hushon / JAX-ResNet-CIFAR10

mariaref / nonconvex-lr

KellerJordan / cifar10-airbench

ibm-granite-community / granite-timeseries-cookbook

soyflourbread / cifar10-tiny

vict0rsch / tips-research-mila

bharathgs / Awesome-Distributed-Deep-Learning

linkedin / Liger-Kernel

ChenRocks / UNITER

Liuhong99 / Sophia

microsoft / Phi-3CookBook

SakanaAI / AI-Scientist

VITA-Group / TENAS

katoro8989 / IRM_Variants_Calibration

AndreasMadsen / faithfulness-measurable-models

yandex-research / DeDLOC

pytorch / torchtitan

prateeky2806 / ties-merging

naver-ai / model-stock

steph1793 / Entropy-SGD