Skip to content
View yiminglin-ai's full-sized avatar

Organizations

@ibug-group @iBUG-HCI2
Block or Report

Block or report yiminglin-ai

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Beta Lists are currently in beta. Share feedback and report bugs.

Starred repositories

Showing results

StyleGAN2-ADA - Official PyTorch implementation

Python 4,028 1,154 Updated May 10, 2024

PhotoMaker [CVPR 2024]

Jupyter Notebook 9,077 719 Updated Jul 31, 2024

Make bilingual epub books Using AI translate

Python 7,192 1,031 Updated Jul 24, 2024

一本 GPT4 生成的单词书📚,超过 8000 个单词分析,涵盖了词义、例句、词根词缀、变形、文化背景、记忆技巧和小故事

HTML 3,010 197 Updated Jul 7, 2024
Python 640 86 Updated Jul 2, 2024

人人都能用英语

TypeScript 23,360 3,608 Updated Aug 6, 2024

LLM101n: Let's build a Storyteller

26,575 1,436 Updated Aug 1, 2024

Omnivore is a complete, open source read-it-later solution for people who like reading.

TypeScript 11,957 608 Updated Aug 6, 2024

Repository for Meta Chameleon, a mixed-modal early-fusion foundation model from FAIR.

Python 1,642 102 Updated Jul 29, 2024

This repository contains a PyTorch implementation of "AD-NeRF: Audio Driven Neural Radiance Fields for Talking Head Synthesis".

Python 1,013 174 Updated Oct 27, 2023

[CVPR 2024] This is the official source for our paper "SyncTalk: The Devil is in the Synchronization for Talking Head Synthesis"

Python 1,148 128 Updated Jul 27, 2024

4M: Massively Multimodal Masked Modeling

Python 1,478 87 Updated Jul 17, 2024

ML-powered speech recognition directly in your browser

TypeScript 1,401 159 Updated Jun 10, 2024

A Survey on Deepfake Generation and Detection

194 3 Updated Aug 3, 2024

[CVPR 2024 Oral] InternVL Family: A Pioneering Open-Source Alternative to GPT-4o. 接近GPT-4o表现的可商用开源多模态对话模型

Python 4,658 360 Updated Aug 2, 2024

Your image is almost there!

Python 7,038 411 Updated Jul 26, 2024

The official Meta Llama 3 GitHub site

Python 25,261 2,789 Updated Jul 31, 2024

Code repository for T2V-Turbo

Python 150 13 Updated Jun 25, 2024

Latent Consistency Models: Synthesizing High-Resolution Images with Few-Step Inference

Python 4,242 219 Updated Jun 14, 2024

MiniCPM-V 2.6: A GPT-4V Level MLLM for Single Image, Multi Image and Video on Your Phone

Python 8,396 588 Updated Aug 6, 2024

Reading list for research topics in multimodal machine learning

5,747 831 Updated Jun 19, 2024

Comprehensive benchmarks and evaluations of Large Language Models (LLMs) with a focus on hardware usage, generation speed, and memory requirements.

Python 12 1 Updated Aug 31, 2023

This is a Phi-3 book for getting started with Phi-3. Phi-3, a family of open AI models developed by Microsoft. Phi-3 models are the most capable and cost-effective small language models (SLMs) avai…

Jupyter Notebook 1,492 142 Updated Aug 6, 2024

A self-organizing file system with llama 3

Jupyter Notebook 4,683 279 Updated Jun 18, 2024

⚡️HivisionIDPhotos: a lightweight and efficient AI ID photos tools. 一个轻量级的AI证件照制作算法。

Python 177 49 Updated Apr 20, 2024

A curated list of awesome papers on dataset distillation and related applications.

HTML 1,290 121 Updated Aug 4, 2024

Approaching (Almost) Any Machine Learning Problem

7,034 1,023 Updated Mar 25, 2023

SWE-agent takes a GitHub issue and tries to automatically fix it, using GPT-4, or your LM of choice. It solves 12.47% of bugs in the SWE-bench evaluation set and takes just 1 minute to run.

Python 12,216 1,224 Updated Aug 5, 2024

Learn how to design large-scale systems. Prep for the system design interview. Includes Anki flashcards.

Python 266,198 45,092 Updated Jul 30, 2024
Next