Skip to content
View flazerain's full-sized avatar

Block or report flazerain

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Beta Lists are currently in beta. Share feedback and report bugs.
Showing results

The AI Scientist: Towards Fully Automated Open-Ended Scientific Discovery 🧑‍🔬

Jupyter Notebook 1 Updated Oct 15, 2024

NeurIPS23 "Flow Factorized Representation Learning"

Python 32 1 Updated Oct 7, 2024

Code of Pyramidal Flow Matching for Efficient Video Generative Modeling

Python 1,868 158 Updated Oct 17, 2024

Building open version of OpenAI o1 via reasoning traces (Groq, ollama, Anthropic, Gemini, OpenAI, Azure supported) Demo: https://huggingface.co/spaces/pseudotensor/open-strawberry

Python 121 11 Updated Oct 15, 2024

AIDE: the Machine Learning CodeGen Agent

Python 459 42 Updated May 31, 2024

Educational framework exploring ergonomic, lightweight multi-agent orchestration. Managed by OpenAI Solution team.

Python 13,532 1,172 Updated Oct 15, 2024

MLE-bench is a benchmark for measuring how well AI agents perform at machine learning engineering

Python 427 40 Updated Oct 17, 2024

Depth Pro: Sharp Monocular Metric Depth in Less Than a Second.

Python 3,301 205 Updated Oct 5, 2024

The First Multimodal Seach Engine Pipeline and Benchmark for LMMs

Python 377 27 Updated Oct 6, 2024
Jupyter Notebook 147 37 Updated Jul 19, 2024

Time-MoE: Billion-Scale Time Series Foundation Models with Mixture of Experts

Python 178 13 Updated Oct 8, 2024

Empowering RAG with a memory-based data interface for all-purpose applications!

Python 1,093 66 Updated Sep 29, 2024

Qwen2-VL is the multimodal large language model series developed by Qwen team, Alibaba Cloud.

Python 2,670 153 Updated Oct 4, 2024

Official implementation of paper "On the Diagram of Thought" (https://arxiv.org/abs/2409.10038)

163 11 Updated Sep 22, 2024

An LLM-powered knowledge curation system that researches a topic and generates a full-length report with citations.

Python 12,172 1,112 Updated Oct 19, 2024

LLM based autonomous agent that conducts in-depth web research on any given topic

Python 14,537 1,919 Updated Oct 20, 2024

Kolmogorov-Arnold Transformer: A PyTorch Implementation with CUDA kernel

Python 550 30 Updated Oct 8, 2024

StoryMaker: Towards consistent characters in text-to-image generation

Python 517 43 Updated Sep 26, 2024

OpenMusic: SOTA Text-to-music (TTM) Generation

Python 454 46 Updated Oct 18, 2024

Writing AI Conference Papers: A Handbook for Beginners

1,172 39 Updated Sep 26, 2024

[EMNLP 2024] RWKV-CLIP: A Robust Vision-Language Representation Learner

Python 105 7 Updated Oct 14, 2024

Tracking any thing based on text prompt

Python 2 Updated Jun 19, 2023

A benchmark for cross-domain few-shot object detection (ECCV24 paper: Cross-Domain Few-Shot Object Detection via Enhanced Open-Set Object Detector)

Python 50 6 Updated Oct 18, 2024

[ICML 2024] "MVMoE: Multi-Task Vehicle Routing Solver with Mixture-of-Experts"

Python 47 5 Updated Aug 5, 2024

OpenMMLab's next-generation platform for general 3D object detection.

Python 5,250 1,536 Updated Jul 10, 2024

State-of-the-Art zero-shot voice conversion & singing voice conversion with in context learning

Python 418 43 Updated Oct 18, 2024

Revisiting Class-Incremental Learning with Pre-Trained Models: Generalizability and Adaptivity are All You Need (IJCV 2024)

Python 113 20 Updated Aug 25, 2024

LLaMA-Omni is a low-latency and high-quality end-to-end speech interaction model built upon Llama-3.1-8B-Instruct, aiming to achieve speech capabilities at the GPT-4o level.

Python 2,421 151 Updated Sep 24, 2024
Next