Skip to content
View xiaodangao137's full-sized avatar
Block or Report

Block or report xiaodangao137

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results
TypeScript 211 14 Updated Jun 21, 2024

Data annotation toolbox supports image, audio and video data.

Python 279 31 Updated Jul 19, 2024

A one-stop, open-source, high-quality data extraction tool, supports PDF/webpage/e-book extraction.一站式开源高质量数据提取工具,支持PDF/网页/多格式电子书提取。

Python 970 90 Updated Jul 22, 2024

FastAPI Backend for a Conversational Agent using Aleph Alpha, (Azure) OpenAI, GPT4ALL, Langchain and a VectorDB

Python 90 15 Updated Jul 9, 2024

Simple Chainlit UI for running llms locally using Ollama and LangChain

Python 81 24 Updated Mar 21, 2024

万物检测(零样本检测+识别) demo for SG2300X 【Recognize Anything + GroundingDINO】

Python 8 2 Updated May 9, 2024

使用onnxruntime部署GroundingDINO开放世界目标检测,包含C++和Python两个版本的程序

Python 27 4 Updated Feb 2, 2024

This is the third party implementation of the paper Grounding DINO: Marrying DINO with Grounded Pre-Training for Open-Set Object Detection.

Python 323 47 Updated Jun 25, 2024

[CVPR 2024 🔥] GeoChat, the first grounded Large Vision Language Model for Remote Sensing

Python 375 26 Updated Mar 24, 2024

[NeurIPS2023] Official implementation of the paper "Large Language Models are Visual Reasoning Coordinators"

Jupyter Notebook 99 7 Updated Nov 9, 2023

A curated list of foundation models for vision and language tasks

694 30 Updated Jun 25, 2024

Open Source framework for voice and multimodal conversational AI

Python 2,407 144 Updated Jul 23, 2024

Vision agent

Python 961 97 Updated Jul 22, 2024

A list of AI autonomous agents

8,644 609 Updated Jul 20, 2024
Python 1,314 118 Updated Jul 21, 2024

🤖 Awesome list of AI Agents

192 18 Updated Jul 1, 2024

This repository contains a hand-curated resources for Prompt Engineering with a focus on Generative Pre-trained Transformer (GPT), ChatGPT, PaLM etc

Python 3,542 316 Updated Jul 5, 2024

official code for "Large Language Models as Optimizers"

Python 329 32 Updated Mar 30, 2024

LLMs interview notes and answers:该仓库主要记录大模型(LLMs)算法工程师相关的面试题和参考答案

1,093 216 Updated Dec 14, 2023

该仓库主要记录 大模型(LLMs) 算法工程师相关的面试题

1,196 94 Updated Mar 31, 2024

大模型算法岗面试题(含答案):常见问题和概念解析 "大模型面试题"、"算法岗面试"、"面试常见问题"、"大模型算法面试"、"大模型应用基础"

Jupyter Notebook 105 4 Updated Jun 10, 2024

Autoregressive Model Beats Diffusion: 🦙 Llama for Scalable Image Generation

Python 1,054 39 Updated Jul 14, 2024

[ECCV 2024] PowerPaint, a versatile image inpainting model that supports text-guided object inpainting, object removal, image outpainting and shape-guided object inpainting with only a single model…

Python 434 23 Updated Jul 5, 2024

Paper list about multimodal and large language models, only used to record papers I read in the daily arxiv for personal needs.

475 28 Updated Jul 22, 2024

A framework for prompt tuning using Intent-based Prompt Calibration

Python 1,911 156 Updated Jul 21, 2024

Build Conversational AI in minutes ⚡️

TypeScript 6,276 802 Updated Jul 22, 2024

MiniCPM-Llama3-V 2.5: A GPT-4V Level Multimodal LLM on Your Phone

Python 8,038 565 Updated Jul 19, 2024

[ICLR'24 spotlight] An open platform for training, serving, and evaluating large language model for tool learning.

Python 4,629 396 Updated May 9, 2024

[CVPR2024 Highlight]GLEE: General Object Foundation Model for Images and Videos at Scale

Python 980 79 Updated Jul 2, 2024
Next