Block or Report
Block or report xiaodangao137
Contact GitHub support about this user’s behavior. Learn more about reporting abuse.
Report abuseLanguage
Sort by: Recently starred
Starred repositories
Data annotation toolbox supports image, audio and video data.
A one-stop, open-source, high-quality data extraction tool, supports PDF/webpage/e-book extraction.一站式开源高质量数据提取工具,支持PDF/网页/多格式电子书提取。
FastAPI Backend for a Conversational Agent using Aleph Alpha, (Azure) OpenAI, GPT4ALL, Langchain and a VectorDB
Simple Chainlit UI for running llms locally using Ollama and LangChain
万物检测(零样本检测+识别) demo for SG2300X 【Recognize Anything + GroundingDINO】
使用onnxruntime部署GroundingDINO开放世界目标检测,包含C++和Python两个版本的程序
This is the third party implementation of the paper Grounding DINO: Marrying DINO with Grounded Pre-Training for Open-Set Object Detection.
[CVPR 2024 🔥] GeoChat, the first grounded Large Vision Language Model for Remote Sensing
[NeurIPS2023] Official implementation of the paper "Large Language Models are Visual Reasoning Coordinators"
A curated list of foundation models for vision and language tasks
Open Source framework for voice and multimodal conversational AI
This repository contains a hand-curated resources for Prompt Engineering with a focus on Generative Pre-trained Transformer (GPT), ChatGPT, PaLM etc
official code for "Large Language Models as Optimizers"
LLMs interview notes and answers:该仓库主要记录大模型(LLMs)算法工程师相关的面试题和参考答案
大模型算法岗面试题(含答案):常见问题和概念解析 "大模型面试题"、"算法岗面试"、"面试常见问题"、"大模型算法面试"、"大模型应用基础"
Autoregressive Model Beats Diffusion: 🦙 Llama for Scalable Image Generation
[ECCV 2024] PowerPaint, a versatile image inpainting model that supports text-guided object inpainting, object removal, image outpainting and shape-guided object inpainting with only a single model…
Paper list about multimodal and large language models, only used to record papers I read in the daily arxiv for personal needs.
A framework for prompt tuning using Intent-based Prompt Calibration
MiniCPM-Llama3-V 2.5: A GPT-4V Level Multimodal LLM on Your Phone
[ICLR'24 spotlight] An open platform for training, serving, and evaluating large language model for tool learning.
[CVPR2024 Highlight]GLEE: General Object Foundation Model for Images and Videos at Scale