- Seoul, Republic of Korea
- in/jaeyoon-jung-935300268
Highlights
- Pro
Block or Report
Block or report lastdefiance20
Contact GitHub support about this user’s behavior. Learn more about reporting abuse.
Report abuseLists (10)
Sort Name ascending (A-Z)
Stars
Language
Sort by: Recently starred
An open-source implementation of LLaVA-NeXT.
RefChecker provides automatic checking pipeline and benchmark dataset for detecting fine-grained hallucinations generated by Large Language Models.
Large Action Model framework to develop AI Web Agents
Process Common Crawl data with Python and Spark
Inspired by google c4, here is a series of colossal clean data cleaning scripts focused on CommonCrawl data processing. Including Chinese data processing and cleaning methods in MassiveText.
TimeGPT-1: production ready pre-trained Time Series Foundation Model for forecasting and anomaly detection. Generative pretrained transformer for time series trained on over 100B data points. It's …
A Docker-powered service for PDF document layout analysis. This service provides a powerful and flexible PDF analysis service. The service allows for the segmentation and classification of differen…
GPT4V-level open-source multi-modal model based on Llama3-8B
This repository provides the code and model checkpoints of the research paper: Scalable Pre-training of Large Autoregressive Image Models
A Generalizable World Model for Autonomous Driving
This is a Phi-3 book for getting started with Phi-3. Phi-3, a family of open AI models developed by Microsoft. Phi-3 models are the most capable and cost-effective small language models (SLMs) avai…
Code and data for "KoDialogBench: Evaluating Conversational Understanding of Language Models with Korean Dialogue Benchmark" (LREC-COLING 2024)
Pix2Seq codebase: multi-tasks with generative modeling (autoregressive and diffusion)
MiniCPM-Llama3-V 2.5: A GPT-4V Level Multimodal LLM on Your Phone
[ICML2024] Unified Training of Universal Time Series Forecasting Transformers
[Information Fusion 2024] HiCMAE: Hierarchical Contrastive Masked Autoencoder for Self-Supervised Audio-Visual Emotion Recognition
A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Recognition, Voice Activity Detection, Text Post-processing etc.
Controlled Text Generation via Language Model Arithmetic
a state-of-the-art-level open visual language model | 多模态预训练模型
[CVPR 2024 Oral] InternVL Family: A Pioneering Open-Source Alternative to GPT-4o. 接近GPT-4o表现的可商用开源多模态对话模型
The human face subset of LAION-400M for large-scale face pretraining.
🔥 [PR 2023] Multi-scale Attention Guided Pose Transfer (official code).
🥤🧑🏻🚀Code and dataset for our EMNLP 2023 paper - "SODA: Million-scale Dialogue Distillation with Social Commonsense Contextualization"