Skip to content
View lastdefiance20's full-sized avatar

Highlights

  • Pro
Block or Report

Block or report lastdefiance20

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Beta Lists are currently in beta. Share feedback and report bugs.
Showing results

tiny vision language model

Jupyter Notebook 4,574 406 Updated Jul 22, 2024

An open-source implementation of LLaVA-NeXT.

Python 141 4 Updated Jun 12, 2024

RefChecker provides automatic checking pipeline and benchmark dataset for detecting fine-grained hallucinations generated by Large Language Models.

Python 230 17 Updated Jul 22, 2024

LLM101n: Let's build a Storyteller

24,758 1,297 Updated Jul 21, 2024

Large Action Model framework to develop AI Web Agents

Python 5,080 438 Updated Jul 22, 2024

Process Common Crawl data with Python and Spark

Python 398 85 Updated Apr 8, 2024
Python 87 3 Updated Jul 17, 2024

Inspired by google c4, here is a series of colossal clean data cleaning scripts focused on CommonCrawl data processing. Including Chinese data processing and cleaning methods in MassiveText.

Python 113 12 Updated Jun 7, 2023

TimeGPT-1: production ready pre-trained Time Series Foundation Model for forecasting and anomaly detection. Generative pretrained transformer for time series trained on over 100B data points. It's …

Jupyter Notebook 2,028 159 Updated Jul 19, 2024

A Docker-powered service for PDF document layout analysis. This service provides a powerful and flexible PDF analysis service. The service allows for the segmentation and classification of differen…

Python 46 5 Updated Jul 21, 2024

GPT4V-level open-source multi-modal model based on Llama3-8B

Python 1,622 86 Updated Jul 16, 2024

This repository provides the code and model checkpoints of the research paper: Scalable Pre-training of Large Autoregressive Image Models

Python 667 42 Updated May 2, 2024

A Generalizable World Model for Autonomous Driving

Python 405 20 Updated Jun 17, 2024
Jupyter Notebook 63 4 Updated May 26, 2024

This is a Phi-3 book for getting started with Phi-3. Phi-3, a family of open AI models developed by Microsoft. Phi-3 models are the most capable and cost-effective small language models (SLMs) avai…

Jupyter Notebook 1,363 123 Updated Jul 20, 2024

Code and data for "KoDialogBench: Evaluating Conversational Understanding of Language Models with Korean Dialogue Benchmark" (LREC-COLING 2024)

Python 14 Updated Mar 2, 2024

Pix2Seq codebase: multi-tasks with generative modeling (autoregressive and diffusion)

Jupyter Notebook 842 69 Updated Nov 7, 2023

MiniCPM-Llama3-V 2.5: A GPT-4V Level Multimodal LLM on Your Phone

Python 8,036 565 Updated Jul 19, 2024

[ICML2024] Unified Training of Universal Time Series Forecasting Transformers

Jupyter Notebook 657 55 Updated Jul 9, 2024

[Information Fusion 2024] HiCMAE: Hierarchical Contrastive Masked Autoencoder for Self-Supervised Audio-Visual Emotion Recognition

Python 77 7 Updated May 20, 2024

A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Recognition, Voice Activity Detection, Text Post-processing etc.

Python 4,895 533 Updated Jul 22, 2024

Controlled Text Generation via Language Model Arithmetic

Python 193 12 Updated Jul 3, 2024

Converts text to speech in realtime

Python 1,460 129 Updated Jul 22, 2024

a state-of-the-art-level open visual language model | 多模态预训练模型

Python 5,685 390 Updated May 29, 2024

[CVPR 2024 Oral] InternVL Family: A Pioneering Open-Source Alternative to GPT-4o. 接近GPT-4o表现的可商用开源多模态对话模型

Python 4,300 327 Updated Jul 22, 2024

The human face subset of LAION-400M for large-scale face pretraining.

Python 263 17 Updated Feb 1, 2023

🔥 [PR 2023] Multi-scale Attention Guided Pose Transfer (official code).

Python 62 5 Updated Apr 30, 2024

🥤🧑🏻‍🚀Code and dataset for our EMNLP 2023 paper - "SODA: Million-scale Dialogue Distillation with Social Commonsense Contextualization"

Python 214 11 Updated Jan 5, 2024
Next