-
Singapore Management University
- Singapore
- www.pxzhang.cn
Stars
Framework agnostic sliced/tiled inference + interactive ui + error analysis plots
S-Eval: Automatic and Adaptive Test Generation for Benchmarking Safety Evaluation of Large Language Models
Contains all assets to run with Moonshot Library (Connectors, Datasets and Metrics)
Moonshot - A simple and modular tool to evaluate and red-team any LLM application.
🎨 ML Visuals contains figures and templates which you can reuse and customize to improve your scientific writing.
A comprehensive benchmark of deepfake detection
Benchmarking State-of-the-Art Deep Learning Software Tools
Reference implementations of MLPerf™ training benchmarks
Benchmarking Deep Learning operations on different hardware
A curation of awesome tools, documents and projects about LLM Security.
面向人脸视频防伪鉴别的大规模中文数据评测基准(Large-Scale Chinese Data Benchmark for Face Video Anti-Forgery Identification)
Universal and Transferable Attacks on Aligned Language Models
Papers and resources related to the security and privacy of LLMs 🤖
Project HashClash - MD5 & SHA-1 cryptanalysis
SuperCLUE: 中文通用大模型综合性基准 | A Benchmark for Foundation Models in Chinese
Holistic Evaluation of Language Models (HELM), a framework to increase the transparency of language models (https://arxiv.org/abs/2211.09110). This framework is also used to evaluate text-to-image …
中文语言理解测评基准 Chinese Language Understanding Evaluation Benchmark: datasets, baselines, pre-trained models, corpus and leaderboard
Continuously updated list of related resources for generative LLMs like GPT and their analysis and detection.
A fast reverse proxy to help you expose a local server behind a NAT or firewall to the internet.
An Open-Source Framework for Prompt-Learning.
A Paperlist of Adversarial Attack on Object Detection
Open deep learning compiler stack for cpu, gpu and specialized accelerators