GitHub - minuva/ph-prompt-attack-detect-plugin: Posthog plugin for prompt attack detection

Prompt Attacks Plugin

This plugin allows you to scan attacks from user prompts and agent responses in PostHog-LLM and PostHog-LLM-Lite. To use this plugin in PostHog-LLM, simply install the plugin's backend. The plugin links the PostHog data ingestion process with the hosted detection algorithm.

How to install in PostHog-LLM

Open PostHog-LLM.
Open the Data pipeline page from the sidebar.
Head to the Manage apps tab.
Install app advanced button
"Install from GitHub, GitLab or npm" using this repository's URL.
Click on the Apps tab, and it will prompt you for the API_SERVER_URL (e.g., http:https://localhost:9612, or http:https://prompt-attack:9612 if running in a Docker container in the postlang network).

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
.eslintrc.js		.eslintrc.js
.gitignore		.gitignore
.prettierrc		.prettierrc
README.md		README.md
index.js		index.js
logo.png		logo.png
package-lock.json		package-lock.json
package.json		package.json
plugin.json		plugin.json

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Prompt Attacks Plugin

How to install in PostHog-LLM

About

Releases

Packages

Languages

minuva/ph-prompt-attack-detect-plugin

Folders and files

Latest commit

History

Repository files navigation

Prompt Attacks Plugin

How to install in PostHog-LLM

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages