Skip to content

minuva/ph-prompt-attack-detect-plugin

Repository files navigation

Prompt Attacks Plugin

This plugin allows you to scan attacks from user prompts and agent responses in PostHog-LLM and PostHog-LLM-Lite. To use this plugin in PostHog-LLM, simply install the plugin's backend. The plugin links the PostHog data ingestion process with the hosted detection algorithm.

How to install in PostHog-LLM

  1. Open PostHog-LLM.
  2. Open the Data pipeline page from the sidebar.
  3. Head to the Manage apps tab.
  4. Install app advanced button
  5. "Install from GitHub, GitLab or npm" using this repository's URL.
  6. Click on the Apps tab, and it will prompt you for the API_SERVER_URL (e.g., http:https://localhost:9612, or http:https://prompt-attack:9612 if running in a Docker container in the postlang network).

About

Posthog plugin for prompt attack detection

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages