ERAG

Overview

ERAG is an advanced system that combines lexical, semantic, text, and knowledge graph searches with conversation context to provide accurate and contextually relevant responses. This tool processes various document types, creates embeddings, builds knowledge graphs, and uses this information to answer user queries intelligently. It also includes modules for interacting with web content, GitHub repositories, and various language models.

Key Features

Multi-modal Document Processing: Handles DOCX, PDF, TXT, and JSON files with intelligent chunking and table of contents extraction.
Advanced Embedding Generation: Creates and manages embeddings for efficient semantic search using sentence transformers, with support for batch processing and caching.
Knowledge Graph Creation: Builds and utilizes a knowledge graph for enhanced information retrieval using spaCy and NetworkX.
Multi-API Support: Integrates with Ollama, LLaMA, and Groq APIs for flexible language model deployment.
Retrieval-Augmented Generation (RAG): Combines retrieved context with language model capabilities for improved responses.
Web Content Processing: Implements real-time web crawling, content extraction, and summarization.
Query Routing: Intelligently routes queries to the most appropriate subsystem based on content relevance and query complexity.
Server Management: Provides a GUI for managing local LLaMA.cpp servers, including model selection and server configuration.
Customizable Settings: Offers a wide range of configurable parameters through a graphical user interface and a centralized settings management system.
Advanced Search Utilities: Implements lexical, semantic, graph-based, and text search methods with configurable weights and thresholds.
Conversation Context Management: Maintains and utilizes conversation history for more coherent and contextually relevant responses.
GitHub Repository Analysis: Provides tools for analyzing and summarizing GitHub repositories, including code analysis, dependency checking, and code smell detection.
Web Summarization: Offers capabilities to summarize web content based on user queries.
Interactive Model Chat: Allows direct interaction with various language models for general conversation and task completion.
Debug and Logging Capabilities: Provides comprehensive logging and debug information for system operations and search results.
Color-coded Console Output: Enhances user experience with color-coded console messages for different types of information.

System Architecture

ERAG is composed of several interconnected components:

File Processing: Handles document upload and processing, including table of contents extraction.
Embedding Utilities: Manages the creation and retrieval of document embeddings.
Knowledge Graph: Creates and maintains a graph representation of document content and entity relationships.
RAG System: Implements the core retrieval-augmented generation functionality.
Query Router: Analyzes queries and routes them to the appropriate subsystem.
Server Manager: Handles the configuration and management of local LLaMA.cpp servers.
Settings Manager: Centralizes system configuration and provides easy customization options.
Search Utilities: Implements various search methods to retrieve relevant context for queries.
API Integration: Provides a unified interface for interacting with different language model APIs.
Talk2Model: Enables direct interaction with language models for general queries and tasks.
Talk2URL: Allows interaction with web content, including crawling and question-answering based on web pages.
WebRAG: Implements a web-based retrieval-augmented generation system for answering queries using internet content.
WebSum: Provides tools for summarizing web content based on user queries.
Talk2Git: Offers capabilities for analyzing and summarizing GitHub repositories.

Installation

Clone the repository:

git clone https://github.com/EdwardDali/erag.git && cd erag

Install required Python dependencies:
```
pip install -r requirements.txt
```

Download required spaCy and NLTK models:

python -m spacy download en_core_web_sm
python -m nltk.downloader punkt

Install Ollama (optional, for using Ollama API):
- Linux/macOS: curl https://ollama.ai/install.sh | sh
- Windows: Visit https://ollama.ai/download and follow installation instructions
Set up environment variables:
- Create a .env file in the project root
- Add the following variables (if applicable):
```
GROQ_API_KEY=your_groq_api_key
GITHUB_TOKEN=your_github_token
```

Usage

Start the ERAG GUI:
```
python main.py
```
Use the GUI to:
- Upload and process documents
- Generate embeddings
- Create knowledge graphs
- Configure system settings
- Manage local LLaMA.cpp servers
- Run various RAG operations (Talk2Doc, WebRAG, etc.)

Configuration

Customize ERAG's behavior through the Settings tab in the GUI or by modifying settings.py. Key configurable options include:

Chunk sizes and overlap for document processing
Embedding model selection and batch size
Knowledge graph parameters (similarity threshold, minimum entity occurrence)
API selection (Ollama, LLaMA, Groq) and model choices
Search method weights and thresholds
RAG system parameters (conversation context size, update threshold)
Server configuration for local LLaMA.cpp instances
Web crawling and summarization settings
GitHub analysis parameters

Advanced Features

Query Routing: Automatically determines the best subsystem to handle a query based on its content and complexity.
Hybrid Search: Combines lexical, semantic, graph-based, and text search methods for comprehensive context retrieval.
Dynamic Embedding Updates: Automatically updates embeddings as new content is added to the system.
Conversation Context Management: Maintains a sliding window of recent conversation history for improved contextual understanding.
Web Content Analysis: Crawls and analyzes web pages to answer queries and generate summaries.
GitHub Repository Analysis: Provides static code analysis, dependency checking, project summarization, and code smell detection for GitHub repositories.
Multi-model Support: Allows interaction with various language models through a unified interface.

Troubleshooting

Ensure all dependencies are correctly installed.
Check console output for detailed error messages (color-coded for easier identification).
Verify API keys and tokens are correctly set in the .env file.
For performance issues, adjust chunk sizes, batch processing parameters, or consider using a GPU.
If using local LLaMA.cpp servers, ensure the correct model files are available and properly configured.

Contact

For support or queries, please open an issue on the GitHub repository or contact the project maintainers.

Name		Name	Last commit message	Last commit date
Latest commit History 234 Commits
docs		docs
src		src
README.md		README.md
main.py		main.py
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

ERAG

Overview

Key Features

System Architecture

Installation

Usage

Configuration

Advanced Features

Troubleshooting

Contact

About

Releases

Packages

Languages

jjhw/erag

Folders and files

Latest commit

History

Repository files navigation

ERAG

Overview

Key Features

System Architecture

Installation

Usage

Configuration

Advanced Features

Troubleshooting

Contact

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages