Skip to content

Intel Generative AI Examples (e.g., ChatQnA with RAG) on Xeon and Gaudi2

License

Notifications You must be signed in to change notification settings

ckhened/GenAIExamples

 
 

Repository files navigation

Intel Generative AI Examples

This project provides a collective list of Generative AI (GenAI) examples such as chatbot with question and answering (ChatQnA), code generation (CodeGen), document summary (DocSum), etc.

version license


GenAI Examples

All the examples are well-validated on Intel platforms. In addition, these examples are:

  • Easy to use. Use ecosystem-compliant APIs to build the end-to-end GenAI examples

  • Easy to customize. Customize the example using different framework, LLM, embedding, serving etc.

  • Easy to deploy. Deploy the GenAI examples with performance on Intel platforms

Note: The below support matrix gives the validated configurations. Feel free to customize per your needs.

ChatQnA

ChatQnA is an example of chatbot for question and answering through retrieval argumented generation (RAG).

Framework LLM Embedding Vector Database Serving HW Description
LangChain NeuralChat-7B BGE-Base Redis TGI-Habana Gaudi2 Chatbot
LangChain NeuralChat-7B BGE-Base Chroma TGI-Habana Gaudi2 Chatbot
LangChain Mistral-7B BGE-Base Redis TGI-Habana Gaudi2 Chatbot

CodeGen

CodeGen is an example of copilot designed for code generation in Visual Studio Code.

Framework LLM Serving HW Description
LangChain Deepseek-Coder-33B TGI-Habana Gaudi2 Copilot

DocSum

DocSum is an example of chatbot for summarizing the content of documents or reports.

Framework LLM Serving HW Description
LangChain NeuralChat-7B TGI-Habana Gaudi2 Chatbot
LangChain Mistral-7B TGI-Habana Gaudi2 Chatbot

SearchQnA

SearchQnA is an example of chatbot for using search engine to enhance QA quality.

Framework LLM Serving HW Description
LangChain NeuralChat-7B TGI-Habana Gaudi2 Chatbot
LangChain Mistral-7B TGI-Habana Gaudi2 Chatbot

VisualQnA

VisualQnA is an example of chatbot for question and answering based on the images.

Framework LLM Serving HW Description
LangChain LLaVA-1.5-7B TGI-Habana Gaudi2 Chatbot

Additional Content

About

Intel Generative AI Examples (e.g., ChatQnA with RAG) on Xeon and Gaudi2

Resources

License

Code of conduct

Security policy

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages

  • Svelte 45.0%
  • Python 31.9%
  • TypeScript 7.1%
  • Shell 5.6%
  • JavaScript 4.0%
  • Dockerfile 3.0%
  • Other 3.4%