GitHub - AKSW/LLMDatasetGenerator: LLM based datatset generator for KGQA on user defined knowledge graphs

Queryfy - Generating datasets for LLM finetuning from Knowledge Graphs

This tool takes a knowledge graph in ttl format as input and generates KGQA datasets from that. It does so by using multiple LLMs (see below) to generate appropriate questions for that specific knowledge graph, including the expected answers for reference and corresponding SPARQL queries.

This work was done for research purposes because by the time of this writing, there was no way to automatically generate datasets for training/finetuning from arbitrary knowledge graphs.

We hope to open up new areas of research by providing this prototype and are looking forward to contributions.

Execution

Requirements:

python
transformers
capable GPU

Edit the config.yaml to your liking
run python pipeline.py

The script generates folders for each step with the results.

Name		Name	Last commit message	Last commit date
Latest commit History 19 Commits
img		img
paper		paper
README.md		README.md
config.yaml		config.yaml
eval.py		eval.py
org.ttl		org.ttl
pipeline.py		pipeline.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Queryfy - Generating datasets for LLM finetuning from Knowledge Graphs

Execution

About

Releases 2

Packages

Contributors 2

Languages

AKSW/LLMDatasetGenerator

Folders and files

Latest commit

History

Repository files navigation

Queryfy - Generating datasets for LLM finetuning from Knowledge Graphs

Execution

About

Topics

Resources

Stars

Watchers

Forks

Releases 2

Packages 0

Contributors 2

Languages

Packages