A simple key-value-artifact store

KVA is a simple key-value-artifact store designed to log and retrieve data. It is like wandb, but not so shitty. At its heart, it is a append-only JSON store with some helpers to easily retrieve data and handle files.

Examples

from kva import kva

kva.init(run_id="some-run")
kva.log(config={'foo': 'bar'})
# Oups there was something missing in the config
kva.log(config={'hello': 'world'})
kva.log(step=1, loss=42)
kva.log(step=2)
kva.log(loss=4.2)
print(kva.get(run_id="some-run").latest('config'))
# {'foo': 'bar', 'hello': 'world'}
print(kva.get(run_id="some-run").latest('config', deep_merge=False))
# {'hello': 'world'}
print(kva.get(run_id="some-run").latest('loss'))
# 4.2
print(kva.get(run_id="some-run").latest(['loss', 'step']))
# {'loss': 4.2, 'step': 1}
print(kva.get(run_id="some-run").latest('loss', index='step')) # Identical to: .latest(['loss'], index=['step'])
#    step  loss
# 0   1.0  42.0
# 1   2.0   4.2

Setup

Install

pip install git+https://github.com/nielsrolf/kva

Configure backend

For local storage, set:

export KVA_STORAGE='~/.kva' # Default

Using with git or git-lfs

When configured to stora data locally, kva stores data in a git friendly way:

data.jsonl
artifacts/{filehash}/filename.extension

Docs

Core methods

`kva.log(data)`

Appends dict(**data, **init_data) to the append-only database. Every value that is a kva.File (or a subclass thereof) is additionally saved.

`kva.filter(accept_row)`

Filters the rows of the database for exact matches and returns a kva.DB object.

`db.latest()`

Returns a view of the data in the db:

kva.DB().latest(
    columns, # Which values to get
    index=None, # If set, returns a dataframe of latest values in the db for each value of the index
    deep_merge=True # Wether or not to merge data of different rows or only select the latest row
)

`with kva.context(**data)`

Adds data to subsequent calls of kva.log.

Convenience

`kva.get(**keys)`

A wrapper for kva.filter(f) where the f checks if all values of a row are identical to values in keys.

`kva.init(**data)`

Basically another way of calling with kva.context(**data):

starts a run that remains active until kva.finish() is called.
subsequent calls to kva.log(**other_data) also log **data
therefore you can use it like this:

UI

Start the UI via:

python server.py --view path/to/view/config.yaml

A view config looks like this:

index:
    - project # This is just an example to show that index may consist of multiple columns
    - run_id # this has the effect that for each unique index (i.e. fo each run_id), we see one link on the main UI

# Once we click on a link, we see a details page on <url>/{project}/{run_id} with multiple panels
panels: 
    - name: summary # Title of the panel
      columns: '*' # The data of each panel is corresponds to: kva.get(project=..., run_id=...).latest(columns=<specified in the panel>, index=<specified in the panel>)
      type: data # This means: we simply see a foldable yaml or table, depending on whether an index is selected or not

    - name: Loss # Title
      columns: ['loss', 'square']
      index: step
      type: lineplot # Plot the data - use index as x-axis and in this case 'loss' on the y-axis. This only works when the datatype of all columns if numerical
    
    - name: samples # View for examples/llm_sampling.py
      columns: ['output']
      index: ['input']
      type: data # Display the data as a table
    
    - name: image-example
      columns: ['output'] # We assume that an image was logged as kva.log(output=File('image.png'))
      type: data # Data displays images / audios / videos directly when a value is of type File
    
    - name: images-over-training
      columns: ['image'] # We assume that an image was logged as kva.log(output=File('image.png'))
      type: data 
      slider: 'step' # Slider selects the step, at each step we display with the standard data displayer

Name		Name	Last commit message	Last commit date
Latest commit History 63 Commits
.vscode		.vscode
examples		examples
frontend		frontend
images		images
kva		kva
.gitignore		.gitignore
README.md		README.md
prompt.sh		prompt.sh
setup.py		setup.py
view.yaml		view.yaml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

A simple key-value-artifact store

Examples

Setup

Install

Configure backend

Using with git or git-lfs

Docs

Core methods

`kva.log(data)`

`kva.filter(accept_row)`

`db.latest()`

`with kva.context(**data)`

Convenience

`kva.get(**keys)`

`kva.init(**data)`

UI

Gallery

About

Releases

Packages

nielsrolf/kva

Folders and files

Latest commit

History

Repository files navigation

A simple key-value-artifact store

Examples

Setup

Install

Configure backend

Using with git or git-lfs

Docs

Core methods

kva.log(data)

kva.filter(accept_row)

db.latest()

with kva.context(**data)

Convenience

kva.get(**keys)

kva.init(**data)

UI

Gallery

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

`kva.log(data)`

`kva.filter(accept_row)`

`db.latest()`

`with kva.context(**data)`

`kva.get(**keys)`

`kva.init(**data)`

Packages