Unification of AI for collaborative experimentation and optimization using Collective Knowledge workflow framework with common JSON API

Introduction

After spending most of our "research" time not on AI innovation but on dealing with numerous and ever changing AI engines, their API, and the whole software and hardware stack, we decided to take an alternative approach.

We are developing CK-TensorFlow which is an open-source suite of convenient wrappers and workflows powered by Collective Knowledge with unified JSON API for simple and customized installation/recompilation, usage, evaluation and multi-objective optimisation of various TensorFlow implementations (CPU,CUDA,OpenCL) across diverse platforms from mobile devices and IoT to supercomputers and TPU cloud.

See cKnowledge.org/ai, shared optimization statistics and online demo of CK AI API with self-optimizing DNN for more details.

We need to have easily customizable TensorFlow builds via JSON API to be able to plug it to our framework for collaborative benchmarking and optimization of realistic workloads and models (such as deep learning) across diverse inputs and hardware provided by volunteers (see cKnowledge.org/ai, live repo and papers 1, 2, 3 for more details).

We can now build TensorFlow library and run classification via CK for various Android and Linux platforms. You can even participate in collaborative evaluation and optimization of TF using your Android device (mobile phone, tablet, etc) via this engaging Android app. You can see and use all public results in the Collective Knowledge repository.

Coordination of development

License

BSD (3 clause)

Prerequisites

Python 2.7+ or 3.3+
Collective Knowledge Framework
Java 8 JDK (though can be automatically installed via CK)
CUDA/cuDNN if you have CUDA-enabled GPU
Android NDK if you want to compile and run TF for Android devices

Prerequisites for Ubuntu

Python

Python 2.x:

$ sudo apt-get install python-dev python-pip python-setuptools python-opencv git
$ sudo pip install --upgrade pip
$ sudo pip install protobuf easydict joblib image numpy scipy

Python 3.x:

$ sudo apt-get install python3-dev python3-pip python3-setuptools
$ sudo pip3 install --upgrade pip
$ sudo pip3 install protobuf easydict joblib image wheel numpy scipy

Prerequisites for Windows

$ pip install --upgrade pip
$ pip install protobuf easydict joblib image numpy scipy

You can find more details about customized TensorFlow builds via CK for Android, Linux, Windows, Raspberry Pi, odroid, etc here.

Installing CK and this repository

You can install all basic dependencies and CK as following

$ sudo pip install ck
$ ck pull repo:ck-tensorflow

Example of TensorFlow unified installation on Ubuntu or Windows via CK (pre-build binaries)

$ ck install package:lib-tensorflow-1.1.0-cpu
$ ck install package:lib-tensorflow-1.1.0-cuda

Benchmarking

 $ ck run program:tensorflow (--env.BATCH_SIZE=10) (--env.NUM_BATCHES=5)

Select one of the test_cpu and test_cuda commands; select an available version of TensorFlow, if prompted (more than one choice); select an available benchmark, if prompted (more than one choice), and select TensorFlow model.

Example of TensorFlow unified classification on Ubuntu via CK

$ ck run program:tensorflow --cmd_key=classify

Note, that you will be asked to select a jpeg image from available CK data sets. We added standard demo images (cat.jpg, catgrey.jpg, fish-bike.jpg, computer_mouse.jpg) to the 'ctuning-datasets-min' repository. You can list them via

 $ ck pull repo:ctuning-datasets-min
 $ ck search dataset --tags=dnn

Crowd-benchmarking

It is now possible to participate in crowd-benchmarking of TensorFlow (early prototype):

$ ck crowdbench tensorflow --user={your email or ID to acknowledge contributions} (--env.BATCH_SIZE=128 --env.NUM_BATCHES=100)

You can see continuously aggregated results in the public Collective Knowledge repository under 'crowd-benchmark TensorFlow library' scenario.

Note, that this is an on-going, heavily evolving and long-term project to enable collaborative and systematic benchmarking and tuning of realistic workloads across diverse hardware (ARM TechCon'16 talk and demo, DATE'16, CPC'15). We also plan to add crowd-benchmarking and crowd-tuning of Caffe, TensorFlow and other DNN frameworks to our Android application soon - please, stay tuned!

Unified, multi-dimensional and multi-objective autotuning

It is now possible to take advantage of our universal multi-objective CK autotuner to optimize TensorFlow. As a first simple example, we added batch size tuning via CK. You can invoke it as following:

$ ck autotune tensorflow

All results will be recorded in the local CK repository and you will be given command lines to plot graphs or replay experiments such as:

$ ck plot graph:{experiment UID}
$ ck replay experiment:{experiment UID} --point={specific optimization point}

Collaborative and unified optimization of DNN

We are now working to extend above autotuner and crowdsource optimization of the whole SW/HW/model/data set stack (paper 1, paper 2).

We would like to thank the community for their interest and feedback about this collaborative AI optimization approach powered by CK at ARM TechCon'16 and the Embedded Vision Summit'17 - so please stay tuned ;) !

Other DNN with unified CK API

CK allows us to unify AI interfaces while collaboratively optimizing underneath engines. For example, we added similar support to install, use and evaluate Caffe/Caffe2 via CK:

$ ck pull repo:ck-caffe2
$ ck pull repo --url=https://github.com/dividiti/ck-caffe

$ ck install package:lib-caffe-bvlc-master-cpu-universal --env.CAFFE_BUILD_PYTHON=ON
$ ck install package:lib-caffe2-master-eigen-cpu-universal --env.CAFFE_BUILD_PYTHON=ON

$ ck run program:caffe --cmd_key=classify
$ ck run program:caffe2 --cmd_key=classify

$ ck crowdbench caffe --env.BATCH_SIZE=5
$ ck crowdbench caffe2 --env.BATCH_SIZE=5 --user=i_want_to_ack_my_contribution

$ ck autotune caffe
$ ck autotune caffe2

Realistic/representative training sets

We provided an option in all our AI crowd-tuning tools to let the community report and share mispredictions (images, correct label and wrong misprediction) to gradually and collaboratively build realistic data/training sets:

Online demo of a unified CK-AI API

Simple demo to classify images with continuous optimization of DNN engines underneath, sharing of mispredictions and creation of a community training set; and to predict compiler optimizations based on program features.

Long term vision

CK-Caffe, CK-Caffe2, CK-Tensorflow are part of an ambitious long-term and community-driven project to enable collaborative and systematic optimization of realistic workloads across diverse hardware in terms of performance, energy usage, accuracy, reliability, hardware price and other costs (ARM TechCon'16 talk and demo, DATE'16, CPC'15).

We are working with the community to unify and crowdsource performance analysis and tuning of various DNN frameworks (or any realistic workload) using Collective Knowledge Technology:

We continue gradually exposing various design and optimization choices including full parameterization of existing models.

Open R&D challenges

We use crowd-benchmarking and crowd-tuning of such realistic workloads across diverse hardware for open academic and industrial R&D challenges - join this community effort!

Related Publications with long term vision

@inproceedings{ck-date16,
    title = {{Collective Knowledge}: towards {R\&D} sustainability},
    author = {Fursin, Grigori and Lokhmotov, Anton and Plowman, Ed},
    booktitle = {Proceedings of the Conference on Design, Automation and Test in Europe (DATE'16)},
    year = {2016},
    month = {March},
    url = {https://www.researchgate.net/publication/304010295_Collective_Knowledge_Towards_RD_Sustainability}
}

@inproceedings{cm:29db2248aba45e59:cd11e3a188574d80,
    url = {https://arxiv.org/abs/1506.06256},
    title = {{Collective Mind, Part II: Towards Performance- and Cost-Aware Software Engineering as a Natural Science.}},
    author = {Fursin, Grigori and Memon, Abdul and Guillon, Christophe and Lokhmotov, Anton},
    booktitle = {{18th International Workshop on Compilers for Parallel Computing (CPC'15)}},
    publisher = {ArXiv},
    year = {2015},
    month = January,
    pdf = {https://arxiv.org/pdf/1506.06256v1}
}

All references with BibTex related to CK concept

Testimonials and awards

2017: We received CGO test of time award for our CGO'07 paper which later motivated creation of Collective Knowledge
2015: ARM and the cTuning foundation use CK to accelerate computer engineering: HiPEAC Info'45 page 17, ARM TechCon'16 presentation and demo, public CK repo

Feedback

Get in touch with CK-AI developers here. Also feel free to engage with our community via this mailing list:

https://groups.google.com/group/collective-knowledge

Name		Name	Last commit message	Last commit date
Latest commit History 335 Commits
.cm		.cm
dataset		dataset
module		module
package		package
program		program
script		script
soft		soft
.ckr.json		.ckr.json
.gitignore		.gitignore
AUTHORS		AUTHORS
CHANGES		CHANGES
CONTRIBUTIONS		CONTRIBUTIONS
COPYRIGHT		COPYRIGHT
LICENSE		LICENSE
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Unification of AI for collaborative experimentation and optimization using Collective Knowledge workflow framework with common JSON API

Introduction

Coordination of development

License

Prerequisites

Prerequisites for Ubuntu

Python

Prerequisites for Windows

Installing CK and this repository

Example of TensorFlow unified installation on Ubuntu or Windows via CK (pre-build binaries)

Benchmarking

Example of TensorFlow unified classification on Ubuntu via CK

Crowd-benchmarking

Unified, multi-dimensional and multi-objective autotuning

Collaborative and unified optimization of DNN

Other DNN with unified CK API

Realistic/representative training sets

Online demo of a unified CK-AI API

Long term vision

Open R&D challenges

Related Publications with long term vision

Testimonials and awards

Feedback

About

Releases

Packages

Languages

License

dsavenko/ck-tensorflow

Folders and files

Latest commit

History

Repository files navigation

Unification of AI for collaborative experimentation and optimization using Collective Knowledge workflow framework with common JSON API

Introduction

Coordination of development

License

Prerequisites

Prerequisites for Ubuntu

Python

Prerequisites for Windows

Installing CK and this repository

Example of TensorFlow unified installation on Ubuntu or Windows via CK (pre-build binaries)

Benchmarking

Example of TensorFlow unified classification on Ubuntu via CK

Crowd-benchmarking

Unified, multi-dimensional and multi-objective autotuning

Collaborative and unified optimization of DNN

Other DNN with unified CK API

Realistic/representative training sets

Online demo of a unified CK-AI API

Long term vision

Open R&D challenges

Related Publications with long term vision

Testimonials and awards

Feedback

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages