DataSwissKnife

A Handy Little Tool to aid your Data Science projects

About

A productivity tool for data science operations to make doing data science simpler and faster, especially for the domain-erudite audience. Created by Ramshankar Yadhunath and Srivenkata Srikanth, with the help of Arvind Sudheer.

Find more details here

Description

DSK is primarily software that has been built with the purpose of aiding anybody who is familiar with necessary domain expertise to do preliminary data science. It works as a handy productivity tool, making preliminary data science operations both simpler as well as faster.

DSK lets users load a raw block of tabular data onto it and asks relevant questions about the kind of operations the user wants to do with the data. DSK then performs these operations of data cleaning, pre-processing, auto-generating visualizations and even some preliminary baseline modelling, all based on the user's response to the questions asked. DSK only makes use of these question-response interactions with the user and thus helps users perform preliminary data science without having to write any code to do so.

The diagram below represents the overall high-level functionality of DSK.

What kind of questions does DSK ask the user?

All questions asked by DSK are ones that require single-character or single-word responses. These include Multiple Choice Questions, Yes/No Questions and Single-Word Answer Questions.

What kind of a tool is DSK?

DSK currently runs only on the command line.

How does DSK help its audience?

Speed and Convenience are two very important considerations for any data science project. Often, the preliminary part of it i.e organizing the project, data cleaning and pre-processing takes up a considerable amount of time, especially if code has to be written for every minor operation that needs to be performed. If data science has to be made simpler and easier for quickly generating essential results(without having to write code), full or partial control has to be transferred from the hands of the user to the system itself. In other words, the system has to be automated.

DSK is an attempt at laying the foundations for a system that will work in automated fashion to help users perform preliminary data science operations without writing code. Currently, DSK is prototypical and will be scaled to a full product in the future iterations of this project.
Read more about the Vision of DSK here.

DSK has also been created by keeping in the best interests of the research community in mind. Students and researchers from fund-crunched institutes usually cannot afford proprietary tools to help with their research. The non-data technicals, i.e people who are running small businesses are usually discouraged from tapping into data-driven techniques because most tools require an early investment of time and money. The free blogs and articles that talk about data science in general can't convince these businesses. The owners do not know how effectively they could use the data they have. DSK helps bridge this gap. Moreover, DSK is free to use and hence can be of help to anybody who wishes to utilize it.

How does DSK work?

The explanation of data flow through DSK has been moved here. Head over to this link in order to understand how DSK helps an end user. Screenshots of the working have been included.

If you would prefer seeing a video over reading about the working of DSK, this video will help!

Installation and Usage

It is recommended you download and run this project within a virtual environment, in order to ensure that the package installs do not tamper with the versions present in your system. The following links will help you learn why and how to use virtual environments in python.

Install using pip

Install the latest version of the dataswissknife package from PyPI
```
pip install dataswissknife
```
Run the following command to start the tool in your terminal within the environment into which the package has been installed
```
dsk
```

Install from Source

Download or clone this repository onto your local system
Extract the repository's contents
Navigate to the repository via the command line
Run the following command to install all necessary dependencies
```
pip install -r requirements.txt
```
Run the following command to start the tool
```
python dataswissknife/main_code.py
```
To avoid warnings being displayed, run with
```
python -W ignore dataswissknife/main_code.py
```
The tool should start in your command line. Follow the prompts.

Contributing

Contributions in the form of feedback or bug reports are most welcome. Currently, we are not accepting code or documentation contributions to the project. We assure you that this is because we have a few more features to add to DSK before we look at the future. As soon as this changes, we will update our guidelines.

How to contribute:

If you have tried the tool, please do consider leaving your feedback here
Contact the maintainer at [email protected]

If you are in need of assistance to use the tool, please contact the maintainer.

Credits

Development Leads

Ramshankar Yadhunath
Srivenkata Srikanth

Contributors

Arvind Sudheer

License

This project is licensed under the GNU General Public License version 3.

Name		Name	Last commit message	Last commit date
Latest commit History 78 Commits
dataswissknife		dataswissknife
documents		documents
img		img
.gitignore		.gitignore
AUTHORS.md		AUTHORS.md
CHANGELOG.md		CHANGELOG.md
LICENSE		LICENSE
MILESTONES.md		MILESTONES.md
README.md		README.md
VISION.md		VISION.md
requirements.txt		requirements.txt
setup.py		setup.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

DataSwissKnife

About

Description

What kind of questions does DSK ask the user?

What kind of a tool is DSK?

How does DSK help its audience?

How does DSK work?

Installation and Usage

Install using pip

Install from Source

Contributing

Credits

Development Leads

Contributors

License

About

Contributors 3

Languages

License

ry05/dataswissknife

Folders and files

Latest commit

History

Repository files navigation

DataSwissKnife

About

Description

What kind of questions does DSK ask the user?

What kind of a tool is DSK?

How does DSK help its audience?

How does DSK work?

Installation and Usage

Install using pip

Install from Source

Contributing

Credits

Development Leads

Contributors

License

About

Resources

License

Stars

Watchers

Forks

Contributors 3

Languages