Skip to content

BBoxDB is a scalable, highly available and distributed data store for multi-dimensional big data. The software supports operations like hyperrectangle queries or spatial joins.

License

Notifications You must be signed in to change notification settings

xiaming9880/bboxdb

 
 

Repository files navigation


Build Status Coverity Scan Build Status Codecov Join the chat at https://gitter.im/bboxdb/Lobby Maven Central Version

Please Note: The master branch may be in an unstable state during development. Please use our releases for productive environments.

What is BBoxDB?

BBoxDB is a highly available distributed storage manager (NoSQL database), designed to handle multi-dimensional big data. Primarily, the software is a research project to explore new ways to handle multi-dimensional data in a distributed environment.

In contrast to existing key-value stores, BBoxDB can handle multi-dimensional efficiently. Existing key-value stores are using one-dimensional keys to address the values. Finding a proper key for multi-dimensional data is hard and often impossible; this is especially true when the data has an extent (non-point data / regions). To retrieve multi-dimensional data from a key-value store, a full data scan is often required. BBoxDB was developed to avoid the expensive full data scan and to make the work with multi-dimensional data more convenient.

Key features

  • The well-known key-value data model is enhanced by a bounding box to describe the location of n-dimensional data in space.

  • Data (point and non-point) of any dimension is supported.

  • The data is indexed, which enables efficient range query processing.

  • BigData is supported by spreading the data across a cluster of systems. Each node stores only a small part of the whole dataset.

  • Multi-dimensional shards are created dynamically on the actual distribution of the data (automatically scale-up/scale-down).

  • Data of multiple tables is stored co-partitioned, and spatial-joins can be executed efficiently without data shuffling between nodes.

  • Data are re-distributed in the background without any service interruption.

  • Continuous queries with bounding box query predicates are supported.

  • User defined filters for query processing on custom data types.

Documentation

The documentation of the project is located at https://jnidzwetzki.github.io/bboxdb/. The documentation also contains the changelog of the project.

Getting started

For a guided tour through the features of BBoxDB, see the getting started chapter in the documentation. We also recommend reading the creating client code section. The install guide explains the needed steps to deploy an own BBoxDB cluster. The guide also describes how you can setup a virtualized cluster with 5 BBoxDB nodes in under two minutes, by using Docker and Docker Compose.

Screenshots

BBoxDB ships with a GUI that allows observing the global index structure. Below you find two screenshots of the GUI. The screenshots show how the space is partitioned. In addition, some details about the discovered nodes are shown.




(The screenshots contain content from OpenStreetMap - CC-BY-SA 2.0)

When 2-dimensional bounding boxes with WGS 84 coordinates are used, a map overlay visualization is supported by the GUI. On the top right picture, some spatial data about Germany was imported and the Figure shows, how Germany in partitioned after the data was imported. In addition, the GUI provides operations to explore 2 dimensional GeoJSON encoded data.

Contact / Stay informed

License

BBoxDB is licensed under the Apache 2.0 license. See the LICENSE file for details.

About

BBoxDB is a scalable, highly available and distributed data store for multi-dimensional big data. The software supports operations like hyperrectangle queries or spatial joins.

Resources

License

Code of conduct

Stars

Watchers

Forks

Packages

No packages published

Languages

  • Java 98.3%
  • Shell 1.5%
  • Other 0.2%