GitHub - yueyub/nebullvm at fc968b2b9a6bc6e10c8837eaecef675a41d7fb40

Branches Tags

Name		Name	Last commit message	Last commit date
Latest commit History 390 Commits
.github/workflows		.github/workflows
apps		apps
docs		docs
nebullvm		nebullvm
notebooks/speedster		notebooks/speedster
.gitignore		.gitignore
.pre-commit-config.yaml		.pre-commit-config.yaml
CITATION.cff		CITATION.cff
CODE_OF_CONDUCT.md		CODE_OF_CONDUCT.md
CONTRIBUTING.md		CONTRIBUTING.md
Dockerfile		Dockerfile
LICENSE		LICENSE
MANIFEST.in		MANIFEST.in
README.md		README.md
azure-pipelines.yml		azure-pipelines.yml
docker_build.sh		docker_build.sh
nebullvm.toml		nebullvm.toml
requirements-dev.txt		requirements-dev.txt
requirements.txt		requirements.txt
setup.py		setup.py

Repository files navigation

Plug and play modules to optimize the performances of your AI systems

Documentation: docs.nebuly.com/

Nebullvm is an ecosystem of plug and play modules to optimize the performances of your AI systems. The optimization modules are stack-agnostic and work with any library. They are designed to be easily integrated into your system, providing a quick and seamless boost to its performance. Simply plug and play to start realizing the benefits of optimized performance right away.

If you like the idea, give us a star to show your support for the project ⭐

What can this help with?

There are multiple modules we actually provide to boost the performances of your AI systems:

✅ Speedster: Automatically apply the best set of SOTA optimization techniques to achieve the maximum inference speed-up on your hardware.

✅ Nos: Automatically maximize the utilization of GPU resources in a Kubernetes cluster through real-time dynamic partitioning and elastic quotas - Effortless optimization at its finest!

✅ ChatLLaMA: Build faster and cheaper ChatGPT-like assistants based on LLaMA architectures.

✅ OpenAlphaTensor: Increase the computational performances of an AI model with custom-generated matrix multiplication algorithm fine-tuned for your specific hardware.

✅ Forward-Forward: The Forward Forward algorithm is a method for training deep neural networks that replaces the backpropagation forward and backward passes with two forward passes.

Next modules and roadmap

We are actively working on incorporating the following modules, as requested by members of our community, in upcoming releases:

GPToptimizer: Effortlessly optimize large APIs generative models from OpenAI, Cohere, HF.
CloudSurfer: Automatically discover the optimal cloud configuration and hardware on AWS, GCP and Azure to run your AI models.
OptiMate: Interactive tool guiding savvy users in achieving the best inference performance out of a given model / hardware setup.
TrainingSim: Easily simulate the training of large AI models on a distributed infrastructure to predict training behaviours without actual implementation.

Contributing

As an open source project in a rapidly evolving field, we welcome contributions of all kinds, including new features, improved infrastructure, and better documentation. If you're interested in contributing, please see the linked page for more information on how to get involved.

Join the community | Contribute to the library

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

What can this help with?

Next modules and roadmap

Contributing

About

Releases

Packages

Languages

License

yueyub/nebullvm

Folders and files

Latest commit

History

Repository files navigation

What can this help with?

Next modules and roadmap

Contributing

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages