Releases: coreylowman/cudarc
Releases · coreylowman/cudarc
v0.11.7 - Cuda 11.4, CUDNN path fixes, new driver functions
What's Changed
- Add documentation on how to synchronize non-default streams by @sebhtml in #261
- Cuda1104 by @elmattic in #266
- Add more search paths/lib names for cudnn dynamic linking & loading by @coreylowman in #264
- Handling self.error_string() failing in Debug for DriverError by @coreylowman in #267
- Adding various functions to driver result api for managed memory by @coreylowman in #268
New Contributors
Full Changelog: v0.11.6...v0.11.7
v0.11.6 - More lib_names for dynamic loading, better error messages
What's Changed
- add another windows lib_name workaround by @brandonros in #250
- Add
CudaDevice::name()
andresult::device::get_name()
by @coreylowman in #252 - Improve panic message if dynamic loading failed. by @coreylowman in #257
New Contributors
- @brandonros made their first contribution in #250
Full Changelog: v0.11.5...v0.11.6
v0.11.5 - Support cuda 11.5/11.6
What's Changed
- Add
CudaSlice::split_at_mut
andCudaViewMut::split_at_mut
by @dkales in #235 - Add support for 11.5 and 11.6 toolkit versions. by @coreylowman in #249
New Contributors
Full Changelog: v0.11.4...v0.11.5
v0.11.4 - Using nvcc to find cuda toolkit version & curand dynamic loading fix
What's Changed
- Using nvcc to get cuda toolkit version instead of cuda.h by @coreylowman in #241
- Adding fix for finding curand64_10.dll by @coreylowman in #243
Full Changelog: v0.11.3...v0.11.4
v0.11.3 - Dynamic loading improvements on windows
What's Changed
- Adding fallback lib name options for dynamic loading by @coreylowman in #240
Full Changelog: v0.11.2...v0.11.3
v0.11.2 - Cuda toolkit 12.5
v0.11.1
What's Changed
- CUDA 12.4 by @bitemyapp in #227
- Moving cublasHgemm link to dynamic loading by @coreylowman in #229
New Contributors
- @bitemyapp made their first contribution in #227
Full Changelog: v0.11.0...v0.11.1
v0.11.0
What's Changed
- Added result and safe functionality to support the cuFuncSetAttribute function. by @GaryMcD in #199
- Implemented LaunchAsync for raw ptr arrays by @jafioti in #204
- Safe ConvolutionND by @bloodre in #202
- Fixed Test Example 07 by @Fiend-Star-666 in #205
- Fix pipelines by @coreylowman in #207
- Adding cuda toolkit versions 12.0/12.1/12.2 of sys & feature flags by @coreylowman in #210
- support cuda toolkit version: 11.7 by @wenhaozhao in #214
- Adding cuda 12.3 bindings by @coreylowman in #215
- [Breaking] Remove Static linking. Make dynamic loading default. Add dynamic-linking feature flag by @coreylowman in #211
- Rework cuda versioning features - Users must always specify a version feature instead of having default behavior. by @coreylowman in #216
- Replace unwrap with ? in CudaDevice::new() by @coreylowman in #223
- Update README.md by @chaserileyroberts in #218
- Add CudaDevice::new_with_stream by @coreylowman in #224
- Wrapping comm_split in cfg based on cuda version by @coreylowman in #225
New Contributors
- @GaryMcD made their first contribution in #199
- @jafioti made their first contribution in #204
- @bloodre made their first contribution in #202
- @Fiend-Star-666 made their first contribution in #205
- @wenhaozhao made their first contribution in #214
- @chaserileyroberts made their first contribution in #218
Full Changelog: v0.10.0...v0.11.0
v0.10.0
What's Changed
- feat: add support for batch matrix-matrix product in cuBLASLt by @OlivierDehaene in #186
- [Breaking] Update dtoh_sync_copy by @zjsec in #183
- [Breaking]
cublaslt::result::get_matmul_algo_heuristic
is now unsafe by @coreylowman in #189 - fix: fix cublaslt objects memory leak by @OlivierDehaene in #192
New Contributors
Full Changelog: v0.9.15...v0.10.0
v0.9.15
What's Changed
- Fixing 07-build-workflow. by @Narsil in #175
- Making the tests pass on single GPU platforms. by @Narsil in #176
- Fix typo in docs about for_num_elems of LaunchConfig by @mert-kurttutan in #177
- Minimum changes for occupancy API by @Jark5455 in #180
- feat: Add support for cublas Lt by @OlivierDehaene in #182
- Add set_offset so that curandSetGeneratorOffset() can be called. by @mneilly in #185
New Contributors
- @mert-kurttutan made their first contribution in #177
- @Jark5455 made their first contribution in #180
- @OlivierDehaene made their first contribution in #182
- @mneilly made their first contribution in #185
Full Changelog: v0.9.14...v0.9.15