Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Develop #125

Merged
merged 26 commits into from
Aug 1, 2023
Merged
Changes from 1 commit
Commits
Show all changes
26 commits
Select commit Hold shift + click to select a range
4d5de57
Major update (VkFFT 1.3.0)
DTolm Apr 2, 2023
8ec6867
-Improved read/write stage for continuous adressing
DTolm Apr 10, 2023
9e8f11d
Bugfix OpenCL save compiled binary seems to be broken #110
DTolm Apr 11, 2023
8d818d3
Handle VKFFT_ERROR_MATH_FAILED in getVkFFTErrorString
al42and Apr 25, 2023
da1acad
Merge pull request #113 from al42and/getVkFFTErrorString-fix
DTolm May 2, 2023
ca263b4
Bugfix (#112)
DTolm May 2, 2023
fce65a2
Bugfixes (https://github.com/vincefn/pyvkfft/issues/25#issuecomment-1…
DTolm May 23, 2023
8ca0b4f
Merge branch 'develop' of https://github.com/DTolm/VkFFT into develop
DTolm May 23, 2023
3d34179
Fix build with Metal
al42and May 24, 2023
ff0d894
Merge pull request #117 from al42and/fix-metal-warning
DTolm May 24, 2023
fdfdc30
Bugfix (https://github.com/vincefn/pyvkfft/issues/25#issuecomment-156…
DTolm May 25, 2023
cc410b1
Fixed https://github.com/vincefn/pyvkfft/issues/25#issuecomment-15642…
DTolm May 31, 2023
7e4d159
Fix clang compilation errors
expenses Jun 10, 2023
fba81bc
Use a ZERO_INIT define
expenses Jun 13, 2023
5de1579
Merge pull request #118 from expenses/clang-errors
DTolm Jul 2, 2023
d772ebd
New directory path configuration of VkFFT
DTolm Jul 2, 2023
e6f8abe
Rename vkFFT_Plan_FFT.H to vkFFT_Plan_FFT.h
DTolm Jul 3, 2023
039646e
Bugfix (https://github.com/vincefn/pyvkfft/issues/25)
DTolm Jul 3, 2023
74cabda
Bugfix (https://github.com/vincefn/pyvkfft/issues/25)
DTolm Jul 5, 2023
9ede6c1
Bugfix (https://github.com/vincefn/pyvkfft/issues/25)
DTolm Jul 6, 2023
cc2b427
Support of arbitrary number of dimensions
DTolm Jul 21, 2023
ddb8c8f
Bugfix (https://github.com/vincefn/pyvkfft/issues/25)
DTolm Jul 28, 2023
41aa1a5
Final snapshot of develop branch 1.3.0 before merge (unless some issu…
DTolm Jul 31, 2023
3abc8bc
LevelZero: Fix the order of kernel/module destruction
al42and Aug 1, 2023
dd42598
Merge pull request #124 from al42and/fix-level_zero-destruction
DTolm Aug 1, 2023
dbed90a
Pre-merge version increment
DTolm Aug 1, 2023
File filter

Filter by extension

Filter by extension

Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
Prev Previous commit
Next Next commit
  • Loading branch information
DTolm committed Jul 6, 2023
commit 9ede6c17f7148faebb13fd4af4d470540b8e73b4
Original file line number Diff line number Diff line change
Expand Up @@ -234,12 +234,16 @@ static inline void appendC2R_read(VkFFTSpecializationConstantsLayout* sc, int ty
VkIf_end(sc);
}
VkContainer logicalStoragePerThread;
logicalStoragePerThread.type = 31;
logicalStoragePerThread.data.i = sc->registers_per_thread_per_radix[sc->stageRadix[0]] * sc->registerBoost;// (sc->registers_per_thread % stageRadix->data.i == 0) ? sc->registers_per_thread * sc->registerBoost : sc->min_registers_per_thread * sc->registerBoost;
VkContainer logicalGroupSize;
logicalStoragePerThread.type = 31;
logicalGroupSize.type = 31;
VkDivCeil(sc, &logicalGroupSize, &sc->fftDim, &logicalStoragePerThread);

if (sc->rader_generator[0] == 0) {
logicalStoragePerThread.data.i = sc->registers_per_thread_per_radix[sc->stageRadix[0]] * sc->registerBoost;// (sc->registers_per_thread % stageRadix->data.i == 0) ? sc->registers_per_thread * sc->registerBoost : sc->min_registers_per_thread * sc->registerBoost;
VkDivCeil(sc, &logicalGroupSize, &sc->fftDim, &logicalStoragePerThread);
}
else {
logicalGroupSize.data.i = localSize.data.i;
}
if ((sc->rader_generator[0] > 0) || ((sc->fftDim.data.i % sc->localSize[0].data.i) && (!sc->stridedSharedLayout)) || ((sc->fftDim.data.i % sc->localSize[1].data.i) && (sc->stridedSharedLayout)) || (logicalGroupSize.data.i != localSize.data.i))
sc->readToRegisters = 0;
else
Expand Down