Skip to content
Permalink

Comparing changes

Choose two branches to see what’s changed or to start a new pull request. If you need to, you can also or learn more about diff comparisons.

Open a pull request

Create a new pull request by comparing changes across two branches. If you need to, you can also . Learn more about diff comparisons here.
base repository: DTolm/VkFFT Loading
base: master
Choose a base ref
...
head repository: DTolm/VkFFT Loading
compare: develop
Choose a head ref
Checking mergeability… Don’t worry, you can still create the pull request.
  • 15 commits
  • 68 files changed
  • 2 contributors

Commits on Jan 24, 2024

  1. Add NULL pointer check to deleteVkFFT

    Copy behaviour of e. g. `free` and `fftw_plan_destroy`, where deleting null pointer does nothing instead of segmentation fault.
    DejvBayer committed Jan 24, 2024
    Configuration menu
    Copy the full SHA
    d84351f View commit details
    Browse the repository at this point in the history

Commits on Feb 6, 2024

  1. Merge pull request #152 from DejvBayer/patch-1

    Add NULL pointer check to deleteVkFFT
    DTolm committed Feb 6, 2024
    Configuration menu
    Copy the full SHA
    646e350 View commit details
    Browse the repository at this point in the history
  2. Added performR2R[VKFFT_MAX_FFT_DIMENSIONS] option to specify type of …

    …real transform per axis (as uint)
    
    -Solves #151
    DTolm committed Feb 6, 2024
    Configuration menu
    Copy the full SHA
    d5c9c04 View commit details
    Browse the repository at this point in the history

Commits on Feb 8, 2024

  1. Configuration menu
    Copy the full SHA
    ef4db97 View commit details
    Browse the repository at this point in the history
  2. Configuration menu
    Copy the full SHA
    135a478 View commit details
    Browse the repository at this point in the history

Commits on Feb 22, 2024

  1. Enable push descriptors for Vulkan backend (#155)

    -Allows to change descriptors inside one command buffer
    -Needs to be enabled by the user before  vkCreateDevice and set with usePushDescriptors flag in configuration of VkFFT
    -there is a warning in validation layer that needs to be investigated "vkCreateDevice: pCreateInfo->pNext chain includes a structure with unexpected VkStructureType VK_STRUCTURE_TYPE_PHYSICAL_DEVICE_PUSH_DESCRIPTOR_PROPERTIES_KHR"
    DTolm committed Feb 22, 2024
    Configuration menu
    Copy the full SHA
    ea42797 View commit details
    Browse the repository at this point in the history

Commits on Feb 23, 2024

  1. Addressing #153 and #154

    -improved glslang management in CMake
    -replaced the relative path to the glslang_c_interface with the full path
    DTolm committed Feb 23, 2024
    Configuration menu
    Copy the full SHA
    8259a42 View commit details
    Browse the repository at this point in the history

Commits on Feb 28, 2024

  1. Bugfix (#156)

    -fixed missing line that was lost somewhere during v1.3.0 reorganization of read/write module
    DTolm committed Feb 28, 2024
    Configuration menu
    Copy the full SHA
    d753451 View commit details
    Browse the repository at this point in the history

Commits on Mar 2, 2024

  1. single kernel - multiple batches convolution support (#159)

    -enabled through singleKernelMultipleBatches parameter
    -kernel batching is controlled through coordinateFeatures
    -number of input/output systems is controlled through numberBatches
    -sample 53 shows the usage of this option
    DTolm committed Mar 2, 2024
    Configuration menu
    Copy the full SHA
    e2d3d57 View commit details
    Browse the repository at this point in the history

Commits on Mar 5, 2024

  1. Configuration menu
    Copy the full SHA
    9dd4480 View commit details
    Browse the repository at this point in the history

Commits on Jun 6, 2024

  1. Separate complex components read/write support (#149)

    -Real and imaginary parts of a number can now be supplied from two buffers with their own respective offsets (so both buffers can actually be just one with an offset for I).
    -Any buffer array pointer can be provided in such form (as two buffers) - input, output, buffer, kernel, temp. Need to set the respective bufferNum and bufferSeparateComplexComponents value.
    -Updated samples 0, 50 and 11 (for now) to demonstrate how this feature works.
    -Bugfixes.
    DTolm committed Jun 6, 2024
    Configuration menu
    Copy the full SHA
    5f7df3f View commit details
    Browse the repository at this point in the history

Commits on Jun 13, 2024

  1. Two new reorderFourStep transposition schemes

    -Improves performance for big sequences
    -Optimized radix kernels to use fewer instructions
    -Use __ldg in CUDA and HIP
    DTolm committed Jun 13, 2024
    Configuration menu
    Copy the full SHA
    daf09d3 View commit details
    Browse the repository at this point in the history
  2. Added const qualifier to user buffers (#177)

    -Fixed some warnings
    DTolm committed Jun 13, 2024
    Configuration menu
    Copy the full SHA
    79cf718 View commit details
    Browse the repository at this point in the history

Commits on Jun 15, 2024

  1. Optimization for AMD power of 2 memory accesses causing memory pin se…

    …rialization
    
    -Option to use temp buffer to automatically pad every intermediate memory access to break power of 2 strides. Can boost performance up to 2x in some cases. Enabled in all APIs for AMD (can be force-overwritten by user).
    DTolm committed Jun 15, 2024
    Configuration menu
    Copy the full SHA
    1377057 View commit details
    Browse the repository at this point in the history

Commits on Jul 5, 2024

  1. Bugfix (#180)

    -Force the callback version of R2C if zeropadding is enabled. The even decomposition version is hard to make work for proper zeropadding, as it is read as complex numbers and won't support odd zeropadding lengths trivially.
    DTolm committed Jul 5, 2024
    Configuration menu
    Copy the full SHA
    d0b68bb View commit details
    Browse the repository at this point in the history
Loading