Skip to content

Releases: clMathLibraries/clFFT

Version 2.12.2

31 Aug 22:06
Compare
Choose a tag to compare

This is a patch update release to v2.12.1.
It has the following:

  • Fix for thread safety issues
  • Fix for failures in certain 2D transforms
  • Fix for failures on POCL runtime
  • Other minor improvements

For details on issues, please see issue trackers.

Version 2.12.1

03 Jun 18:25
Compare
Choose a tag to compare

clFFT - Release Notes - version 2.12.1:

  • Fixes for bugs in decomposition logic of very large problem sizes
    with environment flag CLFFT_REQUEST_LIB_NOMEMALLOC=1
  • Validation of sizes that are powers of 2,3,5 and combinations
    thereof upto maximum size close to 25GB on S9170

Driver notes:

  • This library version has been tested with Catalyst
    Pro driver version 15.302 on Firepro W9100 & S9170.

Version 2.12.0

20 Apr 19:29
Compare
Choose a tag to compare

clFFT - Release Notes - version 2.12.0:

  • Addition of radix 11 & 13 enables support for powers of 11 & 13 size transforms
  • Support for 1D large size inplace transforms with no extra memory allocation
    requirement with environment flag CLFFT_REQUEST_LIB_NOMEMALLOC=1
    for complex FFTs of powers of 3,5,10 sizes. Note that support for
    power-of-2 size has been available since version 2.10.

Driver notes:

  • This library version has been tested with Catalyst
    Pro driver version 15.201 on Firepro W9100.

Version 2.10.2

28 Mar 21:12
Compare
Choose a tag to compare

This is a patch update release to v2.10.1.
It has the following:

  • Fixes for accuracy/stability issues noted in large size real FFTs
  • Directed tests to target testing of specific list of problem sizes

Version 2.10.1

16 Mar 00:10
Compare
Choose a tag to compare

This is a patch update release to v2.10.0.
It has the following:

  • Performance fixes for slowdown observed vs v2.8. Specifically,
    2D transforms for certain sizes showed significant slowdown.
    This issue has been fixed in this release. The fix also makes the
    performance in general better than v2.8.
  • Code reorganization in transposes and performance improvements.
    There is a modest performance improvement with inplace transforms
    for large power-of-2 sizes.
  • Several minor enhancements and bug fixes

Version 2.10.0

12 Jan 23:13
Compare
Choose a tag to compare

This clFFT release tagged as v2.10.0 is part of AMD Compute Libraries (ACL) 1.0 GA.

clFFT - Release Notes - version 2.10.0:

  • Post-callback feature that enables custom post-processing
    of output data directly by the library with user callback function
  • Support for in-place transposes for power-of-2 sizes enables really
    large 1D transforms as well as supporting no additional memory
    allocation, by library, for a range of problem sizes

Version 2.8.0

20 Oct 00:57
Compare
Choose a tag to compare

This clFFT release tagged as v2.8 is part of AMD Compute Libraries (ACL) 1.0 beta 2.

clFFT - Release Notes - version 2.8.0:

  • Support for power-of-7 size transforms
  • Pre-callback feature that enables custom pre-processing
    of input data directly by the library with user callback function
  • Support for 1D large size transforms with no extra memory allocation
    requirement for certain sizes

Driver notes:

  • This library version has been tested with Catalyst
    Pro driver version 14.502 on Firepro W9100.

Version 2.6.1

07 Aug 21:01
Compare
Choose a tag to compare

clFFT - Release Notes - version 2.6.1:

This is a patch update release to v2.6. It has the following.

  • Fix for client application crash with Linux 14.502 driver
  • Performance fixes for real forward and backward transform when enabling ECC
  • Performance fixes for complex transforms over a small range in power-of-2 sizes
  • Improvement in API timing in client
  • Several minor enhancements and bug fixes

Driver notes:

  • This library version has been tested with Catalyst
    Pro driver version 14.502 on Firepro W9100.

Version 2.6.0

07 Aug 20:45
Compare
Choose a tag to compare

This clFFT release tagged as v2.6 is part of AMD Compute Libraries (ACL) 1.0 beta 1.

clFFT - Release Notes - version 2.6.0:

  • Significant uplift of 1D real forward transform
    performance for power-of-2 sizes
  • Significant uplift of large 2D/3D real transform
    performance for power-of-2 sizes

Known issues:

  • Size limit on 1D real transforms remain at 2^24
    for single precision and 2^22 for double
    precision

Driver notes:

  • This library version has been tested with Catalyst
    Pro driver version 14.301 on Firepro W9100.

Version 2.4.0

22 Jul 11:50
Compare
Choose a tag to compare

clFFT - Release Notes - version 2.4.0:

New features of this release:

  • Significant uplift of 1D complex transform performance
  • 1D large size limit relaxation for complex transforms
  • 2D/3D size limit relaxation on real and complex transforms
  • Binary caching feature
  • Several minor fixes and improvements