forked from clMathLibraries/clFFT
-
Notifications
You must be signed in to change notification settings - Fork 0
/
ReleaseNotes.txt
174 lines (119 loc) · 5.1 KB
/
ReleaseNotes.txt
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
clFFT Contents
--------------
The clFFT library is an open source OpenCL library
implementation of discrete Fast Fourier Transforms. It:
* Provides a fast and accurate platform for calculating
discrete FFTs
* Supports 1D, 2D, and 3D transforms with a batch size
that can be greater than 1
* Supports complex and real transforms
* Supports planar (real and complex components in
separate arrays) and interleaved (real and complex
components as a pair contiguous in memory) formats
* Supports dimension lengths that can be any mix of
powers of 2, 3, 5, 7, 11 and 13
* Supports single and double precision floating-point
formats
* Supports in-place or out-of-place transforms
clFFT - Release Notes - version 2.12.2
--------------------------------------
This is a patch update release to v2.12.1.
It has the following:
* Fix for thread safety issues
* Fix for failures in certain 2D transforms
* Fix for failures on POCL runtime
* Other minor improvements
Details on issues available on GitHub.
clFFT - Release Notes - version 2.12.1
--------------------------------------
New features of this release:
* Fixes for bugs in decomposition logic of very large problem sizes
with environment flag CLFFT_REQUEST_LIB_NOMEMALLOC=1
* Validation of sizes that are powers of 2,3,5 and combinations
thereof upto maximum size close to 25GB on S9170
Driver notes:
* This library version has been tested with Catalyst
Pro driver version 15.302 on Firepro W9100 & S9170.
clFFT - Release Notes - version 2.12.0
--------------------------------------
New features of this release:
* Addition of radix 11 & 13 enables support for powers of 11 & 13 size
transforms
* Support for 1D large size transforms with no extra memory allocation
requirement with environment flag CLFFT_REQUEST_LIB_NOMEMALLOC=1
for complex FFTs of powers of 3,5,10 sizes. Note that support for
power-of-2 size has been available since version 2.10.
Driver notes:
* This library version has been tested with Catalyst
Pro driver version 15.201 on Firepro W9100.
clFFT - Release Notes - version 2.10.2
--------------------------------------
This is a patch update release to v2.10.1.
It has the following:
* Fixes for accuracy/stability issues noted in large size real FFTs
clFFT - Release Notes - version 2.10.1
--------------------------------------
This is a patch update release to v2.10.0.
It has the following:
* Performance fixes for slowdown observed vs v2.8. Specifically,
2D transforms for certain sizes showed significant slowdown.
This issue has been fixed in this release. The fix also makes the
performance in general better than v2.8.
* Code reorganization in transposes and performance improvements.
There is a modest performance improvement with inplace transforms
for large power-of-2 sizes.
* Several minor enhancements and bug fixes
clFFT - Release Notes - version 2.10.0
--------------------------------------
New features of this release:
* Post-callback feature that enables custom post-processing
of output data directly by the library with user callback function
* Support for in-place transposes for power-of-2 sizes enables really
large 1D transforms as well as supporting no additional memory
allocation, by library, for a range of problem sizes
clFFT - Release Notes - version 2.8.0
--------------------------------------
New features of this release:
* Support for power-of-7 size transforms
* Pre-callback feature that enables custom pre-processing
of input data directly by the library with user callback function
* Support for 1D large size transforms with no extra memory allocation
requirement for certain sizes
Driver notes:
* This library version has been tested with Catalyst
Pro driver version 14.502 on Firepro W9100.
clFFT - Release Notes - version 2.6.1
--------------------------------------
This is a patch update release to v2.6.0. It has the following.
* Fix for client application crash with Linux 14.502 driver
* Performance fixes for real forward and backward transform when
enabling ECC
* Performance fixes for complex transforms over a small range in
power-of-2 sizes
* Improvement in API timing in client
* Several minor enhancements and bug fixes
Driver notes:
* This library version has been tested with Catalyst
Pro driver version 14.502 on Firepro W9100.
clFFT - Release Notes - version 2.6.0
--------------------------------------
New features of this release:
* Significant uplift of 1D real forward transform
performance for power-of-2 sizes
* Significant uplift of large 2D/3D real transform
performance for power-of-2 sizes
Known issues:
* Size limit on 1D real transforms remain at 2^24
for single precision and 2^22 for double
precision
Driver notes:
* This library version has been tested with Catalyst
Pro driver version 14.301 on Firepro W9100.
clFFT - Release Notes - version 2.4.0
--------------------------------------
New features of this release:
* Significant uplift of 1D complex transform performance
* 1D large size limit relaxation for complex transforms
* 2D/3D size limit relaxation on real and complex transforms
* Binary caching feature
* Several minor fixes and improvements