summaryrefslogtreecommitdiff
AgeCommit message (Collapse)Author
2021-01-20Merge tag '1.5.2' into debian/sidGard Spreemann
2021-01-19Updated to version 1.5.2Cedric Nugteren
2021-01-12Drop erroneous suggests for libclblast-devGard Spreemann
These were introduced through hamfisted copy-pasting.
2021-01-10Changelog updated for releasedebian/1.5.1-2Gard Spreemann
2021-01-10Watch fileGard Spreemann
2021-01-10Git to SalsaGard Spreemann
2020-12-22Changelog for initial upload.gspr/new-queue-20201222debian/1.5.1-1Gard Spreemann
2020-12-22Alphabetically order entries.Gard Spreemann
2020-12-22VCS URLsGard Spreemann
2020-12-22Close ITP.Gard Spreemann
2020-12-22Initial packaging.Gard Spreemann
2020-12-22Merge tag '1.5.1' into debian/sidGard Spreemann
2020-12-22Initial packaging.Gard Spreemann
2020-10-10Add tuning results for TITAN RTXCedric Nugteren
2020-10-10Add tuning results for Radeon RX VegaCedric Nugteren
2020-10-05Merge pull request #400 from baryluk/patch-6Cedric Nugteren
Allow single graph / subplot on plot
2020-10-05Allow single graph / subplot on plotWitold Baryluk
`plt.subplots` tries to be special, and return array or not-array depending on a number of subplots. It is not actually helpful, and IMHO bad design. Make it always `ndarray`. The `and not type(axes) is np.ndarray`, is just in case matplotlib decides to make their behavior more uniform. For now work around it. Also, no need for `ndarray.flat` really. Confirmed to work with existing benchmarks (i.e. rows=2, cols=3), and with single graphs (rows=1, cols=1).
2020-10-04Merge pull request #399 from baryluk/patch-3Cedric Nugteren
Fix a typo in benchmark when running fp 16 vs 32
2020-10-04Fix a typo in benchmark when running fp 16 vs 32Witold Baryluk
The intention here was to limit the iteration range to common indexes only. Fix that.
2020-10-04Merge pull request #397 from baryluk/patch-1Cedric Nugteren
Fix Python SyntaxWarning
2020-10-04Merge pull request #398 from baryluk/patch-2Cedric Nugteren
Fix --load_from_disk argument help message
2020-10-04Fix --load_from_disk argument help messageWitold Baryluk
2020-10-04Fix Python SyntaxWarningWitold Baryluk
There is no guarantee that all empty strings objects are the same or share object with `""` literal.
2020-10-03Merge pull request #396 from CNugteren/CLBlast-395-fix-benchmark-scriptCedric Nugteren
Fix a Python 3 bug in the benchmark script
2020-10-02Fix a Python 3 bug in the benchmark scriptCedric Nugteren
2020-08-16Added FUNDING.yml fileCedric Nugteren
2020-06-07Merge pull request #392 from 9prady9/fix_Program_getIRCedric Nugteren
Fix Program::GetIR to handle programs with multiple devices
2020-06-07Add a cautionary note in Program::GetIR and mention the fix in CHANGELOGPradeep Garigipati
2020-06-05Fix Program::GetIR to handle programs with multiple devicesPradeep Garigipati
2020-05-13Merge pull request #389 from CNugteren/CLBlast-385-version-definesCedric Nugteren
Added version number defines
2020-05-12Added CLBLAST_VERSION_MAJOR/MINOR/PATCH defines in headers to store version ↵Cedric Nugteren
numbering
2020-05-11Merge pull request #388 from CNugteren/CLBlast-381-gemm-direct-tuner-failureCedric Nugteren
Fixed tuners global workgroup size
2020-05-11Increase display width of the local/global sizesCedric Nugteren
2020-05-10Made sure that the global workgroup size is a multiple of the local size in ↵Cedric Nugteren
the tuners
2020-05-10Added logging of local/global workgroup sizes when run the tunersCedric Nugteren
2020-05-10Merge pull request #386 from CNugteren/CLBlast-384-pyclblast-missing-routinesCedric Nugteren
PyCLBlast: add missing batched routines
2020-05-10Updated PyCLBlast version numberCedric Nugteren
2020-05-10Added a sample to demonstrate a batched routineCedric Nugteren
2020-05-10Added pyclblast bindings for the 3 batched routinesCedric Nugteren
2020-05-04Merge pull request #383 from CNugteren/CLBlast-382-improve-tunerCedric Nugteren
Move queue creation out of the tuner loop
2020-05-03Move queue creation out of the tuner loopCedric Nugteren
2020-03-15Merge pull request #378 from CNugteren/CLBlast-377-fix-amax-aminCedric Nugteren
Change amax/amin behaviour
2020-03-08Update API documentationCedric Nugteren
2020-03-08Made it more likely (but no guarantees) for amax/amin to return the first indexCedric Nugteren
2020-03-08Added sample to play around with XAMAX routineCedric Nugteren
2020-03-08Silenced a new OpenCL warning messageCedric Nugteren
2020-02-18Updated to version 1.5.1Cedric Nugteren
2020-02-18Merge pull request #376 from CNugteren/fix_tuner_exception_catchingCedric Nugteren
Catches all exceptions of the tuners
2020-02-17Catches all exceptions of the tunersCedric Nugteren
2019-12-15Merge pull request #372 from trantila/masterCedric Nugteren
Reduced number of TestMatrix calls for the batched xgemm routines.