summaryrefslogtreecommitdiff
path: root/src/utilities/compile.cpp
AgeCommit message (Expand)Author
2023-01-17Updated according to feedback from CNugterenAngus, Alexander
2023-01-03implemented changes to boost Adreno performance according to https://jira-dc....Angus, Alexander
2019-05-11Added a function to set the OpenCL kernel standard, either 1.1 or 1.2Cedric Nugteren
2019-05-08Changed back to cl_intel_subgroups as suggestedCedric Nugteren
2019-05-07Added a host-code check to make sure the avc_motion_estimation is availableCedric Nugteren
2018-09-15Disabled Intel subgroup shuffling for double-precisionCedric Nugteren
2018-07-14Applied feedback from Cedric from first pull requestTyler Sorensen
2018-07-11added inline ptx to support shuffle on Nvidia GPUsTyler Sorensen
2018-05-01Now stores a shared_ptr to the Program class in the cacheCedric Nugteren
2018-04-24Added a define to enable subgroup shuffling if supported by the deviceCedric Nugteren
2017-12-24Fixes for the CUDA backend of CLBlastCedric Nugteren
2017-12-17Removed all ARM Mali tuning results; re-added Mali-T760 and Mali-T628 results...Cedric Nugteren
2017-12-09Made the pre-processor run by default for ARM and Qualcomm GPUsCedric Nugteren
2017-11-30Integrated pre-processor in compilation flow, default is still disabledCedric Nugteren
2017-11-19Added compilation timing and better compilation error reportingCedric Nugteren
2017-11-17Moved compilation function to separate file; removed dependency of tuners of ...Cedric Nugteren