aboutsummaryrefslogtreecommitdiff
path: root/examples/chat-vicuna.sh
diff options
context:
space:
mode:
authorLostRuins <39025047+LostRuins@users.noreply.github.com>2023-06-29 11:56:43 +0800
committerGitHub <noreply@github.com>2023-06-29 05:56:43 +0200
commit96a712ca1b7f427e3bd7ffc0c70b2105cfc7fbf1 (patch)
tree448ac4c00677b54d68272bc4f5310bc5ebe85f02 /examples/chat-vicuna.sh
parentd3494bb86bf7ad5b0b60aae0220ea576f273b5c0 (diff)
Porting the improved K-Quant CUDA kernels to OpenCL (#1966)
* Added broken new q4k quant * xx + ib0 * Fix q2_k fast kernel * Use preprocessor for QK_K * Add q6_k fast matmul kernel * ported q3k speedup successfully * ported q2k and q5k speedups * remove old dot kernels and template * fixed global const struct types * fixing address spaces * fixed string too long CI issue --------- Co-authored-by: 0cc4m <picard12@live.de>
Diffstat (limited to 'examples/chat-vicuna.sh')
0 files changed, 0 insertions, 0 deletions