diff options
author | Johannes Gäßler <johannesg@5d6.de> | 2023-06-19 10:23:56 +0200 |
---|---|---|
committer | GitHub <noreply@github.com> | 2023-06-19 10:23:56 +0200 |
commit | 16b9cd193965769089881bb8ec012fccca7b37b6 (patch) | |
tree | 2ee329793e782f253966fd81f89ea05f5a1a2495 /Makefile | |
parent | b24c3049d96557c24782e4d32feaae65f47277af (diff) |
Convert vector to f16 for dequantize mul mat vec (#1913)
* Convert vector to f16 for dmmv
* compile option
* Added compilation option description to README
* Changed cmake CUDA_ARCHITECTURES from "OFF" to "native"
Diffstat (limited to 'Makefile')
-rw-r--r-- | Makefile | 3 |
1 files changed, 3 insertions, 0 deletions
@@ -169,6 +169,9 @@ ifdef LLAMA_CUDA_DMMV_Y else NVCCFLAGS += -DGGML_CUDA_DMMV_Y=1 endif # LLAMA_CUDA_DMMV_Y +ifdef LLAMA_CUDA_DMMV_F16 + NVCCFLAGS += -DGGML_CUDA_DMMV_F16 +endif # LLAMA_CUDA_DMMV_F16 ifdef LLAMA_CUDA_KQUANTS_ITER NVCCFLAGS += -DK_QUANTS_PER_ITERATION=$(LLAMA_CUDA_KQUANTS_ITER) else |