Skip to content

Conversation

@innermost47
Copy link

This pull request aims to introduce a new feature to the project: the implementation of a web interface that utilizes a REST API for communication with the backend, secured through JWT (JSON Web Token) authentication. This feature will greatly enhance the overall user experience and improve the security of the application by providing a streamlined and protected method for data exchange.

schlenkibus pushed a commit to schlenkibus/alpaca.cpp that referenced this pull request Apr 3, 2023
* Add AVX2 version of ggml_vec_dot_q4_1

* Small optimisations to q4_1 dot product (@Const-me)

* Rearrange Q4_1 quantization to work for multipart models. (Fix antimatter15#152)

* Fix ggml_vec_mad_q4_1 too

* Fix non-vectorised q4_1 vec mul
@innermost47 innermost47 closed this by deleting the head repository Aug 2, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.