Skip to content

v1.4.2 - more metrics!

Choose a tag to compare

@BBC-Esq BBC-Esq released this 16 Sep 21:12
· 1484 commits to main since this release
ac6dd16

Added cpu and ram usage and percentage metrics.

Refactored code to prepare for expansion.

Roadmap:
--Introduce quantized embedding models for even faster increase and lower resource requirement.
--Add options when creating the database and interacting with the LLM; for example, controlling the chunk size or the number of results or the length of the results...to make sure it fits within the LLM's context window.
--Add a calculator that displays the total tokens of a user's prompt + the context returned to a user can see if it fits within the LLM's context window.
--Add "poor man's vector database" search based on my other repo. Make this an option within this program.
--Add some color and other stuff to improve ease on eyes and appearance of the GUI.
--Remove the table on the left side that shows embedding models. Add a "help" or user's manual with various tables this information (only when requested), among other information like primers on how a vector database works, how to get the most out of it, nuances of the various models, and other helpful stuff.