Skip to content

Improve README.md#1260

Merged
ikawrakow merged 1 commit intoikawrakow:mainfrom
mcm007:fix_readme_typos
Feb 14, 2026
Merged

Improve README.md#1260
ikawrakow merged 1 commit intoikawrakow:mainfrom
mcm007:fix_readme_typos

Conversation

@mcm007
Copy link
Copy Markdown
Contributor

@mcm007 mcm007 commented Feb 9, 2026

@saood06, if there are any improvements, glad to adapt.

@Ph0rk0z
Copy link
Copy Markdown

Ph0rk0z commented Feb 10, 2026

Would an improvement be listing all of the new parameters? There's a lot to keep track of and your other option is to dump helps and sift through the regular mainline params too.

@mcm007
Copy link
Copy Markdown
Contributor Author

mcm007 commented Feb 10, 2026

Would an improvement be listing all of the new parameters? There's a lot to keep track of and your other option is to dump helps and sift through the regular mainline params too.

Hey @Ph0rk0z, the changes I’m proposing here are in the README file that explains how to build and run the project in Docker/Podman.

I’ll gather the most frequently used parameters and draft a PR that adds them to the main README. Could you review the draft and share your thoughts, especially on how to use splitting across GPUs?

@Ph0rk0z
Copy link
Copy Markdown

Ph0rk0z commented Feb 11, 2026

Sure if you have one like that, I can look through. I don't know if we need a separate parameters.md or something since those will change again I'm sure.

@mcm007
Copy link
Copy Markdown
Contributor Author

mcm007 commented Feb 11, 2026

Sure if you have one like that, I can look through. I don't know if we need a separate parameters.md or something since those will change again I'm sure.

I've just created parameters.md covering most commonly used parameters.

It’s currently in a raw format, probably adding a table with a "Tips" column would make it more organized and actionable.

Feel free to suggest any tweaks, additions, or structure you think would help!

@saood06
Copy link
Copy Markdown
Collaborator

saood06 commented Feb 11, 2026

Feel free to suggest any tweaks, additions, or structure you think would help!

Build arguments could be expanded (or maybe it's own document). For example the build arguments I often use are

-DGGML_CUDA=ON -DGGML_RPC=ON -DGGML_IQK_FA_ALL_QUANTS=1 -DLLAMA_SERVER_SQLITE3=ON -DCMAKE_TOOLCHAIN_FILE=[...]

@Ph0rk0z
Copy link
Copy Markdown

Ph0rk0z commented Feb 12, 2026

build arguments can be set one and done with ccmake. You fill it out graphically and save it/generate and then never have to paste long strings to build again.

I've just created parameters.md covering most commonly used parameters.

I don't think we need to reinvent the wheel and document already normal mainline parameters. Just the NEW ones that ik has added which people might not know about. Everyone should be able to understand stuff like threads or model. Someone coming over from mainline might not know about -gr or the whole -cuda deal and they'd have to run through a bunch of PR comments to know.

@saood06
Copy link
Copy Markdown
Collaborator

saood06 commented Feb 12, 2026

Everyone should be able to understand stuff like threads or model. Someone coming over from mainline might not know about

I don't agree at all. The whole point of documentation is to support any new user. Not all new users will have used mainline (I personally have helped guide people who used local LLM for the first time with ik_llama.cpp). It also would be much harder for a document to go over the differences and similarities with mainline since then you'd have to track both changes here and there and things change in mainline a lot and even if you went through the effort it still would be more confusing than to just document what exists here.

Also @mcm007 -DCMAKE_TOOLCHAIN_FILE is just used to tell cmake where sqlite3 is on Windows systems, and on another note for the build arguments might want to mention that by default it builds to native CUDA if CUDA is turned on.

A lot of the other sections (eg. server, sweep-bench, imatrix etc.) should probably be removed and just reference the existing documentation as that is more thorough. I know a lot of the existing documentation is not updated with the things you reference (if you want to touch those up, that would be also be nice).

The quantization section is a weird one, not sure where exactly it should point (maybe a discussion, maybe just bullet points like your docker documentation) but as it stands now it would just probably confuse a new user.

Overall, thank you for putting in effort. I definitely have been slacking from my previous goal of trying to keep the documentation here high quality and up to date, so it's really nice to have someone help.

@mcm007
Copy link
Copy Markdown
Contributor Author

mcm007 commented Feb 12, 2026

I understand that most users already have their own workflows and cheat sheets for building and running on their systems.

My intent was to create an easy entry-point for new users (simple copy-paste commands to build and run) while still providing a place for experienced users to find what's possible to customize for their needs.

I've thinking to something like this:

  • Quick Start section in README.md with copy-paste commands
  • Detailed parameters and build options in a separate file (rebranded from parameters.md)
  • Reference to the detailed documentation

Unfortunately, I cannot express the notes as I really want, due to my limited understanding.
I'm genuinely seeking your feedback to make this more useful for everyone.

@mcm007
Copy link
Copy Markdown
Contributor Author

mcm007 commented Feb 12, 2026

You can check the current status here.

@Ph0rk0z
Copy link
Copy Markdown

Ph0rk0z commented Feb 12, 2026

It also would be much harder for a document to go over the differences and similarities with mainline since then you'd have to track both changes here and there

There's existing docs here, right? They cover things like what threads does. Nothing besides outputing the help message covers what -ger does. If it's just a rehash of the help message, what's the point? Extra mile would be adding a flag and the PR it came from so people can read that quickly too. They could quickly get up to speed why they should do --cuda fusion 1 or enabling P2P etc. Dumping 2 pages of parameters on someone.. they're not going to read.

@mcm007
Copy link
Copy Markdown
Contributor Author

mcm007 commented Feb 13, 2026

Extra mile would be adding a flag and the PR it came from so people can read that quickly too.

Good idea, WIP.

@ikawrakow ikawrakow mentioned this pull request Feb 14, 2026
4 tasks
@mcm007
Copy link
Copy Markdown
Contributor Author

mcm007 commented Feb 14, 2026

@saood06, @Ph0rk0z the current status is on #1269

It contains links to the PR for main points.

@mcm007 mcm007 marked this pull request as ready for review February 14, 2026 08:01
@ikawrakow ikawrakow merged commit f805059 into ikawrakow:main Feb 14, 2026
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

4 participants