Improve README.md by mcm007 · Pull Request #1260 · ikawrakow/ik_llama.cpp

mcm007 · 2026-02-09T18:45:47Z

I have read the contributing guidelines
Self-reported review complexity:
- Low
- Medium
- High

@saood06, if there are any improvements, glad to adapt.

Ph0rk0z · 2026-02-10T13:21:05Z

Would an improvement be listing all of the new parameters? There's a lot to keep track of and your other option is to dump helps and sift through the regular mainline params too.

mcm007 · 2026-02-10T16:43:01Z

Would an improvement be listing all of the new parameters? There's a lot to keep track of and your other option is to dump helps and sift through the regular mainline params too.

Hey @Ph0rk0z, the changes I’m proposing here are in the README file that explains how to build and run the project in Docker/Podman.

I’ll gather the most frequently used parameters and draft a PR that adds them to the main README. Could you review the draft and share your thoughts, especially on how to use splitting across GPUs?

Ph0rk0z · 2026-02-11T13:35:07Z

Sure if you have one like that, I can look through. I don't know if we need a separate parameters.md or something since those will change again I'm sure.

mcm007 · 2026-02-11T19:23:28Z

Sure if you have one like that, I can look through. I don't know if we need a separate parameters.md or something since those will change again I'm sure.

I've just created parameters.md covering most commonly used parameters.

It’s currently in a raw format, probably adding a table with a "Tips" column would make it more organized and actionable.

Feel free to suggest any tweaks, additions, or structure you think would help!

saood06 · 2026-02-11T19:46:25Z

Feel free to suggest any tweaks, additions, or structure you think would help!

Build arguments could be expanded (or maybe it's own document). For example the build arguments I often use are

-DGGML_CUDA=ON -DGGML_RPC=ON -DGGML_IQK_FA_ALL_QUANTS=1 -DLLAMA_SERVER_SQLITE3=ON -DCMAKE_TOOLCHAIN_FILE=[...]

Ph0rk0z · 2026-02-12T14:50:01Z

build arguments can be set one and done with ccmake. You fill it out graphically and save it/generate and then never have to paste long strings to build again.

I've just created parameters.md covering most commonly used parameters.

I don't think we need to reinvent the wheel and document already normal mainline parameters. Just the NEW ones that ik has added which people might not know about. Everyone should be able to understand stuff like threads or model. Someone coming over from mainline might not know about -gr or the whole -cuda deal and they'd have to run through a bunch of PR comments to know.

saood06 · 2026-02-12T15:40:27Z

Everyone should be able to understand stuff like threads or model. Someone coming over from mainline might not know about

I don't agree at all. The whole point of documentation is to support any new user. Not all new users will have used mainline (I personally have helped guide people who used local LLM for the first time with ik_llama.cpp). It also would be much harder for a document to go over the differences and similarities with mainline since then you'd have to track both changes here and there and things change in mainline a lot and even if you went through the effort it still would be more confusing than to just document what exists here.

Also @mcm007 -DCMAKE_TOOLCHAIN_FILE is just used to tell cmake where sqlite3 is on Windows systems, and on another note for the build arguments might want to mention that by default it builds to native CUDA if CUDA is turned on.

A lot of the other sections (eg. server, sweep-bench, imatrix etc.) should probably be removed and just reference the existing documentation as that is more thorough. I know a lot of the existing documentation is not updated with the things you reference (if you want to touch those up, that would be also be nice).

The quantization section is a weird one, not sure where exactly it should point (maybe a discussion, maybe just bullet points like your docker documentation) but as it stands now it would just probably confuse a new user.

Overall, thank you for putting in effort. I definitely have been slacking from my previous goal of trying to keep the documentation here high quality and up to date, so it's really nice to have someone help.

mcm007 · 2026-02-12T17:30:21Z

I understand that most users already have their own workflows and cheat sheets for building and running on their systems.

My intent was to create an easy entry-point for new users (simple copy-paste commands to build and run) while still providing a place for experienced users to find what's possible to customize for their needs.

I've thinking to something like this:

Quick Start section in README.md with copy-paste commands
Detailed parameters and build options in a separate file (rebranded from parameters.md)
Reference to the detailed documentation

Unfortunately, I cannot express the notes as I really want, due to my limited understanding.
I'm genuinely seeking your feedback to make this more useful for everyone.

mcm007 · 2026-02-12T18:20:47Z

You can check the current status here.

Ph0rk0z · 2026-02-12T20:35:14Z

It also would be much harder for a document to go over the differences and similarities with mainline since then you'd have to track both changes here and there

There's existing docs here, right? They cover things like what threads does. Nothing besides outputing the help message covers what -ger does. If it's just a rehash of the help message, what's the point? Extra mile would be adding a flag and the PR it came from so people can read that quickly too. They could quickly get up to speed why they should do --cuda fusion 1 or enabling P2P etc. Dumping 2 pages of parameters on someone.. they're not going to read.

mcm007 · 2026-02-13T15:38:21Z

Extra mile would be adding a flag and the PR it came from so people can read that quickly too.

Good idea, WIP.

mcm007 · 2026-02-14T08:01:20Z

@saood06, @Ph0rk0z the current status is on #1269

It contains links to the PR for main points.

Improve README.md

624e9cf

ikawrakow mentioned this pull request Feb 14, 2026

Update README.md #1263

Merged

4 tasks

mcm007 marked this pull request as ready for review February 14, 2026 08:01

ikawrakow approved these changes Feb 14, 2026

View reviewed changes

ikawrakow merged commit f805059 into ikawrakow:main Feb 14, 2026

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Improve README.md#1260

Improve README.md#1260
ikawrakow merged 1 commit intoikawrakow:mainfrom
mcm007:fix_readme_typos

mcm007 commented Feb 9, 2026

Uh oh!

Ph0rk0z commented Feb 10, 2026

Uh oh!

mcm007 commented Feb 10, 2026

Uh oh!

Ph0rk0z commented Feb 11, 2026

Uh oh!

mcm007 commented Feb 11, 2026

Uh oh!

saood06 commented Feb 11, 2026

Uh oh!

Ph0rk0z commented Feb 12, 2026 •

edited

Loading

Uh oh!

saood06 commented Feb 12, 2026 •

edited

Loading

Uh oh!

mcm007 commented Feb 12, 2026

Uh oh!

mcm007 commented Feb 12, 2026

Uh oh!

Ph0rk0z commented Feb 12, 2026

Uh oh!

mcm007 commented Feb 13, 2026

Uh oh!

mcm007 commented Feb 14, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Conversation

mcm007 commented Feb 9, 2026

Uh oh!

Ph0rk0z commented Feb 10, 2026

Uh oh!

mcm007 commented Feb 10, 2026

Uh oh!

Ph0rk0z commented Feb 11, 2026

Uh oh!

mcm007 commented Feb 11, 2026

Uh oh!

saood06 commented Feb 11, 2026

Uh oh!

Ph0rk0z commented Feb 12, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

saood06 commented Feb 12, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

mcm007 commented Feb 12, 2026

Uh oh!

mcm007 commented Feb 12, 2026

Uh oh!

Ph0rk0z commented Feb 12, 2026

Uh oh!

mcm007 commented Feb 13, 2026

Uh oh!

mcm007 commented Feb 14, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Ph0rk0z commented Feb 12, 2026 •

edited

Loading

saood06 commented Feb 12, 2026 •

edited

Loading