p-less Sampling

This repository contains the code for the paper "p-less Sampling: A Robust Hyperparameter-Free Approach for LLM Decoding". The paper is available here.

TL;DR: We introduce p-less Sampling: a hyperparameter-less and information-theoretic approach to sampling which dynamically sets a truncation threshold at each decoding step of the LLM, based on the entire token probability distribution. We further introduce p-less_norm, a variant of p-less, which effectively relaxes the threshold and retains similar desirable properties as p-less, for tasks where diversity is favored over coherence.

Examples

Refer to the notebook for working examples on p-less and p-less_norm decoding.

Important

p-less and p-less_norm samplers can be a direct drop-in to your LLM, see the implementation and notebook on how it is done! 🚀

Installation Requirements

pip install torch transformers

Tested with Python 3.10.12, torch 2.6.0 and transformers 4.55.2.

Advantages of p-less (and p-less_norm) over Existing LLM Decoding Methods

The truncation threshold utilized in p-less sampling dynamically adapts to the entire token probability distribution at each time step. In contrast, existing sampling methods either use a fixed threshold which ignores the current token probability distribution (e.g. top-p, top-k, ϵ-sampling), set the threshold based on the probability of a single token in the current distribution (e.g. min-p), or only considers the token distribution if conditions are met (e.g. η-sampling).
p-less produces a bounded and valid truncation threshold which guarantees a non-empty candidate set for sampling, unlike other sampling methods where bounds are not guaranteed and edge cases are resolved with defaults, such as defaulting to the modal token (or top few tokens) if all tokens do not meet the threshold (e.g. ϵ-sampling, η-sampling, mirostat).
The truncation threshold of p-less sampling dynamically adjusts with temperature, unlike other methods (e.g. top-p, top-k, min-p, ϵ-sampling) whose hyperparameters are not meaningful when temperature approaches zero or infinity.

Thus, p-less uniquely possesses all three of the aforementioned desirable properties of a sampling approach, combining the benefits of existing sampling strategies into a single method.

Generation efficiency: p-less is more efficient than other methods, both in terms of token sampling speed and overall generation length, without sacrificing task-specific performance.

We validated the effectiveness of p-less sampling through extensive experiments: using three LLMs and five datasets spanning math, logical reasoning, and creative writing tasks.

Coming your way soon

We are working towards contributing the p-less samplers to common LLM inference APIs.

If you find our paper or code useful in your work, please cite it as:

@article{RunyanSP2025,
  title={p-LESS SAMPLING: A ROBUST HYPERPARAMETER-FREE APPROACH FOR LLM DECODING},
  author={Runyan Tan and Shuang Wu and Phillip Howard},
  journal={arXiv preprint arXiv:2509.23234},
  year={2025},
  note={[cs.AI, cs.CL]},
  url={https://arxiv.org/abs/2509.23234}
}

Name		Name	Last commit message	Last commit date
Latest commit History 1 Commit
LICENSE		LICENSE
README.md		README.md
p_less_examples.ipynb		p_less_examples.ipynb
p_less_samplers.py		p_less_samplers.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

p-less Sampling

Examples

Installation Requirements

Advantages of p-less (and p-less_norm) over Existing LLM Decoding Methods

Coming your way soon

If you find our paper or code useful in your work, please cite it as:

About

Uh oh!

Releases

Packages

Languages

License

ryttry/p-less

Folders and files

Latest commit

History

Repository files navigation

p-less Sampling

Examples

Installation Requirements

Advantages of p-less (and p-lessnorm) over Existing LLM Decoding Methods

Coming your way soon

If you find our paper or code useful in your work, please cite it as:

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Advantages of p-less (and p-less_norm) over Existing LLM Decoding Methods

Packages