Code for the EMNLP 2025 paper "Demystifying optimized prompts in language models"

The word-stories model is available on Huggingface

Citation

@inproceedings{melamed-etal-2025-demystifying,
    title = "Demystifying optimized prompts in language models",
    author = "Melamed, Rimon  and
      McCabe, Lucas Hurley  and
      Huang, H Howie",
    editor = "Christodoulopoulos, Christos  and
      Chakraborty, Tanmoy  and
      Rose, Carolyn  and
      Peng, Violet",
    booktitle = "Proceedings of the 2025 Conference on Empirical Methods in Natural Language Processing",
    month = nov,
    year = "2025",
    address = "Suzhou, China",
    publisher = "Association for Computational Linguistics",
    url = "https://aclanthology.org/2025.emnlp-main.147/",
    doi = "10.18653/v1/2025.emnlp-main.147",
    pages = "2983--2999",
    ISBN = "979-8-89176-332-6",
    abstract = "Modern language models (LMs) are not robust to out-of-distribution inputs. Machine generated ({``}optimized'') prompts can be used to modulate LM outputs and induce specific behaviors while appearing completely uninterpretable. In this work, we investigate the composition of optimized prompts, as well as the mechanisms by which LMs parse and build predictions from optimized prompts. We find that optimized prompts primarily consist of punctuation and noun tokens which are more rare in the training data. Internally, optimized prompts are clearly distinguishable from natural language counterparts based on sparse subsets of the model{'}s activations. Across various families of instruction-tuned models, optimized prompts follow a similar path in how their representations form through the network."
}

Name		Name	Last commit message	Last commit date
Latest commit History 2 Commits
cpp		cpp
infinigram		infinigram
ood_prompts		ood_prompts
results_data		results_data
.gitignore		.gitignore
.pre-commit-config.yaml		.pre-commit-config.yaml
README.md		README.md
pyproject.toml		pyproject.toml
requirements.txt		requirements.txt
setup.py		setup.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Code for the EMNLP 2025 paper "Demystifying optimized prompts in language models"

Citation

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Code for the EMNLP 2025 paper "Demystifying optimized prompts in language models"

Citation

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages