Add Llama 3 instruction template by Touch-Night · Pull Request #5891 · oobabooga/text-generation-webui

Touch-Night · 2024-04-20T12:14:09Z

A simple template

Checklist:

I have read the Contributing guidelines.

Merge dev branch

Merge dev branch (oobabooga#5257)

Merge dev branch

berkut1 · 2024-04-28T02:50:14Z

@Touch-Night sorry, it was my mistake :) Yeah it includes everything. Everything fine, sorry.

P.S I deleted my message, but you still managed to reply :)

Touch-Night · 2024-04-28T04:29:11Z

Using that template with chat-instruct seems to result in an assistant being at the end of all the messages it sends.

Sorry I lost a -... Now this issue should've been resolved.

AtanasValkov · 2024-04-29T09:46:17Z

Using that template with chat-instruct seems to result in an assistant being at the end of all the messages it sends.

Sorry I lost a -... Now this issue should've been resolved.

your template is pretty solid, but it doesn't work for multi turn chats. This is the template we need to follow according to Meta.

<|begin_of_text|><|start_header_id|>system<|end_header_id|>

{{ system_prompt }}<|eot_id|><|start_header_id|>user<|end_header_id|>

{{ user_message_1 }}<|eot_id|><|start_header_id|>assistant<|end_header_id|>

{{ model_answer_1 }}<|eot_id|><|start_header_id|>user<|end_header_id|>

{{ user_message_2 }}<|eot_id|><|start_header_id|>assistant<|end_header_id|>

which sadly I'm not too sure how to implement into ooba

Touch-Night · 2024-04-29T11:32:05Z

but it doesn't work for multi turn chats.

I don't see how. It works great for me. I don't think it's because this template doesn't work for multi turn chats. Is your context window too small?
What prompt structure do you think will this template finally build?

AtanasValkov · 2024-04-29T15:13:25Z

but it doesn't work for multi turn chats.

I don't see how. It works great for me. I don't think it's because this template doesn't work for multi turn chats. Is your context window too small? What prompt structure do you think will this template finally build?

I load models at 4k context but usually around the 2k mark (depending on base prompt) outputs become identical (this is with varied settings).
From Meta they say that we need to surround each command (even in history) with the start and end header like the structure I sent. But I just don't know if ooba supports such an implementation where the tokens get stored on each new input/output.

currently it outputs like this:

<|begin_of_text|><|start_header_id|>system<|end_header_id|>

instead it should be:

<|begin_of_text|><|start_header_id|>system<|end_header_id|>

Touch-Night · 2024-04-29T15:38:43Z

This is an intended prompt design by @oobabooga .
The full prompt does looks like:

<|begin_of_text|><|start_header_id|>system<|end_header_id|>

{{Custom_system_prompt}}<|eot_id|><|start_header_id|>user<|end_header_id|>

{{ChatTab.Command for chat-instruct mode}}(continue the dialogue below)

{{CharacterContext&Description}}

{{ParametersTab.Chat.User.Description}}

{{character_name}}: {{character_message_1}}
{{user_name}}: {{user_message_1}}
{{character_name}}: {{character_message_2}}
{{user_name}}: {{user_message_2}}<|eot_id|><|start_header_id|>assistant<|end_header_id|>

{{character_name}}:

It's not a thing that can be done with an instruction template. Try editing chat template instead.
If you want it to be like

<|begin_of_text|><|start_header_id|>system<|end_header_id|>

{{ChatTab.Command for chat-instruct mode}}(act as {{character}}/you are {{character}}...)

{{CharacterContext&Description}}

{{ParametersTab.Chat.User.Description}}

<|eot_id|><|start_header_id|>assistant<|end_header_id|>

{{character_message_1}}<|eot_id|><|start_header_id|>user<|end_header_id|>

{{user_message_1}}<|eot_id|><|start_header_id|>assistant<|end_header_id|>

{{character_message_2}}<|eot_id|><|start_header_id|>user<|end_header_id|>

{{user_message_2}}<|eot_id|><|start_header_id|>assistant<|end_header_id|>

You can try this Chat template (redundant, not tested, may be wrong):

{%- if message[0]['role'] == 'system' -%}
    {%- if message['content'] -%}
        {{- message['content'] + '\n\n' -}}
    {%- endif -%}
    {%- if user_bio -%}
        {{- user_bio + '\n\n' -}}
    {%- endif -%}
{%- else -%}
    {%- if message['role'] == 'user' -%}
        {{- message['content'] + '<|eot_id|>'-}}
    {%- endif -%}
{%- endif -%}
{%- for message in messages[1:-1] %}
    {%- if message['role'] == 'system' -%}
        {%- if message['content'] -%}
            {{- message['content'] + '\n\n' -}}
        {%- endif -%}
        {%- if user_bio -%}
            {{- user_bio + '\n\n' -}}
        {%- endif -%}
    {%- else -%}
        {%- if message['role'] == 'user' -%}
            {{- '<|start_header_id|>user<|end_header_id|>\n\n' + message['content'] + '<|eot_id|>'-}}
        {%- else -%}
            {{- '<|start_header_id|>assistant<|end_header_id|>\n\n' + message['content'] + '<|eot_id|>' -}}
        {%- endif -%}
    {%- endif -%}
{%- endfor -%}
{%- if message[-1]['role'] == 'system' -%}
    {%- if message['content'] -%}
        {{- message['content'] + '\n\n' -}}
    {%- endif -%}
    {%- if user_bio -%}
        {{- user_bio + '\n\n' -}}
    {%- endif -%}
{%- else -%}
    {%- if message['role'] == 'user' -%}
        {{- '<|start_header_id|>user<|end_header_id|>\n\n' + message['content'] -}}
    {%- endif -%}
{%- endif -%}

AtanasValkov · 2024-04-29T15:48:54Z

This is an intended prompt design by @oobabooga . The full prompt does looks like:
<|begin_of_text|><|start_header_id|>system<|end_header_id|>
{{ChatTab.Command for chat-instruct mode}}(continue the dialogue below)

{{ParametersTab.Chat.User.Description}}

<|eot_id|><|start_header_id|>user<|end_header_id|>
{{user_name}}: {{user_message_1}}
{{character_name}}: {{character_message_1}}
{{user_name}}: {{user_message_2}}<|eot_id|><|start_header_id|>assistant<|end_header_id|>
It's not a thing that can be done with an instruction template. Try editing chat template instead.

You're right. I'll do it ASAP with chat template. Will edit this comment with results.

AtanasValkov · 2024-04-29T18:09:19Z

You can try this Chat template (redundant, not tested, may be wrong):

This one is the template I use to achieve the meta recommendation, ooba complained a lot about yours and was a bit too long for me to look how to fix. It works like a charm in chat mode. I still need to figure out which generation settings to use since I still encounter repetition errors but I hope this helps anyone looking for what I was looking for.

{%- for message in messages %}
    {%- if message['role'] == 'system' -%}
        {%- if message['content'] -%}
            {{- '<|begin_of_text|><|start_header_id|>system<|end_header_id|>\n\n' + message['content'] + '\n\n' -}}
        {%- endif -%}
        {%- if user_bio -%}
            {{- user_bio + '\n\n' -}}
        {%- endif -%}
    {%- else -%}
        {%- if message['role'] == 'user' -%}
            {{- name1 + ': ' + message['content'] + '<|eot_id|><|start_header_id|>assistant<|end_header_id|>\n\n' -}}
        {%- else -%}
            {{- name2 + ': ' + message['content'] + '<|eot_id|><|start_header_id|>user<|end_header_id|>\n\n' -}}
        {%- endif -%}
    {%- endif -%}
{%- endfor -%}

Merge dev branch

Touch-Night · 2024-04-29T19:06:51Z

{%- for message in messages %}
    {%- if message['role'] == 'system' -%}
        {%- if message['content'] -%}
            {{- '<|begin_of_text|><|start_header_id|>system<|end_header_id|>\n\n' + message['content'] + '\n\n' -}}
        {%- endif -%}
        {%- if user_bio -%}
            {{- user_bio + '\n\n' -}}
        {%- endif -%}
    {%- else -%}
        {%- if message['role'] == 'user' -%}
            {{- name1 + ': ' + message['content'] + '<|eot_id|><|start_header_id|>assistant<|end_header_id|>\n\n' -}}
        {%- else -%}
            {{- name2 + ': ' + message['content'] + '<|eot_id|><|start_header_id|>user<|end_header_id|>\n\n' -}}
        {%- endif -%}
    {%- endif -%}
{%- endfor -%}

Won't it end up like this?

<|begin_of_text|><|start_header_id|>system<|end_header_id|>

<|begin_of_text|><|start_header_id|>system<|end_header_id|>

{{CharacterContext&Description}}
{{ChatTab.Command for chat-instruct mode}}(act as {{character}}/you are {{character}}...)

{{ParametersTab.Chat.User.Description}}

<|eot_id|><|start_header_id|>user<|end_header_id|>

{{user_name}}: {{user_message_1}}<|eot_id|><|start_header_id|>assistant<|end_header_id|>

{{character_name}}: {{character_message_1}}<|eot_id|><|start_header_id|>user<|end_header_id|>

{{user_name}}: {{user_message_2}}<|eot_id|><|start_header_id|>assistant<|end_header_id|>

<|eot_id|><|start_header_id|>assistant<|end_header_id|>

This chat template will cause:

Redundant <|begin_of_text|><|start_header_id|>system<|end_header_id|>\n\n at the beginning of the prompt.
Redundant <|eot_id|><|start_header_id|>assistant<|end_header_id|>\n\n at the end of the prompt.
It is possible to have consecutive two or more messages from the same role, so you can't simply put <|start_header_id|>assistant<|end_header_id|> at the end of user's turn.

AtanasValkov · 2024-04-29T20:25:06Z

Won't it end up like this?

I did say I'm using it in chat mode, not instruct or chat-instruct, so no, I'm not getting the double system in my prompt.
Same as statement 1.
If you're doing consecutive messages from the same role aren't you already messing with the already strict baked in turn order of the AI training? (I'm not sure on this point since I avoid more than one message per role and I'm not familiar with model training.)

Like I said, it's already working as intended for me. This isn't that clean of a solution anyway, since it should be achievable through instruct mode, how I assume most people use models these days.

Touch-Night · 2024-04-30T02:48:10Z

Sorry, you said

currently it outputs like this:

<|begin_of_text|><|start_header_id|>system<|end_header_id|>

Persona of your AI assistant who does stuff and speaks in a certain way.<|eot_id|><|start_header_id|>user<|end_header_id|>

User: Hello! Bot: Hi! User: How are you?<|eot_id|><|start_header_id|>assistant<|end_header_id|>

So I thought you are using chat-instruct mode.

Anyway, one chat template cannot achieve the meta recommendation on chat and chat-instruct mode at the same time. Mine for chat-instruct mode and yours for chat mode.

I'm aware that users should not mess with the turn order, but since it's possible, there will be such user doing that. I want to make sure Llama 3 doesn't wrongly regard a user message as from assistant.

Merge dev branch

kalle07 · 2024-05-05T18:25:24Z

anny progress ?

if i use llama3 without any Modifications in notpad as (alpaca)
its talks with hisself until the moon goas up ^^

the tts based on Coqui_tts talks in the chat with his sself untill ... ^^

Touch-Night · 2024-05-05T18:45:53Z

I think this template needs no more changes, it's done. So no progress unless @oobabooga gives his advice on something like should the <|begin_of_text|> be added to the template.
I'm using this template these days with no other setting tweaks, and it's working fine.

kalle07 · 2024-05-05T18:58:54Z

i dont know where i should copy that in oobadoga

i can only use the llama3 template from gpt4all and there i know where i can copy/paste ^^

berkut1 · 2024-05-06T16:27:57Z

What about <|begin_of_text|>? There is an another template https://github.com/mamei16/LLM_Web_search/blob/main/instruction_templates/Llama-3.yaml

nickpotafiy · 2024-05-06T23:26:40Z

What about <|begin_of_text|>? There is an another template https://github.com/mamei16/LLM_Web_search/blob/main/instruction_templates/Llama-3.yaml

I'm not so sure that bos_token is added automatically. I'm using this template and it seems to be working well. If I omit <|begin_of_text|> it's not present in the debug text when using API, so I'm convinced it needs to be added.

Merge dev branch

oobabooga · 2024-05-19T23:15:50Z

I don't like adding the BOS token to instruct templates, as every tokenizer already adds it automatically by default. In fact, I go out of my way to remove the BOS token from instruction templates somewhere in this repository to prevent a double BOS from happening.

Note that Llama-3 chat models already come with a template in the metadata, which gets detected and applied automatically, so it's not necessary to have a dedicated yaml.

oobabooga added 30 commits December 14, 2023 22:39

Merge pull request oobabooga#4927 from oobabooga/dev

c3e0fcf

Merge dev branch

Merge pull request oobabooga#4937 from oobabooga/dev

443be39

Merge dev branch

Merge pull request oobabooga#4961 from oobabooga/dev

7be0983

Merge dev branch

Merge pull request oobabooga#4980 from oobabooga/dev

b28020a

Merge dev branch

Merge pull request oobabooga#4988 from oobabooga/dev

781367b

Merge dev branch

Merge pull request oobabooga#5002 from oobabooga/dev

71eb744

Merge dev branch

Merge pull request oobabooga#5005 from oobabooga/dev

5b791ca

Merge dev branch

Merge pull request oobabooga#5011 from oobabooga/dev

c1f78db

Merge dev branch

Merge pull request oobabooga#5012 from oobabooga/dev

489f4a2

Merge dev branch

Merge pull request oobabooga#5022 from oobabooga/dev

11288d1

Merge dev branch

Merge pull request oobabooga#5039 from oobabooga/dev

4b25acf

Merge dev branch

Merge pull request oobabooga#5073 from oobabooga/dev

af87609

Merge dev branch

Merge pull request oobabooga#5078 from oobabooga/dev

19d1374

Merge dev branch

Merge pull request oobabooga#5100 from oobabooga/dev

3fd7073

Merge dev branch

Merge pull request oobabooga#5132 from oobabooga/dev

3e3a66e

Merge dev branch

Merge pull request oobabooga#5152 from oobabooga/dev

3f28925

Merge dev branch

Merge pull request oobabooga#5163 from oobabooga/dev

c54d1da

Merge dev branch

Merge pull request oobabooga#5181 from oobabooga/dev

8ea3f31

Merge dev branch

Merge pull request oobabooga#5195 from oobabooga/dev

e169993

Merge dev branch

Merge pull request oobabooga#5199 from oobabooga/dev

ad1ff53

Merge dev branch

Merge pull request oobabooga#5220 from oobabooga/dev

2dc8db8

Merge dev branch

Merge pull request oobabooga#5253 from oobabooga/dev

61e4bfe

Merge dev branch

Merge pull request oobabooga#5266 from oobabooga/dev

d8c3a5b

Merge dev branch (oobabooga#5257)

Merge pull request oobabooga#5347 from oobabooga/dev

1343aa3

Merge dev branch

Merge pull request oobabooga#5348 from oobabooga/dev

837bd88

Merge dev branch

Merge pull request oobabooga#5379 from oobabooga/dev

e7a760e

Merge dev branch

Merge pull request oobabooga#5404 from oobabooga/dev

4f3fdf1

Merge dev branch

Merge pull request oobabooga#5452 from oobabooga/dev

a329db0

Merge dev branch

Merge pull request oobabooga#5453 from oobabooga/dev

0f134bf

Merge dev branch

Merge pull request oobabooga#5496 from oobabooga/dev

dc6adef

Merge dev branch

Add the missing "-"

08878cd

Merge pull request oobabooga#5959 from oobabooga/dev

81f603d

Merge dev branch

Touch-Night and others added 3 commits April 30, 2024 10:49

Merge branch 'oobabooga:main' into main

d48c519

Merge pull request oobabooga#5970 from oobabooga/dev

8f12fb0

Merge dev branch

Merge branch 'oobabooga:main' into main

9db81f7

oobabooga and others added 3 commits May 8, 2024 16:37

Merge pull request oobabooga#5996 from oobabooga/dev

9ac5287

Merge dev branch

Merge branch 'oobabooga:main' into main

3967533

Add Llama-v3 template to config.yaml

fd8038d

Prevent llama-3 derivatives from having wrong template

255dfb2

oobabooga merged commit d7bd3da into oobabooga:dev May 19, 2024

Touch-Night mentioned this pull request May 21, 2024

In chat mode, character's greeting is part of system prompt #6034

Closed

1 task

anon-contributor-0 pushed a commit to anon-contributor-0/text-generation-webui that referenced this pull request May 30, 2024

Add Llama 3 instruction template (oobabooga#5891)

848655b

PoetOnTheRun pushed a commit to PoetOnTheRun/text-generation-webui that referenced this pull request Oct 22, 2024

Add Llama 3 instruction template (oobabooga#5891)

2e48c82

Conversation

Touch-Night commented Apr 20, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Checklist:

Uh oh!

berkut1 commented Apr 28, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Touch-Night commented Apr 28, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

AtanasValkov commented Apr 29, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Touch-Night commented Apr 29, 2024

Uh oh!

AtanasValkov commented Apr 29, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Touch-Night commented Apr 29, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

AtanasValkov commented Apr 29, 2024

Uh oh!

AtanasValkov commented Apr 29, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Touch-Night commented Apr 29, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

AtanasValkov commented Apr 29, 2024

Uh oh!

Touch-Night commented Apr 30, 2024

Uh oh!

kalle07 commented May 5, 2024

Uh oh!

Touch-Night commented May 5, 2024

Uh oh!

kalle07 commented May 5, 2024

Uh oh!

berkut1 commented May 6, 2024

Uh oh!

nickpotafiy commented May 6, 2024

Uh oh!

oobabooga commented May 19, 2024

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

9 participants

Touch-Night commented Apr 20, 2024 •

edited

Loading

berkut1 commented Apr 28, 2024 •

edited

Loading

Touch-Night commented Apr 28, 2024 •

edited

Loading

AtanasValkov commented Apr 29, 2024 •

edited

Loading

AtanasValkov commented Apr 29, 2024 •

edited

Loading

Touch-Night commented Apr 29, 2024 •

edited

Loading

AtanasValkov commented Apr 29, 2024 •

edited

Loading

Touch-Night commented Apr 29, 2024 •

edited

Loading