The legacy Medusa Head structure is inconsistent with the new one.

In medusa_model_legacy.py, the implementation is that the Medusa head is only responsible for generating new hidden states, and the generation of medusa logits still reuses the base_model's lm_head.

Here is the code: https://github.com/FasterDecoding/Medusa/blob/e2a5d20c048a9b0a4092e6933c34313687422518/medusa/model/medusa_model_legacy.py#L203-L206

---

However, in the new medusa_model.py or medusa_model_new.py, this has changed such that each Medusa head has its own "lm_head" (a Linear layer with in_features = hidden_size, out_features = vocab_size), as shown in the code below:
https://github.com/FasterDecoding/Medusa/blob/e2a5d20c048a9b0a4092e6933c34313687422518/medusa/model/medusa_model.py#L111-L119

Inference code is:
https://github.com/FasterDecoding/Medusa/blob/e2a5d20c048a9b0a4092e6933c34313687422518/medusa/model/medusa_model.py#L215-L218

---

This is very confusing, especially since the README.md provides both legacy and new training methods. Which of these truly reflects the performance reported in the paper?

Thank you very much for your work, looking forward to your reply or anyone's discussion. 

	for i in range(self.medusa):
	mhidden_states = self.medusa_head[i](hidden_states)
	mlogits = self.base_model.lm_head(mhidden_states)
	medusa_logits.append(mlogits)

	self.medusa_head = nn.ModuleList(
	[
	nn.Sequential(
	([ResBlock(self.hidden_size)] medusa_num_layers),
	nn.Linear(self.hidden_size, self.vocab_size, bias=False),
	)
	for _ in range(medusa_num_heads)
	]
	)

	medusa_logits = []
	# TODO: Consider parallelizing this loop for efficiency?
	for i in range(self.medusa):
	medusa_logits.append(self.medusa_head[i](hidden_states))

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

The legacy Medusa Head structure is inconsistent with the new one. #140

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

The legacy Medusa Head structure is inconsistent with the new one. #140

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions