-
Notifications
You must be signed in to change notification settings - Fork 1.7k
Bidirectional model? #99
Copy link
Copy link
Open
Description
Hello,
Thanks for the great paper! If I understand correctly, Mamba model is similar to a one directional LSTM. Is there a way to implement it in not causal but bidirectional way, so the model can see information from both sequence ends? It would be similar to BERT encoder architecture in that sense I guess.
Reactions are currently unavailable
Metadata
Metadata
Assignees
Labels
No labels