Commit fff2665
authored
Audio input support (Phi 4 multimodal) (#1448)
* Deps
* Add conformer
* Nemo loading
* Position embeds
* Load t5 attn bias
* Attn and feed forward
* Add conv module and glu pointwise
* Implement relative attn bias
* Add the forward methods
* Add encoder embedding
* Fix oproj
* Some loading
* Conformer loads!
* Fully loading speech stack
* Merger
* Dont need that
* First pass at audio processing
* Read samples
* Optional
* Small loading fix
* Runs but not correct yet
* Improved audio processing?
* Works with this
* Fix t5 attn bias
* It works!
* Comment
* Use some other crates
* Clippy
* Allow bf16 on metal
* Add prefix_audio
* Remove unused
* Typo
* User specified
* Add audio url parsing
* AudioProjectionMode -> InputMode
* Audio prefix caching
* Fix bug in audio prefix caching
* Support both at the same time!
* Tweak logging
* Support stereo
* Add mistralrs-audio
* Support batching
* Add server and rust api example
* Add python api
* Fix add_multimodal_message
* Fix unfold for conformer
* Streaming example
* Add web chat support
* Add modalities registry1 parent 2cb0a3e commit fff2665
File tree
57 files changed
+3850
-157
lines changed- docs
- examples
- python
- server
- mistralrs-audio
- src
- mistralrs-core
- src
- engine
- pipeline
- loaders
- utils
- vision_models
- conformer
- phi4
- mistralrs-pyo3
- src
- mistralrs-server-core/src
- mistralrs-server/src
- mistralrs-web-chat
- src
- handlers
- static
- js
- mistralrs
- examples/phi4mm_audio
- src
Some content is hidden
Large Commits have some content hidden by default. Use the searchbox below for content that may be hidden.
57 files changed
+3850
-157
lines changed| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
10 | 10 | | |
11 | 11 | | |
12 | 12 | | |
13 | | - | |
| 13 | + | |
| 14 | + | |
| 15 | + | |
14 | 16 | | |
15 | 17 | | |
16 | 18 | | |
| |||
Some generated files are not rendered by default. Learn more about customizing how changed files appear on GitHub.
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
10 | 10 | | |
11 | 11 | | |
12 | 12 | | |
| 13 | + | |
13 | 14 | | |
14 | 15 | | |
15 | 16 | | |
| |||
140 | 141 | | |
141 | 142 | | |
142 | 143 | | |
| 144 | + | |
143 | 145 | | |
144 | 146 | | |
145 | 147 | | |
| |||
159 | 161 | | |
160 | 162 | | |
161 | 163 | | |
| 164 | + | |
| 165 | + | |
| 166 | + | |
| 167 | + | |
| 168 | + | |
162 | 169 | | |
163 | 170 | | |
164 | 171 | | |
165 | 172 | | |
166 | 173 | | |
167 | 174 | | |
| 175 | + | |
168 | 176 | | |
169 | 177 | | |
170 | 178 | | |
| |||
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
11 | 11 | | |
12 | 12 | | |
13 | 13 | | |
14 | | - | |
| 14 | + | |
15 | 15 | | |
16 | 16 | | |
17 | 17 | | |
| |||
0 commit comments