-
Notifications
You must be signed in to change notification settings - Fork 9
Open
Description
hi,
can you please document and explain the output format? Because @pe-trik 's OmniSTEval uses it as one supported output format.
I want to use OmniSTEval for evaluating output of my project without merging my code with simulstream (unless I'll be convinced that it is most reasonable way).
I saw one example here: https://raw.githubusercontent.com/pe-trik/OmniSTEval/refs/heads/main/examples/simulstream_example/simulstream_log.jsonl
{"model_loading_time": 22.951433420181274}
{"id": 0, "metadata": {"wav_name": "//storage/MT/sara/MUSTC/en-de/data/tst-COMMON_wav/ted_1096.wav"}}
{"id": 0, "total_audio_processed": 1.0, "computation_time": 0.4919137954711914, "generated_tokens": [], "deleted_tokens": []}
{"id": 0, "total_audio_processed": 2.0, "computation_time": 0.1842632293701172, "generated_tokens": [], "deleted_tokens": []}
{"id": 0, "total_audio_processed": 3.0, "computation_time": 0.18722200393676758, "generated_tokens": [], "deleted_tokens": []}
{"id": 0, "total_audio_processed": 4.0, "computation_time": 0.19884991645812988, "generated_tokens": [], "deleted_tokens": []}
...
{"id": 0, "total_audio_processed": 14.0, "computation_time": 0.2456223964691162, "generated_tokens": ["\u2581Zur", "\u00fcck"], "deleted_tokens": []}
{"id": 0, "total_audio_processed": 15.0, "computation_time": 0.24962759017944336, "generated_tokens": ["\u2581in", "\u2581New", "\u2581York", ",", "\u2581ich", "\u2581bin"], "deleted_tokens": []}
- is model_loading time really used?
- what is total_audio_processed? is it necessary to have it when the output is empty?
- what if it is a cascade, and we don't know how many audio ASR processed at the time when LLM translated a piece?
- do the generated tokens need to be as BPE tokens? What if a model does not use BPE tokens, but full text?
I would suggest this format for STEval tools. What would you think, @pe-trik and @mgaido91 ?
{ document: "doc1.wav", updates: [{ "emission_time": 1.34343, "text": "prefix", "unconfirmed_text": suffix"}, ...] }
{ document: "doc2.wav", updates: [{ "emission_time": 1.3223213, "text": "prefix", "unconfirmed_text": suffix"}, ...] }
Reactions are currently unavailable
Metadata
Metadata
Assignees
Labels
No labels