-
Notifications
You must be signed in to change notification settings - Fork 3.2k
Description
Describe the bug
NeMo training and decoding scripts do not support multi-channel audio data.
Steps/Code to reproduce bug
It does not support specifying which channel to use for each audio file in each line in train.manifest.json or test.manifest.json file.
I was able to run ./examples/asr/speech_to_text_eval.py with "channel_selector=" to specify the channel for all the audio in a manifest.json file, but I can't find a way to specifying them for each audio file inside the manifest.json file.
Expected behavior
Can NeMo team add this useful feature to work with a diverse set of multi-channel training and testing audio data so data from different channel can be mixed within a manifest.json file?
Environment overview (please complete the following information)
NeMo was installed by pip in a conda environment. It works for single channel audio data.