Update: add using pcm bytes (#4323) #4409

YooSungHyun · 2022-05-26T04:26:36Z

first of all, please look #4323

why i can not use {"path","array","sampling_rate"}
because sf.write(format="wav") and sf.read(BytesIO) is changed my pcm data value
maybe, i think wav got header but, pcm is not.
and variable naming, pcm data is "byte" type. so, "array" name is not fair i think

so, i use scipy lib and numpy (that is huggingface dependency)
and refer to @lhoestq answered,

encode -> using sampling_rate and pcm byte -> wav style byte (scipy.wavfile.write to byte)
byte converting using fairseq style pcm audio read FileAudioDataset
decode -> read wavfile.read

that way is not screw up my pcm byte to float data, and another audio type(wav) safety

please check!

lhoestq

Awesome thanks ! Could you also add tests in tests/features/test_audio.py ?

Maybe add a small pcm file in tests/features/data and check that everything works as expected in tests cases like test_audio_encode_example_pcm and test_audio_decode_example_pcm for example.

src/datasets/features/audio.py

mariosasko · 2022-06-01T13:57:17Z

@lhoestq Maybe I'm missing something, but what's the reason to read and encode PCM files to WAV in Audio.encode_example. Isn't the whole purpose of the decodable types to operate on raw files whenever possible? IMO this PR should only modify Audio.decode_example to support PCM files/bytes decoding.

lhoestq · 2022-06-01T14:03:09Z

Because the PCM file is not enough, we also need the sampling_rate associated to it. Therefore the two alternatives are either:

convert to WAV
add a sampling_rate field to the Audio arrow storage (not sure how it would behave for backward compatibility though)

mariosasko · 2022-06-01T14:16:56Z

But scipy.io.wavfile.read, which is used for reading such files, returns a file's sampling rate. The only tricky part is resampling to a different sampling rate than the default one.

lhoestq · 2022-06-01T14:25:57Z

How does it get the sampling rate of a PCM file then ? According to SO it's not possible to infer it from the file alone

Co-authored-by: Quentin Lhoest <[email protected]>

YooSungHyun · 2022-06-02T08:49:20Z

Awesome thanks ! Could you also add tests in tests/features/test_audio.py ?

Maybe add a small pcm file in tests/features/data and check that everything works as expected in tests cases like test_audio_encode_example_pcm and test_audio_decode_example_pcm for example.

@lhoestq how can i test test_audio.py? where is "main" func?
do you have some example or guideline?

YooSungHyun · 2022-06-02T09:35:15Z

But scipy.io.wavfile.read, which is used for reading such files, returns a file's sampling rate. The only tricky part is resampling to a different sampling rate than the default one.

@mariosasko @lhoestq
thanks for comment!

First of all, "PCM file" can not read alone to any audio library.
"PCM file" has not any audio META information header. (it just purely audio byte data. therefore, we don't have to encoding and decoding)
but, "PCM file" is audio extension, so we can use datasets.Audio

if you want to read "PCM file" to audio file likely, it have to needs additional parameter. (channel, sampling_rate, else....)
but, in many situation, we only know sampling_rate for PCM

and, if we want to use datasets.Audio for "PCM file", we must process encode_example.
therefore, i have to use sampling_rate for encoding for making wav-style byte. (we only know sampling_rate)

In my source code, I don't compare sampling rate(datasets.Audio's self.sampling_rate and read pcm sampling_rate(value["sampling_rate"])) and checking mono
@mariosasko ! do you want to process resampling and making mono? then i can modify my source

lhoestq · 2022-06-09T09:44:27Z

There is no "main" function in test scripts :) To run a test script you must use the pytest command:

pytest tests/features/test_audio.py

to run only one function you can also do

pytest tests/features/test_audio.py::test_audio_feature_type_to_arrow

for example

YooSungHyun · 2022-06-13T09:36:21Z

@lhoestq
maybe, if i write test code, i have to commit test_audio.py and send pr?
because, we need to keep test_audio_encode_example_pcm and test_audio_decode_example_pcm method after my pr merged?

lhoestq · 2022-06-13T09:41:32Z

You can add your tests in this PR with the other changes you did

YooSungHyun · 2022-06-14T10:09:07Z

@lhoestq
test complete & commit my test_audio.py

AND, some change in my code.

audio.py
i think "sampling_rate" is already Audio object initial variable. so, we don`t have to use input parameter.

test_audio.py
we can check "PCM" file to path (exactly, extenstion)
so, test case has to know path. if only have bytes, we don`t know that is "PCM" or not

YooSungHyun · 2022-06-14T10:19:31Z

@lhoestq
and, why circleci raised exception?
maybe, repo url is not found!
PLZ, CHK!

YooSungHyun · 2022-06-23T11:03:43Z

@lhoestq
hello????

lhoestq

Thanks for the tests ! This is awesome :)

I left two comments. Also to fix the CI feel free need to merge the master branch into yours

src/datasets/features/audio.py

we can open up soundfile lib Co-authored-by: Quentin Lhoest <[email protected]>

…e lib

YooSungHyun · 2022-06-24T03:53:51Z

@lhoestq
test_audio.py
if we don`t use path in pcm, test-case need to be changed
so, we check path just None

… YooSungHyun/features/audio

…ngHyun/datasets into YooSungHyun/features/audio

YooSungHyun · 2022-06-24T04:15:07Z

i'm merge branch already and multiprocess in setup.py but circleci error only win version

how can i fixed it?

lhoestq

You can ignore the multiprocess error in the CI, this is unrelated to your PR and we're working on it ;)

Thanks a lot for the changes, I spotted one thing that requires a small change: when writing the WAV data, we should use the sampling rate from the input example dictionary (the same that contains "path") - and not the sampling rate from the Audio attribute.

Indeed the sampling rate of the Audio attribute is the expected sampling rate when decoding the example. When we encode it we must specify the sampling rate of the PCM file

src/datasets/features/audio.py

tests/features/test_audio.py

src/datasets/features/audio.py

self.sampling_rate is for decode. so, we have to get value`s sampling_rate Co-authored-by: Quentin Lhoest <[email protected]>

we have to know sampling_rate in input values variable Co-authored-by: Quentin Lhoest <[email protected]>

Co-authored-by: Quentin Lhoest <[email protected]>

…ngHyun/datasets into YooSungHyun/features/audio

YooSungHyun · 2022-06-29T06:07:57Z

@lhoestq thx for comment!
test_audio.py test complete. it runs sucessfully
and, self.get("sampling_rate") -> value.get("sampling_rate") changed

and, some comment is not agreed to me, plz check my sub comment!

lhoestq

Thank you ! It looks all good now.

I just removed the scipy import, since we already import soundfile:

src/datasets/features/audio.py

HuggingFaceDocBuilderDev · 2022-07-07T12:58:07Z

The documentation is not available anymore as the PR was closed or merged.

lhoestq

Thanks for adding support for PCM :)

Let's merge this one. The CI fails are unrelated to this PR and fixed on main

YooSungHyun added 2 commits May 26, 2022 11:41

Update: add using pcm bytes

af32b1e

re make style

c86c1b9

YooSungHyun mentioned this pull request May 30, 2022

Audio can not find value["bytes"] #4323

Closed

lhoestq reviewed May 30, 2022

View reviewed changes

src/datasets/features/audio.py Outdated Show resolved Hide resolved

src/datasets/features/audio.py Outdated Show resolved Hide resolved

src/datasets/features/audio.py Outdated Show resolved Hide resolved

src/datasets/features/audio.py Outdated Show resolved Hide resolved

mariosasko linked an issue Jun 1, 2022 that may be closed by this pull request

Audio can not find value["bytes"] #4323

Closed

YooSungHyun and others added 4 commits June 2, 2022 17:12

Update src/datasets/features/audio.py

40cc82c

Co-authored-by: Quentin Lhoest <[email protected]>

Update src/datasets/features/audio.py

d4d25ee

Co-authored-by: Quentin Lhoest <[email protected]>

Update src/datasets/features/audio.py

4e70447

Co-authored-by: Quentin Lhoest <[email protected]>

delete: wrong comment

e2299db

YooSungHyun requested a review from lhoestq June 2, 2022 09:40

Update: sampling_rate usage & test source update

f0f9d1f

lhoestq reviewed Jun 23, 2022

View reviewed changes

src/datasets/features/audio.py Outdated Show resolved Hide resolved

src/datasets/features/audio.py Outdated Show resolved Hide resolved

YooSungHyun and others added 3 commits June 24, 2022 09:44

Update: pcm2wav bytes don`t need path

0f01966

we can open up soundfile lib Co-authored-by: Quentin Lhoest <[email protected]>

Update: we can get wav style bytes to pcm, so we can read to soundfil…

f7e8dc9

…e lib

Update: pcm doesn`t use path, so check 'None'

9c358bb

YooSungHyun added 2 commits June 24, 2022 13:00

Merge branch 'master' of https://github.com/YooSungHyun/datasets into…

ebb0bf8

… YooSungHyun/features/audio

Merge branch 'huggingface:master' into YooSungHyun/features/audio

c445127

Merge branch 'YooSungHyun/features/audio' of https://github.com/YooSu…

c04f334

…ngHyun/datasets into YooSungHyun/features/audio

lhoestq reviewed Jun 27, 2022

View reviewed changes

YooSungHyun and others added 8 commits June 29, 2022 14:27

Update: not used self`s sampling_rate

a19f7c0

self.sampling_rate is for decode. so, we have to get value`s sampling_rate Co-authored-by: Quentin Lhoest <[email protected]>

Update: add sampling_rate

3d45eca

we have to know sampling_rate in input values variable Co-authored-by: Quentin Lhoest <[email protected]>

Update: sampling_rate variable

93376f3

Co-authored-by: Quentin Lhoest <[email protected]>

Update tests/features/test_audio.py

6c0ede9

Co-authored-by: Quentin Lhoest <[email protected]>

Update tests/features/test_audio.py

b08489e

Co-authored-by: Quentin Lhoest <[email protected]>

Update tests/features/test_audio.py

f0a1c43

Co-authored-by: Quentin Lhoest <[email protected]>

Merge branch 'YooSungHyun/features/audio' of https://github.com/YooSu…

1d7803e

…ngHyun/datasets into YooSungHyun/features/audio

Update: replace get sampling_rate

28b26cc

lhoestq reviewed Jul 7, 2022

View reviewed changes

src/datasets/features/audio.py Outdated Show resolved Hide resolved

src/datasets/features/audio.py Outdated Show resolved Hide resolved

Apply suggestions from code review

2620c2f

lhoestq approved these changes Jul 7, 2022

View reviewed changes

lhoestq merged commit 693418a into huggingface:main Jul 7, 2022

Update: add using pcm bytes (#4323) #4409

Update: add using pcm bytes (#4323) #4409

Uh oh!

Conversation

YooSungHyun commented May 26, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

lhoestq left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

mariosasko commented Jun 1, 2022

Uh oh!

lhoestq commented Jun 1, 2022

Uh oh!

mariosasko commented Jun 1, 2022

Uh oh!

lhoestq commented Jun 1, 2022

Uh oh!

YooSungHyun commented Jun 2, 2022

Uh oh!

YooSungHyun commented Jun 2, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

lhoestq commented Jun 9, 2022

Uh oh!

YooSungHyun commented Jun 13, 2022

Uh oh!

lhoestq commented Jun 13, 2022

Uh oh!

YooSungHyun commented Jun 14, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

YooSungHyun commented Jun 14, 2022

Uh oh!

YooSungHyun commented Jun 23, 2022

Uh oh!

lhoestq left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

YooSungHyun commented Jun 24, 2022

Uh oh!

YooSungHyun commented Jun 24, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

lhoestq left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

YooSungHyun commented Jun 29, 2022

Uh oh!

lhoestq left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

HuggingFaceDocBuilderDev commented Jul 7, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

lhoestq left a comment

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

YooSungHyun commented May 26, 2022 •

edited

Loading

YooSungHyun commented Jun 2, 2022 •

edited

Loading

YooSungHyun commented Jun 14, 2022 •

edited

Loading

YooSungHyun commented Jun 24, 2022 •

edited

Loading

HuggingFaceDocBuilderDev commented Jul 7, 2022 •

edited

Loading