Add AudioEffector #3163

mthrok · 2023-03-10T03:44:29Z

This commit adds a new feature AudioEffector, which can be used to
apply various effects and codecs to waveforms in Tensor.

Under the hood it uses StreamWriter and StreamReader to apply
filters and encode/decode.

This is going to replace the deprecated apply_codec and
apply_sox_effect_tensor functions.

It can also perform online, chunk-by-chunk filtering.

Tutorial to follow.

closes #3161

#3192) Summary: OPUS encoder and VORBIS encoders require "strict=experimental" flags. This commit enables it automatically. The rational behind of it is typically we care if we can encode these formats at all and not how they are encoded. (This might be concern when these encoder becomes more mature on FFmpeg side and providing flags would result in weird behavior) Also when writing high-level functions that uses StreamWriter, if we do not set these flags, then these high-level functions have to add new options that should be passed down to StreamWriter, which turned out to be very painful in #3163 Pull Request resolved: #3192 Reviewed By: nateanl Differential Revision: D44275089 Pulled By: mthrok fbshipit-source-id: 74a757b4b7fc8467c8c88ffcb54fbaf89d6e4384

sorgfresser · 2023-03-26T18:05:04Z

You're using process_all_packets from time to time. While I get that collecting all packets is necessary for some of the filters, it poses a threat since one can not tell how large the underlying file will be. Do you plan on adding an option to limit the amount of packets processed at once - even though the filters would obviously become worse? I would be very glad.

mthrok · 2023-03-27T02:12:42Z

You're using process_all_packets from time to time. While I get that collecting all packets is necessary for some of the filters, it poses a threat since one can not tell how large the underlying file will be. Do you plan on adding an option to limit the amount of packets processed at once - even though the filters would obviously become worse? I would be very glad.

@sorgfresser

My implementations are aware of the potentially long audios, and they don't force the one-go operation. In this particular feature, I am providing two options. one-go or chunk-by-chunk process. It's up to users to pick which fashion they want to process their data.

facebook-github-bot · 2023-03-31T14:05:03Z

@mthrok has imported this pull request. If you are a Meta employee, you can view this diff on Phabricator.

hwangjeff · 2023-03-31T14:43:16Z

torchaudio/io/_effector.py

+        (applied,) = reader.pop_chunks()
+        return Tensor(applied)
+
+    def stream(self, waveform: Tensor, sample_rate: int, frames_per_chunk: int) -> Iterator[Tensor]:


does every effect require the specification of sample rate?

Yes, without a user provided sample rate, we need to assume some default value, which is not universally applicable.

torchaudio/io/_effector.py

facebook-github-bot · 2023-03-31T15:12:05Z

@mthrok has imported this pull request. If you are a Meta employee, you can view this diff on Phabricator.

facebook-github-bot · 2023-03-31T16:15:16Z

This pull request was exported from Phabricator. Differential Revision: D44576660

Summary: This commit adds a new feature AudioEffector, which can be used to apply various effects and codecs to waveforms in Tensor. Under the hood it uses StreamWriter and StreamReader to apply filters and encode/decode. This is going to replace the deprecated `apply_codec` and `apply_sox_effect_tensor` functions. It can also perform online, chunk-by-chunk filtering. Tutorial to follow. closes pytorch#3161 Pull Request resolved: pytorch#3163 Differential Revision: D44576660 Pulled By: mthrok fbshipit-source-id: 27e2a2af626188934a25e66d33c693ddf5bc580e

Summary: This commit adds a new feature AudioEffector, which can be used to apply various effects and codecs to waveforms in Tensor. Under the hood it uses StreamWriter and StreamReader to apply filters and encode/decode. This is going to replace the deprecated `apply_codec` and `apply_sox_effect_tensor` functions. It can also perform online, chunk-by-chunk filtering. Tutorial to follow. closes pytorch#3161 Pull Request resolved: pytorch#3163 Differential Revision: D44576660 Pulled By: mthrok fbshipit-source-id: 42097e758598c098313ff5a6b9563183604d6842

facebook-github-bot · 2023-03-31T16:52:21Z

This pull request was exported from Phabricator. Differential Revision: D44576660

Summary: This commit adds a new feature AudioEffector, which can be used to apply various effects and codecs to waveforms in Tensor. Under the hood it uses StreamWriter and StreamReader to apply filters and encode/decode. This is going to replace the deprecated `apply_codec` and `apply_sox_effect_tensor` functions. It can also perform online, chunk-by-chunk filtering. Tutorial to follow. closes pytorch#3161 Pull Request resolved: pytorch#3163 Differential Revision: D44576660 Pulled By: mthrok fbshipit-source-id: 1ac9613b3e5e5fa51dcc19e54978f23d82f5fa96

facebook-github-bot · 2023-03-31T16:56:55Z

This pull request was exported from Phabricator. Differential Revision: D44576660

Summary: This commit adds a new feature AudioEffector, which can be used to apply various effects and codecs to waveforms in Tensor. Under the hood it uses StreamWriter and StreamReader to apply filters and encode/decode. This is going to replace the deprecated `apply_codec` and `apply_sox_effect_tensor` functions. It can also perform online, chunk-by-chunk filtering. Tutorial to follow. closes pytorch#3161 Pull Request resolved: pytorch#3163 Differential Revision: D44576660 Pulled By: mthrok fbshipit-source-id: e6794d1d434c95db5cd24b3bd11f5e5e2a9671da

facebook-github-bot · 2023-03-31T17:00:47Z

This pull request was exported from Phabricator. Differential Revision: D44576660

facebook-github-bot · 2023-04-01T01:40:13Z

@mthrok merged this pull request in a403624.

github-actions · 2023-04-01T01:40:16Z

Hey @mthrok.
You merged this PR, but labels were not properly added. Please add a primary and secondary label (See https://github.com/pytorch/audio/blob/main/.github/process_commit.py)

facebook-github-bot added the CLA Signed label Mar 10, 2023

mthrok mentioned this pull request Mar 10, 2023

Apply filter function #3161

Closed

mthrok force-pushed the effector branch from 20a7a5a to 45a4425 Compare March 21, 2023 21:53

mthrok mentioned this pull request Mar 21, 2023

Set "experimental" automatically when using FFmpeg native opus/vorbis #3192

Closed

mthrok force-pushed the effector branch 3 times, most recently from bd0d52f to 1f8dbd1 Compare March 24, 2023 17:14

mthrok force-pushed the effector branch 11 times, most recently from 6a43ad2 to 6e99aac Compare March 31, 2023 13:34

mthrok changed the title ~~wip: add effector~~ Add AudioEffector Mar 31, 2023

mthrok marked this pull request as ready for review March 31, 2023 14:02

mthrok requested a review from a team March 31, 2023 14:03

hwangjeff reviewed Mar 31, 2023

View reviewed changes

mthrok force-pushed the effector branch from 293e292 to 045bbf9 Compare March 31, 2023 16:15

mthrok force-pushed the effector branch from 045bbf9 to ea3b60a Compare March 31, 2023 16:52

mthrok force-pushed the effector branch from ea3b60a to a7aba7d Compare March 31, 2023 16:56

mthrok force-pushed the effector branch from a7aba7d to bcd1ec4 Compare March 31, 2023 17:00

mthrok added module: IO new feature labels Mar 31, 2023

facebook-github-bot closed this in a403624 Apr 1, 2023

facebook-github-bot added the Merged label Apr 1, 2023

mthrok deleted the effector branch April 1, 2023 01:43

mthrok mentioned this pull request Apr 7, 2023

Add Stereo to Mono Convertions #877

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Add AudioEffector #3163

Add AudioEffector #3163

Uh oh!

mthrok commented Mar 10, 2023 •

edited

Loading

Uh oh!

sorgfresser commented Mar 26, 2023

Uh oh!

mthrok commented Mar 27, 2023

Uh oh!

facebook-github-bot commented Mar 31, 2023

Uh oh!

hwangjeff Mar 31, 2023

Uh oh!

mthrok Mar 31, 2023

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

facebook-github-bot commented Mar 31, 2023

Uh oh!

facebook-github-bot commented Mar 31, 2023

Uh oh!

facebook-github-bot commented Mar 31, 2023

Uh oh!

facebook-github-bot commented Mar 31, 2023

Uh oh!

facebook-github-bot commented Mar 31, 2023

Uh oh!

facebook-github-bot commented Apr 1, 2023

Uh oh!

github-actions bot commented Apr 1, 2023

Uh oh!

Uh oh!

Add AudioEffector #3163

Add AudioEffector #3163

Uh oh!

Conversation

mthrok commented Mar 10, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

sorgfresser commented Mar 26, 2023

Uh oh!

mthrok commented Mar 27, 2023

Uh oh!

facebook-github-bot commented Mar 31, 2023

Uh oh!

hwangjeff Mar 31, 2023

Choose a reason for hiding this comment

Uh oh!

mthrok Mar 31, 2023

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

facebook-github-bot commented Mar 31, 2023

Uh oh!

facebook-github-bot commented Mar 31, 2023

Uh oh!

facebook-github-bot commented Mar 31, 2023

Uh oh!

facebook-github-bot commented Mar 31, 2023

Uh oh!

facebook-github-bot commented Mar 31, 2023

Uh oh!

facebook-github-bot commented Apr 1, 2023

Uh oh!

github-actions bot commented Apr 1, 2023

Uh oh!

Uh oh!

mthrok commented Mar 10, 2023 •

edited

Loading