Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Decoding speech from non-invasive brain recordings #1

Open
Serendipityzzz opened this issue Feb 5, 2023 · 3 comments
Open

Decoding speech from non-invasive brain recordings #1

Serendipityzzz opened this issue Feb 5, 2023 · 3 comments

Comments

@Serendipityzzz
Copy link
Owner

a convolutional neural network stacked onto a 'Subject Layer' and trained with a constractive objective to predict the deep representations of the audio waveform learnt by a dedicated module pretrained on 56k hours of speech.

f8ba39009024ad0fb9dac8c46bbf808

@Serendipityzzz
Copy link
Owner Author

C is channel/sensor, T is time, X $\in$ $\mathbb{R}^{C \times T}$ be a segment of a brain recording of a given subject while she listens to a speech segment of the same duraiton. Y $\in$ $\mathbb{R}^{F \times T}$ be the latent representation of speech, here the Mel spectrogram with F frequency bands. Thus supervised decoding consists of finding a decoding function: $f_{reg}: \mathbb{R}^{C \times T} \rightarrow \mathbb{R}^{F \times T}$ such that $f_{reg}$ predicts Y given X. We denote $\hat{Y} = f_{reg}(X)$ the representation of speech from the brain, $f_{reg}$ belongs to models like DNN, then a regression loss looks like
c78f9c81441ef817f4b1a1af3845d7e

@Serendipityzzz
Copy link
Owner Author

But this regression loss faces several challenges: decoding predictions appear to be dominated by a non-distinguishable broadband component when speech is present. So Meta AI made three main contributions: the introduction of a contrastive loss, a pre-trained deep speech representation, and a dedicated brain decoder.

@Serendipityzzz
Copy link
Owner Author

if we want to change the loss function, we can use NCE
https://zhuanlan.zhihu.com/p/334772391

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant