Ssbu Codec - Search News

APCodec: A Neural Audio Codec With Parallel Amplitude and Phase Spectrum Encoding and Decoding

Abstract: This paper introduces a novel neural audio codec targeting high waveform sampling rates and low bitrates named APCodec, which seamlessly integrates the strengths of parametric codecs and ...

GitHub

FuseCodec: Semantic-Contextual Fusion and Supervision for Neural Codecs

Overview of the FuseCodec speech tokenization framework. Input speech x is encoded into latent features Z, then quantized into discrete tokens Q(1:K) via residual vector quantization (RVQ). To enrich ...

GitHub

Moshi: a speech-text foundation model for real time dialogue

Finally, the code for the web UI client used in the Moshi demo is provided in the client/ directory. If you want to fine tune Moshi, head out to kyutai-labs/moshi ...

IEEE

Evaluation of Objective Quality Models on Neural Audio Codecs

Abstract: With the emergence of neural audio codecs and new objective quality models based on machine learning, there is a need to clarify which models predict accurately the perceptual quality of ...

Some results have been hidden because they may be inaccessible to you

Show inaccessible results