Ghasemzadeh, 2018 - Google Patents

Multi-layer architecture for efficient steganalysis of UnderMp3Cover in multi-encoder scenario

Ghasemzadeh, 2018

View PDF
Document ID
3601223762760634929
Author
Ghasemzadeh H
Publication year
Publication venue
IEEE Transactions on Information Forensics and Security

External Links

Snippet

Mp3 is a very popular audio format and hence it can be a good host for carrying hidden messages. Therefore, different steganography methods have been proposed for mp3 hosts, but current literature has only focused on steganalysis of mp3stego. In this paper, we …
Continue reading at arxiv.org (PDF) (other versions)

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signal analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signal, using source filter models or psychoacoustic analysis
    • G10L19/018Audio watermarking, i.e. embedding inaudible data in the audio signal
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signal analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signal, using source filter models or psychoacoustic analysis
    • G10L19/04Speech or audio signal analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signal, using source filter models or psychoacoustic analysis using predictive techniques
    • G10L19/16Vocoder architecture
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signal analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signal, using source filter models or psychoacoustic analysis
    • G10L19/02Speech or audio signal analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signal, using source filter models or psychoacoustic analysis using spectral analysis, e.g. transform vocoders or subband vocoders
    • G10L19/0212Speech or audio signal analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signal, using source filter models or psychoacoustic analysis using spectral analysis, e.g. transform vocoders or subband vocoders using orthogonal transformation
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signal analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signal, using source filter models or psychoacoustic analysis
    • G10L19/008Multichannel audio signal coding or decoding, i.e. using interchannel correlation to reduce redundancies, e.g. joint-stereo, intensity-coding, matrixing
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L17/00Speaker identification or verification
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L25/00Speech or voice analysis techniques not restricted to a single one of groups G10L15/00-G10L21/00
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L21/00Processing of the speech or voice signal to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
    • G10L21/02Speech enhancement, e.g. noise reduction or echo cancellation

Similar Documents

Publication Publication Date Title
AlSabhany et al. Digital audio steganography: Systematic review, classification, and analysis of the current state of the art
Hamza et al. Deepfake audio detection via MFCC features using machine learning
Liu et al. Temporal derivative-based spectrum and mel-cepstrum audio steganalysis
Liu et al. Derivative-based audio steganalysis
Yang et al. Defeating fake-quality MP3
Qiao et al. MP3 audio steganalysis
Nematollahi et al. An overview of digital speech watermarking
Ghasemzadeh Multi-layer architecture for efficient steganalysis of UnderMp3Cover in multi-encoder scenario
Ghasemzadeh et al. Comprehensive review of audio steganalysis methods
Yan et al. Speech authentication by semi-fragile speech watermarking utilizing analysis by synthesis and spectral distortion optimization
Ren et al. A universal audio steganalysis scheme based on multiscale spectrograms and DeepResNet
Chen et al. Derivative-based steganographic distortion and its non-additive extensions for audio
CN103985389A (en) Steganalysis method for AMR audio files
Yadav et al. ASSD: Synthetic Speech Detection in the AAC Compressed Domain
Wen et al. Robust audio anti-spoofing with fusion-reconstruction learning on multi-order spectrograms
Ghasemzadeh et al. Reversed-Mel cepstrum based audio steganalysis
Roman et al. Proactive Detection of Voice Cloning with Localized Watermarking
Yadav et al. PS3DT: Synthetic Speech Detection Using Patched Spectrogram Transformer
Doets et al. Distortion estimation in compressed music using only audio fingerprints
Chuchra et al. A deep learning approach for splicing detection in digital audios
Huu et al. Deep neural networks based invisible steganography for audio-into-image algorithm
Karnjana et al. Speech watermarking scheme based on singular-spectrum analysis for tampering detection and identification
Ren et al. Who is speaking actually? robust and versatile speaker traceability for voice conversion
Li et al. Parameterization of LSB in Self‐Recovery Speech Watermarking Framework in Big Data Mining
Büker et al. Deep convolutional neural networks for double compressed AMR audio detection