Ghasemzadeh, 2018 - Google Patents
Multi-layer architecture for efficient steganalysis of UnderMp3Cover in multi-encoder scenarioGhasemzadeh, 2018
View PDF- Document ID
- 3601223762760634929
- Author
- Ghasemzadeh H
- Publication year
- Publication venue
- IEEE Transactions on Information Forensics and Security
External Links
Snippet
Mp3 is a very popular audio format and hence it can be a good host for carrying hidden messages. Therefore, different steganography methods have been proposed for mp3 hosts, but current literature has only focused on steganalysis of mp3stego. In this paper, we …
- 239000010410 layer 0 abstract description 35
Classifications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signal analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signal, using source filter models or psychoacoustic analysis
- G10L19/018—Audio watermarking, i.e. embedding inaudible data in the audio signal
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signal analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signal, using source filter models or psychoacoustic analysis
- G10L19/04—Speech or audio signal analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signal, using source filter models or psychoacoustic analysis using predictive techniques
- G10L19/16—Vocoder architecture
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signal analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signal, using source filter models or psychoacoustic analysis
- G10L19/02—Speech or audio signal analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signal, using source filter models or psychoacoustic analysis using spectral analysis, e.g. transform vocoders or subband vocoders
- G10L19/0212—Speech or audio signal analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signal, using source filter models or psychoacoustic analysis using spectral analysis, e.g. transform vocoders or subband vocoders using orthogonal transformation
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signal analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signal, using source filter models or psychoacoustic analysis
- G10L19/008—Multichannel audio signal coding or decoding, i.e. using interchannel correlation to reduce redundancies, e.g. joint-stereo, intensity-coding, matrixing
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L17/00—Speaker identification or verification
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L25/00—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00-G10L21/00
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L21/00—Processing of the speech or voice signal to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
- G10L21/02—Speech enhancement, e.g. noise reduction or echo cancellation
Similar Documents
Publication | Publication Date | Title |
---|---|---|
AlSabhany et al. | Digital audio steganography: Systematic review, classification, and analysis of the current state of the art | |
Hamza et al. | Deepfake audio detection via MFCC features using machine learning | |
Liu et al. | Temporal derivative-based spectrum and mel-cepstrum audio steganalysis | |
Liu et al. | Derivative-based audio steganalysis | |
Yang et al. | Defeating fake-quality MP3 | |
Qiao et al. | MP3 audio steganalysis | |
Nematollahi et al. | An overview of digital speech watermarking | |
Ghasemzadeh | Multi-layer architecture for efficient steganalysis of UnderMp3Cover in multi-encoder scenario | |
Ghasemzadeh et al. | Comprehensive review of audio steganalysis methods | |
Yan et al. | Speech authentication by semi-fragile speech watermarking utilizing analysis by synthesis and spectral distortion optimization | |
Ren et al. | A universal audio steganalysis scheme based on multiscale spectrograms and DeepResNet | |
Chen et al. | Derivative-based steganographic distortion and its non-additive extensions for audio | |
CN103985389A (en) | Steganalysis method for AMR audio files | |
Yadav et al. | ASSD: Synthetic Speech Detection in the AAC Compressed Domain | |
Wen et al. | Robust audio anti-spoofing with fusion-reconstruction learning on multi-order spectrograms | |
Ghasemzadeh et al. | Reversed-Mel cepstrum based audio steganalysis | |
Roman et al. | Proactive Detection of Voice Cloning with Localized Watermarking | |
Yadav et al. | PS3DT: Synthetic Speech Detection Using Patched Spectrogram Transformer | |
Doets et al. | Distortion estimation in compressed music using only audio fingerprints | |
Chuchra et al. | A deep learning approach for splicing detection in digital audios | |
Huu et al. | Deep neural networks based invisible steganography for audio-into-image algorithm | |
Karnjana et al. | Speech watermarking scheme based on singular-spectrum analysis for tampering detection and identification | |
Ren et al. | Who is speaking actually? robust and versatile speaker traceability for voice conversion | |
Li et al. | Parameterization of LSB in Self‐Recovery Speech Watermarking Framework in Big Data Mining | |
Büker et al. | Deep convolutional neural networks for double compressed AMR audio detection |