Sound shredding moustafa

•Download as PPT, PDF•

1 like•234 views

This document summarizes research on preserving privacy in audio sensing. It presents two approaches: sound shredding and sound subsampling. Sound shredding randomizes audio frames, while subsampling discards some frames. Experiments show these methods slightly reduce context recognition accuracy but significantly impair speech recognition, gender identification, and other privacy risks. However, sound shredding can be partially reconstructed by matching frame frequencies. Overall, the work introduces privacy-preserving audio techniques but recognizes limitations like sound shredding potentially being attacked through reconstruction.

Science

Sound Shredding : Privacy
Preserved Audio Sensing
Presenter: Moustafa Alzantot (UCLA)
Sumeet Kumar, et al.
Carnegie Melon University

Introduction
 Sound sensing can be very useful for context
awareness.
 Identify user location and activities
 Potential risks on user’s privacy
 Speech recognition
 Speaker identification
 How to preserve user privacy without comprising the
context awareness accuracy ?

Research Question
 This paper presents two approaches for
preserving user privacy without significantly
decreasing the context recognition accuracy
or consuming much battery in
Encryption/Decryption.
 Sound shredding
 Sound subsampling

Methodology
Activity context: the place where the activity takes place (e.g.
restaurant for dinning)
Context identification process:
 Audio Data Collection:
 35 sounds collected at 8KHz using nexus 4 phone.
 Feature Extraction:
 Sliding window frame (40 ms window , 50%overlap)
 12 MFCC features for every window.
 Context Recognition:
 Experiments using both simple KNN, and SVM.

Methodology
 Sound Subsampling: collection part of raw data.
 50% subsampling discarding one frame after every single frame is
stored.
 Subsampling results in a slight drop in context recognition
accuracy.

Methodology
 Sound Shredding: randomize the audio
frames order in a sound snippet.

Results : Context Recognition
Accuracy
 Collected 35 sound samples in different contexts
(faculty meeting, restaurant, walking, coffee shop)
 80% of data for training, 20% for testing.
 Context recognition accuracy is slightly dropped.

Results: Privacy User Study
 User study involves playing different sounds (shredded, and sub-
sampled)
 Users rated the ability of speech recognition, gender identification,
and people counting.
 Scale used from 1(Yes, I can) to 5 (Not, at all).
 Gender identification improves the least by 20%.

Results: Reconstructing based on frequency
content
 Number of (10ms) frames in 10 seconds audio snippet = 667 frames.
 Number of possible orderings = 667! (intractable to break shredding by
bruteforce).
 Reconstructing by frequency content
 Greedly match the left and right edge of subsequent frames in frequency domain.
 Can reconstruct if audio is broken in 5 or less segments

Critique of work(1slide)
 Sound subsampling alone is not sufficient for privacy
preserving (at least for people counting, and gender
identification).
 Shredding can be attacked (As they mentioned at the
end of paper)
 Should compare against other methods (like filtering or
perturbing the speech frequency range in the audio
collected)

Similar to Sound shredding moustafa

3. speech processing algorithms for perception improvement of hearing impaire...k srikanth

Kc3517481754IJERA Editor

A GAUSSIAN MIXTURE MODEL BASED SPEECH RECOGNITION SYSTEM USING MATLABsipij

Parameters Optimization for Improving ASR Performance in Adverse Real World N...Waqas Tariq

Optimized audio classification and segmentation algorithm by using ensemble m...Venkat Projects

Analysis PSNR of High Density Salt and Pepper Impulse Noise Using Median Filterijtsrd

T26123129IJERA Editor

Performance estimation based recurrent-convolutional encoder decoder for spee...karthik annam

Novel Approach of Implementing Psychoacoustic model for MPEG-1 Audioinventy

Novel adaptive filter (naf) for impulse noise suppression from digital imagesijbbjournal

Digital audioMohammad Dwikat

Audio Steganography Coding Using the Discreet Wavelet TransformsCSCJournals

De4201715719IJERA Editor

Mp3Shirley Aranjo

MLConf2013: Teaching Computer to Listen to MusicEric Battenberg

Ml conf2013 teaching_computers_shareMLconf

[slide] Attentive Modality Hopping Mechanism for Speech Emotion RecognitionSeoul National University

A novel automatic voice recognition system based on text-independent in a noi...IJECEIAES

Audio Signal Identification and Search Approach for Minimizing the Search Tim...aciijournal

AUDIO SIGNAL IDENTIFICATION AND SEARCH APPROACH FOR MINIMIZING THE SEARCH TIM...aciijournal

Similar to Sound shredding moustafa (20)

3. speech processing algorithms for perception improvement of hearing impaire...

Kc3517481754

A GAUSSIAN MIXTURE MODEL BASED SPEECH RECOGNITION SYSTEM USING MATLAB

Parameters Optimization for Improving ASR Performance in Adverse Real World N...

Optimized audio classification and segmentation algorithm by using ensemble m...

Analysis PSNR of High Density Salt and Pepper Impulse Noise Using Median Filter

T26123129

Performance estimation based recurrent-convolutional encoder decoder for spee...

Novel Approach of Implementing Psychoacoustic model for MPEG-1 Audio

Novel adaptive filter (naf) for impulse noise suppression from digital images

Digital audio

Audio Steganography Coding Using the Discreet Wavelet Transforms

De4201715719

Mp3

MLConf2013: Teaching Computer to Listen to Music

Ml conf2013 teaching_computers_share

[slide] Attentive Modality Hopping Mechanism for Speech Emotion Recognition

A novel automatic voice recognition system based on text-independent in a noi...

Audio Signal Identification and Search Approach for Minimizing the Search Tim...

AUDIO SIGNAL IDENTIFICATION AND SEARCH APPROACH FOR MINIMIZING THE SEARCH TIM...

Recently uploaded

Connaught Place, Delhi Call girls :8448380779 Model Escorts | 100% verifiedDelhi Call girls

GBSN - Microbiology (Unit 2)Areesha Ahmad

GUIDELINES ON SIMILAR BIOLOGICS Regulatory Requirements for Marketing Authori...Lokesh Kothari

Conjugation, transduction and transformationAreesha Ahmad

SAMASTIPUR CALL GIRL 7857803690 LOW PRICE ESCORT SERVICEayushi9330

GBSN - Microbiology (Unit 1)Areesha Ahmad

9654467111 Call Girls In Raj Nagar Delhi Short 1500 Night 6000Sapana Sha

GBSN - Biochemistry (Unit 1)Areesha Ahmad

❤Jammu Kashmir Call Girls 8617697112 Personal Whatsapp Number 💦✅.Nitya salvi

Bacterial Identification and ClassificationsAreesha Ahmad

Site Acceptance Test .Poonam Aher Patil

GBSN - Microbiology (Unit 3)Areesha Ahmad

High Profile 🔝 8250077686 📞 Call Girls Service in GTB Nagar🍑Damini Dixit

Proteomics: types, protein profiling steps etc.Silpa

Factory Acceptance Test( FAT).pptx .Poonam Aher Patil

Nanoparticles synthesis and characterization kaibalyasahoo82800

Pests of cotton_Sucking_Pests_Dr.UPR.pdfPirithiRaju

Clean In Place(CIP).pptx .Poonam Aher Patil

High Class Escorts in Hyderabad ₹7.5k Pick Up & Drop With Cash Payment 969456...chandars293

Discovery of an Accretion Streamer and a Slow Wide-angle Outflow around FUOri...Sérgio Sacani

Recently uploaded (20)

Connaught Place, Delhi Call girls :8448380779 Model Escorts | 100% verified

GBSN - Microbiology (Unit 2)

GUIDELINES ON SIMILAR BIOLOGICS Regulatory Requirements for Marketing Authori...

Conjugation, transduction and transformation

SAMASTIPUR CALL GIRL 7857803690 LOW PRICE ESCORT SERVICE

GBSN - Microbiology (Unit 1)

9654467111 Call Girls In Raj Nagar Delhi Short 1500 Night 6000

GBSN - Biochemistry (Unit 1)

❤Jammu Kashmir Call Girls 8617697112 Personal Whatsapp Number 💦✅.

Bacterial Identification and Classifications

Site Acceptance Test .

GBSN - Microbiology (Unit 3)

High Profile 🔝 8250077686 📞 Call Girls Service in GTB Nagar🍑

Proteomics: types, protein profiling steps etc.

Factory Acceptance Test( FAT).pptx .

Nanoparticles synthesis and characterization

Pests of cotton_Sucking_Pests_Dr.UPR.pdf

Clean In Place(CIP).pptx .

High Class Escorts in Hyderabad ₹7.5k Pick Up & Drop With Cash Payment 969456...

Discovery of an Accretion Streamer and a Slow Wide-angle Outflow around FUOri...

Sound shredding moustafa

1. Sound Shredding : Privacy Preserved Audio Sensing Presenter: Moustafa Alzantot (UCLA) Sumeet Kumar, et al. Carnegie Melon University

2. Introduction  Sound sensing can be very useful for context awareness.  Identify user location and activities  Potential risks on user’s privacy  Speech recognition  Speaker identification  How to preserve user privacy without comprising the context awareness accuracy ?

3. Research Question  This paper presents two approaches for preserving user privacy without significantly decreasing the context recognition accuracy or consuming much battery in Encryption/Decryption.  Sound shredding  Sound subsampling

4. Methodology Activity context: the place where the activity takes place (e.g. restaurant for dinning) Context identification process:  Audio Data Collection:  35 sounds collected at 8KHz using nexus 4 phone.  Feature Extraction:  Sliding window frame (40 ms window , 50%overlap)  12 MFCC features for every window.  Context Recognition:  Experiments using both simple KNN, and SVM.

5. Methodology  Sound Subsampling: collection part of raw data.  50% subsampling discarding one frame after every single frame is stored.  Subsampling results in a slight drop in context recognition accuracy.

6. Methodology  Sound Shredding: randomize the audio frames order in a sound snippet.

7. Results : Context Recognition Accuracy  Collected 35 sound samples in different contexts (faculty meeting, restaurant, walking, coffee shop)  80% of data for training, 20% for testing.  Context recognition accuracy is slightly dropped.

8. Results: Privacy User Study  User study involves playing different sounds (shredded, and sub- sampled)  Users rated the ability of speech recognition, gender identification, and people counting.  Scale used from 1(Yes, I can) to 5 (Not, at all).  Gender identification improves the least by 20%.

9. Results: Computer Based Recognition

10. Results: Reconstructing based on frequency content  Number of (10ms) frames in 10 seconds audio snippet = 667 frames.  Number of possible orderings = 667! (intractable to break shredding by bruteforce).  Reconstructing by frequency content  Greedly match the left and right edge of subsequent frames in frequency domain.  Can reconstruct if audio is broken in 5 or less segments

11. Critique of work(1slide)  Sound subsampling alone is not sufficient for privacy preserving (at least for people counting, and gender identification).  Shredding can be attacked (As they mentioned at the end of paper)  Should compare against other methods (like filtering or perturbing the speech frequency range in the audio collected)

Sound shredding moustafa

Recommended

Recommended

More Related Content

Similar to Sound shredding moustafa

Similar to Sound shredding moustafa (20)

More from BBKuhn

More from BBKuhn (13)

Recently uploaded

Recently uploaded (20)

Sound shredding moustafa