SoundNet_Pytorch

Soundnet model in Pytorch

Introduction

The code is for converting the pretrained tensorflow soundnet model to pytorch model. So no training code for SoundNet model. The pretrained pytorch soundnet model is sound8.pth

Prerequisites

Tensorflow (Only if .pth doesn't exist)
python 3.6 with numpy
pytorch 0.4+

How to use

If the file sound8.pth has not been generated yet, follow the original instructions : model
If audio preprocessing is required (ex : the sample rate is not 22.050 Hz),utils.py has a method for converting the indicated folder.

To convert a file: sox input.wav -r 22050 -c 1 ouput.wav
To extract a features vector use:

audio,sr = load_audio(filepath)
    features = ex.extract_pytorch_feature(audio,'./soundnet/sound8.pth')   
    print([x.shape for x in features])
    
    ##extract vector
    conv = ex.extract_vector(features,idlayer) #features vector

Highlevel features:

conv5, idlayer = 4
conv7, idlayer = 6

The temporal resolution

In order to find the the temporal resolution 1/m for each layer, the slope and the interception are calculated, which describes the relationship between the time in seconds and the number of channels of the extract_feature_vector method.

Acknowledgments

Mode for soundnet tensorflow model is ported from soundnet_tensorflow. Thanks for his works!

reference

Yusuf Aytar, Carl Vondrick, and Antonio Torralba. "Soundnet: Learning sound representations from unlabeled video." Advances in Neural Information Processing Systems. 2016.

Name		Name	Last commit message	Last commit date
Latest commit History 10 Commits
modeltf		modeltf
sound		sound
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
__init__.py		__init__.py
extract_features.py		extract_features.py
pytorch_model.py		pytorch_model.py
relation_layer_seconds.txt		relation_layer_seconds.txt
sound8.pth		sound8.pth
utils.py		utils.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

SoundNet_Pytorch

Introduction

Prerequisites

How to use

The temporal resolution

Acknowledgments

reference

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

SoundNet_Pytorch

Introduction

Prerequisites

How to use

The temporal resolution

Acknowledgments

reference

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages