UrbanSound8K Classification using a Custom 3-Layer CNN

A PyTorch based Deep Learning pipeline designed to classify environmental sounds from the UrbanSound8K dataset. The architecture utilizes a custom 3 layer Convolutional Neural Network (CNN).

📊 Project Performance Summary

Training Set: Folds 1–8 (~7,000 audio samples)
Testing Set: Folds 9 & 10 (~1,700 audio samples — Strict Isolate Evaluation)
Training Epochs: 100
Final Evaluation Accuracy: 76.77%
Total Paramater Count: 2.8M

Note on Data Integrity: This project strictly adheres to the official UrbanSound8K fold structure. Folds 9 and 10 were completely locked away during the training phase, ensuring zero data leakage and a true, real-world generalization score.

🏗️ Model Architecture

The model is built using a custom sequential CNN backbone designed to balance high computational accuracy with a lightweight parameter footprint. By downsampling through 3 distinct convolutional blocks, the model keeps total parameters around 2.8 Million.

Name		Name	Last commit message	Last commit date
Latest commit History 2 Commits
.gitignore		.gitignore
README.md		README.md
best_audio_cnn.pth		best_audio_cnn.pth
data_cleaning.ipynb		data_cleaning.ipynb
dataset_generator.ipynb		dataset_generator.ipynb
main.ipynb		main.ipynb
training.ipynb		training.ipynb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

UrbanSound8K Classification using a Custom 3-Layer CNN

📊 Project Performance Summary

🏗️ Model Architecture

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

UrbanSound8K Classification using a Custom 3-Layer CNN

📊 Project Performance Summary

🏗️ Model Architecture

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages