Back to home

Birdsong Identification

A machine learning system that classifies audio recordings of bird calls and songs, built with fine-tuned Wav2Vec2 and a feature pipeline tailored for noisy field data.

Pileated woodpecker with waveform of its call

Project overview

Designed to help birdwatchers, researchers, and conservationists quickly identify species from audio captured in real-world environments.

  • Finetuned Wav2Vec2 for species classification
  • Audio augmentation and noise filtering for robustness
  • Custom feature extraction focused on bird vocalization patterns

Key contributions

A production-ready pipeline with insights into model performance, dataset curation, and deployment readiness.

Transformers Audio Processing Transfer Learning Robustness

Results

Achieved higher classification accuracy on field audio while maintaining low latency for inference.

Contextual analysis showed improved species separation across similar-sounding calls and reduced false positives on noisy backgrounds.