Using Machine Learning to Measure Biodiversity

Using Machine Learning to Measure Biodiversity from Sound Recordings

Presenter: Alex Dewey

Co-Presenter(s):
Alex Dewey, Vincent Valenzuela, Antone Silveria, and Jonathan Calderon Chavez, Colin Quinn

Presenter Status: Undergraduate student

Academic Year: 20-21

Semester: Spring

Faculty Mentor: Gurman Gill

Department: Computer Science

Funding Source/Sponsor: Koret Scholars Program

President's Strategic Plan Goal: Sustainability and Environmental Inquiry

Screenshot URL: https://drive.google.com/uc?id=1NYRPVWWXXXzQQJSRtii-sxByvV1nNeMt

Abstract:
Biodiversity is an incredibly challenging metric to measure. This project aims to classify a soundscape and use that knowledge to help classify 500,000 minutes of sound data to understand broad, landscape scale patterns of biodiversity, human impact through noise pollution, and areas of quiet. All of these are indicators of ecosystem and community quality - essential measures for conservation, monitoring, and land management decision making. The main classification categories are Anthrophony (e.g., cars, airplanes, human voices), Biophony (e.g., birds, insects, amphibians), Geophony (e.g., wind, rain, running water), and Other.
The main tools used to accomplish this task are mel spectrograms (e.g., visual representation of sound), convolutional neural networks (CNNs), transfer learning, ensemble learning, support vector machines (SVMs), and uniform manifold approximation and projection (UMAP). With these techniques we are able, to get braod category accuracies of 87%, and with confidence thresholding, we get accuracies of broad classification of 96%, and subcategory classification accuracies of 86%, 89%, and 100% for each subcategory classifier.