Using Machine Learning to Measure Biodiversity
Using Machine Learning to Measure Biodiversity from Sound Recordings
Presenter: Alex Dewey
Co-Presenter(s):
Alex Dewey, Vincent Valenzuela, Antone Silveria, and Jonathan Calderon Chavez, Colin Quinn
Presenter Status: Undergraduate student
Academic Year: 20-21
Semester: Spring
Faculty Mentor: Gurman Gill
Department: Computer Science
Funding Source/Sponsor: Koret Scholars Program
President's Strategic Plan Goal: Sustainability and Environmental Inquiry
Screenshot URL: https://drive.google.com/uc?id=1NYRPVWWXXXzQQJSRtii-sxByvV1nNeMt
Abstract:
Biodiversity is an incredibly challenging metric to measure. This project aims to classify a soundscape and use that knowledge to help classify 500,000 minutes of sound data to understand broad, landscape scale patterns of biodiversity, human impact through noise pollution, and areas of quiet. All of these are indicators of ecosystem and community quality - essential measures for conservation, monitoring, and land management decision making. The main classification categories are Anthrophony (e.g., cars, airplanes, human voices), Biophony (e.g., birds, insects, amphibians), Geophony (e.g., wind, rain, running water), and Other.
The main tools used to accomplish this task are mel spectrograms (e.g., visual representation of sound), convolutional neural networks (CNNs), transfer learning, ensemble learning, support vector machines (SVMs), and uniform manifold approximation and projection (UMAP). With these techniques we are able, to get braod category accuracies of 87%, and with confidence thresholding, we get accuracies of broad classification of 96%, and subcategory classification accuracies of 86%, 89%, and 100% for each subcategory classifier.