We clearly need something like the UC Davis Wine Wheel to describe audio. If you've never seen this, definitely check out some of the descriptors.
https://www.thewinecellarinsider.com/wine-topics/wine-educational-questions/davis-aroma-wheel/
https://www.thewinecellarinsider.com/wine-topics/wine-educational-questions/davis-aroma-wheel/