I would like to recognize sounds within an audio stream. For instance, a doorbell.
Basically it would be the equivalent to using OpenCV to identify a face within a video stream.
So I am looking for the equivalent of an OpenCV library for audio (rather than images/video). Or perhaps some cloud-based solution -- although I have not see anything on AWS or Azure.