Looking for the equivalent to OpenCV for audio streams

I would like to recognize sounds within an audio stream.  For instance, a doorbell.
Basically it would be the equivalent to using OpenCV to identify a face within a video stream.
So I am looking for the equivalent of an OpenCV library for audio (rather than images/video).  Or perhaps some cloud-based solution -- although I have not see anything on AWS or Azure.
Francois KoutchoukCTOAsked:
Who is Participating?
I wear a lot of hats...

"The solutions and answers provided on Experts Exchange have been extremely helpful to me over the last few years. I wear a lot of hats - Developer, Database Administrator, Help Desk, etc., so I know a lot of things but not a lot about one thing. Experts Exchange gives me answers from people who do know a lot about one thing, in a easy to use platform." -Todd S.

Kyle SantosQuality AssuranceCommented:

I am following up on your question.  Do you still need help?

If you solved the problem on your own, would you please post the solution here in case others have the same problem?


Kyle Santos
Customer Relations
Francois KoutchoukCTOAuthor Commented:
I haven't solved the problem.  I am surprised there is so much available on video/image analysis (OpenCV) and so little on the... audio portion of the video!
Kyle SantosQuality AssuranceCommented:
Hi Francois,

Would you like me to send alerts to more experts to try and help you here?
Maximize Customer Retention with Superior Service

The IT Service Excellence Tool Kit delivers expert advice for technology solution providers. Get your free copy for valuable how-to assets including sample agreements, checklists, flowcharts, and more to help build customer satisfaction and retention.

Francois KoutchoukCTOAuthor Commented:
Yes, I would love to see a solution to that.  I reached out to one of the expert on OpenCV who told me he was not interested in the audio aspect...  I think there is a whole area to explore, what with all those AI discussions these days. Thank you,
Scott Fell, EE MVEDeveloper & EE ModeratorCommented:
I have only just started playing with this, I think https://github.com/BrainJS/brain.js or https://www.tensorflow.org/ is what you are after.
David FavorLinux/LXD/WordPress/Hosting SavantCommented:
If I were going to attempt this, I'd likely use a speech recognition system.

Try searching for speech recognition ubuntu OR github + pick a system + then train the system to recognize your doorbell or any other sounds you're targeting to parse/recognize.
It's more than this solution.Get answers and train to solve all your tech problems - anytime, anywhere.Try it for free Edge Out The Competitionfor your dream job with proven skills and certifications.Get started today Stand Outas the employee with proven skills.Start learning today for free Move Your Career Forwardwith certification training in the latest technologies.Start your trial today
Multimedia Programming

From novice to tech pro — start learning today.