Forum: >>> Magnum BBS <<<

Dark
Log in

Username Password

VOICE RECOGNITION BASED ON SPECTROGRAM

From [email protected]@21:1/5 to All on Sat Jan 7 13:14:32 2017

Can we find out total number of speakers and their duration by looking at/analysing spectrogram.!
[image description] (https://drive.google.com/drive/folders/0B4rwzcsr5hevdEJlam9scTRodTg)

By just looking at the image, I can see some pattern, but I am looking for right solution in terms of opencv code(python)

--- SoupGate-Win32 v1.05
* Origin: fsxNet Usenet Gateway (21:1/5)

From Martin Leese@21:1/5 to [email protected] on Sun Jan 8 12:18:28 2017

[email protected] wrote:

Can we find out total number of speakers and their duration by looking at/analysing spectrogram.!
[image description] (https://drive.google.com/drive/folders/0B4rwzcsr5hevdEJlam9scTRodTg)

In general, no. Different speakers can
use similar frequency ranges, so frequency
doesn't work for this. The ear/brain uses
spatial processing (search for "cocktail
party effect").

By just looking at the image, I can see some pattern, but I am looking for right solution in terms of opencv code(python)

I can't, because I do not have permission
to view the file.

--
Regards,
Martin Leese
E-mail: [email protected]D
Web: http://members.tripod.com/martin_leese/

--- SoupGate-Win32 v1.05
* Origin: fsxNet Usenet Gateway (21:1/5)

Who's Online

System Info

Sysop:	Keyop
Location:	Huddersfield, West Yorkshire, UK
Users:	741
Nodes:	16 (2 / 14)
Uptime:	125:13:26
Calls:	12,470
Files:	15,200
Messages:	6,538,385