(Mis)Uses of Technology

(Mis)Uses of Technology

by Mike Masnick




Searching For Sound

from the limitations dept

Many people have pointed out that search engines (yes, mainly Google) are now the "front end to the internet." However, how does that work when the internet is increasingly not just about text? Especially as broadband catches on around the world, more and more content is audio and visual content. Both new and old search engines are now working on better ways to sort through that content - using metadata and speech recognition to understand what's being said. The article uses NPR as the main example, describing how they use voice recognition technology to create immediate transcripts of their audio, which are completely searchable. They admit that these transcripts are later replaced by "more accurate" human written transcripts, but that the automated ones work well enough. The article also focuses on StreamSage, which seems to be one of the more advanced tools. It uses voice recognition to transcribe audio - but also tries to add in some contextual analysis to create an automated "table of contents" for the file, so searching through it is much easier.

Leave a Comment..

 
 

Add Your Comment

Have a Techdirt Account? Sign in now.
Get Techdirt’s Daily Email
Plain Text HTML
Save me a cookie
  • Plain Text: A CRLF will be replaced by break <br> tag, all other allowable HTML is intact
  • HTML: No formatting of any kind is done without explicitly being written in
  • Allowed HTML Tags: <b> <i> <p> <a> <em> <br> <strong> <blockquote> <hr> <tt>
Close
Have a Techdirt Account? Sign in now.
Get Techdirt’s Daily Email
Plain Text HTML Save me a cookie

Search Techdirt
And now, a word from our Sponsors..



Subscribe to Techdirt's Daily Email Newsletter

Techdirt's Daily Email Newsletter

Related Stories
Close
E-mail It