Saturday, September 27, 2008

Google audio indexing is GAUDI

Google Audio Indexing is a new technology from Google that allows users to better search and watch videos from various YouTube channels. It uses speech technology to find spoken words inside videos and lets the user jump to the right portion of the video where these words are spoken.

Political videos and election materials are a special case of broadcast news content, a domain that has received a lot of academic and industry attention and is known to perform well.

By making the technology available to a wide audience, Google hopes to both offer a useful service and learn what internet users think of this new technology.

Google Audio Indexing uses speech technology to transform spoken words into text and leverages the Google indexing technology to return the best results to the user.

The returned videos are ranked based on the spoken content, the metadata, the freshness, etc.

Google periodically crawl the YouTube political channels for new content. As soon as a new video is uploaded to YouTube, it is processed by the system and made available in the index for people to search. The speech research group at Google has developed its own speech recognition system called GAUDI, which powers both Google Audio Indexing and the Google Elections Video Search gadget.