Categories
Archives
- July 2010
- June 2010
- May 2010
- April 2010
- March 2010
- February 2010
- January 2010
- December 2009
- November 2009
- October 2009
- September 2009
- August 2009
- July 2009
- June 2009
- May 2009
- April 2009
- January 2009
- May 2008
- April 2008
- January 2008
- December 2007
- April 2007
- February 2007
- January 2007
- October 2006
- August 2006
- June 2006
- May 2006
Google adds automatic captions to YouTube
This week, Google announced the addition of automatic captioning to YouTube. Captions can now be machine-generated by Google’s built-in automatic speech recognition (ASR) technology. This is a huge step toward making video content more accessible and more discoverable.
When I first came to MPOW, our group was piloting search technology that paired a video in RealPlayer side-by-side with a text transcript and then highlighted the words as the video played. It worked, but there was a significant bottleneck at the transcription stage.
To prepare the transcripts, we had to ship our videos on DVD to the vendor (who shall go unnamed) for them to complete the transcription which then took weeks or even months. The long-awaited end result was an XML document with each word and paragraph timecode-tagged. They told us they were using sophisticated automatic voice recognition technology and that the results then had to be proofread by humans. My suspicion is that most of the transcribing was people-powered. In any case, it was expensive and time-consuming to the point where the video content might be dated by the time it was finally available to the user. It just didn’t seem to be scalable and I ultimately decided it wasn’t worth it.
In the past year, I’ve seen a couple of video-search platforms that do similar things but probably much better. They’re still expensive though.
I’m anxious to see how well Google’s voice-to-text extraction works. I don’t hold lofty expectations since much of our content will have lots of medical terminology. If it does work, the next step would be to embed the YouTube viewer into DSpace’s XMLUI and combine the searches somehow.