I'd like to see one of the first tangible results from the "OpenASR" project (an open source speech to text engine being built by ICSI using MPF as the internal framework) be a detector that would emit metadata when it detects human speech and then when it detects a "significant" period of non-speech thereafter. Ideally, the speech detected "events" could then be used to trigger storyboard keyframes.
