CHARACTERIZING CONTENT FOR AUDIO-VIDEO DUBBING AND OTHER TRANSFORMATIONS
|Owner||WARNER BROS. ENTERTAINMENT INC. (USA)|
AbstractA computer-implemented method for transforming audio-video data includes automatically detecting substantially all discrete human-perceivable messages encoded in the audio-video data, determining a semantic encoding for each of the detected messages, assigning a time code to each of the encodings correlated to specific frames of the audio-video data, and recording a data structure relating each time code to a corresponding one of the semantic encodings in a recording medium. The method may further include converting extracted recorded vocal instances from the audio-video data into a text data, generating a dubbing list comprising the text data and the time code, assigning a set of annotations corresponding to the one or more vocal instances specifying one or more creative intents, generating the scripting data comprising the dubbing list and the set of annotations, and other optional operations. An apparatus may be programmed to perform the method by executable instructions for the foregoing operations.
IPC Classes ?
- G10L 21/055 - Time compression or expansion for synchronising with other signals, e.g. video signals
- G10L 15/26 - Speech to text systems
- H04N 21/43 - Processing of content or additional data, e.g. demultiplexing additional data from a digital video stream; Elementary client operations, e.g. monitoring of home network or synchronizing decoder's clock; Client middleware
- G11B 27/02 - Editing, e.g. varying the order of information signals recorded on, or reproduced from, record carriers