Dolby International AB

Netherlands

Back to Profile

1-100 of 1,094 for Dolby International AB Sort by
Query
Aggregations
IP Type
        Patent 1,092
        Trademark 2
Jurisdiction
        United States 745
        World 320
        Canada 29
Date
New (last 4 weeks) 1
2024 March (MTD) 1
2024 January 1
2023 November 3
2024 (YTD) 2
See more
IPC Class
G10L 19/008 - Multichannel audio signal coding or decoding using interchannel correlation to reduce redundancy, e.g. joint-stereo, intensity-coding or matrixing 362
G10L 19/02 - Speech or audio signal analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using spectral analysis, e.g. transform vocoders or subband vocoders 218
G10L 19/16 - Vocoder architecture 177
H04S 7/00 - Indicating arrangements; Control arrangements, e.g. balance control 177
G10L 19/00 - Speech or audio signal analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis 166
See more
NICE Class
09 - Scientific and electric apparatus and instruments 2
38 - Telecommunications services 2
42 - Scientific, technological and industrial services, research and design 2
Status
Pending 78
Registered / In Force 1,016
  1     2     3     ...     11        Next Page

1.

METHOD FOR SIGNALING A STEP-WISE TEMPORAL SUB-LAYER ACCESS SAMPLE

      
Application Number 18515746
Status Pending
Filing Date 2023-11-21
First Publication Date 2024-03-21
Owner DOLBY INTERNATIONAL AB (Netherlands)
Inventor Deshpande, Sachin G.

Abstract

An electronic device for encoding a picture is described. The electronic device includes a processor and instructions stored in memory that are in electronic communication with the processor. The instructions are executable to encode a step-wise temporal sub-layer access (STSA) sample grouping. The instructions are further executable to send and/or store the STSA sample grouping.

IPC Classes  ?

  • H04N 19/31 - Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using hierarchical techniques, e.g. scalability in the temporal domain
  • H04N 19/172 - Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding the unit being an image region, e.g. an object the region being a picture, frame or field
  • H04N 21/2343 - Processing of video elementary streams, e.g. splicing of video streams or manipulating MPEG-4 scene graphs involving reformatting operations of video signals for distribution or compliance with end-user requests or end-user device requirements
  • H04N 21/235 - Processing of additional data, e.g. scrambling of additional data or processing content descriptors

2.

SIGNALING SCALABILITY INFORMATION IN A PARAMETER SET

      
Application Number 18196850
Status Pending
Filing Date 2023-05-12
First Publication Date 2024-01-04
Owner DOLBY INTERNATIONAL AB (Netherlands)
Inventor Deshpande, Sachin G.

Abstract

A system for decoding a video bitstream includes receiving a frame of the video that includes at least one slice and at least one tile and where each of the at least one slice and the at least one tile are not all aligned with one another.

IPC Classes  ?

  • H04N 19/44 - Decoders specially adapted therefor, e.g. video decoders which are asymmetric with respect to the encoder
  • H04N 13/161 - Encoding, multiplexing or demultiplexing different image signal components
  • H04N 19/597 - Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using predictive coding specially adapted for multi-view video sequence encoding
  • H04N 19/70 - Methods or arrangements for coding, decoding, compressing or decompressing digital video signals characterised by syntax aspects related to video coding, e.g. related to compression standards
  • H04N 19/30 - Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using hierarchical techniques, e.g. scalability
  • H04N 19/112 - Selection of coding mode or of prediction mode according to a given display mode, e.g. for interlaced or progressive display mode
  • H04N 19/103 - Selection of coding mode or of prediction mode

3.

SYSTEM FOR NESTED ENTROPY ENCODING

      
Application Number 18138892
Status Pending
Filing Date 2023-04-25
First Publication Date 2023-11-23
Owner DOLBY INTERNATIONAL AB (Netherlands)
Inventor
  • Su, Yeping
  • Segall, Christopher A.

Abstract

Methods and systems for improving coding efficiency of video.

IPC Classes  ?

  • H04N 19/513 - Processing of motion vectors
  • H04N 19/52 - Processing of motion vectors by encoding by predictive encoding
  • H04N 19/46 - Embedding additional information in the video signal during the compression process
  • H04N 19/91 - Entropy coding, e.g. variable length coding [VLC] or arithmetic coding
  • H04N 19/44 - Decoders specially adapted therefor, e.g. video decoders which are asymmetric with respect to the encoder

4.

Method of Coding and Decoding Images, Coding and Decoding Device and Computer Programs Corresponding Thereto

      
Application Number 18218521
Status Pending
Filing Date 2023-07-05
First Publication Date 2023-11-02
Owner DOLBY INTERNATIONAL AB (Netherlands)
Inventor
  • Henry, Felix
  • Pateux, Stephane
  • Clare, Gordon

Abstract

A method for coding includes; segmenting an image into blocks; grouping blocks into a number of subsets; coding, using an entropy coding module, each subset, by associating digital information with symbols of each block of a subset, including, for the first block of the image, initializing state variables of the coding module; and generating a data sub-stream representative of at least one of the coded subsets of blocks. Where a current block is the first block to be coded of a subset, symbol occurrence probabilities for the first current block are determined based on those for a coded and decoded predetermined block of at least one other subset. Where the current block is the last coded block of the subset: writing, in the sub-stream representative of the subset, the entire the digital information associated with the symbols during coding of the blocks of the subset, and implementing the initializing sub-step.

IPC Classes  ?

  • H04N 19/13 - Adaptive entropy coding, e.g. adaptive variable length coding [AVLC] or context adaptive binary arithmetic coding [CABAC]
  • H04N 19/196 - Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the adaptation method, adaptation tool or adaptation type used for the adaptive coding being specially adapted for the computation of encoding parameters, e.g. by averaging previously computed encoding parameters
  • H04N 19/50 - Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using predictive coding
  • H04N 19/503 - Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using predictive coding involving temporal prediction
  • H04N 19/51 - Motion estimation or motion compensation
  • H04N 19/61 - Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using transform coding in combination with predictive coding
  • H04N 19/593 - Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using predictive coding involving spatial prediction techniques
  • H04N 19/124 - Quantisation
  • H04N 19/137 - Motion inside a coding unit, e.g. average field, frame or block difference
  • H04N 19/174 - Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding the unit being an image region, e.g. an object the region being a slice, e.g. a line of blocks or a group of blocks
  • H04N 19/44 - Decoders specially adapted therefor, e.g. video decoders which are asymmetric with respect to the encoder
  • H04N 19/436 - Methods or arrangements for coding, decoding, compressing or decompressing digital video signals - characterised by implementation details or hardware specially adapted for video compression or decompression, e.g. dedicated software implementation using parallelised computational arrangements
  • H04N 19/625 - Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using transform coding using discrete cosine transform [DCT]
  • H04N 19/176 - Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding the unit being an image region, e.g. an object the region being a block, e.g. a macroblock
  • H04N 19/91 - Entropy coding, e.g. variable length coding [VLC] or arithmetic coding
  • H04N 19/136 - Incoming video signal characteristics or properties
  • H04N 19/70 - Methods or arrangements for coding, decoding, compressing or decompressing digital video signals characterised by syntax aspects related to video coding, e.g. related to compression standards
  • H04N 19/119 - Adaptive subdivision aspects e.g. subdivision of a picture into rectangular or non-rectangular coding blocks

5.

Method of Coding and Decoding Images, Coding and Decoding Device and Computer Programs Corresponding Thereto

      
Application Number 18219036
Status Pending
Filing Date 2023-07-06
First Publication Date 2023-11-02
Owner DOLBY INTERNATIONAL AB (Netherlands)
Inventor
  • Henry, Felix
  • Pateux, Stephane

Abstract

A method of coding at least one image comprising the steps of splitting the image into a plurality of blocks, of grouping said blocks into a predetermined number of subsets of blocks, of coding each of said subsets of blocks in parallel, the blocks of a subset considered being coded according to a predetermined sequential order of traversal. The coding step comprises, for a current block of a subset considered, the sub-step of predictive coding of said current block with respect to at least one previously coded and decoded block, and the sub-step of entropy coding of said current block on the basis of at least one probability of appearance of a symbol.

IPC Classes  ?

  • H04N 19/593 - Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using predictive coding involving spatial prediction techniques
  • H04N 19/50 - Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using predictive coding
  • H04N 19/13 - Adaptive entropy coding, e.g. adaptive variable length coding [AVLC] or context adaptive binary arithmetic coding [CABAC]
  • H04N 19/61 - Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using transform coding in combination with predictive coding
  • H04N 19/91 - Entropy coding, e.g. variable length coding [VLC] or arithmetic coding
  • H04N 19/174 - Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding the unit being an image region, e.g. an object the region being a slice, e.g. a line of blocks or a group of blocks
  • H04N 19/436 - Methods or arrangements for coding, decoding, compressing or decompressing digital video signals - characterised by implementation details or hardware specially adapted for video compression or decompression, e.g. dedicated software implementation using parallelised computational arrangements
  • H04N 19/25 - Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using video object coding with scene description coding, e.g. binary format for scenes [BIFS] compression
  • H04N 19/184 - Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding the unit being bits, e.g. of the compressed video stream
  • H04N 19/51 - Motion estimation or motion compensation

6.

ACOUSTIC ECHO CANCELLATION CONTROL FOR DISTRIBUTED AUDIO DEVICES

      
Application Number 17628732
Status Pending
Filing Date 2020-07-29
First Publication Date 2023-10-05
Owner
  • Dolby Laboratories Licensing Corporation (USA)
  • Dolby International AB (Netherlands)
Inventor
  • Dickins, Glenn N.
  • Hines, Christopher Graham
  • Gunawan, David
  • Cartwright, Richard J.
  • Seefeldt, Alan J.
  • Arteaga, Daniel
  • Thomas, Mark R.P.
  • Lando, Joshua B.

Abstract

An audio processing method may involve receiving output signals from each microphone of a plurality of microphones in an audio environment, the output signals corresponding to a current utterance of a person and determining, based on the output signals, one or more aspects of context information relating to the person, including an estimated current proximity of the person to one or more microphone locations. The method may involve selecting two or more loudspeaker-equipped audio devices based, at least in part, on the one or more aspects of the context information, determining one or more types of audio processing changes to apply to audio data being rendered to loudspeaker feed signals for the audio devices and causing one or more types of audio processing changes to be applied. In some examples, the audio processing changes have the effect of increasing a speech to echo ratio at one or more microphones.

IPC Classes  ?

  • H04M 9/08 - Two-way loud-speaking telephone systems with means for conditioning the signal, e.g.  for suppressing echoes for one or both directions of traffic
  • G10L 15/22 - Procedures used during a speech recognition process, e.g. man-machine dialog

7.

GOLOMB-RICE/EG CODING TECHNIQUE FOR CABAC IN HEVC

      
Application Number 18204326
Status Pending
Filing Date 2023-05-31
First Publication Date 2023-09-28
Owner DOLBY INTERNATIONAL AB (Netherlands)
Inventor
  • Kim, Seung-Hwan
  • Kerofsky, Louis J.
  • Segall, Christopher A.

Abstract

A system utilizing a high throughput coding mode for CABAC in HEVC is described. The system may include an electronic device configured to obtain a block of data to be encoded using an arithmetic based encoder; to generate a sequence of syntax elements using the obtained block; to compare an Absolute-3 value of the sequence or a parameter associated with the Absolute-3 value to a preset value; and to convert the Absolute-3 value to a codeword using a first code or a second code that is different than the first code, according to a result of the comparison.

IPC Classes  ?

  • H04N 19/60 - Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using transform coding
  • H04N 19/70 - Methods or arrangements for coding, decoding, compressing or decompressing digital video signals characterised by syntax aspects related to video coding, e.g. related to compression standards
  • H03M 7/40 - Conversion to or from variable length codes, e.g. Shannon-Fano code, Huffman code, Morse code
  • H04N 19/91 - Entropy coding, e.g. variable length coding [VLC] or arithmetic coding
  • H04N 19/44 - Decoders specially adapted therefor, e.g. video decoders which are asymmetric with respect to the encoder

8.

BASS ENHANCEMENT FOR LOUDSPEAKERS

      
Application Number 17913156
Status Pending
Filing Date 2021-03-19
First Publication Date 2023-07-06
Owner
  • Dolby International AB (Netherlands)
  • Dolby Laboratories Licensing Corporation (USA)
Inventor
  • Ekstrand, Per
  • Hao, Yuxing
  • Yu, Xuemei

Abstract

A method of audio processing includes generating harmonics in a hybrid complex quadrature mirror filter domain. Generating the harmonics may include multiplication, using a feedback delay loop, and dynamic compression. The harmonics may be generated based on one or more hybrid sub-bands of the complex transform domain signal.

IPC Classes  ?

  • H04R 3/04 - Circuits for transducers for correcting frequency response

9.

COORDINATION OF AUDIO DEVICES

      
Application Number 17630779
Status Pending
Filing Date 2020-07-28
First Publication Date 2023-06-29
Owner
  • DOLBY INTERNATIONAL AB (Netherlands)
  • DOLBY LABORATORIES LICENSING CORPORATION (USA)
Inventor
  • Dickins, Glenn N.
  • Thomas, Mark Richard Paul
  • Seefeldt, Alan J.
  • Lando, Joshua B.
  • Arteaga, Daniel
  • Dyonisio, Carlos Medaglia
  • Gunawan, David
  • Cartwright, Richard J.
  • Hines, Christopher Graham

Abstract

An audio session management method may involve: determining, by an audio session manager, one or more first media engine capabilities of a first media engine of a first smart audio device, the first media engine being configured for managing one or more audio media streams received by the first smart audio device and for performing first smart audio device signal processing for the one or more audio media streams according to a first media engine sample clock; receiving, by the audio session manager and via a first application communication link, first application control signals from the first application; and controlling the first smart audio device according to the first media engine capabilities, by the audio session manager, via first audio session management control signals transmitted to the first smart audio device via a first smart audio device communication link and without reference to the first media engine sample clock.

IPC Classes  ?

  • H04L 67/141 - Setup of application sessions
  • H04S 7/00 - Indicating arrangements; Control arrangements, e.g. balance control
  • H04R 1/32 - Arrangements for obtaining desired frequency or directional characteristics for obtaining desired directional characteristic only
  • H04R 1/40 - Arrangements for obtaining desired frequency or directional characteristics for obtaining desired directional characteristic only by combining a number of identical transducers
  • H04R 3/00 - Circuits for transducers
  • H04R 3/12 - Circuits for transducers for distributing signals to two or more loudspeakers

10.

METHODS, APPARATUS AND SYSTEMS FOR DIFFRACTION MODELLING BASED ON GRID PATHFINDING

      
Application Number 17914736
Status Pending
Filing Date 2021-03-29
First Publication Date 2023-06-15
Owner Dolby International AB (Netherlands)
Inventor
  • Terentiv, Leon
  • Fischer, Daniel
  • Fersch, Christof
  • Setiawan, Panji

Abstract

Described herein is a method of processing audio content for rendering in a three-dimensional audio scene, wherein the audio content comprises a sound source at a source position, the method comprising: obtaining a voxelized representation of the three-dimensional audio scene, wherein the voxelized representation indicates volume elements in which sound can propagate and volume elements by which sound is occluded; generating a two-dimensional projection map for the audio scene based on the voxelized representation by applying a projection operation to the voxelized representation that projects onto a horizontal plane; and determining parameters indicating a virtual source position of a virtual sound source based on the source position, a listener position, and the projection map, to simulate, by rendering a virtual source signal from the virtual source position, an impact of acoustic diffraction by the three-dimensional audio scene on a source signal of the sound source at the source position. Described are moreover a corresponding apparatus as well as corresponding computer program products.

IPC Classes  ?

  • H04S 7/00 - Indicating arrangements; Control arrangements, e.g. balance control

11.

Automatic Leveling of Speech Content

      
Application Number 17915074
Status Pending
Filing Date 2021-03-25
First Publication Date 2023-05-25
Owner
  • Dolby Laboratories Licensing Corporation (USA)
  • Dolby International AB (Netherlands)
Inventor
  • Yeh, Chunghsin
  • Cengarle, Giulio
  • De Burgh, Mark David De Burgh David

Abstract

Embodiments are disclosed for automatic leveling of speech content. In an embodiment, a method comprises: receiving, using one or more processors, frames of an audio recording including speech and non-speech content; for each frame: determining, using the one or more processors, a speech probability; analyzing, using the one or more processors, a perceptual loudness of the frame; obtaining, using the one or more processors, a target loudness range for the frame; computing, using the one or more processors, gains to apply to the frame based on the target loudness range and the perceptual loudness analysis, where the gains include dynamic gains that change frame-by-frame and that are scaled based on the speech probability; and applying the gains to the frame so that a resulting loudness range of the speech content in the audio recording fits within the target loudness range.

IPC Classes  ?

  • G10L 21/0364 - Speech enhancement, e.g. noise reduction or echo cancellation by changing the amplitude for improving intelligibility
  • G10L 25/30 - Speech or voice analysis techniques not restricted to a single one of groups characterised by the analysis technique using neural networks
  • G10L 25/21 - Speech or voice analysis techniques not restricted to a single one of groups characterised by the type of extracted parameters the extracted parameters being power information
  • G10L 25/84 - Detection of presence or absence of voice signals for discriminating voice from noise
  • G10L 17/20 - Pattern transformations or operations aimed at increasing system robustness, e.g. against channel noise or different working conditions
  • G10L 21/028 - Voice signal separating using properties of sound source
  • G10L 21/034 - Automatic adjustment

12.

Golomb-Rice/EG coding technique for CABAC in HEVC

      
Application Number 18047391
Grant Number 11706451
Status In Force
Filing Date 2022-10-18
First Publication Date 2023-05-04
Grant Date 2023-07-18
Owner DOLBY INTERNATIONAL AB (Netherlands)
Inventor
  • Kim, Seung-Hwan
  • Kerofsky, Louis J.
  • Segall, Christopher A.

Abstract

A system utilizing a high throughput coding mode for CABAC in HEVC is described. The system may include an electronic device configured to obtain a block of data to be encoded using an arithmetic based encoder; to generate a sequence of syntax elements using the obtained block; to compare an Absolute-3 value of the sequence or a parameter associated with the Absolute-3 value to a preset value; and to convert the Absolute-3 value to a codeword using a first code or a second code that is different than the first code, according to a result of the comparison.

IPC Classes  ?

  • H04N 19/60 - Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using transform coding
  • H04N 19/70 - Methods or arrangements for coding, decoding, compressing or decompressing digital video signals characterised by syntax aspects related to video coding, e.g. related to compression standards
  • H03M 7/40 - Conversion to or from variable length codes, e.g. Shannon-Fano code, Huffman code, Morse code
  • H04N 19/91 - Entropy coding, e.g. variable length coding [VLC] or arithmetic coding
  • H04N 19/44 - Decoders specially adapted therefor, e.g. video decoders which are asymmetric with respect to the encoder

13.

NESTED ENTROPY ENCODING

      
Application Number 17952725
Status Pending
Filing Date 2022-09-26
First Publication Date 2023-04-06
Owner DOLBY INTERNATIONAL AB (Netherlands)
Inventor
  • Su, Yeping
  • Segall, Christopher A.

Abstract

Methods and systems for improving coding decoding efficiency of video by providing a syntax modeler, a buffer, and a decoder. The syntax modeler may associate a first sequence of symbols with syntax elements. The buffer may store tables, each represented by a symbol in the first sequence, and each used to associate a respective symbol in a second sequence of symbols with encoded data. The decoder decodes the data into a bitstream using the second sequence retrieved from a table.

IPC Classes  ?

  • H04N 19/13 - Adaptive entropy coding, e.g. adaptive variable length coding [AVLC] or context adaptive binary arithmetic coding [CABAC]
  • H04N 19/91 - Entropy coding, e.g. variable length coding [VLC] or arithmetic coding
  • H03M 7/40 - Conversion to or from variable length codes, e.g. Shannon-Fano code, Huffman code, Morse code
  • H03M 7/42 - Conversion to or from variable length codes, e.g. Shannon-Fano code, Huffman code, Morse code using table look-up for the coding or decoding process, e.g. using read-only memory
  • H03M 7/30 - Compression; Expansion; Suppression of unnecessary data, e.g. redundancy reduction
  • H04N 19/52 - Processing of motion vectors by encoding by predictive encoding
  • H04N 19/176 - Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding the unit being an image region, e.g. an object the region being a block, e.g. a macroblock
  • H04N 19/70 - Methods or arrangements for coding, decoding, compressing or decompressing digital video signals characterised by syntax aspects related to video coding, e.g. related to compression standards
  • H04N 19/46 - Embedding additional information in the video signal during the compression process
  • H04N 19/61 - Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using transform coding in combination with predictive coding
  • H04N 19/15 - Data rate or code amount at the encoder output by monitoring actual compressed data size at the memory before deciding storage at the transmission buffer
  • H04L 45/745 - Address table lookup; Address filtering

14.

NOISE FLOOR ESTIMATION AND NOISE REDUCTION

      
Application Number 17793539
Status Pending
Filing Date 2021-01-18
First Publication Date 2023-03-16
Owner DOLBY INTERNATIONAL AB (Netherlands)
Inventor
  • Cengarle, Giulio
  • Mateos Sole, Antonio
  • Scaini, Davide

Abstract

Embodiments are disclosed for noise floor estimation and noise reduction, In an embodiment, a method comprises: obtaining an audio signal; dividing the audio signal into a plurality of buffers; determining time-frequency samples for each buffer of the audio signal; for each buffer and for each frequency, determining a median (or mean) and a measure of an amount of variation of energy based on the samples in the buffer and samples in neighboring buffers that together span a specified time range of the audio signal; combining the median (or mean) and the measure of the amount of variation of energy into a cost function; for each frequency: determining a signal energy of a particular buffer of the audio signal that corresponds to a minimum value of the cost function; selecting the signal energy as the estimated noise floor of the audio signal; and reducing, using the estimated noise floor, noise in the audio signal.

IPC Classes  ?

  • G10L 21/02 - Speech enhancement, e.g. noise reduction or echo cancellation

15.

Methods, devices and systems for parallel video encoding and decoding

      
Application Number 17900698
Grant Number 11838558
Status In Force
Filing Date 2022-08-31
First Publication Date 2023-03-16
Grant Date 2023-12-05
Owner DOLBY INTERNATIONAL AB (Netherlands)
Inventor
  • Zhao, Jie
  • Segall, Christopher A.
  • Kerofsky, Louis J.

Abstract

A method for decoding a video bitstream is disclosed. The method comprises: entropy decoding a first portion of a video bitstream, wherein first portion of video bitstream is associated with a video frame, thereby producing a first portion of decoded data; entropy decoding a second portion of video bitstream, wherein second portion of video bitstream is associated with video frame, thereby producing a second portion of decoded data, wherein entropy decoding second portion of video bitstream is independent of entropy decoding first portion of video bitstream; and reconstructing a first portion of video frame associated with video bitstream using first portion of decoded data and second portion of decoded data.

IPC Classes  ?

  • H04N 19/91 - Entropy coding, e.g. variable length coding [VLC] or arithmetic coding
  • H04N 19/70 - Methods or arrangements for coding, decoding, compressing or decompressing digital video signals characterised by syntax aspects related to video coding, e.g. related to compression standards
  • H04N 19/119 - Adaptive subdivision aspects e.g. subdivision of a picture into rectangular or non-rectangular coding blocks
  • H04N 19/15 - Data rate or code amount at the encoder output by monitoring actual compressed data size at the memory before deciding storage at the transmission buffer
  • H04N 19/192 - Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the adaptation method, adaptation tool or adaptation type used for the adaptive coding the adaptation method, adaptation tool or adaptation type being iterative or recursive
  • H04N 19/46 - Embedding additional information in the video signal during the compression process
  • H04N 19/61 - Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using transform coding in combination with predictive coding
  • H04N 19/174 - Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding the unit being an image region, e.g. an object the region being a slice, e.g. a line of blocks or a group of blocks
  • H04N 19/17 - Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding the unit being an image region, e.g. an object
  • H04N 19/44 - Decoders specially adapted therefor, e.g. video decoders which are asymmetric with respect to the encoder
  • H04N 19/436 - Methods or arrangements for coding, decoding, compressing or decompressing digital video signals - characterised by implementation details or hardware specially adapted for video compression or decompression, e.g. dedicated software implementation using parallelised computational arrangements
  • H04N 19/159 - Prediction type, e.g. intra-frame, inter-frame or bidirectional frame prediction
  • H04N 19/43 - Hardware specially adapted for motion estimation or compensation
  • H04N 19/136 - Incoming video signal characteristics or properties
  • H04N 19/40 - Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using video transcoding, i.e. partial or full decoding of a coded input stream followed by re-encoding of the decoded output stream
  • H04N 19/172 - Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding the unit being an image region, e.g. an object the region being a picture, frame or field
  • H04N 19/80 - Methods or arrangements for coding, decoding, compressing or decompressing digital video signals - Details of filtering operations specially adapted for video compression, e.g. for pixel interpolation
  • H04N 19/184 - Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding the unit being bits, e.g. of the compressed video stream

16.

Apparatus and method for processing an input audio signal using cascaded filterbanks

      
Application Number 18048810
Grant Number 11894002
Status In Force
Filing Date 2022-10-21
First Publication Date 2023-03-09
Grant Date 2024-02-06
Owner
  • Fraunhofer-Gesellschaft zur Foerderung der angewandten Forschung (Germany)
  • Dolby International AB (Netherlands)
Inventor
  • Villemoes, Lars
  • Ekstrand, Per
  • Disch, Sascha
  • Nagel, Frederik
  • Wilde, Stephan

Abstract

An apparatus for processing an input audio signal relies on a cascade of filterbanks, the cascade having a synthesis filterbank for synthesizing an audio intermediate signal from the input audio signal, the input audio signal being represented by a plurality of first subband signals generated by an analysis filterbank, wherein a number of filterbank channels of the synthesis filterbank is smaller than a number of channels of the analysis filterbank. The apparatus furthermore has a further analysis filterbank for generating a plurality of second subband signals from the audio intermediate signal, wherein the further analysis filterbank has a number of channels being different from the number of channels of the synthesis filterbank, so that a sampling rate of a subband signal of the plurality of second subband signals is different from a sampling rate of a first subband signal of the plurality of first subband signals.

IPC Classes  ?

  • G10L 19/008 - Multichannel audio signal coding or decoding using interchannel correlation to reduce redundancy, e.g. joint-stereo, intensity-coding or matrixing
  • G10L 21/038 - Speech enhancement, e.g. noise reduction or echo cancellation using band spreading techniques
  • G10L 21/04 - Time compression or expansion
  • G10L 19/02 - Speech or audio signal analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using spectral analysis, e.g. transform vocoders or subband vocoders

17.

METHOD AND APPARATUS FOR METADATA-BASED DYNAMIC PROCESSING OF AUDIO DATA

      
Document Number 03230363
Status Pending
Filing Date 2022-08-24
Open to Public Date 2023-03-02
Owner
  • DOLBY LABORATORIES LICENSING CORPORATION (USA)
  • DOLBY INTERNATIONAL AB (Netherlands)
Inventor
  • Fersch, Christof
  • Norcross, Scott Gregory

Abstract

Described herein is a method of metadata-based dynamic processing of audio data for playback, the method including: receiving, by a decoder, a bitstream including audio data and metadata for dynamic loudness adjustment; decoding, by the decoder, the audio data and the metadata to obtain decoded audio data and the metadata; determining, by the decoder, from the metadata, one or more processing parameters for dynamic loudness adjustment based on a playback condition; applying the determined one or more processing parameters to the decoded audio data to obtain processed audio data; and outputting the processed audio data for playback. Described is further a method of encoding audio data and metadata for dynamic loudness adjustment into a bitstream. Moreover, described are a respective decoder and encoder, a respective system and computer program products.

IPC Classes  ?

  • G10L 19/16 - Vocoder architecture
  • H03G 7/00 - Volume compression or expansion in amplifiers
  • G10L 21/0364 - Speech enhancement, e.g. noise reduction or echo cancellation by changing the amplitude for improving intelligibility
  • H03G 3/00 - Gain control in amplifiers or frequency changers
  • H03G 11/00 - Limiting amplitude; Limiting rate of change of amplitude

18.

METHODS, APPARATUS AND SYSTEMS FOR 6DOF AUDIO RENDERING AND DATA REPRESENTATIONS AND BITSTREAM STRUCTURES FOR 6DOF AUDIO RENDERING

      
Application Number 17896005
Status Pending
Filing Date 2022-08-25
First Publication Date 2023-03-02
Owner DOLBY INTERNATIONAL AB (Netherlands)
Inventor
  • Terentiv, Leon
  • Fersch, Christof
  • Fischer, Daniel

Abstract

The present disclosure relates to methods, apparatus and systems for encoding an audio signal into a bitstream, in particular at an encoder, comprising: encoding or including audio signal data associated with 3DoF audio rendering into one or more first bitstream parts of the bitstream, and encoding or including metadata associated with 6DoF audio rendering into one or more second bitstream parts of the bitstream. The present disclosure further relates to methods, apparatus and systems for decoding an audio signal and audio rendering based on the bitstream.

IPC Classes  ?

  • H04S 7/00 - Indicating arrangements; Control arrangements, e.g. balance control
  • G10L 19/008 - Multichannel audio signal coding or decoding using interchannel correlation to reduce redundancy, e.g. joint-stereo, intensity-coding or matrixing
  • G10L 19/16 - Vocoder architecture
  • H04S 3/00 - Systems employing more than two channels, e.g. quadraphonic

19.

METHOD AND DEVICE FOR ADAPTIVE PLAYOUT OF MEDIA CONTENT

      
Application Number 17797569
Status Pending
Filing Date 2021-02-01
First Publication Date 2023-02-23
Owner Dolby International AB (Netherlands)
Inventor Larsen, Jonas Moeller

Abstract

Described herein is a method for controlling media data playout on a client device, wherein the method includes the steps of: (a) retrieving, by the client device, media data comprising a plurality of segments subdivided into one or more chunks for playout from at least one media server; (b) analyzing a current chunk of the one or more chunks of a current segment; and (c) adapting the playout of the media data in response to the result of the analysis prior to fully retrieving the current chunk. Described herein are further a client device having implemented a media player application configured to perform said method and a computer program product with instructions adapted to cause a device having processing capability to carry out said method.

IPC Classes  ?

  • H04N 21/4402 - Processing of video elementary streams, e.g. splicing a video clip retrieved from local storage with an incoming video stream or rendering scenes according to MPEG-4 scene graphs involving reformatting operations of video signals for household redistribution, storage or real-time display
  • H04N 21/845 - Structuring of content, e.g. decomposing content into time segments

20.

COMPANDING SYSTEM AND METHOD TO REDUCE QUANTIZATION NOISE USING ADVANCED SPECTRAL EXTENSION

      
Application Number 17820594
Status Pending
Filing Date 2022-08-18
First Publication Date 2023-02-16
Owner
  • DOLBY INTERNATIONAL AB (Netherlands)
  • DOLBY LABORATORIES LICENSING CORPORATION (USA)
Inventor
  • Hedelin, Per
  • Biswas, Arijit
  • Schug, Michael
  • Melkote, Vinay

Abstract

Embodiments are directed to a companding method and system for reducing coding noise in an audio codec. A compression process reduces an original dynamic range of an initial audio signal through a compression process that divides the initial audio signal into a plurality of segments using a defined window shape, calculates a wideband gain in the frequency domain using a non-energy based average of frequency domain samples of the initial audio signal, and applies individual gain values to amplify segments of relatively low intensity and attenuate segments of relatively high intensity. The compressed audio signal is then expanded back to the substantially the original dynamic range that applies inverse gain values to amplify segments of relatively high intensity and attenuating segments of relatively low intensity. A QMF filterbank is used to analyze the initial audio signal to obtain a frequency domain representation.

IPC Classes  ?

  • G10L 21/0232 - Processing in the frequency domain
  • H04B 1/66 - TRANSMISSION - Details of transmission systems not characterised by the medium used for transmission for improving efficiency of transmission
  • G10L 19/02 - Speech or audio signal analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using spectral analysis, e.g. transform vocoders or subband vocoders
  • G10L 19/008 - Multichannel audio signal coding or decoding using interchannel correlation to reduce redundancy, e.g. joint-stereo, intensity-coding or matrixing
  • G10L 25/18 - Speech or voice analysis techniques not restricted to a single one of groups characterised by the type of extracted parameters the extracted parameters being spectral information of each sub-band
  • G10L 19/032 - Quantisation or dequantisation of spectral components
  • G10L 21/034 - Automatic adjustment
  • G10L 25/45 - Speech or voice analysis techniques not restricted to a single one of groups characterised by the type of analysis window
  • H03G 7/00 - Volume compression or expansion in amplifiers

21.

ADAPTIVE STREAMING OF MEDIA CONTENT WITH BITRATE SWITCHING

      
Application Number 17792829
Status Pending
Filing Date 2021-01-12
First Publication Date 2023-02-16
Owner Dolby International AB (Netherlands)
Inventor Schmidt, Malte

Abstract

A method for adaptive streaming of media content with bitrate switching is described, wherein the media content comprising a plurality of consecutive media segments. The method comprising, at a media streaming server: transmitting a segment of the media content encoded in a first coding mode having a first bitrate; receiving an indication for a coding mode switch to a second coding mode having a second bitrate and in response transmitting a transition segment for transitioning between the first coding mode and the second coding mode; and transmitting another segment of the media content encoded in the second coding mode.

IPC Classes  ?

  • H04N 21/2662 - Controlling the complexity of the video stream, e.g. by scaling the resolution or bitrate of the video stream based on the client capabilities

22.

VIDEO DECODER WITH REDUCED DYNAMIC RANGE TRANSFORM WITH INVERSE TRANSFORM SHIFTING MEMORY

      
Application Number 17874773
Status Pending
Filing Date 2022-07-27
First Publication Date 2023-02-16
Owner DOLBY INTERNATIONAL AB (Netherlands)
Inventor
  • Kerofsky, Louis J.
  • Misra, Kiran
  • Segall, Christopher A.

Abstract

A method for decoding video includes receiving quantized coefficients representative of a block of video representative of a plurality of pixels. The quantized coefficients are dequantized based upon a function of a remainder. The dequantized coefficients are inverse transformed to determine a decoded residue.

IPC Classes  ?

  • H04N 19/132 - Sampling, masking or truncation of coding units, e.g. adaptive resampling, frame skipping, frame interpolation or high-frequency transform coefficient masking
  • H04N 19/184 - Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding the unit being bits, e.g. of the compressed video stream
  • H04N 19/44 - Decoders specially adapted therefor, e.g. video decoders which are asymmetric with respect to the encoder
  • H04N 19/176 - Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding the unit being an image region, e.g. an object the region being a block, e.g. a macroblock
  • H04N 19/182 - Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding the unit being a pixel

23.

SYSTEMS, METHODS AND APPARATUS FOR CONVERSION FROM CHANNEL-BASED AUDIO TO OBJECT-BASED AUDIO

      
Application Number 17781978
Status Pending
Filing Date 2020-12-02
First Publication Date 2023-01-26
Owner
  • DOLBY LABORATORIES LICENSING CORPORATION (USA)
  • DOLBY INTERNATIONAL AB (Netherlands)
Inventor
  • Ward, Michael C.
  • Sanchez, Freddie
  • Fersch, Christof Joseph

Abstract

Embodiments are disclosed for channel-based audio (CBA) (e.g., 22.2-ch audio) to object-based audio (OBA) conversion. The conversion includes converting CBA metadata to object audio metadata (OAMD) and reordering the CBA channels based on channel shuffle information derived in accordance with channel ordering constraints of the OAMD. The OBA with reordered channels is rendered in a playback device using the OAMD or in a source device, such as a set-top box or audio/video recorder. In an embodiment, the CBA metadata includes signaling that indicates a specific OAMD representation to be used in the conversion of the metadata. In an embodiment, pre-computed OAMD is transmitted in a native audio bitstream (e.g., AAC) for transmission (e.g., over HDMI) or for rendering in a source device. In an embodiment, pre-computed OAMD is transmitted in a transport layer bitstream (e.g., ISO BMFF, MPEG4 audio bitstream) to a playback device or source device.

IPC Classes  ?

24.

PROJECTION SYSTEM AND METHOD OF DRIVING A PROJECTION SYSTEM

      
Application Number 17784610
Status Pending
Filing Date 2020-12-11
First Publication Date 2023-01-26
Owner
  • DOLBY LABORATORIES LICENSING CORPORATION (USA)
  • Dolby International AB (Netherlands)
Inventor
  • Pertierra, Juan Pablo
  • Richards, Martin J.
  • Le Barbenchon, Clement Luc Carol
  • Pires Arrifano, Angelo Miguel

Abstract

A projection system and method includes a light source configured to emit a light in response to an image data; a phase light modulator configured to receive the light from the light source and to apply a spatially-varying phase modulation on the light; and a controller configured to determine, for a frame of the image data, a plurality of phase configurations, respective ones of the plurality of phase configurations corresponding to solutions of a phase algorithm and representing the same image with a different modulation pattern, and provide a phase control signal to the phase light modulator, the phase control signal configured to cause the phase light modulator to modulate the plurality of phase configurations in a time-divisional manner within a time period of the frame, thereby to project a series of subframes within the time period.

IPC Classes  ?

  • G03H 1/22 - Processes or apparatus for obtaining an optical image from holograms
  • H04N 9/31 - Projection devices for colour picture display

25.

Harmonic transposition in an audio coding method and system

      
Application Number 17954179
Grant Number 11594234
Status In Force
Filing Date 2022-09-27
First Publication Date 2023-01-26
Grant Date 2023-02-28
Owner Dolby International AB (Netherlands)
Inventor
  • Ekstrand, Per
  • Villemoes, Lars

Abstract

s, generating a frame of the output signal.

IPC Classes  ?

  • G10L 19/022 - Blocking, i.e. grouping of samples in time; Choice of analysis windows; Overlap factoring
  • G10L 19/24 - Variable rate codecs, e.g. for generating different qualities using a scalable representation such as hierarchical encoding or layered encoding
  • G10L 21/038 - Speech enhancement, e.g. noise reduction or echo cancellation using band spreading techniques
  • G10L 21/04 - Time compression or expansion
  • G10L 19/02 - Speech or audio signal analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using spectral analysis, e.g. transform vocoders or subband vocoders

26.

METHODS AND DEVICES FOR PROVIDING PERSONALIZED AUDIO TO A USER

      
Application Number 17780950
Status Pending
Filing Date 2020-11-27
First Publication Date 2023-01-05
Owner Dolby International AB (Netherlands)
Inventor
  • Schildbach, Wolfgang
  • Schmidt, Malte

Abstract

The present application describes a method (400) for providing personalized audio to a user. The method (400) comprises receiving (401) a manifest file (140) for a media element from which audio is to be rendered, wherein the manifest file (140) comprises a description (141) for a plurality of different presentations (152) of audio content of the media element. In addition, the method (400) comprises selecting (402) a presentation (152) from the plurality of presentations (152) based on the manifest file (140). The method (400) further comprises receiving (403) a list of audio track objects comprised within the media element, and selecting (404) an audio track object from the list of audio track objects, in dependence of the selected presentation (152).

IPC Classes  ?

  • H04N 21/2343 - Processing of video elementary streams, e.g. splicing of video streams or manipulating MPEG-4 scene graphs involving reformatting operations of video signals for distribution or compliance with end-user requests or end-user device requirements
  • H04N 21/439 - Processing of audio elementary streams
  • H04N 21/2662 - Controlling the complexity of the video stream, e.g. by scaling the resolution or bitrate of the video stream based on the client capabilities
  • H04N 21/84 - Generation or processing of descriptive data, e.g. content descriptors
  • H04N 21/8352 - Generation of protective data, e.g. certificates involving content or source identification data, e.g. UMID [Unique Material Identifier]

27.

METHODS, APPARATUS AND SYSTEM FOR RENDERING AN AUDIO PROGRAM

      
Application Number 17903906
Status Pending
Filing Date 2022-09-06
First Publication Date 2022-12-29
Owner Dolby International AB (Netherlands)
Inventor
  • Fersch, Christof
  • Stahlmann, Alexander

Abstract

A method for generating a bitstream indicative of an object based audio program is described. The bitstream comprises a sequence of containers. A first container of the sequence of containers comprises a plurality of substream entities for a plurality of substreams of the object based audio program and a presentation section. The method comprises determining a set of object channels. The method further comprises providing a set of object related metadata for the set of object channels. In addition, the method comprises inserting a first set of object channel frames and a first set of object related metadata frames into a respective set of substream entities of the first container. Furthermore, the method comprises inserting presentation data into the presentation section.

IPC Classes  ?

  • G10L 19/008 - Multichannel audio signal coding or decoding using interchannel correlation to reduce redundancy, e.g. joint-stereo, intensity-coding or matrixing
  • G11B 27/034 - Electronic editing of digitised analogue information signals, e.g. audio or video signals on discs
  • G11B 27/34 - Indicating arrangements
  • H04S 7/00 - Indicating arrangements; Control arrangements, e.g. balance control
  • G10L 19/16 - Vocoder architecture
  • G11B 27/10 - Indexing; Addressing; Timing or synchronising; Measuring tape travel
  • G11B 27/32 - Indexing; Addressing; Timing or synchronising; Measuring tape travel by using information detectable on the record carrier by using information signals recorded by the same method as the main recording on separate auxiliary tracks of the same or an auxiliary record carrier
  • G10L 19/018 - Audio watermarking, i.e. embedding inaudible data in the audio signal

28.

DEEP SOURCE SEPARATION ARCHITECTURE

      
Application Number 17770177
Status Pending
Filing Date 2020-10-20
First Publication Date 2022-12-22
Owner
  • Dolby Laboratories Licensing Corporation (USA)
  • Dolby International AB (Netherlands)
Inventor
  • Kadioglu, Berkan
  • Horgan, Michael Getty
  • Puig, Jordi Pons
  • Liu, Xiaoyu

Abstract

A speech separation server comprises a deep-learning encoder with nonlinear activation. The encoder is programmed to take a mixture audio waveform in the time domain, learn generalized patterns from the mixture audio waveform, and generate an encoded representation that effectively characterizes the mixture audio waveform for speech separation.

IPC Classes  ?

  • G10L 21/028 - Voice signal separating using properties of sound source
  • G10L 25/30 - Speech or voice analysis techniques not restricted to a single one of groups characterised by the analysis technique using neural networks
  • G06N 3/04 - Architecture, e.g. interconnection topology

29.

METHODS AND SYSTEM FOR WAVEFORM CODING OF AUDIO SIGNALS WITH A GENERATIVE MODEL

      
Application Number 17770035
Status Pending
Filing Date 2020-10-16
First Publication Date 2022-12-08
Owner
  • Dolby Laboratories Licensing Corporation (USA)
  • DOLBY INTERNATIONAL AB (Netherlands)
Inventor
  • Klejsa, Janusz
  • Biswas, Arijit
  • Villemoes, Lars
  • Fejgin, Roy M.
  • Zhou, Cong

Abstract

Described herein is a method of waveform decoding, the method including the steps of: (a) receiving, by a waveform decoder, a bitstream including a finite bitrate representation of a source signal; (b) waveform decoding the finite bitrate representation of the source signal to obtain a waveform approximation of the source signal; (c) providing the waveform approximation of the source signal to a generative model that implements a probability density function, to obtain a probability distribution for a reconstructed signal of the source signal; and (d) generating the reconstructed signal of the source signal based on the probability distribution. Described are further a method and system for waveform coding and a method of training a generative model.

IPC Classes  ?

  • G10L 19/00 - Speech or audio signal analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis

30.

Picture coding method, picture decoding method, picture coding apparatus, picture decoding apparatus, and program thereof

      
Application Number 16148803
Grant Number RE049321
Status In Force
Filing Date 2018-10-01
First Publication Date 2022-11-29
Grant Date 2022-11-29
Owner DOLBY INTERNATIONAL AB (Netherlands)
Inventor Kadono, Shinya

Abstract

A picture coding method of the present invention codes a picture signal and a ratio of a number of luminance pixels and a number of chrominance pixels for the picture signal, and then one coding method out of at least two coding methods is selected depending on the ratio. Next, data related to a picture size is coded in accordance with the selected coding method. The data related to the picture size indicates a size of the picture corresponding to the picture signal or an output area, which is a pixel area to be outputted in decoding in a whole pixel area coded in the picture signal coding.

IPC Classes  ?

  • H04N 7/12 - Systems in which the television signal is transmitted via one channel or a plurality of parallel channels, the bandwidth of each channel being less than the bandwidth of the television signal
  • H04N 19/16 - Assigned coding mode, i.e. the coding mode being predefined or preselected to be further used for selection of another element or parameter for a given display mode, e.g. for interlaced or progressive display mode
  • H04N 19/103 - Selection of coding mode or of prediction mode
  • H04N 19/105 - Selection of the reference unit for prediction within a chosen coding or prediction mode, e.g. adaptive choice of position and number of pixels used for prediction
  • H04N 19/122 - Selection of transform size, e.g. 8x8 or 2x4x8 DCT; Selection of sub-band transforms of varying structure or type
  • H04N 19/182 - Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding the unit being a pixel
  • H04N 19/186 - Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding the unit being a colour or a chrominance component
  • H04N 19/59 - Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using predictive coding involving spatial sub-sampling or interpolation, e.g. alteration of picture size or resolution
  • H04N 19/85 - Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using pre-processing or post-processing specially adapted for video compression

31.

TRANSFORMING AUDIO SIGNALS CAPTURED IN DIFFERENT FORMATS INTO A REDUCED NUMBER OF FORMATS FOR SIMPLIFYING ENCODING AND DECODING OPERATIONS

      
Application Number 17882900
Status Pending
Filing Date 2022-08-08
First Publication Date 2022-11-24
Owner
  • Dolby Laboratories Licensing Corporation (USA)
  • Dolby International AB (Netherlands)
Inventor
  • Bruhn, Stefan
  • Eckert, Michael
  • Torres, Juan Felix
  • Brown, Stefanie
  • Mcgrath, David S.

Abstract

The disclosed embodiments enable converting audio signals captured in various formats by various capture devices into a limited number of formats that can be processed by an audio codec (e.g., an Immersive Voice and Audio Services (IVAS) codec). In an embodiment, a simplification unit of the audio device receives an audio signal captured by one or more audio capture devices coupled to the audio device. The simplification unit determines whether the audio signal is in a format that is supported/not supported by an encoding unit of the audio device. Based on the determining, the simplification unit, converts the audio signal into a format that is supported by the encoding unit. In an embodiment, if the simplification unit determines that the audio signal is in a spatial format, the simplification unit can convert the audio signal into a spatial “mezzanine” format supported by the encoding.

IPC Classes  ?

  • G10L 19/008 - Multichannel audio signal coding or decoding using interchannel correlation to reduce redundancy, e.g. joint-stereo, intensity-coding or matrixing

32.

Subband block based harmonic transposition

      
Application Number 17751214
Grant Number 11646047
Status In Force
Filing Date 2022-05-23
First Publication Date 2022-11-17
Grant Date 2023-05-09
Owner Dolby International AB (Netherlands)
Inventor Villemoes, Lars

Abstract

The present document relates to audio source coding systems which make use of a harmonic transposition method for high frequency reconstruction (HFR), as well as to digital effect processors, e.g. exciters, where generation of harmonic distortion add brightness to the processed signal, and to time stretchers where a signal duration is prolonged with maintained spectral content. A system and method configured to generate a time stretched and/or frequency transposed signal from an input signal is described. The system comprises an analysis filterbank configured to provide an analysis subband signal from the input signal; wherein the analysis subband signal comprises a plurality of complex valued analysis samples, each having a phase and a magnitude. Furthermore, the system comprises a subband processing unit configured to determine a synthesis subband signal from the analysis subband signal using a subband transposition factor Q and a subband stretch factor S. The subband processing unit performs a block based nonlinear processing wherein the magnitude of samples of the synthesis subband signal are determined from the magnitude of corresponding samples of the analysis subband signal and a predetermined sample of the analysis subband signal. In addition, the system comprises a synthesis filterbank configured to generate the time stretched and/or frequency transposed signal from the synthesis subband signal.

IPC Classes  ?

  • G10L 21/038 - Speech enhancement, e.g. noise reduction or echo cancellation using band spreading techniques
  • G10L 19/02 - Speech or audio signal analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using spectral analysis, e.g. transform vocoders or subband vocoders
  • G10L 19/022 - Blocking, i.e. grouping of samples in time; Choice of analysis windows; Overlap factoring
  • G10L 21/04 - Time compression or expansion
  • G10L 25/18 - Speech or voice analysis techniques not restricted to a single one of groups characterised by the type of extracted parameters the extracted parameters being spectral information of each sub-band
  • G10L 19/032 - Quantisation or dequantisation of spectral components

33.

DYNAMICS PROCESSING ACROSS DEVICES WITH DIFFERING PLAYBACK CAPABILITIES

      
Application Number 17630897
Status Pending
Filing Date 2020-07-27
First Publication Date 2022-11-10
Owner
  • Dolby Laboratories Licensing Corporation (USA)
  • DOLBY INTERNATIONAL AB (Netherlands)
Inventor
  • Seefeldt, Alan J.
  • Lando, Joshua B.
  • Arteaga, Daniel

Abstract

Individual loudspeaker dynamics processing configuration data, for each of a plurality of loudspeakers of a listening environment, may be obtained. Listening environment dynamics processing configuration data may be determined, based on the individual loudspeaker dynamics processing configuration data. Dynamics processing may be performed on received audio data based on the listening environment dynamics processing configuration data, to generate processed audio data. The processed audio data may be rendered for reproduction via a set of loudspeakers that includes at least some of the plurality of loudspeakers, to produce rendered audio signals. The rendered audio signals may be provided to, and reproduced by, the set of loudspeakers.

IPC Classes  ?

  • H04R 5/04 - Circuit arrangements
  • H04S 3/00 - Systems employing more than two channels, e.g. quadraphonic
  • H04S 7/00 - Indicating arrangements; Control arrangements, e.g. balance control
  • H04R 5/02 - Spatial or constructional arrangements of loudspeakers
  • H04R 3/04 - Circuits for transducers for correcting frequency response

34.

System and method for non-destructively normalizing loudness of audio signals within portable devices

      
Application Number 17750803
Grant Number 11670315
Status In Force
Filing Date 2022-05-23
First Publication Date 2022-11-10
Grant Date 2023-06-06
Owner
  • Dolby Laboratories Licensing Corporation (USA)
  • Dolby International AB (Netherlands)
Inventor
  • Riedmiller, Jeffrey
  • Mundt, Harald
  • Schug, Michael
  • Wolters, Martin

Abstract

Many portable playback devices cannot decode and playback encoded audio content having wide bandwidth and wide dynamic range with consistent loudness and intelligibility unless the encoded audio content has been prepared specially for these devices. This problem can be overcome by including with the encoded content some metadata that specifies a suitable dynamic range compression profile by either absolute values or differential values relative to another known compression profile. A playback device may also adaptively apply gain and limiting to the playback audio. Implementations in encoders, in transcoders and in decoders are disclosed.

IPC Classes  ?

  • G10L 19/22 - Mode decision, i.e. based on audio signal content versus external parameters
  • G10L 19/16 - Vocoder architecture
  • G10L 19/02 - Speech or audio signal analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using spectral analysis, e.g. transform vocoders or subband vocoders
  • H03G 7/00 - Volume compression or expansion in amplifiers
  • H03G 3/32 - Automatic control in amplifiers having semiconductor devices the control being dependent upon ambient noise level or sound level
  • G10L 19/26 - Pre-filtering or post-filtering
  • H03G 3/30 - Automatic control in amplifiers having semiconductor devices

35.

METHODS, APPARATUS AND SYSTEMS FOR MODELLING AUDIO OBJECTS WITH EXTENT

      
Application Number EP2022061331
Publication Number 2022/229319
Status In Force
Filing Date 2022-04-28
Publication Date 2022-11-03
Owner DOLBY INTERNATIONAL AB (Netherlands)
Inventor
  • Terentiv, Leon
  • Fischer, Daniel
  • Setiawan, Panji
  • Fersch, Christof

Abstract

A method of modelling extended audio objects for audio rendering in a virtual or augmented reality environment is described. The method comprises obtaining an extent representation indicative of a geometric form of an extended audio object and information relating to one or more first audio sources that are associated with the extended audio object. Furthermore, the method comprises obtaining a relative point on the geometric form of the extended audio object based on a user position in the virtual or augmented reality environment. The method also comprises determining an extent parameter for the extent representation based on the user position and the relative point and determining positions of one or more second audio sources, relative to the user position, for modelling the extended audio object. In addition, the method comprises outputting a modified representation of the extended audio object for modelling the extended audio object.

IPC Classes  ?

  • H04S 7/00 - Indicating arrangements; Control arrangements, e.g. balance control

36.

COORDINATION OF AUDIO DEVICES

      
Application Number 17631024
Status Pending
Filing Date 2020-07-27
First Publication Date 2022-10-27
Owner
  • DOLBY LABORATORIES LICENSING CORPORATION (USA)
  • DOLBY INTERNATIONAL AB (Netherlands)
Inventor
  • Dickins, Glenn N.
  • Cartwright, Richard J.
  • Gunawan, David
  • Hines, Christopher Graham
  • Thomas, Mark R. P.
  • Seefeldt, Alan J.
  • Lando, Joshua B.
  • Dyonisio, Carlos Eduardo Medaglia
  • Arteaga, Daniel

Abstract

An audio session management method for an audio environment having multiple audio devices may involve receiving, from a first device implementing a first application and by a device implementing an audio session manager, a first route initiation request to initiate a first route for a first audio session. The first route initiation request may indicate a first audio source and a first audio environment destination. The first audio environment destination may correspond with at least a first person in the audio environment, but in some instances will not indicate an audio device. The method may involve establishing a first route corresponding to the first route initiation request. Establishing the first route may involve determining a first location of at least the first person in the audio environment, determining at least one audio device for a first stage of the first audio session and initiating or scheduling the first audio session.

IPC Classes  ?

  • H04R 5/04 - Circuit arrangements
  • G06N 20/00 - Machine learning
  • G10L 15/08 - Speech classification or search
  • H04R 3/00 - Circuits for transducers
  • H04R 3/12 - Circuits for transducers for distributing signals to two or more loudspeakers
  • H04S 7/00 - Indicating arrangements; Control arrangements, e.g. balance control

37.

ADAPTABLE SPATIAL AUDIO PLAYBACK

      
Application Number 17630098
Status Pending
Filing Date 2020-07-16
First Publication Date 2022-10-20
Owner
  • DOLBY LABORATORIES LICENSING CORPORATION (USA)
  • DOLBY INTERNATIONAL AB (Netherlands)
Inventor
  • Seefeldt, Alan J.
  • Lando, Joshua B.
  • Arteaga, Daniel
  • Dickins, Glenn N.
  • Thomas, Mark Richard Paul

Abstract

A rendering mode may be determined for received audio data, including audio signals and associated spatial data. The audio data may be rendered for reproduction via a set of loudspeakers of an environment according to the rendering mode, to produce rendered audio signals. Rendering the audio data may involve determining relative activation of a set of loudspeakers in an environment. The rendering mode may be variable between a reference spatial mode and one or more distributed spatial modes. The reference spatial mode may have an assumed listening position and orientation. In the distributed spatial mode(s), one or more elements of the audio data may each be rendered in a more spatially distributed manner than in the reference spatial mode and spatial locations of remaining elements of the audio data may be warped such that they span a rendering space of the environment more completely than in the reference spatial mode.

IPC Classes  ?

  • H04S 7/00 - Indicating arrangements; Control arrangements, e.g. balance control
  • H04R 5/02 - Spatial or constructional arrangements of loudspeakers

38.

Methods, apparatus and systems for encoding and decoding of directional sound sources

      
Application Number 17727732
Grant Number 11887608
Status In Force
Filing Date 2022-04-23
First Publication Date 2022-10-13
Grant Date 2024-01-30
Owner
  • DOLBY LABORATORIES LICENSING CORPORATION (USA)
  • DOLBY INTERNATIONAL AB (Netherlands)
Inventor
  • Tsingos, Nicolas R.
  • Thomas, Mark R. P.
  • Fersch, Christof

Abstract

Some disclosed methods involve encoding or decoding directional audio data. Some encoding methods may involve receiving a mono audio signal corresponding to an audio object and a representation of a radiation pattern corresponding to the audio object. The radiation pattern may include sound levels corresponding to plurality of sample times, a plurality of frequency bands and a plurality of directions. The methods may involve encoding the mono audio signal and encoding the source radiation pattern to determine radiation pattern metadata. Encoding the radiation pattern may involve determining a spherical harmonic transform of the representation of the radiation pattern and compressing the spherical harmonic transform to obtain encoded radiation pattern metadata.

IPC Classes  ?

  • G10L 19/008 - Multichannel audio signal coding or decoding using interchannel correlation to reduce redundancy, e.g. joint-stereo, intensity-coding or matrixing
  • H04S 7/00 - Indicating arrangements; Control arrangements, e.g. balance control

39.

ENCODING OF ENVELOPE INFORMATION OF AN AUDIO DOWNMIX SIGNAL

      
Application Number EP2022059005
Publication Number 2022/214480
Status In Force
Filing Date 2022-04-05
Publication Date 2022-10-13
Owner DOLBY INTERNATIONAL AB (Netherlands)
Inventor Mundt, Harald

Abstract

A method for encoding envelope information is provided. In some implementations, the method involves determining a first downmixed signal associated with a downmixed channel associated with an audio signal to be encoded. In some implementations, the method involves determining energy levels of the first downmixed signal for a plurality of frequency bands. In some implementations, the method involves determining whether to encode information indicative of the energy levels in a bitstream. In some implementations, the method involves encoding the determined energy levels. In some implementations, the method involves generating an energy control value indicating that energy levels are encoded. In some implementations, the method involves generating the bitstream, wherein the energy control value and the information indicative of the energy levels are usable by the decoder to adjust energy levels associated with the first downmixed signal.

IPC Classes  ?

  • G10L 19/008 - Multichannel audio signal coding or decoding using interchannel correlation to reduce redundancy, e.g. joint-stereo, intensity-coding or matrixing
  • G10L 19/02 - Speech or audio signal analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using spectral analysis, e.g. transform vocoders or subband vocoders

40.

MULTI-BAND DUCKING OF AUDIO SIGNALS TECHNICAL FIELD

      
Application Number US2022023057
Publication Number 2022/216542
Status In Force
Filing Date 2022-04-01
Publication Date 2022-10-13
Owner
  • DOLBY LABORATORIES LICENSING CORPORATION (USA)
  • DOLBY INTERNATIONAL AB (Netherlands)
Inventor
  • Tyagi, Rishabh
  • Purnhagen, Heiko

Abstract

A method for multi-band ducking of audio signals is provided. In some implementations, the method involves receiving, at a decoder, an input audio signal, wherein the input audio signal is a downmixed audio signal. In some implementations, the method involves separating the input audio signal into a first set of frequency bands. In some implementations, the method involves determining a set of ducking gains, a ducking gain corresponding to a frequency band of the first set of frequency bands. In some implementations, the method involves generating a broadband decorrelated audio signal, wherein ducking gains of the set of ducking gains are applied to at least one of: 1) a second set of frequency bands prior to generating the at least one broadband decorrelated audio signal; or 2) a third set of frequency bands that separates the at least one broadband decorrelated audio signal.

IPC Classes  ?

  • G10L 19/008 - Multichannel audio signal coding or decoding using interchannel correlation to reduce redundancy, e.g. joint-stereo, intensity-coding or matrixing

41.

RENDERING AUDIO OVER MULTIPLE SPEAKERS WITH MULTIPLE ACTIVATION CRITERIA

      
Application Number 17630910
Status Pending
Filing Date 2020-07-25
First Publication Date 2022-10-06
Owner
  • Dolby Laboratories Licensing Corporation (USA)
  • Dolby International AB (Netherlands)
Inventor
  • Seefedlt, Alan J.
  • Lando, Joshua B.
  • Arteaga, Daniel

Abstract

Methods for rendering audio for playback by two or more speakers are disclosed. The audio includes one or more audio signals, each with an associated intended perceived spatial position. Relative activation of the speakers may be a cost function of a model of perceived spatial position of the audio signals when played back over the speakers, a measure of proximity of the intended perceived spatial position of the audio signals to positions of the speakers, and one or more additional dynamically configurable functions. The dynamically configurable functions may be based on at least one or more properties of the audio signals, one or more properties of the set of speakers and/or one or more external inputs.

IPC Classes  ?

42.

PROJECTION SYSTEM AND METHOD OF DRIVING A PROJECTION SYSTEM WITH FIELD MAPPING

      
Application Number US2022021823
Publication Number 2022/204446
Status In Force
Filing Date 2022-03-24
Publication Date 2022-09-29
Owner
  • DOLBY LABORATORIES LICENSING CORPORATION (USA)
  • DOLBY INTERNATIONAL AB (Netherlands)
Inventor
  • Pires-Arrifano, Angelo
  • Le Barbenchon, Clement, Luc, Carol
  • Pertierra, Juan, Pablo

Abstract

A projection system includes a light source configured to emit a light in response to an image data, a phase light modulator configured to receive the light from the light source and to apply a spatially-varying phase modulation on the light, thereby generating a projection light and steering the light on a reconstruction field, wherein the reconstruction field is a complex plane on which a reconstruction image is formed, and a controller configured to control the light source, control the phase light modulator, initialize (401) the reconstruction field to an initial value, and iteratively for each of a plurality of subframes within a frame of the image data: set (402) the reconstruction field to the initial value for the first iteration or set (402) the reconstruction field to a subsequent-iteration reconstruction field value for any subsequent-iteration, map (403) the reconstruction field to a modulation field, wherein the modulation field is a complex plane of the phase light modulator which modulates a phase of the light, set (404) an amplitude of the modulation field to a predetermined value, and map (405) the modulation field with the amplitude set to the predetermined value, to a subsequent-iteration reconstruction field, wherein the controller is further configured to provide (408) a phase control signal based on the modulation field mapped with the last iteration to the phase light modulator.

IPC Classes  ?

  • H04N 9/31 - Projection devices for colour picture display
  • G03H 1/22 - Processes or apparatus for obtaining an optical image from holograms

43.

Decoding audio bitstreams with enhanced spectral band replication metadata in at least one fill element

      
Application Number 17831080
Grant Number 11664038
Status In Force
Filing Date 2022-06-02
First Publication Date 2022-09-15
Grant Date 2023-05-30
Owner Dolby International AB (Netherlands)
Inventor
  • Villemoes, Lars
  • Purnhagen, Heiko
  • Ekstrand, Per

Abstract

Embodiments relate to an audio processing unit that includes a buffer, bitstream payload deformatter, and a decoding subsystem. The buffer stores at least one block of an encoded audio bitstream. The block includes a fill element that begins with an identifier followed by fill data. The fill data includes at least one flag identifying whether enhanced spectral band replication (eSBR) processing is to be performed on audio content of the block. A corresponding method for decoding an encoded audio bitstream is also provided.

IPC Classes  ?

  • G10L 19/16 - Vocoder architecture
  • G10L 19/035 - Scalar quantisation
  • G10L 19/24 - Variable rate codecs, e.g. for generating different qualities using a scalable representation such as hierarchical encoding or layered encoding
  • G10L 21/038 - Speech enhancement, e.g. noise reduction or echo cancellation using band spreading techniques

44.

AUDIO CODEC WITH ADAPTIVE GAIN CONTROL OF DOWNMIXED SIGNALS

      
Application Number US2022019292
Publication Number 2022/192217
Status In Force
Filing Date 2022-03-08
Publication Date 2022-09-15
Owner
  • DOLBY LABORATORIES LICENSING CORPORATION (USA)
  • DOLBY INTERNATIONAL AB (Netherlands)
Inventor
  • Setiawan, Panji
  • Tyagi, Rishabh
  • Bruhn, Stefan

Abstract

A method for performing gain control on audio signals is provided. In some implementations, the method involves determining downmixed signals associated with one or more downmix channels associated with a current frame of an audio signal to be encoded. In some implementations, the method involves determining whether an overload condition exists for an encoder. In some implementation, the method involves determining a gain parameter. In some implementations, the method involves determining at least one gain transition function based on the gain parameter and a gain parameter associated with a preceding frame of the audio signal. In some implementations, the method involves applying the at least one gain transition function to one or more of the downmixed signals. In some implementations, the method involves encoding the downmixed signals in connection with information indicative of gain control applied to the current frame.

IPC Classes  ?

  • G10L 19/008 - Multichannel audio signal coding or decoding using interchannel correlation to reduce redundancy, e.g. joint-stereo, intensity-coding or matrixing
  • G10L 19/22 - Mode decision, i.e. based on audio signal content versus external parameters
  • G10L 19/002 - Dynamic bit allocation
  • G10L 19/005 - Correction of errors induced by the transmission channel, if related to the coding algorithm
  • G10L 19/16 - Vocoder architecture
  • H04S 3/00 - Systems employing more than two channels, e.g. quadraphonic

45.

AUDIO CODEC WITH ADAPTIVE GAIN CONTROL OF DOWNMIXED SIGNALS

      
Document Number 03212631
Status Pending
Filing Date 2022-03-08
Open to Public Date 2022-09-15
Owner
  • DOLBY LABORATORIES LICENSING CORPORATION (USA)
  • DOLBY INTERNATIONAL AB (Netherlands)
Inventor
  • Setiawan, Panji
  • Tyagi, Rishabh
  • Bruhn, Stefan

Abstract

A method for performing gain control on audio signals is provided. In some implementations, the method involves determining downmixed signals associated with one or more downmix channels associated with a current frame of an audio signal to be encoded. In some implementations, the method involves determining whether an overload condition exists for an encoder. In some implementation, the method involves determining a gain parameter. In some implementations, the method involves determining at least one gain transition function based on the gain parameter and a gain parameter associated with a preceding frame of the audio signal. In some implementations, the method involves applying the at least one gain transition function to one or more of the downmixed signals. In some implementations, the method involves encoding the downmixed signals in connection with information indicative of gain control applied to the current frame.

IPC Classes  ?

  • G10L 19/008 - Multichannel audio signal coding or decoding using interchannel correlation to reduce redundancy, e.g. joint-stereo, intensity-coding or matrixing
  • G10L 19/002 - Dynamic bit allocation
  • G10L 19/005 - Correction of errors induced by the transmission channel, if related to the coding algorithm
  • G10L 19/16 - Vocoder architecture
  • G10L 19/22 - Mode decision, i.e. based on audio signal content versus external parameters
  • H04S 3/00 - Systems employing more than two channels, e.g. quadraphonic

46.

APPARATUS AND METHOD FOR LEVELING MAIN AND SUPPLEMENTARY AUDIO FROM A HBBTV SERVICE

      
Application Number EP2022055717
Publication Number 2022/189341
Status In Force
Filing Date 2022-03-07
Publication Date 2022-09-15
Owner DOLBY INTERNATIONAL AB (Netherlands)
Inventor
  • Lassure, Gael
  • Stahlmann, Alexander
  • Mueller, Jan

Abstract

Described is a method of audio processing in a HbbTV terminal device. The method includes receiving a decoded broadcast feed including a first audio track, receiving HbbTV content relating to the broadcast feed, the HbbTV content including a second audio track, extracting level-related information from the decoded broadcast feed, wherein the level-related information is embedded in the decoded broadcast feed and enables to obtain an indication of an original audio level of the first audio track, analyzing the first audio track for determining an actual audio level of the first audio track, determining a gain factor based on the actual audio level and the original audio level, and generating a third audio track for output by the HbbTV terminal device based on the first audio track, the second audio track, and the gain factor. Also described is an apparatus for carrying out the method, as well as corresponding programs and computer-readable storage media.

IPC Classes  ?

  • H04N 21/426 - Internal components of the client
  • H04N 21/462 - Content or additional data management e.g. creating a master electronic program guide from data received from the Internet and a Head-end or controlling the complexity of a video stream by scaling the resolution or bit-rate based on the client capabi
  • H04N 21/434 - Disassembling of a multiplex stream, e.g. demultiplexing audio and video streams or extraction of additional data from a video stream; Remultiplexing of multiplex streams; Extraction or processing of SI; Disassembling of packetised elementary stream
  • H04H 20/10 - Arrangements for replacing or switching information during the broadcast or during the distribution
  • H04N 21/44 - Processing of video elementary streams, e.g. splicing a video clip retrieved from local storage with an incoming video stream or rendering scenes according to MPEG-4 scene graphs
  • H04N 21/458 - Scheduling content for creating a personalised stream, e.g. by combining a locally stored advertisement with an incoming stream; Updating operations, e.g. for OS modules
  • H04N 21/81 - Monomedia components thereof

47.

Image decoding device and image coding device

      
Application Number 17699552
Grant Number 11924451
Status In Force
Filing Date 2022-03-21
First Publication Date 2022-09-08
Grant Date 2024-03-05
Owner DOLBY INTERNATIONAL AB (China)
Inventor
  • Yamamoto, Tomoyuki
  • Ikai, Tomohiro
  • Yasugi, Yukinobu
  • Tsukuba, Takeshi

Abstract

A reference picture information decoding unit (13) omits decoding of a reference list sorting presence or absence flag and/or a reference list sorting order based on the number of current picture referable pictures.

IPC Classes  ?

  • H04N 19/44 - Decoders specially adapted therefor, e.g. video decoders which are asymmetric with respect to the encoder
  • H04N 19/50 - Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using predictive coding
  • H04N 19/51 - Motion estimation or motion compensation
  • H04N 19/52 - Processing of motion vectors by encoding by predictive encoding
  • H04N 19/537 - Motion estimation other than block-based
  • H04N 19/573 - Motion compensation with multiple frame prediction using two or more reference frames in a given prediction direction
  • H04N 19/577 - Motion compensation with bidirectional frame interpolation, i.e. using B-pictures
  • H04N 19/61 - Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using transform coding in combination with predictive coding
  • H04N 19/70 - Methods or arrangements for coding, decoding, compressing or decompressing digital video signals characterised by syntax aspects related to video coding, e.g. related to compression standards
  • H04N 19/96 - Tree coding, e.g. quad-tree coding

48.

LAYERED CODING AND DATA STRUCTURE FOR COMPRESSED HIGHER-ORDER AMBISONICS SOUND OR SOUND FIELD REPRESENTATIONS

      
Application Number 17749007
Status Pending
Filing Date 2022-05-19
First Publication Date 2022-09-08
Owner DOLBY INTERNATIONAL AB (Netherlands)
Inventor
  • Kordon, Sven
  • Krueger, Alexander

Abstract

The present document relates to a method of layered encoding of a frame of a compressed higher-order Ambisonics, HOA, representation of a sound or sound field. The compressed HOA representation comprises a plurality of transport signals. The method comprises assigning the plurality of transport signals to a plurality of hierarchical layers, the plurality of layers including a base layer and one or more hierarchical enhancement layers, generating, for each layer, a respective HOA extension payload including side information for parametrically enhancing a reconstructed HOA representation obtainable from the transport signals assigned to the respective layer and any layers lower than the respective layer, assigning the generated HOA extension payloads to their respective layers, and signaling the generated HOA extension payloads in an output bitstream. The present document further relates to a method of decoding a frame of a compressed HOA representation of a sound or sound field, an encoder and a decoder for layered coding of a compressed HOA representation, and a data structure representing a frame of a compressed HOA representation of a sound or sound field.

IPC Classes  ?

  • G10L 19/008 - Multichannel audio signal coding or decoding using interchannel correlation to reduce redundancy, e.g. joint-stereo, intensity-coding or matrixing
  • G10L 19/24 - Variable rate codecs, e.g. for generating different qualities using a scalable representation such as hierarchical encoding or layered encoding

49.

LAYERED CODING FOR COMPRESSED SOUND OR SOUND FIELD REPRESENTENTATIONS

      
Application Number 17751492
Status Pending
Filing Date 2022-05-23
First Publication Date 2022-09-01
Owner DOLBY INTERNATIONAL AB (Netherlands)
Inventor
  • Kordon, Sven
  • Krueger, Alexander

Abstract

The present document relates to a method of layered encoding of a compressed sound representation of a sound or sound field. The compressed sound representation comprises a basic compressed sound representation comprising a plurality of components, basic side information for decoding the basic compressed sound representation to a basic reconstructed sound representation of the sound or sound field, and enhancement side information including parameters for improving the basic reconstructed sound representation. The method comprises sub-dividing the plurality of components into a plurality of groups of components and assigning each of the plurality of groups to a respective one of a plurality of hierarchical layers, the number of groups corresponding to the number of layers, and the plurality of layers including a base layer and one or more hierarchical enhancement layers, adding the basic side information to the base layer, and determining a plurality of portions of enhancement side information from the enhancement side information and assigning each of the plurality of portions of enhancement side information to a respective one of the plurality of layers, wherein each portion of enhancement side information includes parameters for improving a reconstructed sound representation obtainable from data included in the respective layer and any layers lower than the respective layer. The document further relates to a method of decoding a compressed sound representation of a sound or sound field, wherein the compressed sound representation is encoded in a plurality of hierarchical layers that include a base layer and one or more hierarchical enhancement layers, as well as to an encoder and a decoder for layered coding of a compressed sound representation.

IPC Classes  ?

  • G10L 19/008 - Multichannel audio signal coding or decoding using interchannel correlation to reduce redundancy, e.g. joint-stereo, intensity-coding or matrixing
  • G10L 19/24 - Variable rate codecs, e.g. for generating different qualities using a scalable representation such as hierarchical encoding or layered encoding
  • H04S 7/00 - Indicating arrangements; Control arrangements, e.g. balance control

50.

METHODS AND DEVICES FOR GENERATION AND PROCESSING OF MODIFIED AUDIO BITSTREAMS

      
Application Number 17635183
Status Pending
Filing Date 2020-08-13
First Publication Date 2022-09-01
Owner
  • DOLBY INTERNATIONAL AB (Netherlands)
  • DOLBY LABORATORIES LICENSING CORPORATION (USA)
Inventor
  • Fersch, Christof
  • Fischer, Daniel
  • Terentiv, Leon
  • Mcgarry, Gregory John

Abstract

Described herein is a method for generating a modified bitstream on a source device, wherein the method includes the steps of: a) receiving, by a receiver, a bitstream including coded media data; b) generating, by an embedder, payload of additional media data and embedding the payload in the bitstream for obtaining, as an output from the embedder, a modified bitstream including the coded media data and the payload of the additional media data; and c) outputting the modified bitstream to a sink device. Described is further a method for processing said modified bitstream on a sink device. Described are moreover a respective source device and sink device as well as a system of a source device and a sink device and respective computer program products.

IPC Classes  ?

  • G10L 19/083 - Determination or coding of the excitation function; Determination or coding of the long-term prediction parameters the excitation function being an excitation gain
  • G10L 19/16 - Vocoder architecture

51.

MULTI-LAG FORMAT FOR AUDIO CODING

      
Application Number 17636856
Status Pending
Filing Date 2020-08-18
First Publication Date 2022-09-01
Owner Dolby International AB (Netherlands)
Inventor
  • Villemoes, Lars
  • Lehtonen, Heidi-Maria
  • Purnhagen, Heiko
  • Hedelin, Per

Abstract

Described herein is a method of encoding an audio signal. The method comprises: generating a plurality of subband audio signals based on the audio signal; determining a spectral envelope of the audio signal; for each subband audio signal, determining autocorrelation information for the subband audio signal based on an autocorrelation function of the subband audio signal; and generating an encoded representation of the audio signal, the encoded representation comprising a representation of the spectral envelope of the audio signal and a representation of the autocorrelation information for the plurality of subband audio signals. Further described are methods of decoding the audio signal from the encoded representation, as well as corresponding encoders, decoders, computer programs, and computer-readable recording media.

IPC Classes  ?

  • G10L 19/02 - Speech or audio signal analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using spectral analysis, e.g. transform vocoders or subband vocoders
  • G10L 25/18 - Speech or voice analysis techniques not restricted to a single one of groups characterised by the type of extracted parameters the extracted parameters being spectral information of each sub-band
  • G10L 19/24 - Variable rate codecs, e.g. for generating different qualities using a scalable representation such as hierarchical encoding or layered encoding
  • G10L 25/06 - Speech or voice analysis techniques not restricted to a single one of groups characterised by the type of extracted parameters the extracted parameters being correlation coefficients

52.

MANAGING PLAYBACK OF MULTIPLE STREAMS OF AUDIO OVER MULTIPLE SPEAKERS

      
Application Number 17630917
Status Pending
Filing Date 2020-07-27
First Publication Date 2022-08-25
Owner
  • Dolby Laboratories Licensing Corporation (USA)
  • Dolby International AB (Netherlands)
Inventor
  • Seefeldt, Alan J.
  • Lando, Joshua B.
  • Arteaga, Daniel
  • Thomas, Mark R.P
  • Dickins, Glenn N.

Abstract

A multi-stream rendering system and method may render and play simultaneously a plurality of audio program streams over a plurality of arbitrarily placed loudspeakers. At least one of the program streams may be a spatial mix. The rendering of said spatial mix may be dynamically modified as a function of the simultaneous rendering of one or more additional program streams. The rendering of one or more additional program streams may be dynamically modified as a function of the simultaneous rendering of the spatial mix.

IPC Classes  ?

  • H04R 3/12 - Circuits for transducers for distributing signals to two or more loudspeakers
  • H04R 5/02 - Spatial or constructional arrangements of loudspeakers
  • H04S 7/00 - Indicating arrangements; Control arrangements, e.g. balance control
  • G10L 25/78 - Detection of presence or absence of voice signals
  • G10L 15/08 - Speech classification or search
  • G10L 15/22 - Procedures used during a speech recognition process, e.g. man-machine dialog
  • H04R 5/04 - Circuit arrangements

53.

SPATIAL AUDIO SIGNAL MANIPULATION

      
Application Number 17694506
Status Pending
Filing Date 2022-03-14
First Publication Date 2022-08-25
Owner
  • DOLBY LABORATORIES LICENSING CORPORATION (USA)
  • DOLBY INTERNATIONAL AB (Netherlands)
Inventor
  • Breebaart, Dirk Jeroen
  • Mateos Sole, Antonio
  • Purnhagen, Heiko
  • Tsingos, Nicolas R.

Abstract

Described herein is a method (30) of rendering an audio signal (17) for playback in an audio environment (27) defined by a target loudspeaker system (23), the audio signal (17) including audio data relating to an audio object and associated position data indicative of an object position. Method (30) includes the initial step (31) of receiving the audio signal (17). At step (32) loudspeaker layout data for the target loudspeaker system (23) is received. At step (33) control data is received that is indicative of a position modification to be applied to the audio object in the audio environment (27). At step (38) in response to the position data, loudspeaker layout data and control data, rendering modification data is generated. Finally, at step (39) the audio signal (17) is rendered with the rendering modification data to output the audio signal (17) with the audio object at a modified object position that is between loudspeakers within the audio environment (27).

IPC Classes  ?

  • H04S 7/00 - Indicating arrangements; Control arrangements, e.g. balance control
  • H04R 5/02 - Spatial or constructional arrangements of loudspeakers
  • H04S 3/00 - Systems employing more than two channels, e.g. quadraphonic

54.

Method of coding and decoding images, coding and decoding device and computer programs corresponding thereto

      
Application Number 17693185
Grant Number 11889098
Status In Force
Filing Date 2022-03-11
First Publication Date 2022-08-25
Grant Date 2024-01-30
Owner DOLBY INTERNATIONAL AB (Netherlands)
Inventor
  • Henry, Felix
  • Clare, Gordon

Abstract

A method is provided for coding at least one image split up into partitions, a current partition to be coded containing data, at least one data item of which is allotted a sign. The coding method includes, for the current partition, the following steps: calculating the value of a function representative of the data of the current partition with the exclusion of the sign; comparing the calculated value with a predetermined value of the sign; as a function of the result of the comparison, modifying or not modifying at least one of the data items of the current partition, in the case of modification, coding the at least one modified data item.

IPC Classes  ?

  • H04N 19/50 - Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using predictive coding
  • H04N 19/176 - Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding the unit being an image region, e.g. an object the region being a block, e.g. a macroblock
  • H04N 19/463 - Embedding additional information in the video signal during the compression process by compressing encoding parameters before transmission
  • H04N 19/467 - Embedding additional information in the video signal during the compression process characterised by the embedded information being invisible, e.g. watermarking
  • H04N 19/91 - Entropy coding, e.g. variable length coding [VLC] or arithmetic coding
  • H04N 19/117 - Filters, e.g. for pre-processing or post-processing
  • H04N 19/18 - Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding the unit being a set of transform coefficients
  • H04N 19/48 - Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using compressed domain processing techniques other than decoding, e.g. modification of transform coefficients, variable length coding [VLC] data or run-length data
  • H04N 19/147 - Data rate or code amount at the encoder output according to rate distortion criteria
  • H04N 19/46 - Embedding additional information in the video signal during the compression process
  • H04N 19/196 - Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the adaptation method, adaptation tool or adaptation type used for the adaptive coding being specially adapted for the computation of encoding parameters, e.g. by averaging previously computed encoding parameters
  • H04N 19/167 - Position within a video image, e.g. region of interest [ROI]
  • H04N 19/85 - Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using pre-processing or post-processing specially adapted for video compression
  • H04N 19/13 - Adaptive entropy coding, e.g. adaptive variable length coding [AVLC] or context adaptive binary arithmetic coding [CABAC]

55.

AUDIO DE-ESSER INDEPENDENT OF ABSOLUTE SIGNAL LEVEL

      
Application Number 17733397
Status Pending
Filing Date 2022-04-29
First Publication Date 2022-08-18
Owner
  • DOLBY LABORATORIES LICENSING CORPORATION (USA)
  • DOLBY INTERNATIONAL AB (Netherlands)
Inventor
  • Cengarle, Giulio
  • Sole, Antonio Mateos
  • Crockett, Brett G.

Abstract

Methods, systems, and computer program products of automatic de-essing are disclosed. An automatic de-esser can be used without manually setting parameters and can perform reliable sibilance detection and reduction regardless of absolute signal level, singer gender and other extraneous factors. An audio processing device divides input audio signals into buffers each containing a number of samples, the buffers overlapping one another. The audio processing device transforms each buffer from the time domain into the frequency domain and implements de-essing as a multi-band compressor that only acts on a designated sibilance band. The audio processing device determines an amount of attenuation in the sibilance band based on comparison of energy level in sibilance band of a buffer to broadband energy level in a previous buffer. The amount of attenuation is also determined based on a zero-crossing rate, as well as a slope and onset of a compression curve.

IPC Classes  ?

  • G10L 21/0232 - Processing in the frequency domain
  • G10L 21/0264 - Noise filtering characterised by the type of parameter measurement, e.g. correlation techniques, zero crossing techniques or predictive techniques
  • H03G 3/30 - Automatic control in amplifiers having semiconductor devices

56.

LATENCY MANAGEMENT FOR CONTENT DELIVERY

      
Application Number 17627017
Status Pending
Filing Date 2020-07-15
First Publication Date 2022-08-11
Owner DOLBY INTERNATIONAL AB (Netherlands)
Inventor
  • Wolters, Martin
  • Krauss, Kurt

Abstract

The present document discloses a method for playback of media content via a delivery channel. The delivery channel may generally refer to the channels through which audio or video programs are delivered (transmitted) to the user (receiver). The media content may generally comprise consecutive media programs. In particular, for a specific media program within the media content, a respective content type for that specific media program is also provided. The method may comprise receiving an indication of the sensitivity of a media program to playback latency. The method may further comprise receiving at least a portion of the media program. The method may yet further comprise adapting the playback of the media program based on the indication of its sensitivity to playback latency.

IPC Classes  ?

  • H04N 21/4402 - Processing of video elementary streams, e.g. splicing a video clip retrieved from local storage with an incoming video stream or rendering scenes according to MPEG-4 scene graphs involving reformatting operations of video signals for household redistribution, storage or real-time display
  • H04N 21/435 - Processing of additional data, e.g. decrypting of additional data or reconstructing software from modules extracted from the transport stream

57.

SELECTABLE LINEAR PREDICTIVE OR TRANSFORM CODING MODES WITH ADVANCED STEREO CODING

      
Application Number 17728692
Status Pending
Filing Date 2022-04-25
First Publication Date 2022-08-04
Owner Dolby International AB (Netherlands)
Inventor
  • Purnhagen, Heiko
  • Carlsson, Pontus
  • Kjoerling, Kristofer

Abstract

Methods and systems for advanced stereo processing of an audio signal are disclosed. The methods and systems include selecting a coding mode of either transform coding or linear predictive coding and performing advanced stereo processing when in the selected coding mode. Both encoding and decoding operations are provided.

IPC Classes  ?

  • G10L 19/002 - Dynamic bit allocation
  • H04S 5/02 - Pseudo-stereo systems, e.g. in which additional channel signals are derived from monophonic signals by means of phase shifting, time delay or reverberation of the pseudo four-channel type, e.g. in which rear channel signals are derived from two-channel stereo signals
  • H04S 5/00 - Pseudo-stereo systems, e.g. in which additional channel signals are derived from monophonic signals by means of phase shifting, time delay or reverberation
  • H04S 3/02 - Systems employing more than two channels, e.g. quadraphonic of the matrix type, i.e. in which input signals are combined algebraically, e.g. after having been phase shifted with respect to each other
  • G10L 19/008 - Multichannel audio signal coding or decoding using interchannel correlation to reduce redundancy, e.g. joint-stereo, intensity-coding or matrixing

58.

PROJECTION SYSTEM AND METHOD WITH DYNAMIC TARGET GEOMETRY

      
Application Number US2022014793
Publication Number 2022/165441
Status In Force
Filing Date 2022-02-01
Publication Date 2022-08-04
Owner
  • DOLBY LABORATORIES LICENSING CORPORATION (USA)
  • DOLBY INTERNATIONAL AB (Netherlands)
Inventor
  • Pertierra, Juan Pablo
  • Pires Arrifano, Angelo Miguel
  • Le Barbenchon, Clement Luc Carol
  • Richards, Martin J.
  • Lippey, Barret

Abstract

A projection system and method includes a light source configured to emit a light in response to an image data; a phase light modulator configured to receive the light from the light source and to apply a spatially-varying phase modulation on the light, thereby to steer the light and generate a projection light; and a controller configured to dynamically determine, based on at least one of a user input or a sensor signal, a target geometry of a projection surface on which the projection light is projected, determine, based on the target geometry, a phase configuration for a frame of the image data, and provide a phase control signal to the phase light modulator, the phase control signal configured to cause the phase light modulator to generate the projection light in accordance with the phase configuration for the frame.

IPC Classes  ?

  • H04N 9/31 - Projection devices for colour picture display

59.

DETECTION AND ENHANCEMENT OF SPEECH IN BINAURAL RECORDINGS

      
Application Number US2022012128
Publication Number 2022/155205
Status In Force
Filing Date 2022-01-12
Publication Date 2022-07-21
Owner
  • DOLBY LABORATORIES LICENSING CORPORATION (USA)
  • DOLBY INTERNATIONAL AB (Netherlands)
Inventor
  • Cengarle, Giulio
  • Ma, Yuanxing

Abstract

Disclosed herein are method, systems, and computer-program products for segmenting a binaural recording of speech into parts containing self-speech and parts containing external speech, and processing each category with different settings, to obtain an enhanced overall presentation. The segmentation is based on a combination of: i) feature-based frame-by-frame classification, and ii) detecting dissimilarity by statistical methods. The segmentation information is then used by a speech enhancement chain, where independent settings are used to process the self- and external speech parts.

IPC Classes  ?

  • G10L 25/51 - Speech or voice analysis techniques not restricted to a single one of groups specially adapted for particular use for comparison or discrimination
  • G10L 25/87 - Detection of discrete points within a voice signal
  • G10L 21/0208 - Noise filtering
  • G10L 25/78 - Detection of presence or absence of voice signals

60.

METHODS AND DEVICES FOR GENERATION AND PROCESSING OF MODIFIED BITSTREAMS

      
Application Number 17633829
Status Pending
Filing Date 2020-08-13
First Publication Date 2022-07-21
Owner
  • Dolby Laboratories Licensing Corporation (USA)
  • Dolby International AB (Netherlands)
Inventor
  • Fersch, Christof
  • Fischer, Daniel
  • Terentiv, Leon
  • Mcgarry, Gregory John

Abstract

Described herein is a method for generating a modified bitstream on a source device, wherein the method includes the steps of: a) receiving, by a receiver, a bitstream including coded media data; b) generating, by an embedder, payload of additional media data and embedding the payload in the bitstream for obtaining, as an output from the embedder, a modified bitstream including the coded media data and the payload of the additional media data; and d) outputting the modified bitstream to a sink device. Described is further a method for processing said modified bitstream on a sink device. Described are moreover a respective source device and sink device as well as a system of a source device and a sink device and respective computer program products.

IPC Classes  ?

  • G10L 19/16 - Vocoder architecture
  • G10L 19/018 - Audio watermarking, i.e. embedding inaudible data in the audio signal
  • H04W 4/80 - Services using short range communication, e.g. near-field communication [NFC], radio-frequency identification [RFID] or low energy communication

61.

System and method for displaying high quality images in a dual modulation projection system

      
Application Number 17572060
Grant Number 11611727
Status In Force
Filing Date 2022-01-10
First Publication Date 2022-07-07
Grant Date 2023-03-21
Owner
  • Dolby Laboratories Licensing Corporation (USA)
  • Dolby International AB (Netherlands)
Inventor
  • Pertierra, Juan P.
  • Richards, Martin J.
  • Orlick, Christopher John
  • Le Barbenchon, Clement
  • Pires Arrifano, Angelo M.

Abstract

A novel high efficiency image projection system includes a beam-steering modulator, an amplitude modulator, and a controller. In a particular embodiment the controller generates beam-steering drive values from image data and uses the beam-steering drive values to drive the beam-steering modulator. Additionally, the controller utilizes the beam-steering drive values to generate a lightfield simulation of a lightfield projected onto the amplitude modulator by the beam-steering modulator. The controller utilizes the lightfield simulation to generate amplitude drive values for driving the amplitude modulator in order to project a high quality version of the image described by the image data.

IPC Classes  ?

  • H04N 9/31 - Projection devices for colour picture display

62.

BINAURAL SIGNAL POST-PROCESSING

      
Application Number US2021063878
Publication Number 2022/133128
Status In Force
Filing Date 2021-12-16
Publication Date 2022-06-23
Owner
  • DOLBY LABORATORIES LICENSING CORPORATION (USA)
  • DOLBY INTERNATIONAL AB (Netherlands)
Inventor
  • Breebaart, Dirk Jeroen
  • Cengarle, Giulio
  • Brown, C. Phillip

Abstract

A method of audio processing includes performing spatial analysis on a binaural signal to estimate level differences and phase differences characteristic of a binaural filter of the binaural signal, performing object extraction on the binaural audio signal using the estimated level and phase differences to generate a left/right main component signal and a left/right residual component signal. The system may process the left/right main and left/right residual components differently using different object processing parameters for e.g. repositioning, equalization, compression, upmixing, channel remapping or storage to generate a processed binaural signal that provides an improved listening experience. Repositioning may be based on head tracking sensor data.

IPC Classes  ?

  • H04S 5/00 - Pseudo-stereo systems, e.g. in which additional channel signals are derived from monophonic signals by means of phase shifting, time delay or reverberation
  • H04S 7/00 - Indicating arrangements; Control arrangements, e.g. balance control

63.

DIALOGUE ENHANCEMENT IN AUDIO CODEC

      
Application Number 17603149
Status Pending
Filing Date 2020-04-15
First Publication Date 2022-06-23
Owner DOLBY INTERNATIONAL AB (Netherlands)
Inventor
  • Gorlow, Stanislaw
  • Samuelsson, Leif Jonas
  • Hoerich, Holger
  • Friedrich, Tobias

Abstract

Dialogue enhancement of an audio signal, comprising obtaining a set of time-varying parameters configured to estimate a dialogue component present in said audio signal, estimating the dialogue component from the audio signal, applying a compressor only to the estimated dialogue component, to generate a processed dialogue component, applying a user-determined gain to the processed dialogue component, to provide an enhanced dialogue component. The processing of the estimated dialogue may be performed on the decoder side or encoder side. The invention enables an improved dialogue enhancement.

IPC Classes  ?

  • G10L 21/0308 - Voice signal separating characterised by the type of parameter measurement, e.g. correlation techniques, zero crossing techniques or predictive techniques
  • G10L 19/008 - Multichannel audio signal coding or decoding using interchannel correlation to reduce redundancy, e.g. joint-stereo, intensity-coding or matrixing
  • H03G 5/16 - Automatic control
  • H03G 9/18 - Combinations of two or more types of control, e.g. gain control and tone control in untuned amplifiers having semiconductor devices for tone control and volume expansion or compression

64.

METHOD AND APPARATUS FOR PROCESSING OF AUDIO DATA USING A PRE-CONFIGURED GENERATOR

      
Application Number EP2021085852
Publication Number 2022/129146
Status In Force
Filing Date 2021-12-15
Publication Date 2022-06-23
Owner DOLBY INTERNATIONAL AB (Netherlands)
Inventor Biswas, Arijit

Abstract

Described herein is a method for setting up a decoder for generating processed audio data from an audio bitstream, the decoder comprising a Generator of a Generative Adversarial Network, GAN, for processing of the audio data, wherein the method includes the steps of (a) pre-configuring the Generator for processing of audio data with a set of parameters for the Generator, the parameters being determined by training, at training time, the Generator using the full concatenated distribution; and (b) pre-configuring the decoder to determine, at decoding time, a truncation mode for modifying the concatenated distribution and to apply the determined truncation mode to the concatenated distribution. Described are further a method of generating processed audio data from an audio bitstream using a Generator of a Generative Adversarial Network, GAN, for processing of the audio data and a respective apparatus. Moreover, described are also respective systems and computer program products.

IPC Classes  ?

  • G10L 21/0208 - Noise filtering
  • G10L 19/26 - Pre-filtering or post-filtering
  • G10L 19/005 - Correction of errors induced by the transmission channel, if related to the coding algorithm
  • G10L 25/30 - Speech or voice analysis techniques not restricted to a single one of groups characterised by the analysis technique using neural networks
  • G06N 3/04 - Architecture, e.g. interconnection topology

65.

IMMERSIVE VOICE AND AUDIO SERVICES (IVAS) WITH ADAPTIVE DOWNMIX STRATEGIES

      
Document Number 03203960
Status Pending
Filing Date 2021-12-02
Open to Public Date 2022-06-09
Owner
  • DOLBY LABORATORIES LICENSING CORPORATION (USA)
  • DOLBY INTERNATIONAL AB (Netherlands)
Inventor
  • Mundt, Harald
  • Mcgrath, David S.
  • Tyagi, Rishabh

Abstract

Disclosed is an audio signal encoding/decoding method that uses an encoding downmix strategy applied at an encoder that is different than a decoding re-mix/upmix strategy applied at a decoder. Based on the type of downmix coding scheme, the method comprises: computing input downmixing gains to be applied to the input audio signal to construct a primary downmix channel; determining downmix scaling gains to scale the primary downmix channel; generating prediction gains based on the input audio signal, the input downmixing gains and the downmix scaling gains; determining residual channel(s) from the side channels by using the primary downmix channel and the prediction gains to generate side channel predictions and subtracting the side channel predictions from the side channels; determining decorrelation gains based on energy in the residual channels; encoding the primary downmix channel, the residual channel(s), the prediction gains and the decorrelation gains; and sending the bitstream to a decoder.

IPC Classes  ?

  • G10L 19/008 - Multichannel audio signal coding or decoding using interchannel correlation to reduce redundancy, e.g. joint-stereo, intensity-coding or matrixing
  • H04S 5/00 - Pseudo-stereo systems, e.g. in which additional channel signals are derived from monophonic signals by means of phase shifting, time delay or reverberation
  • G10L 19/24 - Variable rate codecs, e.g. for generating different qualities using a scalable representation such as hierarchical encoding or layered encoding

66.

METHOD AND APPARATUS FOR DECODING STEREO LOUDSPEAKER SIGNALS FROM A HIGHER-ORDER AMBISONICS AUDIO SIGNAL

      
Application Number 17521762
Status Pending
Filing Date 2021-11-08
First Publication Date 2022-06-09
Owner DOLBY INTERNATIONAL AB (Netherlands)
Inventor
  • Boehm, Johannes
  • Keiler, Florian

Abstract

Decoding of Ambisonics representations for a stereo loudspeaker setup is known for first-order Ambisonics audio signals. But such first-order Ambisonics approaches have either high negative side lobes or poor localisation in the frontal region. The invention deals with the processing for stereo decoders for higher-order Ambisonics HOA.

IPC Classes  ?

  • H04S 3/00 - Systems employing more than two channels, e.g. quadraphonic
  • H04S 7/00 - Indicating arrangements; Control arrangements, e.g. balance control
  • H04S 3/02 - Systems employing more than two channels, e.g. quadraphonic of the matrix type, i.e. in which input signals are combined algebraically, e.g. after having been phase shifted with respect to each other
  • H04S 1/00 - Two-channel systems
  • G10L 19/00 - Speech or audio signal analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis

67.

PERVASIVE ACOUSTIC MAPPING

      
Application Number IB2021000788
Publication Number 2022/118072
Status In Force
Filing Date 2021-12-02
Publication Date 2022-06-09
Owner
  • DOLBY INTERNATIONAL AB (Netherlands)
  • DOLBY LABORATORIES LICENSING CORPORATION (USA)
Inventor
  • Thomas, Mark R.P.
  • Southwell, Benjamin John
  • Bruni, Avery
  • Townsend, Olha Michelle
  • Arteaga, Daniel
  • Scaini, Davide
  • Hines, Christopher Graham
  • Seefeldt, Alan J.
  • Gunawan, David
  • Brown, C. Phillip

Abstract

Some methods may involve receiving a first content stream that includes first audio signals, rendering the first audio signals to produce first audio playback signals, generating first calibration signals, generating first modified audio playback signals by inserting the first calibration signals into the first audio playback signals, and causing a loudspeaker system to play back the first modified audio playback signals, to generate first audio device playback sound. The method(s) may involve receiving microphone signals corresponding to at least the first audio device playback sound and to second through Nth audio device playback sound corresponding to second through Nth modified audio playback signals (including second through Nth calibration signals) played back by second through Nth audio devices, extracting second through Nth calibration signals from the microphone signals and estimating at least one acoustic scene metric based, at least partly, on the second through Nth calibration signals.

IPC Classes  ?

  • H04S 7/00 - Indicating arrangements; Control arrangements, e.g. balance control
  • H04R 3/00 - Circuits for transducers

68.

AUDIBILITY AT USER LOCATION THROUGH MUTUAL DEVICE AUDIBILITY

      
Application Number US2021061506
Publication Number 2022/119990
Status In Force
Filing Date 2021-12-02
Publication Date 2022-06-09
Owner
  • DOLBY LABORATORIES LICENSING CORPORATION (USA)
  • DOLBY INTERNATIONAL AB (Netherlands)
Inventor
  • Thomas, Mark R. P.
  • Arteaga, Daniel
  • Hines, Christopher Graham
  • Scaini, Davide
  • Southwell, Benjamin, John
  • Bruni, Avery
  • Townsend, Olha, Michelle

Abstract

Some methods involve causing a plurality of audio devices in an audio environment to reproduce audio data, each audio device of the plurality of audio devices including at least one loudspeaker and at least one microphone, determining audio device location data including an audio device location for each audio device of the plurality of audio devices and obtaining microphone data from each audio device of the plurality of audio devices. Some methods involve determining a mutual audibility for each audio device of the plurality of audio devices relative to each other audio device of the plurality of audio devices, determining a user location of a person in the audio environment, determining a user location audibility of each audio device of the plurality of audio devices at the user location and controlling one or more aspects of audio device playback based, at least in part, on the user location audibility.

IPC Classes  ?

  • H04S 7/00 - Indicating arrangements; Control arrangements, e.g. balance control

69.

IMMERSIVE VOICE AND AUDIO SERVICES (IVAS) WITH ADAPTIVE DOWNMIX STRATEGIES

      
Application Number US2021061671
Publication Number 2022/120093
Status In Force
Filing Date 2021-12-02
Publication Date 2022-06-09
Owner
  • DOLBY LABORATORIES LICENSING CORPORATION (USA)
  • DOLBY INTERNATIONAL AB (Netherlands)
Inventor
  • Mundt, Harald
  • Mcgrath, David S.
  • Tyagi, Rishabh

Abstract

Disclosed is an audio signal encoding/decoding method that uses an encoding downmix strategy applied at an encoder that is different than a decoding re-mix/upmix strategy applied at a decoder. Based on the type of downmix coding scheme, the method comprises: computing input downmixing gains to be applied to the input audio signal to construct a primary downmix channel; determining downmix scaling gains to scale the primary downmix channel; generating prediction gains based on the input audio signal, the input downmixing gains and the downmix scaling gains; determining residual channel(s) from the side channels by using the primary downmix channel and the prediction gains to generate side channel predictions and subtracting the side channel predictions from the side channels; determining decorrelation gains based on energy in the residual channels; encoding the primary downmix channel, the residual channel(s), the prediction gains and the decorrelation gains; and sending the bitstream to a decoder.

IPC Classes  ?

  • G10L 19/008 - Multichannel audio signal coding or decoding using interchannel correlation to reduce redundancy, e.g. joint-stereo, intensity-coding or matrixing
  • H04S 5/00 - Pseudo-stereo systems, e.g. in which additional channel signals are derived from monophonic signals by means of phase shifting, time delay or reverberation
  • G10L 19/24 - Variable rate codecs, e.g. for generating different qualities using a scalable representation such as hierarchical encoding or layered encoding

70.

Layered coding for compressed sound or sound field representations

      
Application Number 17561262
Grant Number 11626119
Status In Force
Filing Date 2021-12-23
First Publication Date 2022-06-09
Grant Date 2023-04-11
Owner DOLBY INTERNATIONAL AB (Netherlands)
Inventor
  • Kordon, Sven
  • Krueger, Alexander

Abstract

The present document relates to a method of layered encoding of a compressed sound representation of a sound or sound field. The compressed sound representation comprises a basic compressed sound representation comprising a plurality of components, basic side information for decoding the basic compressed sound representation to a basic reconstructed sound representation of the sound or sound field, and enhancement side information including parameters for improving the basic reconstructed sound representation. The method comprises sub-dividing the plurality of components into a plurality of groups of components and assigning each of the plurality of groups to a respective one of a plurality of hierarchical layers, the number of groups corresponding to the number of layers, and the plurality of layers including a base layer and one or more hierarchical enhancement layers, adding the basic side information to the base layer, and determining a plurality of portions of enhancement side information from the enhancement side information and assigning each of the plurality of portions of enhancement side information to a respective one of the plurality of layers, wherein each portion of enhancement side information includes parameters for improving a reconstructed sound representation obtainable from data included in the respective layer and any layers lower than the respective layer. The document further relates to a method of decoding a compressed sound representation of a sound or sound field, wherein the compressed sound representation is encoded in a plurality of hierarchical layers that include a base layer and one or more hierarchical enhancement layers, as well as to an encoder and a decoder for layered coding of a compressed sound representation.

IPC Classes  ?

  • G10L 19/008 - Multichannel audio signal coding or decoding using interchannel correlation to reduce redundancy, e.g. joint-stereo, intensity-coding or matrixing
  • H04S 3/00 - Systems employing more than two channels, e.g. quadraphonic

71.

AUTOMATIC LOCALIZATION OF AUDIO DEVICES

      
Application Number US2021061533
Publication Number 2022/120005
Status In Force
Filing Date 2021-12-02
Publication Date 2022-06-09
Owner
  • DOLBY LABORATORIES LICENSING CORPORATION (USA)
  • DOLBY INTERNATIONAL AB (Netherlands)
Inventor
  • Arteaga, Daniel
  • Scaini, Davide
  • Thomas, Mark R.P.
  • Bruni, Avery
  • Townsend, Olha Michelle

Abstract

A method may involve: receiving direction of arrival (DOA) data corresponding to sound emitted by at least a first smart audio device of the audio environment that includes a first audio transmitter and a first audio receiver, the DOA data corresponding to sound received by at least a second smart audio device of the audio environment that includes a second audio transmitter and a second audio receiver, the DOA data corresponding to sound emitted by at least the second smart audio device and received by at least the first smart audio device; receiving one or more configuration parameters corresponding to the audio environment, to one or more audio devices, or both; and minimizing a cost function based at least in part on the DOA data and the configuration parameter(s), to estimate a position and an orientation of at least the first smart audio device and the second smart audio device.

IPC Classes  ?

  • H04S 7/00 - Indicating arrangements; Control arrangements, e.g. balance control
  • H04R 3/00 - Circuits for transducers

72.

ROTATION OF SOUND COMPONENTS FOR ORIENTATION-DEPENDENT CODING SCHEMES

      
Application Number US2021061549
Publication Number 2022/120011
Status In Force
Filing Date 2021-12-02
Publication Date 2022-06-09
Owner
  • DOLBY LABORATORIES LICENSING CORPORATION (USA)
  • DOLBY INTERNATIONAL AB (Netherlands)
Inventor
  • Bruhn, Stefan
  • Mundt, Harald
  • Mcgrath, David S.
  • Brown, Stefanie

Abstract

Method for encoding scene-based audio is provided. In some implementations, the method involves determining, by an encoder, a spatial direction of a dominant sound component in a frame of an input audio signal. In some implementations, the method involves determining rotation parameters based on the determined spatial direction and a direction preference of a coding scheme to be used to encode the input audio signal. In some implementations, the method involves rotating sound components of the frame based on the rotation parameters such that, after being rotated, the dominant sound component has a spatial direction that aligns with the direction preference of the coding scheme. In some implementations, the method involves encoding the rotated sound components of the frame of the input audio signal using the coding scheme in connection with an indication of the rotation parameters or an indication of the spatial direction of the dominant sound component.

IPC Classes  ?

  • G10L 19/008 - Multichannel audio signal coding or decoding using interchannel correlation to reduce redundancy, e.g. joint-stereo, intensity-coding or matrixing

73.

AUTOMATIC GENERATION AND SELECTION OF TARGET PROFILES FOR DYNAMIC EQUALIZATION OF AUDIO CONTENT

      
Application Number US2021059827
Publication Number 2022/115303
Status In Force
Filing Date 2021-11-18
Publication Date 2022-06-02
Owner
  • DOLBY LABORATORIES LICENSING CORPORATION (USA)
  • DOLBY INTERNATIONAL AB (Netherlands)
Inventor
  • Cengarle, Giulio
  • Engel, Nicholas Laurence
  • Scannell, Patrick Winfrey
  • Scaini, David

Abstract

In an embodiment, a method comprises: filtering reference audio content items to separate the reference audio content items into different frequency bands; for each frequency band, extracting a first feature vector from at least a portion of each of the reference audio content items, wherein the first feature vector includes at least one audio characteristic of the reference audio content items; obtaining at least one semantic label from at least a portion of each of the reference audio content items; obtaining a second feature vector consisting of the first feature vectors per frequency band and the at least one semantic label; generating, based on the second feature vector, cluster feature vectors representing centroids of clusters; separating the reference audio content items according to the cluster feature vectors; and computing an average target profile for each cluster based on the reference audio content items in the cluster.

IPC Classes  ?

  • H03G 5/16 - Automatic control
  • G06K 9/62 - Methods or arrangements for recognition using electronic means
  • H03G 5/00 - Tone control or bandwidth control in amplifiers

74.

ROBUST INTRUSIVE PERCEPTUAL AUDIO QUALITY ASSESSMENT BASED ON CONVOLUTIONAL NEURAL NETWORKS

      
Application Number EP2021083531
Publication Number 2022/112594
Status In Force
Filing Date 2021-11-30
Publication Date 2022-06-02
Owner DOLBY INTERNATIONAL AB (Netherlands)
Inventor
  • Biswas, Arijit
  • Jiang, Guanxin

Abstract

Described herein is a computer-implemented deep-learning-based system for determining an indication of an audio quality of an input audio frame. The system comprises at least one inception block configured to receive at least one representation of an input audio frame and to map the at least one representation of the input audio frame into a feature map; and at least one fully connected layer configured to receive a feature map corresponding to the at least one representation of the input audio frame from the at least one inception block, wherein the at least one fully connected layer is configured to determine the indication of the audio quality of the input audio frame. Described are further respective methods of operating and training said system.

IPC Classes  ?

  • G10L 25/60 - Speech or voice analysis techniques not restricted to a single one of groups specially adapted for particular use for comparison or discrimination for measuring the quality of voice signals
  • G10L 25/30 - Speech or voice analysis techniques not restricted to a single one of groups characterised by the analysis technique using neural networks
  • G06N 3/04 - Architecture, e.g. interconnection topology

75.

Method and apparatus for processing of auxiliary media streams embedded in a MPEGH 3D audio stream

      
Application Number 17544959
Grant Number 11830508
Status In Force
Filing Date 2021-12-08
First Publication Date 2022-05-26
Grant Date 2023-11-28
Owner DOLBY INTERNATIONAL AB (Netherlands)
Inventor
  • Schreiner, Stephan
  • Fersch, Christof

Abstract

The disclosure relates to methods, apparatus and systems for side load processing of packetized media streams. In an embodiment, the apparatus comprises: a receiver for receiving a bitstream, and a splitter for identifying a packet type in the bitstream and splitting, based on the identification of a value of the packet type in the bit stream into a main stream and an auxiliary stream.

IPC Classes  ?

  • G10L 19/16 - Vocoder architecture
  • G10L 19/008 - Multichannel audio signal coding or decoding using interchannel correlation to reduce redundancy, e.g. joint-stereo, intensity-coding or matrixing
  • H04N 21/426 - Internal components of the client
  • H04N 21/434 - Disassembling of a multiplex stream, e.g. demultiplexing audio and video streams or extraction of additional data from a video stream; Remultiplexing of multiplex streams; Extraction or processing of SI; Disassembling of packetised elementary stream
  • H04N 21/4363 - Adapting the video stream to a specific local network, e.g. a IEEE 1394 or Bluetooth® network
  • H04N 21/439 - Processing of audio elementary streams

76.

Post filter for audio signals

      
Application Number 17532775
Grant Number 11610595
Status In Force
Filing Date 2021-11-22
First Publication Date 2022-05-19
Grant Date 2023-03-21
Owner Dolby International AB (Netherlands)
Inventor
  • Resch, Barbara
  • Kjörling, Kristofer
  • Villemoes, Lars

Abstract

In some embodiments, a pitch filter for filtering a preliminary audio signal generated from an audio bitstream is disclosed. The pitch filter has an operating mode selected from one of either: (i) an active mode where the preliminary audio signal is filtered using filtering information to obtain a filtered audio signal, and (ii) an inactive mode where the pitch filter is disabled. The preliminary audio signal is generated in an audio encoder or audio decoder having a coding mode selected from at least two distinct coding modes, and the pitch filter is capable of being selectively operated in either the active mode or the inactive mode while operating in the coding mode based on control information.

IPC Classes  ?

  • G10L 19/00 - Speech or audio signal analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
  • G10L 19/26 - Pre-filtering or post-filtering
  • G10L 19/20 - Vocoders using multiple modes using sound class specific coding, hybrid encoders or object based coding
  • G10L 19/12 - Determination or coding of the excitation function; Determination or coding of the long-term prediction parameters the excitation function being a code excitation, e.g. in code excited linear prediction [CELP] vocoders
  • G10L 19/125 - Pitch excitation, e.g. pitch synchronous innovation CELP [PSI-CELP] 
  • G10L 21/003 - Changing voice quality, e.g. pitch or formants
  • G10L 19/09 - Long term prediction, i.e. removing periodical redundancies, e.g. by using adaptive codebook or pitch predictor
  • G10L 21/013 - Adapting to target pitch
  • G10L 19/22 - Mode decision, i.e. based on audio signal content versus external parameters
  • G10L 21/007 - Changing voice quality, e.g. pitch or formants characterised by the process used
  • G10L 19/032 - Quantisation or dequantisation of spectral components
  • G10L 19/02 - Speech or audio signal analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using spectral analysis, e.g. transform vocoders or subband vocoders
  • G10L 19/107 - Sparse pulse excitation, e.g. by using algebraic codebook

77.

METHOD AND APPARATUS FOR UPDATING A NEURAL NETWORK

      
Application Number 17438908
Status Pending
Filing Date 2020-03-05
First Publication Date 2022-05-19
Owner DOLBY INTERNATIONAL AB (Netherlands)
Inventor
  • Fersch, Christof
  • Biswas, Arijit

Abstract

Described herein is a method of generating a media bitstream to transmit parameters for updating a neural network implemented in a decoder, wherein the method includes the steps of: (a) determining at least one set of parameters for updating the neural network; (b) encoding the at least one set of parameters and media data to generate the media bitstream; and (c) transmitting the media bitstream to the decoder for updating the neural network with the at least one set of parameters. Described herein are further a method for updating a neural network implemented in a decoder, an apparatus for generating a media bitstream to transmit parameters for updating a neural network implemented in a decoder, an apparatus for updating a neural network implemented in a decoder and computer program products comprising a computer-readable storage medium with instructions adapted to cause the device to carry out said methods when executed by a device having processing capability.

IPC Classes  ?

78.

Decoded picture buffer removal

      
Application Number 17353445
Grant Number 11653011
Status In Force
Filing Date 2021-06-21
First Publication Date 2022-05-12
Grant Date 2023-05-16
Owner DOLBY INTERNATIONAL AB (Netherlands)
Inventor Deshpande, Sachin G.

Abstract

A system for decoding a video bitstream includes receiving a bitstream and a plurality of enhancement bitstreams together with receiving a video parameter set and a video parameter set extension. The system also receives an output layer set change message including information indicating a change in at least one output layer set.

IPC Classes  ?

  • H04N 19/70 - Methods or arrangements for coding, decoding, compressing or decompressing digital video signals characterised by syntax aspects related to video coding, e.g. related to compression standards
  • H04N 19/30 - Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using hierarchical techniques, e.g. scalability
  • H04N 19/12 - Selection from among a plurality of transforms or standards, e.g. selection between discrete cosine transform [DCT] and sub-band transform or selection between H.263 and H.264
  • H04N 19/44 - Decoders specially adapted therefor, e.g. video decoders which are asymmetric with respect to the encoder
  • H04N 19/423 - Methods or arrangements for coding, decoding, compressing or decompressing digital video signals - characterised by implementation details or hardware specially adapted for video compression or decompression, e.g. dedicated software implementation characterised by memory arrangements
  • H04N 19/187 - Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding the unit being a scalable video layer
  • H04N 21/845 - Structuring of content, e.g. decomposing content into time segments
  • H04N 21/2343 - Processing of video elementary streams, e.g. splicing of video streams or manipulating MPEG-4 scene graphs involving reformatting operations of video signals for distribution or compliance with end-user requests or end-user device requirements
  • H04N 21/44 - Processing of video elementary streams, e.g. splicing a video clip retrieved from local storage with an incoming video stream or rendering scenes according to MPEG-4 scene graphs
  • H04N 19/172 - Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding the unit being an image region, e.g. an object the region being a picture, frame or field
  • H04N 19/177 - Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding the unit being a group of pictures [GOP]
  • H04N 19/597 - Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using predictive coding specially adapted for multi-view video sequence encoding

79.

METHODS AND DEVICES FOR CONTROLLING AUDIO PARAMETERS

      
Application Number 17278081
Status Pending
Filing Date 2019-09-18
First Publication Date 2022-04-28
Owner
  • DOLBY LABORATORIES LICENSING CORPORATION (USA)
  • DOLBY INTERNATIONAL AB (Netherlands)
Inventor
  • Saule, Lucas E.
  • Chen, Eugene
  • Derreveaux, Julien Guy Pierre
  • Siwak, Jakub
  • Brinkley, Daniel Christian

Abstract

A method of controlling headphones having external microphone signal pass-through functionality may involve controlling a display to present a geometric shape on the display and receiving an indication of digit motion from a sensor system associated with the display. The sensor system may include a touch sensor system or a gesture sensor system. The indication may be an indication of a direction of digit motion relative to the display. The method may involve controlling the display to present a sequence of images indicating that the geometric shape either enlarges or contracts, depending on the direction of digit motion and changing a headphone transparency setting according to a current size of the geometric shape. The headphone transparency setting may correspond to an external microphone signal gain setting and/or a media signal gain setting of the headphones.

IPC Classes  ?

80.

SIGNAL CODING USING A GENERATIVE MODEL AND LATENT DOMAIN QUANTIZATION

      
Application Number EP2021078053
Publication Number 2022/078960
Status In Force
Filing Date 2021-10-11
Publication Date 2022-04-21
Owner DOLBY INTERNATIONAL AB (Netherlands)
Inventor
  • Klejsa, Janusz
  • Villemoes, Lars
  • Hedelin, Per

Abstract

The present disclosure provides a decoder configured to receive a finite bitrate stream that includes a quantized latent frame, where the quantized latent frame includes a quantized representation of a current frame of a signal in a latent domain different from a first domain; to generate a reconstructed latent frame from the quantized latent frame; to use a generative neural network model to perform a task for which the general neural network model has been trained, wherein the task includes to generate parameters for an invertible mapping from the latent domain to the first domain; to reconstruct a current frame of the signal in the first domain, which includes to map the reconstructed latent frame to the first domain by use of the invertible mapping, and to use the reconstructed current frame of the signal in the first domain to update a state of the generative neural network model.

IPC Classes  ?

  • G10L 19/02 - Speech or audio signal analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using spectral analysis, e.g. transform vocoders or subband vocoders
  • G06N 3/04 - Architecture, e.g. interconnection topology

81.

A GENERATIVE NEURAL NETWORK MODEL FOR PROCESSING AUDIO SAMPLES IN A FILTER-BANK DOMAIN

      
Application Number EP2021078652
Publication Number 2022/079263
Status In Force
Filing Date 2021-10-15
Publication Date 2022-04-21
Owner DOLBY INTERNATIONAL AB (Netherlands)
Inventor
  • Ekstrand, Per
  • Klejsa, Janusz
  • Tinajero, Pedro Jafeth Villasana
  • Villemoes, Lars

Abstract

A neural network system is provided, implementing a generative model for autoregressively generating a distribution for a plurality of current filter-bank samples of an audio signal, wherein the current samples correspond to a current time slot, and each current sample corresponds to a channel of the filter-bank. The system includes a hierarchy of a plurality of neural network processing tiers ordered from a top to a bottom tier, each tier trained to generate conditioning information based on previous filter-bank samples and, for at least each tier but the top tier, also on the conditioning information from a tier higher up in the hierarchy, and an output stage trained to generate the probability distribution based on previous samples for one or more previous time slots and the conditioning information from the lowest processing tier.

IPC Classes  ?

  • G06N 3/08 - Learning methods
  • G06N 3/04 - Architecture, e.g. interconnection topology
  • G10L 19/00 - Speech or audio signal analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
  • G10L 19/02 - Speech or audio signal analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using spectral analysis, e.g. transform vocoders or subband vocoders
  • G10L 21/00 - Processing of the speech or voice signal to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
  • G10L 25/30 - Speech or voice analysis techniques not restricted to a single one of groups characterised by the analysis technique using neural networks
  • H03H 17/02 - Frequency-selective networks

82.

A GENERAL MEDIA NEURAL NETWORK PREDICTOR AND A GENERATIVE MODEL INCLUDING SUCH A PREDICTOR

      
Application Number US2021054617
Publication Number 2022/081599
Status In Force
Filing Date 2021-10-12
Publication Date 2022-04-21
Owner
  • DOLBY LABORATORIES LICENSING CORPORATION (USA)
  • DOLBY INTERNATIONAL AB (Netherlands)
Inventor
  • Zhou, Cong
  • Vinton, Mark, S.
  • Davidson, Grant, A.
  • Villemoes, Lars

Abstract

A neural network system for predicting frequency coefficients of a media signal, the neural network system comprising a time predicting portion including at least one neural network trained to predict a first set of output variables representing a specific frequency band of a current time frame given coefficients of one or several previous time frames, and a frequency predicting portion including a at least one neural network trained to predict a second set of output variables representing a specific frequency band given coefficients of one or several frequency bands adjacent to the specific frequency band in said current time frame. Such a neural network system forms a predictor capable of capturing both temporal and frequency dependencies occurring in time-frequency tiles of a media signal.

IPC Classes  ?

  • G10L 19/04 - Speech or audio signal analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using predictive techniques
  • G10L 21/038 - Speech enhancement, e.g. noise reduction or echo cancellation using band spreading techniques
  • G06N 3/02 - Neural networks

83.

FRAME-LEVEL PERMUTATION INVARIANT TRAINING FOR SOURCE SEPARATION

      
Application Number US2021054737
Publication Number 2022/081678
Status In Force
Filing Date 2021-10-13
Publication Date 2022-04-21
Owner
  • DOLBY LABORATORIES LICENSING CORPORATION (USA)
  • DOLBY INTERNATIONAL AB (Netherlands)
Inventor
  • Liu, Xiaoyu
  • Pons Puig, Jordi

Abstract

Described is a method of training a deep-learning-based system for sound source separation. The system comprises a separation stage for frame-wise extraction of representations of sound sources from a representation of an audio signal, and a clustering stage for generating, for each frame, a vector indicative of an assignment permutation of extracted frames of representations of sound sources to respective sound sources. The representation of the audio signal is a waveform-based representation. The separation stage is trained using frame-level permutation invariant training. Further, the clustering stage is trained to generate embedding vectors for the frames of the audio signal that allow to determine estimates of respective assignment permutations between extracted sound signals and labels of sound sources that had been used for the frames. Also described is a method of using the deep-learning-based system for sound source separation.

IPC Classes  ?

  • G10L 21/0308 - Voice signal separating characterised by the type of parameter measurement, e.g. correlation techniques, zero crossing techniques or predictive techniques
  • G10L 25/30 - Speech or voice analysis techniques not restricted to a single one of groups characterised by the analysis technique using neural networks
  • G06N 3/08 - Learning methods
  • G06N 3/02 - Neural networks

84.

REAL-TIME PACKET LOSS CONCEALMENT USING DEEP GENERATIVE NETWORKS

      
Application Number EP2021078443
Publication Number 2022/079164
Status In Force
Filing Date 2021-10-14
Publication Date 2022-04-21
Owner DOLBY INTERNATIONAL AB (Netherlands)
Inventor
  • Pascual, Santiago
  • Serra, Joan
  • Pons Puig, Jordi

Abstract

The present disclosure relates to a method and system for performing packet loss concealment using a neural network system. The method comprises obtaining a representation of an incomplete audio signal, inputting the representation of the incomplete audio signal to an encoder neural network and outputting a latent representation of a predicted complete audio signal. The latent representation is input to a decoder neural network which outputs a representation of a predicted complete audio signal comprising a reconstruction of the original portion of the complete audio signal, wherein said encoder neural network and said decoder neural network have been trained with an adversarial neural network.

IPC Classes  ?

  • G10L 19/005 - Correction of errors induced by the transmission channel, if related to the coding algorithm
  • G10L 19/00 - Speech or audio signal analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis

85.

METHOD AND APPARATUS FOR GENERATING AN INTERMEDIATE AUDIO FORMAT FROM AN INPUT MULTICHANNEL AUDIO SIGNAL

      
Application Number EP2021078444
Publication Number 2022/079165
Status In Force
Filing Date 2021-10-14
Publication Date 2022-04-21
Owner DOLBY INTERNATIONAL AB (Netherlands)
Inventor
  • Arteaga, Daniel
  • Pons Puig, Jordi

Abstract

Described herein is a method for training a machine learning algorithm. The method may comprise receiving a first input multichannel audio signal. The method may comprise generating, using the machine learning algorithm, an intermediate audio signal based on the first input multichannel audio signal. The method may comprise rendering the intermediate audio signal into a first output multichannel audio signal. Further, the method may comprise improving the machine learning algorithm based on a difference between the first input multichannel audio signal and the first output multichannel audio signal. Described herein are further an apparatus for generating an intermediate audio format from an input multichannel audio signal as well as a respective computer program product comprising a computer-readable storage medium with instructions adapted to carry out said method when executed by a device having processing capability.

IPC Classes  ?

  • G10L 25/30 - Speech or voice analysis techniques not restricted to a single one of groups characterised by the analysis technique using neural networks
  • H04S 1/00 - Two-channel systems
  • H04S 3/00 - Systems employing more than two channels, e.g. quadraphonic
  • G10L 21/0272 - Voice signal separating
  • G10L 19/008 - Multichannel audio signal coding or decoding using interchannel correlation to reduce redundancy, e.g. joint-stereo, intensity-coding or matrixing

86.

METHOD AND APPARATUS FOR NEURAL NETWORK BASED PROCESSING OF AUDIO USING SINUSOIDAL ACTIVATION

      
Application Number EP2021078653
Publication Number 2022/079264
Status In Force
Filing Date 2021-10-15
Publication Date 2022-04-21
Owner DOLBY INTERNATIONAL AB (Netherlands)
Inventor Biswas, Arijit

Abstract

Described herein is a method of processing an audio signal using a deep-learning-based generator, wherein the method includes the steps of: (a) inputting the audio signal into the generator for processing the audio signal; (b) mapping a time segment of the audio signal to a latent feature space representation, using an encoder stage of the generator; (c) upsampling the latent feature space representation using a decoder stage of the generator, wherein at least one layer of the decoder stage applies sinusoidal activation; and (d) obtaining, as an output from the decoder stage of the generator, a processed audio signal. Described are further a method for training said generator and respective apparatus, systems and computer program products.

IPC Classes  ?

87.

ATTENUATING WAVEFRONT DETERMINATION FOR NOISE REDUCTION

      
Application Number 17424155
Status Pending
Filing Date 2020-01-16
First Publication Date 2022-03-31
Owner
  • DOLBY LABORATORIES LICENSING CORPORATION (USA)
  • DOLBY INTERNATIONAL AB (Netherlands)
Inventor
  • Pires Arrifano, Angelo Miguel
  • Pertierra, Juan P.

Abstract

A system and method comprise a light source; a spatial light modulator including a substantially transparent material layer and a phase modulation layer; an imaging device configured to receive a light from the light source as reflected by the spatial light modulator, and to generate an image data; and a controller. The controller provides a phase-drive signal to the spatial light modulator and determines an attenuating wavefront of the substantially transparent material layer based on the image data.

IPC Classes  ?

  • G09G 3/36 - Control arrangements or circuits, of interest only in connection with visual indicators other than cathode-ray tubes for presentation of an assembly of a number of characters, e.g. a page, by composing the assembly by combination of individual elements arranged in a matrix by control of light from an independent source using liquid crystals
  • G02B 26/06 - Optical devices or arrangements for the control of light using movable or deformable optical elements for controlling the phase of light
  • G03H 1/04 - Processes or apparatus for producing holograms
  • G03H 1/22 - Processes or apparatus for obtaining an optical image from holograms

88.

ADAPTIVE NOISE ESTIMATION

      
Application Number US2021051162
Publication Number 2022/066590
Status In Force
Filing Date 2021-09-21
Publication Date 2022-03-31
Owner
  • DOLBY LABORATORIES LICENSING CORPORATION (USA)
  • DOLBY INTERNATIONAL AB (Netherlands)
Inventor
  • Scaini, Davide
  • Yeh, Chunghsin
  • Cengarle, Giulio
  • De Burgh, Mark David

Abstract

In some embodiments, a method, comprises: dividing, using at least one processor, an audio input into speech and non-speech segments; for each frame in each non-speech segment, estimating, using the at least one processor, a time-varying noise spectrum of the non-speech segment; for each frame in each speech segment, estimating, using the at least one processor, speech spectrum of the speech segment; for each frame in each speech segment, identifying one or more non-speech frequency components in the speech spectrum; comparing the one or more non-speech frequency components with one or more corresponding frequency components in a plurality of estimated noise spectra and selecting the estimated noise spectrum from the plurality of estimated noise spectra based on a result of the comparing.

IPC Classes  ?

  • G10L 21/0232 - Processing in the frequency domain
  • G10L 25/78 - Detection of presence or absence of voice signals

89.

METHOD AND DEVICE FOR PROCESSING A BINAURAL RECORDING

      
Application Number US2021050534
Publication Number 2022/060891
Status In Force
Filing Date 2021-09-15
Publication Date 2022-03-24
Owner
  • DOLBY LABORATORIES LICENSING CORPORATION (USA)
  • DOLBY INTERNATIONAL AB (Netherlands)
Inventor
  • Shuang, Zhiwei
  • Ma, Yuanxing
  • Liu, Yang
  • Yang, Ziyu
  • Cengarle, Giulio

Abstract

The present invention relates to a method and device for processing a first and a second audio signal representing an input binaural audio signal acquired by a binaural recording device. The present invention further relates to a method for rendering a binaural audio signal on a speaker system. The method for processing a binaural signal comprising extracting audio information from the first audio signal, computing a band gain for reducing noise in the first audio signal and applying the band gains to respective frequency bands of the first audio signal in accordance with a dynamic scaling factor, to provide a first output audio signal. Wherein the dynamic scaling factor has a value between zero and one and is selected so as to reduce quality degradation for the first audio signal.

IPC Classes  ?

90.

PROCESSING PARAMETRICALLY CODED AUDIO

      
Application Number US2021049285
Publication Number 2022/055883
Status In Force
Filing Date 2021-09-07
Publication Date 2022-03-17
Owner
  • DOLBY LABORATORIES LICENSING CORPORATION (USA)
  • DOLBY INTERNATIONAL AB (Netherlands)
Inventor
  • Breebaart, Dirk Jeroen
  • Eckert, Michael
  • Purnhagen, Heiko

Abstract

A method comprising receiving a first input bit stream for a first parametrically coded input audio signal, the first input bit stream including data representing a first input core audio signal and a first set including at least one spatial parameter relating to the first parametrically coded input audio signal. A first covariance matrix of the first parametrically coded audio signal is determined based on the spatial parameter(s) of the first set. A modified set including at least one spatial parameter is determined based on the determined first covariance matrix, wherein the modified set is different from the first set. An output core audio signal is determined, which is based on, or constituted by, the first input core audio signal. An output bit stream for a parametrically coded output audio signal is generated, the output bit stream including data representing the output core audio signal and the modified set.

IPC Classes  ?

  • G10L 19/008 - Multichannel audio signal coding or decoding using interchannel correlation to reduce redundancy, e.g. joint-stereo, intensity-coding or matrixing
  • G10L 19/16 - Vocoder architecture

91.

Methods, Apparatus and Systems for Dual-Ended Media Intelligence

      
Application Number 17312011
Status Pending
Filing Date 2019-12-10
First Publication Date 2022-02-24
Owner
  • DOLBY LABORATORIES LICENSING CORPORATION (USA)
  • DOLBY INTERNATIONAL AB (Netherlands)
Inventor
  • Bai, Yanning
  • Gerrard, Mark William
  • Han, Richard
  • Wolters, Martin

Abstract

A method of encoding audio content comprises performing a content analysis of the audio content, generating classification information indicative of a content type of the audio content based on the content analysis, encoding the audio content and the classification information in a bitstream, and outputting the bitstream. A method of decoding audio content from a bitstream including audio content and classification information for the audio content, wherein the classification information is indicative of a content classification of the audio content, comprises receiving the bitstream, decoding the audio content and the classification information, and selecting, based on the classification information, a post processing mode for performing post processing of the decoded audio content. Selecting the post processing mode can involve calculating one or more control weights for post processing of the decoded audio content based on the classification information.

IPC Classes  ?

  • G10L 19/008 - Multichannel audio signal coding or decoding using interchannel correlation to reduce redundancy, e.g. joint-stereo, intensity-coding or matrixing
  • G10L 19/16 - Vocoder architecture
  • G06F 16/65 - Clustering; Classification
  • G06F 16/68 - Retrieval characterised by using metadata, e.g. metadata not derived from the content or metadata generated manually

92.

Method of Coding and Decoding Images, Coding and Decoding Device and Computer Programs Corresponding Thereto

      
Application Number 17460992
Status Pending
Filing Date 2021-08-30
First Publication Date 2022-02-24
Owner DOLBY INTERNATIONAL AB (Netherlands)
Inventor
  • Henry, Felix
  • Clare, Gordon

Abstract

A method for encoding an image having been cut up into partitions. The method includes: predicting data of a current partition based on an already encoded and then decoded reference partition, generating a predicted partition; determining residual data by comparing data relating to the current partition with the predicted partition, the residual data associated with various digital data items. Prior producing a signal containing the encoded information, performing the following steps: determining, from the predetermined residual data, a subset containing residual data capable of being modified; calculating the value of a function representative of the residual data; comparing the calculated value with a value of at least one of the digital data items; based on the comparison, modification or non-modification of at least one of the residual data items of the subset; and, in the event of a modification, entropy encoding the at least one modified residual data item.

IPC Classes  ?

  • H04N 19/91 - Entropy coding, e.g. variable length coding [VLC] or arithmetic coding
  • H04N 19/176 - Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding the unit being an image region, e.g. an object the region being a block, e.g. a macroblock
  • H04N 19/463 - Embedding additional information in the video signal during the compression process by compressing encoding parameters before transmission
  • H04N 19/467 - Embedding additional information in the video signal during the compression process characterised by the embedded information being invisible, e.g. watermarking
  • H04N 19/117 - Filters, e.g. for pre-processing or post-processing
  • H04N 19/18 - Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding the unit being a set of transform coefficients
  • H04N 19/48 - Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using compressed domain processing techniques other than decoding, e.g. modification of transform coefficients, variable length coding [VLC] data or run-length data
  • H04N 19/147 - Data rate or code amount at the encoder output according to rate distortion criteria
  • H04N 19/46 - Embedding additional information in the video signal during the compression process
  • H04N 19/14 - Coding unit complexity, e.g. amount of activity or edge presence estimation
  • H04N 19/44 - Decoders specially adapted therefor, e.g. video decoders which are asymmetric with respect to the encoder
  • H04N 19/124 - Quantisation
  • H04N 19/13 - Adaptive entropy coding, e.g. adaptive variable length coding [AVLC] or context adaptive binary arithmetic coding [CABAC]
  • H04N 19/593 - Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using predictive coding involving spatial prediction techniques
  • H04N 19/184 - Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding the unit being bits, e.g. of the compressed video stream
  • H04N 19/105 - Selection of the reference unit for prediction within a chosen coding or prediction mode, e.g. adaptive choice of position and number of pixels used for prediction
  • H04N 19/119 - Adaptive subdivision aspects e.g. subdivision of a picture into rectangular or non-rectangular coding blocks
  • H04N 19/122 - Selection of transform size, e.g. 8x8 or 2x4x8 DCT; Selection of sub-band transforms of varying structure or type
  • H04N 19/159 - Prediction type, e.g. intra-frame, inter-frame or bidirectional frame prediction
  • H04N 19/172 - Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding the unit being an image region, e.g. an object the region being a picture, frame or field

93.

Binaural dialogue enhancement

      
Application Number 17465733
Grant Number 11641560
Status In Force
Filing Date 2021-09-02
First Publication Date 2022-02-24
Grant Date 2023-05-02
Owner
  • Dolby Laboratories Licensing Corporation (USA)
  • Dolby International AB (Netherlands)
Inventor
  • Samuelsson, Leif Jonas
  • Breebaart, Dirk Jeroen
  • Cooper, David Matthew
  • Koppens, Jeroen

Abstract

Methods for dialogue enhancing audio content, comprising providing a first audio signal presentation of the audio components, providing a second audio signal presentation, receiving a set of dialogue estimation parameters configured to enable estimation of dialogue components from the first audio signal presentation, applying said set of dialogue estimation parameters to said first audio signal presentation, to form a dialogue presentation of the dialogue components; and combining the dialogue presentation with said second audio signal presentation to form a dialogue enhanced audio signal presentation for reproduction on the second audio reproduction system, wherein at least one of said first and second audio signal presentation is a binaural audio signal presentation.

IPC Classes  ?

  • H04S 1/00 - Two-channel systems
  • H04S 3/00 - Systems employing more than two channels, e.g. quadraphonic
  • H04S 7/00 - Indicating arrangements; Control arrangements, e.g. balance control
  • H04S 3/02 - Systems employing more than two channels, e.g. quadraphonic of the matrix type, i.e. in which input signals are combined algebraically, e.g. after having been phase shifted with respect to each other
  • H04R 5/04 - Circuit arrangements

94.

AUTOMATIC DETECTION AND ATTENUATION OF SPEECH-ARTICULATION NOISE EVENTS

      
Application Number EP2021072384
Publication Number 2022/034139
Status In Force
Filing Date 2021-08-11
Publication Date 2022-02-17
Owner DOLBY INTERNATIONAL AB (Netherlands)
Inventor
  • Yeh, Chunghsin
  • Cengarle, Giulio
  • De Burgh, Mark David

Abstract

Described is a method of performing automatic audio enhancement on an input audio signal including at least one speech-articulation noise event. The method comprises: segmenting the input audio signal into a number of audio frames; obtaining at least one feature parameter from the audio frames; and determining, based at least in part on the obtained feature parameter, a respective type of the speech-articulation noise event and a respective time-frequency range associated with the speech-articulation noise event within the input audio signal.

IPC Classes  ?

  • G10L 15/04 - Segmentation; Word boundary detection
  • G10L 21/0264 - Noise filtering characterised by the type of parameter measurement, e.g. correlation techniques, zero crossing techniques or predictive techniques
  • G10L 21/034 - Automatic adjustment
  • G10L 25/93 - Discriminating between voiced and unvoiced parts of speech signals
  • G10L 21/0308 - Voice signal separating characterised by the type of parameter measurement, e.g. correlation techniques, zero crossing techniques or predictive techniques
  • G10L 25/09 - Speech or voice analysis techniques not restricted to a single one of groups characterised by the type of extracted parameters the extracted parameters being zero crossing rates
  • G10L 25/21 - Speech or voice analysis techniques not restricted to a single one of groups characterised by the type of extracted parameters the extracted parameters being power information
  • G10L 25/24 - Speech or voice analysis techniques not restricted to a single one of groups characterised by the type of extracted parameters the extracted parameters being the cepstrum
  • G10L 25/84 - Detection of presence or absence of voice signals for discriminating voice from noise
  • G10L 21/0316 - Speech enhancement, e.g. noise reduction or echo cancellation by changing the amplitude

95.

Methods and apparatus for rate quality scalable coding with generative models

      
Application Number 17290193
Grant Number 11621011
Status In Force
Filing Date 2019-10-29
First Publication Date 2022-02-10
Grant Date 2023-04-04
Owner Dolby International AB (Netherlands)
Inventor
  • Klejsa, Janusz
  • Hedelin, Per

Abstract

Described herein is a method of decoding an audio or speech signal, the method including the steps of: (a) receiving, by a decoder, a coded bitstream including the audio or speech signal and conditioning information; (b) providing, by a bitstream decoder, decoded conditioning information in a format associated with a first bitrate; (c) converting, by a converter, the decoded conditioning information from the format associated with the first bitrate to a format associated with a second bitrate; and (d) providing, by a generative neural network, a reconstruction of the audio or speech signal according to a probabilistic model conditioned by the conditioning information in the format associated with the second bitrate. Described are further an apparatus for decoding an audio or speech signal, a respective encoder, a system of the encoder and the apparatus for decoding an audio or speech signal as well as a respective computer program product.

IPC Classes  ?

  • G10L 19/06 - Determination or coding of the spectral characteristics, e.g. of the short-term prediction coefficients
  • G10L 19/032 - Quantisation or dequantisation of spectral components
  • G10L 19/24 - Variable rate codecs, e.g. for generating different qualities using a scalable representation such as hierarchical encoding or layered encoding
  • G10L 25/30 - Speech or voice analysis techniques not restricted to a single one of groups characterised by the analysis technique using neural networks

96.

TRACKING A REFERENCE PICTURE ON AN ELECTRONIC DEVICE

      
Application Number 17408936
Status Pending
Filing Date 2021-08-23
First Publication Date 2022-02-10
Owner DOLBY INTERNATIONAL AB (Netherlands)
Inventor
  • Misra, Kiran
  • Deshpande, Sachin G.
  • Segall, Christopher A.

Abstract

A method for tracking a reference picture on an electronic device is described. The method includes receiving a bitstream. The method also includes decoding a portion of the bitstream to produce a decoded reference picture. The method further includes tracking the decoded reference picture in a decoded picture buffer (DPB) with reduced overhead referencing. The method additionally includes decoding a picture based on the decoded reference picture.

IPC Classes  ?

  • H04N 19/503 - Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using predictive coding involving temporal prediction
  • H04N 19/70 - Methods or arrangements for coding, decoding, compressing or decompressing digital video signals characterised by syntax aspects related to video coding, e.g. related to compression standards
  • H04N 19/44 - Decoders specially adapted therefor, e.g. video decoders which are asymmetric with respect to the encoder
  • H04N 19/423 - Methods or arrangements for coding, decoding, compressing or decompressing digital video signals - characterised by implementation details or hardware specially adapted for video compression or decompression, e.g. dedicated software implementation characterised by memory arrangements
  • H04N 19/587 - Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using predictive coding involving temporal sub-sampling or interpolation, e.g. decimation or subsequent interpolation of pictures in a video sequence
  • H04N 19/573 - Motion compensation with multiple frame prediction using two or more reference frames in a given prediction direction
  • H04N 19/58 - Motion compensation with long-term prediction, i.e. the reference frame for a current frame not being the temporally closest one

97.

HUM NOISE DETECTION AND REMOVAL FOR SPEECH AND MUSIC RECORDINGS

      
Application Number EP2021071148
Publication Number 2022/023415
Status In Force
Filing Date 2021-07-28
Publication Date 2022-02-03
Owner DOLBY INTERNATIONAL AB (Netherlands)
Inventor Yeh, Chunghsin

Abstract

Described are methods of processing audio data for hum noise detection and/or removal. The audio data comprises a plurality of frames. One method incudes: classifying frames of the audio data as either content frames or noise frames, using one or more content activity detectors; determining a noise spectrum from one or more frames of the audio data that are classified as noise frames; determining one or more hum noise frequencies based on the determined noise spectrum; generating an estimated hum noise signal based on the one or more hum noise frequencies; and removing hum noise from at least one frame of the audio data based on the estimated hum noise signal. Also described are apparatus for carrying out the methods, as well as corresponding programs and computer-readable storage media.

IPC Classes  ?

  • G10L 21/0216 - Noise filtering characterised by the method used for estimating noise
  • G10L 25/78 - Detection of presence or absence of voice signals
  • G10L 21/0232 - Processing in the frequency domain
  • G10L 21/0208 - Noise filtering

98.

AUDIO PROCESSING IN IMMERSIVE AUDIO SERVICES

      
Application Number 17292457
Status Pending
Filing Date 2019-11-12
First Publication Date 2022-01-20
Owner
  • DOLBY LABORATORIES LICENSING CORPORATION (USA)
  • DOLBY INTERNATIONAL AB (Netherlands)
Inventor
  • Bruhn, Stefan
  • Torres, Juan Felix
  • Mcgrath, David S.
  • Lee, Brian

Abstract

The disclosure herein generally relates to capturing, acoustic pre-processing, encoding, decoding, and rendering of directional audio of an audio scene. In particular, it relates to a device adapted to modify a directional property of a captured directional audio in response to spatial data of a microphone system capturing the directional audio. The disclosure further relates to a rendering device configured to modify a directional property of a received directional audio in response to received spatial data.

IPC Classes  ?

  • H04S 7/00 - Indicating arrangements; Control arrangements, e.g. balance control
  • H04R 3/00 - Circuits for transducers
  • H04R 5/027 - Spatial or constructional arrangements of microphones, e.g. in dummy heads

99.

PACKET LOSS CONCEALMENT

      
Application Number EP2021068774
Publication Number 2022/008571
Status In Force
Filing Date 2021-07-07
Publication Date 2022-01-13
Owner DOLBY INTERNATIONAL AB (Netherlands)
Inventor
  • Mundt, Harald
  • Bruhn, Stefan
  • Purnhagen, Heiko
  • Plain, Simon
  • Schug, Michael

Abstract

Described are methods of processing an audio signal for packet loss concealment. The audio signal comprises a sequence of frames, each frame containing representations of a plurality of audio channels and reconstruction parameters for upmixing the plurality of audio channels to a predetermined channel format. One method includes: receiving the audio signal; and generating a reconstructed audio signal in the predefined channel format based on the received audio signal. Generating the reconstructed audio signal comprises: determining whether at least one frame of the audio signal has been lost; and if a number of consecutively lost frames exceeds a first threshold, fading the reconstructed audio signal to a predefined spatial configuration. Also described is a method of encoding an audio signal. Yet further described are apparatus for carrying out the methods, as well as corresponding programs and computer-readable storage media.

IPC Classes  ?

  • G10L 19/005 - Correction of errors induced by the transmission channel, if related to the coding algorithm
  • G10L 19/008 - Multichannel audio signal coding or decoding using interchannel correlation to reduce redundancy, e.g. joint-stereo, intensity-coding or matrixing

100.

TRANSMISSION-AGNOSTIC PRESENTATION-BASED PROGRAM LOUDNESS

      
Application Number 17372295
Status Pending
Filing Date 2021-07-09
First Publication Date 2022-01-06
Owner
  • DOLBY LABORATORIES LICENSING CORPORATION (USA)
  • DOLBY INTERNATIONAL AB (Netherlands)
Inventor
  • Koppens, Jeroen
  • Norcross, Scott Gregory

Abstract

This disclosure falls into the field of audio coding, in particular it is related to the field of providing a framework for providing loudness consistency among differing audio output signals. In particular, the disclosure relates to methods, computer program products and apparatus for encoding and decoding of audio data bitstreams in order to attain a desired loudness level of an output audio signal.

IPC Classes  ?

  • G10L 19/16 - Vocoder architecture
  • G10L 19/24 - Variable rate codecs, e.g. for generating different qualities using a scalable representation such as hierarchical encoding or layered encoding
  • G10L 21/034 - Automatic adjustment
  1     2     3     ...     11        Next Page