Dolby Laboratories Licensing Corporation

United States of America

Back to Profile

1-100 of 2,337 for Dolby Laboratories Licensing Corporation Sort by
Query
Patent
United States - USPTO
Aggregations Reset Report
Date
New (last 4 weeks) 21
2024 April (MTD) 7
2024 March 20
2024 February 14
2024 January 16
See more
IPC Class
H04S 7/00 - Indicating arrangements; Control arrangements, e.g. balance control 357
G10L 19/008 - Multichannel audio signal coding or decoding using interchannel correlation to reduce redundancy, e.g. joint-stereo, intensity-coding or matrixing 347
H04S 3/00 - Systems employing more than two channels, e.g. quadraphonic 294
H04N 19/176 - Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding the unit being an image region, e.g. an object the region being a block, e.g. a macroblock 141
H04N 19/61 - Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using transform coding in combination with predictive coding 130
See more
Status
Pending 258
Registered / In Force 2,079
Found results for  patents
  1     2     3     ...     24        Next Page

1.

CANVAS SIZE SCALABLE VIDEO CODING

      
Application Number 18544411
Status Pending
Filing Date 2023-12-18
First Publication Date 2024-04-11
Owner Dolby Laboratories Licensing Corporation (USA)
Inventor
  • Lu, Taoran
  • Pu, Fangjun
  • Yin, Peng
  • Mccarthy, Sean Thomas
  • Chen, Tao

Abstract

Methods and systems for canvas size scalability across the same or different bitstream layers of a video coded bitstream are described. Offset parameters for a conformance window, a reference region of interest (ROI) in a reference layer, and a current ROI in a current layer are received. The width and height of a current ROI and a reference ROI are computed based on the offset parameters and they are used to generate a width and height scaling factor to be used by a reference picture resampling unit to generate an output picture based on the current ROI and the reference ROI.

IPC Classes  ?

  • H04N 19/513 - Processing of motion vectors
  • H04N 19/105 - Selection of the reference unit for prediction within a chosen coding or prediction mode, e.g. adaptive choice of position and number of pixels used for prediction
  • H04N 19/172 - Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding the unit being an image region, e.g. an object the region being a picture, frame or field
  • H04N 19/33 - Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using hierarchical techniques, e.g. scalability in the spatial domain

2.

INTRA-PREDICTION FOR HEXAGONALLY-SAMPLED VIDEO AND IMAGE COMPRESSION

      
Application Number 18264311
Status Pending
Filing Date 2022-02-10
First Publication Date 2024-04-04
Owner Dolby Laboratories Licensing Corporation (USA)
Inventor
  • Zhang, Zhaobin
  • Gadgil, Neeraj J.
  • Su, Guan-Ming

Abstract

Methods, systems, and devices implement intra-prediction for hexagonally-sampled compression and decompression of videos and images having a regular grid of hexagonally-shaped pixels. For encoding, a prediction unit (PU) shape is selected at a sequence level from the group consisting of parallelogram, zigzag-square, hexagonal super-pixel, a rectangular zigzag and an arrow, and the hexagonally-sampled image is divided into regions based on the PU shape. For each region: a prediction mode and a PU size are determined; reference pixels are determined for each predicted pixel in the PU shape based on the prediction mode; a weighted factor is determined for each of the reference pixels based on a distance between the reference pixel and the predicted pixel; and a predicted value of each of the predicted pixels in the PU shape is determined using the corresponding reference pixels and the weighted factors.

IPC Classes  ?

  • H04N 19/105 - Selection of the reference unit for prediction within a chosen coding or prediction mode, e.g. adaptive choice of position and number of pixels used for prediction
  • H04N 19/119 - Adaptive subdivision aspects e.g. subdivision of a picture into rectangular or non-rectangular coding blocks
  • H04N 19/159 - Prediction type, e.g. intra-frame, inter-frame or bidirectional frame prediction
  • H04N 19/176 - Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding the unit being an image region, e.g. an object the region being a block, e.g. a macroblock

3.

REPRESENTING SPATIAL AUDIO BY MEANS OF AN AUDIO SIGNAL AND ASSOCIATED METADATA

      
Application Number 18465636
Status Pending
Filing Date 2023-09-12
First Publication Date 2024-04-04
Owner
  • DOLBY INTERNATIONAL AB (Ireland)
  • DOLBY LABORATORIES LICENSING CORPORATION (USA)
Inventor Bruhn, Stefan

Abstract

There is provided encoding and decoding methods for representing spatial audio that is a combination of directional sound and diffuse sound. An exemplary encoding method includes inter alia creating a single- or multi-channel downmix audio signal by downmixing input audio signals from a plurality of microphones in an audio capture unit capturing the spatial audio; determining first metadata parameters associated with the downmix audio signal, wherein the first metadata parameters are indicative of one or more of: a relative time delay value, a gain value, and a phase value associated with each input audio signal; and combining the created downmix audio signal and the first metadata parameters into a representation of the spatial audio.

IPC Classes  ?

  • H04S 3/02 - Systems employing more than two channels, e.g. quadraphonic of the matrix type, i.e. in which input signals are combined algebraically, e.g. after having been phase shifted with respect to each other

4.

AUDIO FILTERBANK WITH DECORRELATING COMPONENTS

      
Application Number 17683762
Status Pending
Filing Date 2020-09-02
First Publication Date 2024-04-04
Owner Dolby Laboratories Licensing Corporation (USA)
Inventor Mcgrath, David S.

Abstract

An multi-input, multi-output audio process is implemented as a linear system for use in an audio filterbank to convert a set of frequency-domain input audio signals into a set of frequency-domain output audio signals. A transfer function from one input to one output is defined as a frequency dependent gain function. In some implementations, the transfer function includes a direct component that is substantially defined as a frequency dependent gain, and one or more decorrelated components that have frequency-varying group phase response. The transfer function is formed from a set of sub-band functions, with each sub-band function being formed from a set of corresponding component transfer functions including direct component and one or more decorrelated components.

IPC Classes  ?

  • H04S 3/02 - Systems employing more than two channels, e.g. quadraphonic of the matrix type, i.e. in which input signals are combined algebraically, e.g. after having been phase shifted with respect to each other
  • H04S 5/00 - Pseudo-stereo systems, e.g. in which additional channel signals are derived from monophonic signals by means of phase shifting, time delay or reverberation

5.

CROSS-ASSET GUIDE CHROMA REFORMATTING FOR MULTI-ASSET IMAGING FORMAT

      
Application Number 18460377
Status Pending
Filing Date 2023-09-01
First Publication Date 2024-04-04
Owner Dolby Laboratories Licensing Corporation (USA)
Inventor
  • Choudhury, Anustup Kumar Atanu
  • Su, Guan-Ming

Abstract

A first image and a second image of different dynamic ranges are derived from the same source image. Based on a chroma sampling format of the first image, it is determined whether edge preserving filtering is to be used to generate chroma upsampled image data in a reconstructed image. If so, image metadata for performing the edge preserving filtering is generated. The first image, the second image and the image metadata are encoded into an image data container to enable a recipient device to generate the reconstructed image.

IPC Classes  ?

  • H04N 19/184 - Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding the unit being bits, e.g. of the compressed video stream
  • G06V 10/25 - Determination of region of interest [ROI] or a volume of interest [VOI]
  • H04N 19/117 - Filters, e.g. for pre-processing or post-processing
  • H04N 19/172 - Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding the unit being an image region, e.g. an object the region being a picture, frame or field
  • H04N 19/59 - Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using predictive coding involving spatial sub-sampling or interpolation, e.g. alteration of picture size or resolution
  • H04N 19/98 - Adaptive-dynamic-range coding [ADRC]

6.

PROGRESSIVE CALCULATION AND APPLICATION OF RENDERING CONFIGURATIONS FOR DYNAMIC APPLICATIONS

      
Application Number 18255582
Status Pending
Filing Date 2021-12-02
First Publication Date 2024-04-04
Owner DOLBY LABORATORIES LICENSING CORPORATION (USA)
Inventor
  • Lando, Joshua B.
  • Seefeldt, Alan J.

Abstract

Some examples involve rendering received audio data by determining a first relative activation of a set of loudspeakers in an environment according to a first rendering configuration corresponding to a first set of speaker activations, receiving a first rendering transition indication indicating a transition from the first rendering configuration to a second rendering configuration and determining a second set of speaker activations corresponding to a simplified version of the second rendering configuration. Some examples involve performing a first transition from the first set of speaker activations to the second set of speaker activations, determining a third set of speaker activations corresponding to a complete version of the second rendering configuration and performing a second transition to the third set of speaker activations without requiring completion of the first transition.

IPC Classes  ?

  • H04S 7/00 - Indicating arrangements; Control arrangements, e.g. balance control

7.

FREQUENCY DOMAIN MULTIPLEXING OF SPATIAL AUDIO FOR MULTIPLE LISTENER SWEET SPOTS

      
Application Number 18255309
Status Pending
Filing Date 2021-12-02
First Publication Date 2024-04-04
Owner Dolby Laboratories Licensing Corporation (USA)
Inventor
  • Seefeldt, Alan J.
  • Brown, C. Phillip

Abstract

Some methods involve receiving, by a control system that is configured for implementing a plurality of renderers, audio data and listening configuration data for a plurality of listening configurations, each listening configuration of the plurality of listening configurations corresponding to a listening position and a listening orientation in an audio environment, and rendering, by each renderer and according to the listening configuration data, the received audio data to obtain a set of renderer-specific loudspeaker feed signals for a corresponding listening configuration. Each renderer may be configured to render the audio data for a different listening configuration. Some such methods may involve decomposing each set of renderer-specific loudspeaker feed signals into a renderer-specific set of frequency bands and combining the renderer-specific frequency bands of each renderer to produce an output set of loudspeaker feed signals.

IPC Classes  ?

  • H04S 7/00 - Indicating arrangements; Control arrangements, e.g. balance control

8.

FREQUENCY DOMAIN MULTIPLEXING OF SPATIAL AUDIO FOR MULTIPLE LISTENER SWEET SPOTS

      
Application Number 18255251
Status Pending
Filing Date 2021-12-02
First Publication Date 2024-03-28
Owner Dolby Laboratories Licensing Corporation (USA)
Inventor
  • Seefeldt, Alan J.
  • Brown, C. Phillip

Abstract

Some methods involve receiving, by a control system configured for implementing a plurality of Tenderers, audio data and listening configuration data for a plurality of listening configurations, each listening configuration of the plurality of listening configurations corresponding to a listening position and a listening orientation in an audio environment, and rendering, by each Tenderer and according to the listening configuration data, the received audio data to obtain a set of Tenderer-specific loudspeaker feed signals for a corresponding listening configuration. Each Tenderer may be configured to render the audio data for a different listening configuration. Some such methods may involve decomposing each set of renderer-specific loudspeaker feed signals into a Tenderer-specific set of frequency bands and combining the renderer-specific frequency bands of each Tenderer to produce an output set of loudspeaker feed signals. Some such methods may involve outputting the output set of loudspeaker feed signals to a plurality of loudspeakers.

IPC Classes  ?

  • H04S 7/00 - Indicating arrangements; Control arrangements, e.g. balance control

9.

SPATIAL NOISE FILLING IN MULTI-CHANNEL CODEC

      
Application Number 18255506
Status Pending
Filing Date 2021-12-01
First Publication Date 2024-03-28
Owner Dolby Laboratories Licensing Corporation (USA)
Inventor
  • Tyagi, Rishabh
  • Eckert, Michael

Abstract

Embodiments are disclosed for spatial noise filling in multi-channel codecs. In an embodiment, a method of regenerating background noise ambience in a multi-channel codec by generating spatial hole filling noise comprises: computing noise estimates based on a primary downmix channel generated from an input audio signal representing a spatial audio scene with background noise ambience; computing spectral shaping filter coefficients based on the noise estimates; spectrally shaping the multi-channel noise signal using the spectral shaping filter coefficients and a noise distribution, the spectral shaping resulting in a diffused, multi-channel noise signal with uncorrelated channels; spatially shaping the diffused, uncorrelated multi-channel noise signal with uncorrelated channels based on a noise ambience of the spatial audio scene; and adding the spatially and spectrally shaped multi-channel noise to a multi-channel codec output to synthesize the background noise ambience of the spatial audio scene.

IPC Classes  ?

  • G10L 19/03 - Spectral prediction for preventing pre-echo; Temporary noise shaping [TNS], e.g. in MPEG2 or MPEG4
  • G10L 19/008 - Multichannel audio signal coding or decoding using interchannel correlation to reduce redundancy, e.g. joint-stereo, intensity-coding or matrixing
  • G10L 21/0216 - Noise filtering characterised by the method used for estimating noise

10.

SYSTEM AND METHOD FOR OPTIMIZING LOUDNESS AND DYNAMIC RANGE ACROSS DIFFERENT PLAYBACK DEVICES

      
Application Number 18483082
Status Pending
Filing Date 2023-10-09
First Publication Date 2024-03-28
Owner
  • DOLBY LABORATORIES LICENSING CORPORATION (USA)
  • DOLBY INTERNATIONAL AB (Ireland)
Inventor
  • Riedmiller, Jeffrey
  • Norcross, Scott Gregory
  • Roeden, Karl Jonas

Abstract

Embodiments are directed to a method and system for receiving, in a bitstream, metadata associated with the audio data, and analyzing the metadata to determine whether a loudness parameter for a first group of audio playback devices are available in the bitstream. Responsive to determining that the parameters are present for the first group, the system uses the parameters and audio data to render audio. Responsive to determining that the loudness parameters are not present for the first group, the system analyzes one or more characteristics of the first group, and determines the parameter based on the one or more characteristics.

IPC Classes  ?

  • G06F 3/16 - Sound input; Sound output
  • H03G 9/00 - Combinations of two or more types of control, e.g. gain control and tone control
  • H04R 29/00 - Monitoring arrangements; Testing arrangements

11.

Audio Encoding and Decoding Using Presentation Transform Parameters

      
Application Number 18487232
Status Pending
Filing Date 2023-10-16
First Publication Date 2024-03-28
Owner
  • DOLBY LABORATORIES LICENSING CORPORATION (USA)
  • DOLBY INTERNATIONAL AB (Ireland)
Inventor
  • Breebaart, Dirk Jeroen
  • Cooper, David Matthew
  • Samuelsson, Leif Jonas
  • Koppens, Jeroen
  • Wilson, Rhonda J.
  • Purnhagen, Heiko
  • Stahlmann, Alexander

Abstract

A method for encoding an input audio stream including the steps of obtaining a first playback stream presentation of the input audio stream intended for reproduction on a first audio reproduction system, obtaining a second playback stream presentation of the input audio stream intended for reproduction on a second audio reproduction system, determining a set of transform parameters suitable for transforming an intermediate playback stream presentation to an approximation of the second playback stream presentation, wherein the transform parameters are determined by minimization of a measure of a difference between the approximation of the second playback stream presentation and the second playback stream presentation, and encoding the first playback stream presentation and the set of transform parameters for transmission to a decoder.

IPC Classes  ?

  • G10L 19/008 - Multichannel audio signal coding or decoding using interchannel correlation to reduce redundancy, e.g. joint-stereo, intensity-coding or matrixing
  • G06F 3/16 - Sound input; Sound output
  • H04L 65/70 - Media network packetisation
  • H04L 65/75 - Media network packet handling
  • H04S 1/00 - Two-channel systems
  • H04S 7/00 - Indicating arrangements; Control arrangements, e.g. balance control

12.

INSERTION OF FORCED GAPS FOR PERVASIVE LISTENING

      
Application Number 18254962
Status Pending
Filing Date 2021-12-02
First Publication Date 2024-03-28
Owner Dolby Laboratories Licensing Corporation (USA)
Inventor
  • Hines, Christopher Graham
  • Southwell, Benjamin John

Abstract

An attenuation or “gap” may be inserted into at least a first frequency range of at least first and second audio playback signals of a content stream during at least a first time interval to generate at least first and second modified audio playback signals. Corresponding audio device playback sound may be provided by at least first and second audio devices. At least one microphone may detect at least the first audio device playback sound and the second audio device playback sound and may generate corresponding microphone signals. Audio data may be extracted from the microphone signals in at least the first frequency range, to produce extracted audio data. A far-field audio environment impulse response and/or audio environment noise may be estimated based, at least in part, on the extracted audio data.

IPC Classes  ?

  • H04S 7/00 - Indicating arrangements; Control arrangements, e.g. balance control
  • H04S 3/00 - Systems employing more than two channels, e.g. quadraphonic

13.

IMAGE ENHANCEMENT VIA GLOBAL AND LOCAL RESHAPING

      
Application Number 18262611
Status Pending
Filing Date 2022-01-26
First Publication Date 2024-03-21
Owner Dolby Laboratories Licensing Corporation (USA)
Inventor
  • Su, Guan-Ming
  • Kadu, Harshad
  • Klittmark, Per Jonas Andreas
  • Chen, Tao

Abstract

A first reshaping mapping is performed on a first image represented in a first domain to generate a second image represented in a second domain. The first domain is of a first dynamic range different from a second dynamic range of which the second domain is. A second reshaping mapping is performed on the second image represented in the second domain to generate a third image represented in the first domain. The third image is perceptually different from the first image in at least one of: global contrast, global saturation, local contrast, local saturation, etc. A display image is derived from the third image and rendered on a display device.

IPC Classes  ?

  • G06T 5/00 - Image enhancement or restoration
  • G06V 10/60 - Extraction of image or video features relating to illumination properties, e.g. using a reflectance or lighting model

14.

METHOD AND DEVICE FOR APPLYING DYNAMIC RANGE COMPRESSION TO A HIGHER ORDER AMBISONICS SIGNAL

      
Application Number 18505494
Status Pending
Filing Date 2023-11-09
First Publication Date 2024-03-21
Owner Dolby Laboratories Licensing Corporation (USA)
Inventor
  • Boehm, Johannes
  • Keiler, Florian

Abstract

A method for performing DRC on a HOA signal comprises transforming the HOA signal to the spatial domain, analyzing the transformed HOA signal, and obtaining, from results of said analyzing, gain factors that are usable for dynamic compression. The gain factors can be transmitted together with the HOA signal. When applying the DRC, the HOA signal is transformed to the spatial domain, the gain factors are extracted and multiplied with the transformed HOA signal in the spatial domain, wherein a gain compensated transformed HOA signal is obtained. The gain compensated transformed HOA signal is transformed back into the HOA domain, wherein a gain compensated HOA signal is obtained. The DRC may be applied in the QMF-filter bank domain.

IPC Classes  ?

  • H04S 3/00 - Systems employing more than two channels, e.g. quadraphonic
  • G10L 19/008 - Multichannel audio signal coding or decoding using interchannel correlation to reduce redundancy, e.g. joint-stereo, intensity-coding or matrixing
  • H04S 3/02 - Systems employing more than two channels, e.g. quadraphonic of the matrix type, i.e. in which input signals are combined algebraically, e.g. after having been phase shifted with respect to each other

15.

SYSTEMS AND METHODS FOR LOCAL DIMMING IN MULTI-MODULATION DISPLAYS

      
Application Number 18518082
Status Pending
Filing Date 2023-11-22
First Publication Date 2024-03-21
Owner Dolby Laboratories Licensing Corporation (USA)
Inventor
  • Shields, Jerome
  • Richards, Martin J.
  • Pertierra, Juan P.

Abstract

Dual and multi-modulator projector display systems and techniques are disclosed. In one embodiment, a projector display system comprises a light source; a controller, a first modulator, receiving light from the light source and rendering a halftone image of said the input image; a blurring optical system that blurs said halftone image with a Point Spread Function (PSF); and a second modulator receiving the blurred halftone image and rendering a pulse width modulated image which may be projected to form the desired screen image. Systems and techniques for forming a binary halftone image from input image, correcting for misalignment between the first and second modulators and calibrating the projector system—e.g. over time—for continuous image improvement are also disclosed.

IPC Classes  ?

  • H04N 9/31 - Projection devices for colour picture display
  • B65B 11/04 - Wrapping articles or quantities of material, without changing their position during the wrapping operation, e.g. in moulds with hinged folders the articles being rotated
  • B65B 11/48 - Enclosing articles, or quantities of material, by folding a wrapper, e.g. a pocketed wrapper, and securing its opposed free margins to enclose contents
  • B65B 11/58 - Applying two or more wrappers, e.g. in succession
  • B65B 49/08 - Reciprocating or oscillating folders
  • B65B 51/06 - Applying adhesive tape
  • B65B 55/00 - Preserving, protecting or purifying packages or package contents in association with packaging
  • B65B 61/06 - Auxiliary devices, not otherwise provided for, for operating on sheets, blanks, webs, binding material, containers or packages for severing webs, or for separating joined packages by cutting
  • B65B 61/26 - Auxiliary devices, not otherwise provided for, for operating on sheets, blanks, webs, binding material, containers or packages for marking or coding completed packages
  • G03B 21/00 - Projectors or projection-type viewers; Accessories therefor
  • G03B 21/13 - Projectors for producing special effects at the edges of picture, e.g. blurring
  • G03B 21/20 - Lamp housings

16.

HEAD TRACKED SPATIAL AUDIO AND/OR VIDEO RENDERING

      
Application Number 18520413
Status Pending
Filing Date 2023-11-27
First Publication Date 2024-03-21
Owner Dolby Laboratories Licensing Corporation (USA)
Inventor
  • Ninan, Ajit
  • Rozzi, William Anthony

Abstract

Images are acquired through image sensors operating in conjunction with a media consumption system. The acquired images are used to determine a user's movement in a plurality of degrees of freedom. Sound images depicted in spatial audio rendered by audio speakers operating in conjunction with the media consumption system are adapted based at least in part on the user's movement in the plurality of degrees of freedom.

IPC Classes  ?

  • H04S 7/00 - Indicating arrangements; Control arrangements, e.g. balance control
  • G06F 3/01 - Input arrangements or combined input and output arrangements for interaction between user and computer
  • G06T 7/20 - Analysis of motion
  • G06T 7/73 - Determining position or orientation of objects or cameras using feature-based methods

17.

METHOD OF RENDERING ONE OR MORE CAPTURED AUDIO SOUNDFIELDS TO A LISTENER

      
Application Number 18469498
Status Pending
Filing Date 2023-09-18
First Publication Date 2024-03-21
Owner DOLBY LABORATORIES LICENSING CORPORATION (USA)
Inventor
  • Cartwright, Richard J.
  • Mcgrath, David S.
  • Dickins, Glenn N.

Abstract

A computer implemented system for rendering captured audio soundfields to a listener comprises apparatus to deliver the audio soundfields to the listener. The delivery apparatus delivers the audio soundfields to the listener with first and second audio elements perceived by the listener as emanating from first and second virtual source locations, respectively, and with the first audio element and/or the second audio element delivered to the listener from a third virtual source location. The first virtual source location and the second virtual source location are perceived by the listener as being located to the front of the listener, and the third virtual source location is located to the rear or the side of the listener.

IPC Classes  ?

  • H04S 1/00 - Two-channel systems
  • H04M 3/56 - Arrangements for connecting several subscribers to a common circuit, i.e. affording conference facilities
  • H04R 3/12 - Circuits for transducers for distributing signals to two or more loudspeakers
  • H04R 5/033 - Headphones for stereophonic communication
  • H04R 5/04 - Circuit arrangements
  • H04S 7/00 - Indicating arrangements; Control arrangements, e.g. balance control

18.

AUDIO CHANNEL SPATIAL TRANSLATION

      
Application Number 18474170
Status Pending
Filing Date 2023-09-25
First Publication Date 2024-03-21
Owner Dolby Laboratories Licensing Corporation (USA)
Inventor Davis, Mark F.

Abstract

The present invention is directed to methods and apparatus for translating a first plurality of audio input channels to a second plurality of audio output channels. This includes determining that there is pair-wise coding among any of the first plurality of audio input channels, determining an input/output-mapping matrix for mapping at least a first set of the first plurality of audio input channels to at least a second set of the second plurality of audio output channels; and deriving the second plurality of audio output channels based on first plurality of audio input channels, the input/output-mapping matrix and the determined pair-wise coding. The first plurality of audio input channels represent the same soundfield represented by the second plurality of audio output channels.

IPC Classes  ?

  • H04S 5/00 - Pseudo-stereo systems, e.g. in which additional channel signals are derived from monophonic signals by means of phase shifting, time delay or reverberation

19.

VIDEO CODING METHOD AND APPARATUS USING ANY TYPES OF BLOCK PARTITIONING

      
Application Number 18523309
Status Pending
Filing Date 2023-11-29
First Publication Date 2024-03-21
Owner Dolby Laboratories Licensing Corporation (USA)
Inventor
  • Ryu, Ho Chan
  • Ahn, Yong Jo

Abstract

The present invention relates to a block partitioning structure in video coding technology, and a video encoding and decoding method and apparatus using the same, wherein the video encoding and decoding method includes the steps of: acquiring quad-partitioning information of a block; acquiring bi-partitioning information of the block when the acquired quad-partitioning information of the block does not indicate four partitions; acquiring partitioning direction information for bi-partitioning of the block when the acquired bi-partitioning information of the block indicates two partitions; acquiring information on whether to perform any other type of partitioning, when the acquired bi-partitioning information of the block does not indicate two partitions; and acquiring additional information required for the any other type of partitioning, when the acquired information on whether to perform any other type of partitioning indicates that the any other type of partitioning is performed.

IPC Classes  ?

  • H04N 19/119 - Adaptive subdivision aspects e.g. subdivision of a picture into rectangular or non-rectangular coding blocks
  • H04N 19/176 - Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding the unit being an image region, e.g. an object the region being a block, e.g. a macroblock
  • H04N 19/46 - Embedding additional information in the video signal during the compression process
  • H04N 19/66 - Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using error resilience involving data partitioning, i.e. separation of data into packets or partitions according to importance

20.

Frame-rate scalable video coding

      
Application Number 18506758
Grant Number 11936888
Status In Force
Filing Date 2023-11-10
First Publication Date 2024-03-14
Grant Date 2024-03-19
Owner DOLBY LABORATORIES LICENSING CORPORATION (USA)
Inventor
  • Atkins, Robin
  • Yin, Peng
  • Lu, Taoran
  • Pu, Fangjun
  • Mccarthy, Sean Thomas
  • Husak, Walter J.
  • Chen, Tao
  • Su, Guan-Ming

Abstract

Methods and systems for frame rate scalability are described. Support is provided for input and output video sequences with variable frame rate and variable shutter angle across scenes, or for input video sequences with fixed input frame rate and input shutter angle, but allowing a decoder to generate a video output at a different output frame rate and shutter angle than the corresponding input values. Techniques allowing a decoder to decode more computationally-efficiently a specific backward compatible target frame rate and shutter angle among those allowed are also presented.

IPC Classes  ?

  • H04N 19/172 - Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding the unit being an image region, e.g. an object the region being a picture, frame or field
  • H04N 19/187 - Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding the unit being a scalable video layer
  • H04N 19/30 - Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using hierarchical techniques, e.g. scalability
  • H04N 19/31 - Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using hierarchical techniques, e.g. scalability in the temporal domain
  • H04N 19/46 - Embedding additional information in the video signal during the compression process
  • H04N 19/70 - Methods or arrangements for coding, decoding, compressing or decompressing digital video signals characterised by syntax aspects related to video coding, e.g. related to compression standards

21.

IMAGE ENCODING AND DECODING APPARATUS, AND IMAGE ENCODING AND DECODING METHOD

      
Application Number 18516398
Status Pending
Filing Date 2023-11-21
First Publication Date 2024-03-14
Owner Dolby Laboratories Licensing Corporation (USA)
Inventor
  • Han, Jong Ki
  • Seo, Chan Won
  • Choi, Kwang Hyun

Abstract

According to the present invention, an adaptive scheme is applied to an image encoding apparatus that includes an inter-predictor, an intra-predictor, a transformer, a quantizer, an inverse quantizer, and an inverse transformer, wherein input images are classified into two or more different categories, and two or more modules from among the inter-predictor, the intra-predictor, the transformer, the quantizer, and the inverse quantizer are implemented to perform respective operations in different schemes according to the category to which an input image belongs. Thus, the invention has the advantage of efficiently encoding an image without the loss of important information as compared to a conventional image encoding apparatus which adopts a packaged scheme.

IPC Classes  ?

  • H04N 19/124 - Quantisation
  • H04L 45/745 - Address table lookup; Address filtering
  • H04N 19/11 - Selection of coding mode or of prediction mode among a plurality of spatial predictive coding modes
  • H04N 19/117 - Filters, e.g. for pre-processing or post-processing
  • H04N 19/12 - Selection from among a plurality of transforms or standards, e.g. selection between discrete cosine transform [DCT] and sub-band transform or selection between H.263 and H.264
  • H04N 19/136 - Incoming video signal characteristics or properties
  • H04N 19/14 - Coding unit complexity, e.g. amount of activity or edge presence estimation
  • H04N 19/159 - Prediction type, e.g. intra-frame, inter-frame or bidirectional frame prediction
  • H04N 19/176 - Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding the unit being an image region, e.g. an object the region being a block, e.g. a macroblock
  • H04N 19/61 - Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using transform coding in combination with predictive coding

22.

METHODS AND SYSTEMS FOR RENDERING OBJECT BASED AUDIO

      
Application Number 18470165
Status Pending
Filing Date 2023-09-19
First Publication Date 2024-03-07
Owner
  • Dolby Laboratories Licensing Corporation (USA)
  • DOLBY INTERNATIONAL AB (Ireland)
Inventor
  • Mehta, Sripal S.
  • Ziegler, Thomas
  • Baker, Giles
  • Riedmiller, Jeffrey
  • Saungsomboon, Prinyar

Abstract

Methods for generating an object based audio program, renderable in a personalizable manner, and including a bed of speaker channels renderable in the absence of selection of other program content (e.g., to provide a default full range audio experience). Other embodiments include steps of delivering, decoding, and/or rendering such a program. Rendering of content of the bed, or of a selected mix of other content of the program, may provide an immersive experience. The program may include multiple object channels (e.g., object channels indicative of user-selectable and user-configurable objects), the bed of speaker channels, and other speaker channels. Another aspect is an audio processing unit (e.g., encoder or decoder) configured to perform, or which includes a buffer memory which stores at least one frame (or other segment) of an object based audio program (or bitstream thereof) generated in accordance with, any embodiment of the method.

IPC Classes  ?

  • G10L 19/008 - Multichannel audio signal coding or decoding using interchannel correlation to reduce redundancy, e.g. joint-stereo, intensity-coding or matrixing
  • G06F 3/16 - Sound input; Sound output
  • G10L 19/20 - Vocoders using multiple modes using sound class specific coding, hybrid encoders or object based coding
  • H04S 3/00 - Systems employing more than two channels, e.g. quadraphonic
  • H04S 7/00 - Indicating arrangements; Control arrangements, e.g. balance control

23.

CODING AND DECODING OF INTERLEAVED IMAGE DATA

      
Application Number 18503711
Status Pending
Filing Date 2023-11-07
First Publication Date 2024-03-07
Owner Dolby Laboratories Licensing Corporation (USA)
Inventor
  • Tourapis, Alexandros
  • Husak, Walter J.
  • Pahalawatta, Peshala V.
  • Leontaris, Athanasios

Abstract

Sampled data is packaged in checkerboard format for encoding and decoding. The sampled data may be quincunx sampled multi-image video data (e.g., 3D video or a multi-program stream), and the data may also be divided into sub-images of each image which are then multiplexed, or interleaved, in frames of a video stream to be encoded and then decoded using a standardized video encoder. A system for viewing may utilize a standard video decoder and a formatting device that de-interleaves the decoded sub-images of each frame reformats the images for a display device. A 3D video may be encoded using a most advantageous interleaving format such that a preferred quality and compression ratio is reached. In one embodiment, the invention includes a display device that accepts data in multiple formats.

IPC Classes  ?

  • H04N 19/597 - Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using predictive coding specially adapted for multi-view video sequence encoding
  • H04N 13/139 - Format conversion, e.g. of frame-rate or size
  • H04N 13/161 - Encoding, multiplexing or demultiplexing different image signal components
  • H04N 13/194 - Transmission of image signals
  • H04N 19/112 - Selection of coding mode or of prediction mode according to a given display mode, e.g. for interlaced or progressive display mode
  • H04N 19/132 - Sampling, masking or truncation of coding units, e.g. adaptive resampling, frame skipping, frame interpolation or high-frequency transform coefficient masking
  • H04N 19/16 - Assigned coding mode, i.e. the coding mode being predefined or preselected to be further used for selection of another element or parameter for a given display mode, e.g. for interlaced or progressive display mode
  • H04N 19/176 - Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding the unit being an image region, e.g. an object the region being a block, e.g. a macroblock
  • H04N 19/33 - Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using hierarchical techniques, e.g. scalability in the spatial domain
  • H04N 19/46 - Embedding additional information in the video signal during the compression process
  • H04N 19/587 - Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using predictive coding involving temporal sub-sampling or interpolation, e.g. decimation or subsequent interpolation of pictures in a video sequence
  • H04N 19/60 - Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using transform coding
  • H04N 19/61 - Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using transform coding in combination with predictive coding
  • H04N 19/85 - Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using pre-processing or post-processing specially adapted for video compression
  • H04N 21/2365 - Multiplexing of several video streams
  • H04N 21/2383 - Channel coding of digital bit-stream, e.g. modulation
  • H04N 21/434 - Disassembling of a multiplex stream, e.g. demultiplexing audio and video streams or extraction of additional data from a video stream; Remultiplexing of multiplex streams; Extraction or processing of SI; Disassembling of packetised elementary stream
  • H04N 21/438 - Interfacing the downstream path of the transmission network originating from a server, e.g. retrieving MPEG packets from an IP network

24.

QUANTIZATION PARAMETER SIGNALING

      
Application Number 18506828
Status Pending
Filing Date 2023-11-10
First Publication Date 2024-03-07
Owner Dolby Laboratories Licensing Corporation (USA)
Inventor
  • Pu, Fangjun
  • Lu, Taoran
  • Yin, Peng
  • Mccarthy, Sean Thomas

Abstract

A quantization parameter signalling mechanism for both SDR and HDR content in video coding is described using two approaches. The first approach is to send the user-defined QpC table directly in high level syntax. This leads to more flexible and efficient QP control for future codec development and video content coding. The second approach is to signal luma and chroma QPs independently. This approach eliminates the need for QpC tables and removes the dependency of chroma quantization parameter on luma QP.

IPC Classes  ?

  • H04N 19/70 - Methods or arrangements for coding, decoding, compressing or decompressing digital video signals characterised by syntax aspects related to video coding, e.g. related to compression standards
  • H04N 19/172 - Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding the unit being an image region, e.g. an object the region being a picture, frame or field
  • H04N 19/186 - Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding the unit being a colour or a chrominance component
  • H04N 19/46 - Embedding additional information in the video signal during the compression process

25.

PERCEPTUALLY-BASED LOSS FUNCTIONS FOR AUDIO ENCODING AND DECODING BASED ON MACHINE LEARNING

      
Application Number 18507824
Status Pending
Filing Date 2023-11-13
First Publication Date 2024-03-07
Owner Dolby Laboratories Licensing Corporation (USA)
Inventor
  • Fejgin, Roy M.
  • Davidson, Grant A.
  • Wu, Chih-Wei
  • Kumar, Vivek

Abstract

Computer-implemented methods for training a neural network, as well as for implementing audio encoders and decoders via trained neural networks, are provided. The neural network may receive an input audio signal, generate an encoded audio signal and decode the encoded audio signal. A loss function generating module may receive the decoded audio signal and a ground truth audio signal, and may generate a loss function value corresponding to the decoded audio signal. Generating the loss function value may involve applying a psychoacoustic model. The neural network may be trained based on the loss function value. The training may involve updating at least one weight of the neural network.

IPC Classes  ?

  • G10L 19/022 - Blocking, i.e. grouping of samples in time; Choice of analysis windows; Overlap factoring
  • G06F 3/16 - Sound input; Sound output
  • G06N 3/048 - Activation functions
  • G06N 3/084 - Backpropagation, e.g. using gradient descent

26.

PERCEPTUAL ENHANCEMENT FOR BINAURAL AUDIO RECORDING

      
Application Number 18257862
Status Pending
Filing Date 2021-12-14
First Publication Date 2024-03-07
Owner Dolby Laboratories Licensing Corporation (USA)
Inventor
  • Ma, Yuanxing
  • Shuang, Zhiwei
  • Liu, Yang

Abstract

A method of audio processing includes capturing a binaural audio signal, calculating noise reduction gains using a machine learning model, and generating a modified binaural audio signal. The method may further including performing various corrections to the audio to account for video captured by different cameras such as a front camera and a rear camera. The method may further include performing smooth switching of the binaural audio when switching between the front camera and the rear camera. In this manner, noise may be reduced in the binaural audio, and the user perception of the combined video and binaural audio may be improved.

IPC Classes  ?

  • H04R 1/10 - Earpieces; Attachments therefor
  • H04R 5/04 - Circuit arrangements
  • H04S 7/00 - Indicating arrangements; Control arrangements, e.g. balance control

27.

FRAME-RATE SCALABLE VIDEO CODING

      
Application Number 18508088
Status Pending
Filing Date 2023-11-13
First Publication Date 2024-03-07
Owner Dolby Laboratories Licensing Corporation (USA)
Inventor
  • Atkins, Robin
  • Yin, Peng
  • Lu, Taoran
  • Pu, Fangjun
  • Mccarthy, Sean Thomas
  • Husak, Walter J.
  • Chen, Tao
  • Su, Guan-Ming

Abstract

Methods and systems for frame rate scalability are described. Support is provided for input and output video sequences with variable frame rate and variable shutter angle across scenes, or for input video sequences with fixed input frame rate and input shutter angle, but allowing a decoder to generate a video output at a different output frame rate and shutter angle than the corresponding input values. Techniques allowing a decoder to decode more computationally-efficiently a specific backward compatible target frame rate and shutter angle among those allowed are also presented.

IPC Classes  ?

  • H04N 19/31 - Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using hierarchical techniques, e.g. scalability in the temporal domain
  • H04N 19/172 - Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding the unit being an image region, e.g. an object the region being a picture, frame or field
  • H04N 19/187 - Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding the unit being a scalable video layer
  • H04N 19/30 - Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using hierarchical techniques, e.g. scalability
  • H04N 19/46 - Embedding additional information in the video signal during the compression process
  • H04N 19/70 - Methods or arrangements for coding, decoding, compressing or decompressing digital video signals characterised by syntax aspects related to video coding, e.g. related to compression standards

28.

DETERMINING DIALOG QUALITY METRICS OF A MIXED AUDIO SIGNAL

      
Application Number 18259848
Status Pending
Filing Date 2022-01-04
First Publication Date 2024-02-29
Owner Dolby Laboratories Licensing Corporation (USA)
Inventor
  • Sun, Jundai
  • Lu, Lie
  • Yang, Shaofan
  • Wilson, Rhonda J.
  • Breebaart, Dirk Jeroen

Abstract

Disclosed is a method for determining one or more dialog quality metrics of a mixed audio signal comprising a dialog component and a noise component, the method comprising separating an estimated dialog component from the mixed audio signal by means of a dialog separator using a dialog separating model determined by training the dialog separator based on the one or more quality metrics; providing the estimated dialog component from the dialog separator to a quality metrics estimator; and determining the one or more quality metrics by means of the quality metrics estimator based on the mixed signal and the estimated dialog component. Further disclosed is a method for training a dialog separator, a system comprising circuitry configured to perform the method, and a non-transitory computer-readable storage medium.

IPC Classes  ?

  • G10L 25/60 - Speech or voice analysis techniques not restricted to a single one of groups specially adapted for particular use for comparison or discrimination for measuring the quality of voice signals
  • G10L 21/0272 - Voice signal separating

29.

METHOD AND DEVICE FOR ENCODING AND DECODING IMAGE USING MOTION VECTOR RESOLUTION SCALING

      
Application Number 18504337
Status Pending
Filing Date 2023-11-08
First Publication Date 2024-02-29
Owner Dolby Laboratories Licensing Corporation (USA)
Inventor
  • Han, Jong Ki
  • Lee, Jae Yung

Abstract

A video encoding method according to an embodiment of the present invention includes generating header information that includes information about resolutions of motion vectors of respective blocks, determined based on motion prediction for a unit image. Here, the header information includes flag information indicating whether resolutions of all motion vectors included in the unit image are integer-pixel resolutions. Further, a video decoding method according to another embodiment of the present invention includes extracting information about resolutions of motion vectors of each unit image from header information included in a target bitstream to be decoded; and a decoding unit for decoding the unit image based on the resolution information. Here, the header information includes flag information indicating whether resolutions of all motion vectors included in the unit image are integer-pixel resolutions.

IPC Classes  ?

  • H04N 19/53 - Multi-resolution motion estimation; Hierarchical motion estimation
  • H04N 19/105 - Selection of the reference unit for prediction within a chosen coding or prediction mode, e.g. adaptive choice of position and number of pixels used for prediction
  • H04N 19/136 - Incoming video signal characteristics or properties
  • H04N 19/17 - Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding the unit being an image region, e.g. an object
  • H04N 19/27 - Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using video object coding involving both synthetic and natural picture components, e.g. synthetic natural hybrid coding [SNHC]
  • H04N 19/50 - Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using predictive coding
  • H04N 19/51 - Motion estimation or motion compensation
  • H04N 19/523 - Motion estimation or motion compensation with sub-pixel accuracy
  • H04N 19/70 - Methods or arrangements for coding, decoding, compressing or decompressing digital video signals characterised by syntax aspects related to video coding, e.g. related to compression standards

30.

MULTIPLE STAGE MODULATION PROJECTOR DISPLAY SYSTEMS HAVING EFFICIENT LIGHT UTILIZATION

      
Application Number 17589736
Status Pending
Filing Date 2022-01-31
First Publication Date 2024-02-29
Owner DOLBY LABORATORIES LICENSING CORPORATION (USA)
Inventor Richards, Martin J.

Abstract

Dual or multi-modulation display systems comprising a first modulator and a second modulator are disclosed. The first modulator may comprise a plurality of analog mirrors (e.g. MEMS array) and the second modulator may comprise a plurality of mirrors (e.g., DMD array). The display system may further comprise a controller that sends control signals to the first and second modulator. The display system may render highlight features within a projected image by affecting a time multiplexing scheme. In one embodiment, the first modulator may be switched on a sub-frame basis such that a desired proportion of the available light may be focused or directed onto the second modulator to form the highlight feature on a sub-frame rendering basis.

IPC Classes  ?

  • H04N 5/74 - Projection arrangements for image reproduction, e.g. using eidophor
  • H04N 9/31 - Projection devices for colour picture display

31.

Signal reshaping for high dynamic range signals

      
Application Number 18385724
Grant Number 11910025
Status In Force
Filing Date 2023-10-31
First Publication Date 2024-02-20
Grant Date 2024-02-20
Owner DOLBY LABORATORIES LICENSING CORPORATION (USA)
Inventor
  • Atkins, Robin
  • Yin, Peng
  • Lu, Taoran
  • Pytlarz, Jaclyn Anne

Abstract

In a method to improve backwards compatibility when decoding high-dynamic range images coded in a wide color gamut (WCG) space which may not be compatible with legacy color spaces, hue and/or saturation values of images in an image database are computed for both a legacy color space (say, YCbCr-gamma) and a preferred WCG color space (say, IPT-PQ). Based on a cost function, a reshaped color space is computed so that the distance between the hue values in the legacy color space and rotated hue values in the preferred color space is minimized HDR images are coded in the reshaped color space. Legacy devices can still decode standard dynamic range images assuming they are coded in the legacy color space, while updated devices can use color reshaping information to decode HDR images in the preferred color space at full dynamic range.

IPC Classes  ?

  • H04N 19/87 - Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using pre-processing or post-processing specially adapted for video compression involving scene cut or scene change detection in combination with video compression
  • H04N 19/46 - Embedding additional information in the video signal during the compression process
  • H04N 19/98 - Adaptive-dynamic-range coding [ADRC]
  • H04N 19/85 - Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using pre-processing or post-processing specially adapted for video compression
  • H04N 1/60 - Colour correction or control

32.

ORCHESTRATION OF ACOUSTIC DIRECT SEQUENCE SPREAD SPECTRUM SIGNALS FOR ESTIMATION OF ACOUSTIC SCENE METRICS

      
Application Number 18255550
Status Pending
Filing Date 2021-12-02
First Publication Date 2024-02-15
Owner Dolby Laboratories Licensing Corporation (USA)
Inventor
  • Southwell, Benjamin John
  • Gunawan, David
  • Thomas, Mark R.P.
  • Hines, Christopher Graham

Abstract

Some methods may involve receiving a first content stream that includes first audio signals, rendering the first audio signals to produce first audio playback signals, generating first direct sequence spread spectrum (DSSS) signals, generating first modified audio playback signals by inserting the first DSSS signals into the first audio playback signals, and causing a loudspeaker system to play back the first modified audio playback signals, to generate first audio device playback sound. The method(s) may involve receiving microphone signals corresponding to at least the first audio device playback sound and to second through Nth audio device playback sound corresponding to second through Nth modified audio playback signals (including second through Nth DSSS signals) played back by second through Nth audio devices, extracting second through Nth DSSS signals from the microphone signals and estimating at least one acoustic scene metric based, at least partly, on the second through Nth DSSS signals.

IPC Classes  ?

  • H04S 7/00 - Indicating arrangements; Control arrangements, e.g. balance control

33.

METHOD FOR AND APPARATUS FOR DECODING/RENDERING AN AMBISONICS AUDIO SOUNDFIELD REPRESENTATION FOR AUDIO PLAYBACK USING 2D SETUPS

      
Application Number 18457030
Status Pending
Filing Date 2023-08-28
First Publication Date 2024-02-15
Owner Dolby Laboratories Licensing Corporation (USA)
Inventor
  • Keiler, Florian
  • Boehm, Johannes

Abstract

Improved methods and/or apparatus for decoding an encoded audio signal in soundfield format for L loudspeakers. The method and/or apparatus can render an Ambisonics format audio signal to 2D loudspeaker setup(s) based on a rendering matrix. The rendering matrix has elements based on loudspeaker positions and wherein the rendering matrix is determined based on weighting at least an element of a first matrix with a weighting factor Improved methods and/or apparatus for decoding an encoded audio signal in soundfield format for L loudspeakers. The method and/or apparatus can render an Ambisonics format audio signal to 2D loudspeaker setup(s) based on a rendering matrix. The rendering matrix has elements based on loudspeaker positions and wherein the rendering matrix is determined based on weighting at least an element of a first matrix with a weighting factor ℊ = 1 L . Improved methods and/or apparatus for decoding an encoded audio signal in soundfield format for L loudspeakers. The method and/or apparatus can render an Ambisonics format audio signal to 2D loudspeaker setup(s) based on a rendering matrix. The rendering matrix has elements based on loudspeaker positions and wherein the rendering matrix is determined based on weighting at least an element of a first matrix with a weighting factor ℊ = 1 L . The first matrix is determined based on positions of the L loudspeakers and at least a virtual position of at least a virtual loudspeaker that is added to the positions of the L loudspeakers.

IPC Classes  ?

  • H04S 3/02 - Systems employing more than two channels, e.g. quadraphonic of the matrix type, i.e. in which input signals are combined algebraically, e.g. after having been phase shifted with respect to each other
  • H04S 7/00 - Indicating arrangements; Control arrangements, e.g. balance control

34.

BINAURAL SIGNAL POST-PROCESSING

      
Application Number 18258041
Status Pending
Filing Date 2021-12-16
First Publication Date 2024-02-15
Owner
  • Dolby Laboratories Licensing Corporation (USA)
  • Dolby International AB (Ireland)
Inventor
  • Breebaart, Dirk Jeroen
  • Cengarle, Giulio
  • Brown, C. Phillip

Abstract

A method of audio processing includes performing spatial analysis on a binaural signal to estimate level differences and phase differences characteristic of a binaural filter of the binaural signal, performing object extraction on the binaural audio signal using the estimated level and phase differences to generate a left/right main component signal and a left/right residual component signal. The system may process the left/right main and left/right residual components differently using different object processing parameters for e.g. repositioning, equalization, compression, upmixing, channel remapping or storage to generate a processed binaural signal that provides an improved listening experience. Repositioning may be based on head tracking sensor data.

IPC Classes  ?

  • H04S 7/00 - Indicating arrangements; Control arrangements, e.g. balance control
  • H04S 3/00 - Systems employing more than two channels, e.g. quadraphonic

35.

MULTISOURCE MEDIA DELIVERY SYSTEMS AND METHODS

      
Application Number 18256987
Status Pending
Filing Date 2021-12-16
First Publication Date 2024-02-15
Owner Dolby Laboratories Licensing Corporation (USA)
Inventor
  • Riedmiller, Jeffrey
  • Yu, Mingchao
  • Cloud, Jason Michael

Abstract

A method for delivering media content to one or more clients over a distributed system is disclosed. The method may include generating a plurality of network-coded symbols from a plurality of original symbols representing a first media asset. The method may further include generating an original plurality of coded variants of the first media asset. The method may further include distributing a first coded variant of the original plurality of coded variants to a first cache on a first server device for storage in the first cache. The method may further include distributing a second coded variant of the original plurality of coded variants to a second cache on a second server device for storage in the second cache.

IPC Classes  ?

  • H04N 21/60 - Network structure or processes for video distribution between server and client or between remote clients; Control signalling between clients, server and network components; Transmission of management data between server and client; Communication details between server and client
  • H04N 21/2183 - Cache memory

36.

SOURCE COLOR VOLUME INFORMATION MESSAGING

      
Application Number 18486697
Status Pending
Filing Date 2023-10-13
First Publication Date 2024-02-15
Owner Dolby Laboratories Licensing Corporation (USA)
Inventor
  • Chen, Tao
  • Yin, Peng
  • Lu, Taoran
  • Husak, Walter J.

Abstract

Methods are described to communicate source color volume information in a coded bitstream using SEI messaging. Such data include at least the minimum, maximum, and average luminance values in the source data plus optional data that may include the color volume x and y chromaticity coordinates for the input color primaries (e.g., red, green, and blue) of the source data, and the color x and y chromaticity coordinates for the color primaries corresponding to the minimum, average, and maximum luminance values in the source data. Messaging data signaling an active region in each picture may also be included.

IPC Classes  ?

  • H04N 19/70 - Methods or arrangements for coding, decoding, compressing or decompressing digital video signals characterised by syntax aspects related to video coding, e.g. related to compression standards
  • H04N 19/14 - Coding unit complexity, e.g. amount of activity or edge presence estimation
  • H04N 19/186 - Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding the unit being a colour or a chrominance component
  • H04N 19/20 - Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using video object coding

37.

METHOD FOR ENCODING AND DECODING IMAGE USING ADAPTIVE DEBLOCKING FILTERING, AND APPARATUS THEREFOR

      
Application Number 18493447
Status Pending
Filing Date 2023-10-24
First Publication Date 2024-02-15
Owner Dolby Laboratories Licensing Corporation (USA)
Inventor
  • Jeong, Je Chang
  • Kim, Ki Baek

Abstract

Disclosed is an encoding/decoding method and apparatus related to adaptive deblocking filtering. There is provided an image decoding method performing adaptive filtering in inter-prediction, the method including: reconstructing, from a bitstream, an image signal including a reference block on which block matching is performed in inter-prediction of a current block to be encoded; obtaining, from the bitstream, a flag indicating whether the reference block exists within a current picture where the current block is positioned; reconstructing the current block by using the reference block; adaptively applying an in-loop filter for the reconstructed current block based on the obtained flag; and storing the current block to which the in-loop filter is or is not applied in a decoded picture buffer (DPB).

IPC Classes  ?

  • H04N 19/82 - Methods or arrangements for coding, decoding, compressing or decompressing digital video signals - Details of filtering operations specially adapted for video compression, e.g. for pixel interpolation involving filtering within a prediction loop
  • H04N 19/60 - Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using transform coding
  • H04N 19/51 - Motion estimation or motion compensation
  • H04N 19/117 - Filters, e.g. for pre-processing or post-processing
  • H04N 19/50 - Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using predictive coding
  • H04N 19/58 - Motion compensation with long-term prediction, i.e. the reference frame for a current frame not being the temporally closest one
  • H04N 19/176 - Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding the unit being an image region, e.g. an object the region being a block, e.g. a macroblock
  • H04N 19/172 - Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding the unit being an image region, e.g. an object the region being a picture, frame or field
  • H04N 19/577 - Motion compensation with bidirectional frame interpolation, i.e. using B-pictures
  • H04N 19/70 - Methods or arrangements for coding, decoding, compressing or decompressing digital video signals characterised by syntax aspects related to video coding, e.g. related to compression standards
  • H04N 19/137 - Motion inside a coding unit, e.g. average field, frame or block difference
  • H04N 19/593 - Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using predictive coding involving spatial prediction techniques
  • H04N 19/107 - Selection of coding mode or of prediction mode between spatial and temporal predictive coding, e.g. picture refresh
  • H04N 19/124 - Quantisation
  • H04N 19/184 - Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding the unit being bits, e.g. of the compressed video stream
  • H04N 19/91 - Entropy coding, e.g. variable length coding [VLC] or arithmetic coding

38.

PERSONALIZED HRTFS VIA OPTICAL CAPTURE

      
Application Number 18455565
Status Pending
Filing Date 2023-08-24
First Publication Date 2024-02-08
Owner Dolby Laboratories Licensing Corporation (USA)
Inventor
  • Joyner, Mcgregor Steele
  • Brandmeyer, Alex
  • Daly, Scott
  • Baker, Jeffrey Ross
  • Fanelli, Andrea
  • Crum, Poppy Anne Carrie

Abstract

An apparatus and method of generating personalized HRTFs. The system is prepared by calculating a model for HRTFs described as the relationship between a finite example set of input data, namely anthropometric measures and demographic information for a set of individuals, and a corresponding set of output data, namely HRTFs numerically simulated using a high-resolution database of 3D scans of the same set of individuals. At the time of use, the system queries the user for their demographic information, and then from a series of images of the user, the system detects and measures various anthropometric characteristics. The system then applies the prepared model to the anthropometric and demographic data as part of generating a personalized HRTF. In this manner, the personalized HRTF can be generated with more convenience than by performing a high-resolution scan or an acoustic measurement of the user, and with less computational complexity than by numerically simulating their HRTF.

IPC Classes  ?

  • H04S 7/00 - Indicating arrangements; Control arrangements, e.g. balance control
  • G06T 7/11 - Region-based segmentation
  • G06T 7/70 - Determining position or orientation of objects or cameras
  • H04S 1/00 - Two-channel systems
  • G06V 40/10 - Human or animal bodies, e.g. vehicle occupants or pedestrians; Body parts, e.g. hands
  • G06F 18/214 - Generating training patterns; Bootstrap methods, e.g. bagging or boosting

39.

ORCHESTRATION OF ACOUSTIC DIRECT SEQUENCE SPREAD SPECTRUM SIGNALS FOR ESTIMATION OF ACOUSTIC SCENE METRICS

      
Application Number 18255499
Status Pending
Filing Date 2021-12-02
First Publication Date 2024-02-08
Owner Dolby Laboratories Licensing Corporation (USA)
Inventor
  • Southwell, Benjamin John
  • Gunawan, David
  • Thomas, Mark R.P.
  • Hines, Christopher Graham

Abstract

Some methods may involve receiving a first content stream that includes first audio signals, rendering the first audio signals to produce first audio playback signals, generating first direct sequence spread spectrum (DSSS) signals, generating first modified audio playback signals by inserting the first DSSS signals into the first audio playback signals, and causing a loudspeaker system to play back the first modified audio playback signals, to generate first audio device playback sound. The method(s) may involve receiving microphone signals corresponding to at least the first audio device playback sound and to second through Nth audio device playback sound corresponding to second through Nth modified audio playback signals (including second through Nth DSSS signals) played back by second through Nth audio devices, extracting second through Nth DSSS signals from the microphone signals and estimating at least one acoustic scene metric based, at least partly, on the second through Nth DSSS signals.

IPC Classes  ?

  • H04S 7/00 - Indicating arrangements; Control arrangements, e.g. balance control
  • H04R 5/02 - Spatial or constructional arrangements of loudspeakers
  • H04R 5/04 - Circuit arrangements

40.

AUDIO CONTENT IDENTIFICATION

      
Application Number 18022125
Status Pending
Filing Date 2021-08-18
First Publication Date 2024-02-01
Owner Dolby Laboratories Licensing Corporation (USA)
Inventor
  • Wang, Guiping
  • Lu, Lie

Abstract

A method of audio content identification includes using a two-stage classifier. The first stage includes previously-existing classifiers and the second stage includes a new classifier. The outputs of the first and second stages calculated over different time periods are combined to generate a steering signal. The final classification results from a combination of the steering signal and the outputs of the first and second stages. In this manner, a new classifier may be added without disrupting existing classifiers.

IPC Classes  ?

  • G10L 25/81 - Detection of presence or absence of voice signals for discriminating voice from music
  • G10L 15/08 - Speech classification or search
  • G10L 15/02 - Feature extraction for speech recognition; Selection of recognition unit

41.

ACOUSTIC FEEDBACK MANAGEMENT IN REAL-TIME AUDIO COMMUNICATION

      
Application Number 18258302
Status Pending
Filing Date 2021-12-22
First Publication Date 2024-02-01
Owner Dolby Laboratories Licensing Corporation (USA)
Inventor
  • Fang, Qianqian
  • Li, Kai
  • Guo, Yanmeng
  • Huang, Wei
  • Liu, Yang

Abstract

Disclosed is a method for managing acoustic feedback in real-time audio communications in a communications system, the method comprising determining, by means of a detection module, whether a first communication device is in loudspeaker mode, whether the first communication device is in real-time audio communications with a second communication, and whether the first communication device and the second communication device are in a same acoustic space. Upon determining that this is the case a request signal for requesting one or more measures against acoustic feedback is provided to a mitigation module. Further disclosed are a device and a system configured to perform the method, a non-transitory computer-readable medium, an encoder and a decoder.

IPC Classes  ?

  • H04M 9/08 - Two-way loud-speaking telephone systems with means for conditioning the signal, e.g.  for suppressing echoes for one or both directions of traffic
  • H04M 3/40 - Applications of speech amplifiers
  • H04R 3/02 - Circuits for transducers for preventing acoustic reaction

42.

VOLUME LEVELER CONTROLLER AND CONTROLLING METHOD

      
Application Number 18356044
Status Pending
Filing Date 2023-07-20
First Publication Date 2024-02-01
Owner DOLBY LABORATORIES LICENSING CORPORATION (USA)
Inventor
  • Wang, Jun
  • Lu, Lie
  • Seefeldt, Alan J.

Abstract

Volume leveler controller and controlling method are disclosed. In one embodiment, A volume leveler controller includes an audio content classifier for identifying the content type of an audio signal in real time; and an adjusting unit for adjusting a volume leveler in a continuous manner based on the content type as identified. The adjusting unit may configured to positively correlate the dynamic gain of the volume leveler with informative content types of the audio signal, and negatively correlate the dynamic gain of the volume leveler with interfering content types of the audio signal.

IPC Classes  ?

  • H03G 7/00 - Volume compression or expansion in amplifiers
  • H03G 3/30 - Automatic control in amplifiers having semiconductor devices
  • H03G 3/32 - Automatic control in amplifiers having semiconductor devices the control being dependent upon ambient noise level or sound level
  • H03G 5/16 - Automatic control
  • G10L 25/30 - Speech or voice analysis techniques not restricted to a single one of groups characterised by the analysis technique using neural networks
  • G10L 25/51 - Speech or voice analysis techniques not restricted to a single one of groups specially adapted for particular use for comparison or discrimination
  • G10L 21/0364 - Speech enhancement, e.g. noise reduction or echo cancellation by changing the amplitude for improving intelligibility

43.

METHOD AND DEVICE FOR DECODING A HIGHER-ORDER AMBISONICS (HOA) REPRESENTATION OF AN AUDIO SOUNDFIELD

      
Application Number 18359198
Status Pending
Filing Date 2023-07-26
First Publication Date 2024-02-01
Owner Dolby Laboratories Licensing Corporation (USA)
Inventor
  • Boehm, Johannes
  • Keiler, Florian

Abstract

The invention discloses rendering sound field signals, such as Higher-Order Ambisonics (HOA), for arbitrary loudspeaker setups, where the rendering results in highly improved localization properties and is energy preserving. This is obtained by rendering an audio sound field representation for arbitrary spatial loudspeaker setups and/or by a a decoder that decodes based on a decode matrix (D). The decode matrix (D) is based on smoothing and scaling of a first decode matrix {circumflex over (D)} with smoothing coefficients. The first decode matrix {circumflex over (D)} is based on a mix matrix G and a mode matrix {tilde over (ψ)}, where the mix matrix G was determined based on L speakers and positions of a spherical modelling grid related to a HOA order N, and the mode matrix {tilde over (ψ)} was determined based on the spherical modelling grid and the HOA order N.

IPC Classes  ?

  • H04S 7/00 - Indicating arrangements; Control arrangements, e.g. balance control
  • H04S 3/00 - Systems employing more than two channels, e.g. quadraphonic

44.

ACOUSTIC ENVIRONMENT SIMULATION

      
Application Number 18366385
Status Pending
Filing Date 2023-08-07
First Publication Date 2024-02-01
Owner Dolby Laboratories Licensing Corporation (USA)
Inventor Breebaart, Dirk Jeroen

Abstract

Encoding/decoding an audio signal having one or more audio components, wherein each audio component is associated with a spatial location. A first audio signal presentation (z) of the audio components, a first set of transform parameters (w(f)), and signal level data (β2) are encoded and transmitted to the decoder. The decoder uses the first set of transform parameters (w(f)) to form a reconstructed simulation input signal intended for an acoustic environment simulation, and applies a signal level modification (α) to the reconstructed simulation input signal. The signal level modification is based on the signal level data (β2) and data (p2) related to the acoustic environment simulation. The attenuated reconstructed simulation input signal is then processed in an acoustic environment simulator. With this process, the decoder does not need to determine the signal level of the simulation input signal, thereby reducing processing load.

IPC Classes  ?

  • G10L 19/008 - Multichannel audio signal coding or decoding using interchannel correlation to reduce redundancy, e.g. joint-stereo, intensity-coding or matrixing
  • G10L 19/012 - Comfort noise or silence coding
  • G10L 19/00 - Speech or audio signal analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
  • G10L 19/02 - Speech or audio signal analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using spectral analysis, e.g. transform vocoders or subband vocoders

45.

METHODS, APPARATUS AND SYSTEMS FOR POSITION-BASED GAIN ADJUSTMENT OF OBJECT-BASED AUDIO

      
Application Number 18353063
Status Pending
Filing Date 2023-07-15
First Publication Date 2024-01-25
Owner
  • DOLBY LABORATORIES LICENSING CORPORATION (USA)
  • DOLBY INTERNATIONAL AB (Ireland)
Inventor
  • Tsingos, Nicolas R.
  • Mcgrath, David S.
  • Sanchez, Freddie
  • Mateos Sole, Antonio

Abstract

The positions of a plurality of speakers at a media consumption site are determined. Audio information in an object-based format is received. Gain adjustment value for a sound content portion in the object-based format may be determined based on the position of the sound content portion and the positions of the plurality of speakers. Audio information in a ring-based channel format is received. Gain adjustment value for each ring-based channel in a set of ring-based channels may be determined based on the ring to which the ring-based channel belongs and the positions of the speakers at a media consumption site.

IPC Classes  ?

  • H04S 7/00 - Indicating arrangements; Control arrangements, e.g. balance control

46.

FRAME-RATE SCALABLE VIDEO CODING

      
Application Number 18477511
Status Pending
Filing Date 2023-09-28
First Publication Date 2024-01-25
Owner Dolby Laboratories Licensing Corporation (USA)
Inventor
  • Atkins, Robin
  • Yin, Peng
  • Lu, Taoran
  • Pu, Fangjun
  • Mccarthy, Sean Thomas
  • Husak, Walter J.
  • Chen, Tao
  • Su, Guan-Ming

Abstract

Methods and systems for frame rate scalability are described. Support is provided for input and output video sequences with variable frame rate and variable shutter angle across scenes, or for input video sequences with fixed input frame rate and input shutter angle, but allowing a decoder to generate a video output at a different output frame rate and shutter angle than the corresponding input values. Techniques allowing a decoder to decode more computationally-efficiently a specific backward compatible target frame rate and shutter angle among those allowed are also presented.

IPC Classes  ?

  • H04N 19/31 - Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using hierarchical techniques, e.g. scalability in the temporal domain
  • H04N 19/187 - Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding the unit being a scalable video layer
  • H04N 19/172 - Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding the unit being an image region, e.g. an object the region being a picture, frame or field
  • H04N 19/30 - Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using hierarchical techniques, e.g. scalability
  • H04N 19/46 - Embedding additional information in the video signal during the compression process
  • H04N 19/70 - Methods or arrangements for coding, decoding, compressing or decompressing digital video signals characterised by syntax aspects related to video coding, e.g. related to compression standards

47.

PROCESSING OF EXTENDED DIMENSION LIGHT FIELD IMAGES

      
Application Number 18255583
Status Pending
Filing Date 2021-12-02
First Publication Date 2024-01-25
Owner DOLBY LABORATORIES LICENSING CORPORATION (USA)
Inventor Atkins, Robin

Abstract

In one embodiment, methods, media, and systems process and display light field images using a view function that is based on pixel locations in the image and on the viewer's distance (observer's Z position) from the display. The view function can be an angular view function that specifies different angular views for different pixels in the light field image based on the inputs that can include: the x or y pixel location in the image, the viewer's distance from the display, and the viewer's angle relative to the display. In one embodiment, light field metadata, such as angular range metadata and/or angular offset metadata can be used to process and display the image. In one embodiment, color volume mapping metadata can be used to adjust color volume mapping based on the determined angular views; and the color volume mapping metadata can also be adjusted based on angular offset metadata.

IPC Classes  ?

  • H04N 13/117 - Transformation of image signals corresponding to virtual viewpoints, e.g. spatial image interpolation the virtual viewpoint locations being selected by the viewers or determined by viewer tracking
  • H04N 13/366 - Image reproducers using viewer tracking
  • H04N 13/388 - Volumetric displays, i.e. systems where the image is built up from picture elements distributed through a volume
  • H04N 13/232 - Image signal generators using stereoscopic image cameras using a single 2D image sensor using fly-eye lenses, e.g. arrangements of circular lenses
  • H04N 13/178 - Metadata, e.g. disparity information
  • H04N 23/957 - Light-field or plenoptic cameras or camera modules

48.

Alias cancelling during audio coding mode transitions

      
Application Number 17589228
Grant Number RE049813
Status In Force
Filing Date 2022-01-31
First Publication Date 2024-01-23
Grant Date 2024-01-23
Owner Dolby Laboratories Licensing Corporation (USA)
Inventor
  • Oh, Hyen-O
  • Lee, Chang Heon
  • Kang, Hong-Goo
  • Song, Jeungook

Abstract

An apparatus for processing an audio signal and method thereof are disclosed. The present invention includes receiving, by an audio processing apparatus, an audio signal including a first data of a first block encoded with rectangular coding scheme and a second data of a second block encoded with non-rectangular coding scheme; receiving a compensation signal corresponding to the second block; estimating a prediction of an aliasing part using the first data; and, obtaining a reconstructed signal for the second block based on the second data, the compensation signal and the prediction of aliasing part.

IPC Classes  ?

  • G10L 19/00 - Speech or audio signal analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
  • G10L 25/45 - Speech or voice analysis techniques not restricted to a single one of groups characterised by the type of analysis window
  • G10L 21/00 - Processing of the speech or voice signal to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
  • G10L 19/04 - Speech or audio signal analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using predictive techniques
  • G10L 19/022 - Blocking, i.e. grouping of samples in time; Choice of analysis windows; Overlap factoring
  • G10L 19/18 - Vocoders using multiple modes
  • G10L 19/005 - Correction of errors induced by the transmission channel, if related to the coding algorithm

49.

AUTOMATIC GENERATION AND SELECTION OF TARGET PROFILES FOR DYNAMIC EQUALIZATION OF AUDIO CONTENT

      
Application Number 18253850
Status Pending
Filing Date 2021-11-18
First Publication Date 2024-01-18
Owner
  • Dolby Laboratories Licensing Corporation (USA)
  • DOLBY INTERNATIONAL AB (Ireland)
Inventor
  • Cengarle, Giulio
  • Engel, Nicholas Laurence
  • Scannell, Patrick Winfrey
  • Scaini, Davide

Abstract

In an embodiment, a method comprises: filtering reference audio content items to separate the reference audio content items into different frequency bands; for each frequency band, extracting a first feature vector from at least a portion of each of the reference audio content items, wherein the first feature vector includes at least one audio characteristic of the reference audio content items; obtaining at least one semantic label from at least a portion of each of the reference audio content items; obtaining a second feature vector consisting of the first feature vectors per frequency band and the at least one semantic label; generating, based on the second feature vector, cluster feature vectors representing centroids of clusters; separating the reference audio content items according to the cluster feature vectors; and computing an average target profile for each cluster based on the reference audio content items in the cluster.

IPC Classes  ?

  • H03G 5/00 - Tone control or bandwidth control in amplifiers
  • H04R 3/04 - Circuits for transducers for correcting frequency response
  • G10L 25/21 - Speech or voice analysis techniques not restricted to a single one of groups characterised by the type of extracted parameters the extracted parameters being power information
  • G10L 25/18 - Speech or voice analysis techniques not restricted to a single one of groups characterised by the type of extracted parameters the extracted parameters being spectral information of each sub-band
  • G10L 15/18 - Speech classification or search using natural language modelling

50.

SYSTEM FOR MAINTAINING REVERSIBLE DYNAMIC RANGE CONTROL INFORMATION ASSOCIATED WITH PARAMETRIC AUDIO CODERS

      
Application Number 18355168
Status Pending
Filing Date 2023-07-19
First Publication Date 2024-01-18
Owner
  • DOLBY LABORATORIES LICENSING CORPORATION (USA)
  • DOLBY INTERNATIONAL AB (Ireland)
Inventor
  • Riedmiller, Jeffrey
  • Roeden, Karl J.
  • Kjoerling, Kristofer
  • Purnhagen, Heiko
  • Melkote, Vinay
  • Sehlstrom, Leif

Abstract

On the basis of a bitstream (P), an n-channel audio signal (X) is reconstructed by deriving an m-channel core signal (Y) and multichannel coding parameters (a) from the bitstream, where 1≤m

IPC Classes  ?

  • E21B 33/138 - Plastering the borehole wall; Injecting into the formation
  • E21B 41/00 - Equipment or details not covered by groups
  • E21B 21/00 - Methods or apparatus for flushing boreholes, e.g. by use of exhaust air from motor
  • G10L 19/008 - Multichannel audio signal coding or decoding using interchannel correlation to reduce redundancy, e.g. joint-stereo, intensity-coding or matrixing
  • G10L 19/16 - Vocoder architecture
  • G10L 19/18 - Vocoders using multiple modes
  • G10L 19/24 - Variable rate codecs, e.g. for generating different qualities using a scalable representation such as hierarchical encoding or layered encoding

51.

AUTOMATIC LOCALIZATION OF AUDIO DEVICES

      
Application Number 18255554
Status Pending
Filing Date 2021-12-02
First Publication Date 2024-01-18
Owner
  • Dolby Laboratories Licensing Corporation (USA)
  • Dolby International AB (Ireland)
Inventor
  • Arteaga, Daniel
  • Scaini, Davide
  • Thomas, Mark R.P.
  • Bruni, Avery
  • Townsend, Olha Michelle

Abstract

A method may involve: receiving direction of arrival (DOA) data corresponding to sound emitted by at least a first smart audio device of the audio environment that includes a first audio transmitter and a first audio receiver, the DOA data corresponding to sound received by at least a second smart audio device of the audio environment that includes a second audio transmitter and a second audio receiver, the DOA data corresponding to sound emitted by at least the second smart audio device and received by at least the first smart audio device; receiving one or more configuration parameters corresponding to the audio environment, to one or more audio devices, or both; and minimizing a cost function based at least in part on the DOA data and the configuration parameter(s), to estimate a position and an orientation of at least the first smart audio device and the second smart audio device.

IPC Classes  ?

  • H04S 7/00 - Indicating arrangements; Control arrangements, e.g. balance control
  • H04R 5/02 - Spatial or constructional arrangements of loudspeakers
  • H04R 3/00 - Circuits for transducers

52.

PROCESSING OF MICROPHONE SIGNALS FOR SPATIAL PLAYBACK

      
Application Number 18352197
Status Pending
Filing Date 2023-07-13
First Publication Date 2024-01-11
Owner DOLBY LABORATORIES LICENSING CORPORATION (USA)
Inventor Mcgrath, David S.

Abstract

Disclosed are methods and systems which convert a multi-microphone input signal to a multichannel output signal making use of a time- and frequency-varying matrix. For each time and frequency tile, the matrix is derived as a function of a dominant direction of arrival and a steering strength parameter. Likewise, the dominant direction and steering strength parameter are derived from characteristics of the multi-microphone signals, where those characteristics include values representative of the inter-channel amplitude and group-delay differences.

IPC Classes  ?

  • H04R 3/00 - Circuits for transducers
  • H04R 1/40 - Arrangements for obtaining desired frequency or directional characteristics for obtaining desired directional characteristic only by combining a number of identical transducers
  • H04R 5/04 - Circuit arrangements

53.

ADAPTIVE NOISE ESTIMATION

      
Application Number 18044777
Status Pending
Filing Date 2021-09-21
First Publication Date 2024-01-11
Owner
  • Dolby Laboratories Licensing Corporation (USA)
  • DOLBY INTERNATIONAL AB (Ireland)
Inventor
  • Scaini, Davide
  • Yeh, Chunghsin
  • Cengarle, Giulio
  • De Burgh, Mark David

Abstract

In some embodiments, a method, comprises: dividing, using at least one processor, an audio input into speech and non-speech segments; for each frame in each non-speech segment, estimating, using the at least one processor, a time-varying noise spectrum of the non-speech segment; for each frame in each speech segment, estimating, using the at least one processor, speech spectrum of the speech segment; for each frame in each speech segment, identifying one or more non-speech frequency components in the speech spectrum; comparing the one or more non-speech frequency components with one or more corresponding frequency components in a plurality of estimated noise spectra and selecting the estimated noise spectrum from the plurality of estimated noise spectra based on a result of the comparing.

IPC Classes  ?

  • G10L 21/0232 - Processing in the frequency domain
  • G10L 21/028 - Voice signal separating using properties of sound source
  • G10L 25/18 - Speech or voice analysis techniques not restricted to a single one of groups characterised by the type of extracted parameters the extracted parameters being spectral information of each sub-band
  • G10L 25/84 - Detection of presence or absence of voice signals for discriminating voice from noise
  • G10L 21/034 - Automatic adjustment
  • G10L 21/0364 - Speech enhancement, e.g. noise reduction or echo cancellation by changing the amplitude for improving intelligibility
  • G10L 25/21 - Speech or voice analysis techniques not restricted to a single one of groups characterised by the type of extracted parameters the extracted parameters being power information

54.

ROTATION OF SOUND COMPONENTS FOR ORIENTATION-DEPENDENT CODING SCHEMES

      
Application Number 18255232
Status Pending
Filing Date 2021-12-02
First Publication Date 2024-01-11
Owner
  • Dolby Laboratories Licensing Corporation (USA)
  • DOLBY INTERNATIONAL AB (Ireland)
Inventor
  • Bruhn, Stefan
  • Mundt, Harald
  • Mcgrath, David S.
  • Brown, Stefanie

Abstract

Method for encoding scene-based audio is provided. In some implementations, the method involves determining, by an encoder, a spatial direction of a dominant sound component in a frame of an input audio signal. In some implementations, the method involves determining rotation parameters based on the determined spatial direction and a direction preference of a coding scheme to be used to encode the input audio signal. In some implementations, the method involves rotating sound components of the frame based on the rotation parameters such that, after being rotated, the dominant sound component has a spatial direction that aligns with the direction preference of the coding scheme. In some implementations, the method involves encoding the rotated sound components of the frame of the input audio signal using the coding scheme in connection with an indication of the rotation parameters or an indication of the spatial direction of the dominant sound component.

IPC Classes  ?

  • G10L 19/008 - Multichannel audio signal coding or decoding using interchannel correlation to reduce redundancy, e.g. joint-stereo, intensity-coding or matrixing
  • G10L 19/002 - Dynamic bit allocation
  • G10L 19/032 - Quantisation or dequantisation of spectral components

55.

FRAME-RATE SCALABLE VIDEO CODING

      
Application Number 18334306
Status Pending
Filing Date 2023-06-13
First Publication Date 2024-01-11
Owner DOLBY LABORATORIES LICENSING CORPORATION (USA)
Inventor
  • Atkins, Robin
  • Yin, Peng
  • Lu, Taoran
  • Pu, Fangjun
  • Mccarthy, Sean Thomas
  • Husak, Walter J.
  • Chen, Tao
  • Su, Guan-Ming

Abstract

Methods and systems for frame rate scalability are described. Support is provided for input and output video sequences with variable frame rate and variable shutter angle across scenes, or for input video sequences with fixed input frame rate and input shutter angle, but allowing a decoder to generate a video output at a different output frame rate and shutter angle than the corresponding input values. Techniques allowing a decoder to decode more computationally-efficiently a specific backward compatible target frame rate and shutter angle among those allowed are also presented.

IPC Classes  ?

  • H04N 19/31 - Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using hierarchical techniques, e.g. scalability in the temporal domain
  • H04N 19/187 - Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding the unit being a scalable video layer
  • H04N 19/172 - Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding the unit being an image region, e.g. an object the region being a picture, frame or field
  • H04N 19/30 - Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using hierarchical techniques, e.g. scalability
  • H04N 19/46 - Embedding additional information in the video signal during the compression process
  • H04N 19/70 - Methods or arrangements for coding, decoding, compressing or decompressing digital video signals characterised by syntax aspects related to video coding, e.g. related to compression standards

56.

METHODS AND DEVICES FOR ENCODING AND/OR DECODING IMMERSIVE AUDIO SIGNALS

      
Application Number 18349427
Status Pending
Filing Date 2023-07-10
First Publication Date 2024-01-04
Owner
  • DOLBY LABORATORIES LICENSING CORPORATION (USA)
  • DOLBY INTERNATIONAL AB (Ireland)
Inventor
  • Mcgrath, David S.
  • Eckert, Michael
  • Purnhagen, Heiko
  • Bruhn, Stefan

Abstract

The present document describes a method (700) for encoding a multi-channel input signal (201). The method (700) comprises determining (701) a plurality of downmix channel signals (203) from the multi-channel input signal (201) and performing (702) energy compaction of the plurality of downmix channel signals (203) to provide a plurality of compacted channel signals (404). Furthermore, the method (700) comprises determining (703) joint coding metadata (205) based on the plurality of compacted channel signals (404) and based on the multi-channel input signal (201), wherein the joint coding metadata (205) is such that it allows upmixing of the plurality of compacted channel signals (404) to an approximation of the multi-channel input signal (201). In addition, the method (700) comprises encoding (704) the plurality of compacted channel signals (404) and the joint coding metadata (205).

IPC Classes  ?

  • G10L 19/16 - Vocoder architecture
  • G10L 19/008 - Multichannel audio signal coding or decoding using interchannel correlation to reduce redundancy, e.g. joint-stereo, intensity-coding or matrixing
  • G10L 19/18 - Vocoders using multiple modes

57.

WRAPPED RESHAPING FOR CODEWORD AUGMENTATION WITH NEIGHBORHOOD CONSISTENCY

      
Application Number 18252357
Status Pending
Filing Date 2021-11-10
First Publication Date 2024-01-04
Owner Dolby Laboratories Licensing Corporation (USA)
Inventor
  • Horvath, Janos
  • Kadu, Harshad
  • Su, Guan-Ming

Abstract

An input image of a first bit depth in an input domain is received. Forward reshaping operations are performed on the input image to generate a forward reshaped image of a second bit depth in a reshaping domain. An image container containing image data derived from the forward reshaped image is encoded into an output video signal of the second bit depth.

IPC Classes  ?

  • H04N 19/98 - Adaptive-dynamic-range coding [ADRC]
  • H04N 19/105 - Selection of the reference unit for prediction within a chosen coding or prediction mode, e.g. adaptive choice of position and number of pixels used for prediction
  • H04N 19/186 - Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding the unit being a colour or a chrominance component

58.

FRAME-LEVEL PERMUTATION INVARIANT TRAINING FOR SOURCE SEPARATION

      
Application Number 18248801
Status Pending
Filing Date 2021-10-13
First Publication Date 2024-01-04
Owner
  • Dolby Laboratories Licensing Corporation (USA)
  • DOLBY INTERNATIONAL AB (Ireland)
Inventor
  • Liu, Xiaoyu
  • Pons Puig, Jordi

Abstract

Described is a method of training a deep-learning-based system for sound source separation. The system comprises a separation stage for frame-wise extraction of representations of sound sources from a representation of an audio signal, and a clustering stage for generating, for each frame, a vector indicative of an assignment permutation of extracted frames of representations of sound sources to respective sound sources. The representation of the audio signal is a waveform-based representation. The separation stage is trained using frame-level permutation invariant training. Further, the clustering stage is trained to generate embedding vectors for the frames of the audio signal that allow to determine estimates of respective assignment permutations between extracted sound signals and labels of sound sources that had been used for the frames. Also described is a method of using the deep-learning-based system for sound source separation.

IPC Classes  ?

  • G10L 21/028 - Voice signal separating using properties of sound source

59.

METHODS, APPARATUS AND SYSTEMS FOR DECOMPRESSING A HIGHER ORDER AMBISONICS (HOA) SIGNAL

      
Application Number 18339368
Status Pending
Filing Date 2023-06-22
First Publication Date 2024-01-04
Owner Dolby Laboratories Licensing Corporation (USA)
Inventor
  • Kordon, Sven
  • Krueger, Alexander
  • Wuebbolt, Oliver

Abstract

A method for compressing a HOA signal being an input HOA representation with input time frames (C(k)) of HOA coefficient sequences comprises spatial HOA encoding of the input time frames and subsequent perceptual encoding and source encoding. Each input time frame is decomposed (802) into a frame of predominant sound signals (XPS(k−1)) and a frame of an ambient HOA component ({tilde over (C)}AMB(k−1)). The ambient HOA component ({tilde over (C)}AMB(k−1)) comprises, in a layered mode, first HOA coefficient sequences of the input HOA representation (cn(k−1)) in lower positions and second HOA coefficient sequences (cAMB,n(k−1)) in remaining higher positions. The second HOA coefficient sequences are part of an HOA representation of a residual between the input HOA representation and the HOA representation of the predominant sound signals.

IPC Classes  ?

  • H04S 3/00 - Systems employing more than two channels, e.g. quadraphonic
  • G10L 19/008 - Multichannel audio signal coding or decoding using interchannel correlation to reduce redundancy, e.g. joint-stereo, intensity-coding or matrixing
  • G10L 19/24 - Variable rate codecs, e.g. for generating different qualities using a scalable representation such as hierarchical encoding or layered encoding
  • H04S 7/00 - Indicating arrangements; Control arrangements, e.g. balance control

60.

SIGNAL RESHAPING AND CODING FOR HDR AND WIDE COLOR GAMUT SIGNALS

      
Application Number 18470353
Status Pending
Filing Date 2023-09-19
First Publication Date 2024-01-04
Owner DOLBY LABORATORIES LICENSING CORPORATION (USA)
Inventor
  • Yin, Peng
  • Lu, Taoran
  • Pu, Fangjun
  • Chen, Tao
  • Husak, Walter J.

Abstract

In a method to improve the coding efficiency of high-dynamic range (HDR) images, a decoder parses sequence processing set (SPS) data from an input coded bitstream to detect that an HDR extension syntax structure is present in the parsed SPS data. It extracts from the HDR extension syntax structure post-processing information that includes one or more of a color space enabled flag, a color enhancement enabled flag, an adaptive_reshaping_enabled_flag, a dynamic range conversion flag, a color correction enabled flag, or an SDR_viewable_flag. It decodes the input bitstream to generate a preliminary output decoded signal, and generates a second output signal based on the preliminary output signal and the post-processing information.

IPC Classes  ?

  • H04N 19/70 - Methods or arrangements for coding, decoding, compressing or decompressing digital video signals characterised by syntax aspects related to video coding, e.g. related to compression standards
  • H04N 19/186 - Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding the unit being a colour or a chrominance component
  • H04N 19/30 - Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using hierarchical techniques, e.g. scalability
  • H04N 19/85 - Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using pre-processing or post-processing specially adapted for video compression
  • G06T 5/00 - Image enhancement or restoration

61.

TIMESTAMP SMOOTHING TO REMOVE JITTER

      
Application Number 18252998
Status Pending
Filing Date 2021-11-17
First Publication Date 2023-12-28
Owner Dolby Laboratories Licensing Corporation (USA)
Inventor
  • Prema Thasarathan, Shanush
  • Wang, Ning
  • Samarasekera, Senaka Chandranath

Abstract

Embodiments are disclosed for timestamp smoothing to remove jitter. In some embodiments, a method of smoothing timestamps associated with audio packets comprises: receiving, using at least one processor, a series of input timestamps for audio packets and their respective packet lengths; estimating, using the at least one processor, an initial timestamp based on the series of input timestamps, the packet lengths and a sample time; calculating, using the at least one processor, a predicted timestamp based on the estimated initial timestamp; and smoothing, using the at least one processor, the predicted timestamp.

IPC Classes  ?

  • H04L 47/283 - Flow control; Congestion control in relation to timing considerations in response to processing delays, e.g. caused by jitter or round trip time [RTT]
  • H04L 43/106 - Active monitoring, e.g. heartbeat, ping or trace-route using time related information in packets, e.g. by adding timestamps
  • H04L 41/147 - Network analysis or design for predicting network behaviour

62.

SUBBAND DOMAIN ACOUSTIC ECHO CANCELLER BASED ACOUSTIC STATE ESTIMATOR

      
Application Number 18255573
Status Pending
Filing Date 2021-12-02
First Publication Date 2023-12-28
Owner DOLBY LABORATORIES LICENSING CORPORATION (USA)
Inventor
  • Southwell, Benjamin John
  • Gunawan, David
  • Hines, Christopher Graham

Abstract

Some implementations involve receiving, from a first subband domain acoustic echo canceller (AEC) of a first audio device in an audio environment, first adaptive filter management data from each of a plurality of first adaptive filter management modules, each first adaptive filter management module corresponding to a subband of the first subband domain AEC, each first adaptive filter management module being configured to control a first plurality of adaptive filters. The first plurality of adaptive filters may include at least a first adaptive filter type and a second adaptive filter type. Some implementations involve extracting, from the first adaptive filter management data, a first plurality of extracted features corresponding to a plurality of subbands of the first subband domain AEC and estimating a current local acoustic state based, at least in part, on the first plurality of extracted features.

IPC Classes  ?

  • H04R 3/02 - Circuits for transducers for preventing acoustic reaction

63.

POST-PROCESSING GAINS FOR SIGNAL ENHANCEMENT

      
Application Number 18344782
Status Pending
Filing Date 2023-06-29
First Publication Date 2023-12-28
Owner DOLBY LABORATORIES LICENSING CORPORATION (USA)
Inventor
  • Sun, Xuejing
  • Dickins, Glenn N.

Abstract

A method, an apparatus, and logic to post-process raw gains determined by input processing to generate post-processed gains, comprising using one or both of delta gain smoothing and decision-directed gain smoothing. The delta gain smoothing comprises applying a smoothing filter to the raw gain with a smoothing factor that depends on the gain delta: the absolute value of the difference between the raw gain for the current frame and the post-processed gain for a previous frame. The decision-directed gain smoothing comprises converting the raw gain to a signal-to-noise ratio, applying a smoothing filter with a smoothing factor to the signal-to-noise ratio to calculate a smoothed signal-to-noise ratio, and converting the smoothed signal-to-noise ratio to determine the second smoothed gain, with smoothing factor possibly dependent on the gain delta.

IPC Classes  ?

  • G10L 21/0364 - Speech enhancement, e.g. noise reduction or echo cancellation by changing the amplitude for improving intelligibility
  • G10L 21/0316 - Speech enhancement, e.g. noise reduction or echo cancellation by changing the amplitude
  • G10K 11/16 - Methods or devices for protecting against, or for damping, noise or other acoustic waves in general
  • H03G 3/32 - Automatic control in amplifiers having semiconductor devices the control being dependent upon ambient noise level or sound level
  • G10L 21/0224 - Processing in the time domain
  • G10L 21/034 - Automatic adjustment
  • G10L 25/78 - Detection of presence or absence of voice signals
  • H03G 3/30 - Automatic control in amplifiers having semiconductor devices

64.

DIRECTED INTERPOLATION AND DATA POST-PROCESSING

      
Application Number 18466957
Status Pending
Filing Date 2023-09-14
First Publication Date 2023-12-28
Owner DOLBY LABORATORIES LICENSING CORPORATION (USA)
Inventor
  • Tourapis, Alexandros
  • Leontaris, Athanasios
  • Pahalawatta, Peshala V.
  • Stec, Kevin J.

Abstract

An encoding device evaluates a plurality of processing and/or post-processing algorithms and/or methods to be applied to a video stream, and signals a selected method, algorithm, class or category of methods/algorithms either in an encoded bitstream or as side information related to the encoded bitstream. A decoding device or post-processor utilizes the signaled algorithm or selects an algorithm/method based on the signaled method or algorithm. The selection is based, for example, on availability of the algorithm/method at the decoder/post-processor and/or cost of implementation. The video stream may comprise, for example, downsampled multiplexed stereoscopic images and the selected algorithm may include any of upconversion and/or error correction techniques that contribute to a restoration of the downsampled images.

IPC Classes  ?

  • H04N 19/597 - Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using predictive coding specially adapted for multi-view video sequence encoding
  • H04N 19/80 - Methods or arrangements for coding, decoding, compressing or decompressing digital video signals - Details of filtering operations specially adapted for video compression, e.g. for pixel interpolation
  • H04N 19/17 - Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding the unit being an image region, e.g. an object
  • H04N 13/161 - Encoding, multiplexing or demultiplexing different image signal components
  • H04N 13/172 - Processing image signals image signals comprising non-image signal components, e.g. headers or format information
  • H04N 13/178 - Metadata, e.g. disparity information
  • H04N 13/218 - Image signal generators using stereoscopic image cameras using a single 2D image sensor using spatial multiplexing
  • H04N 19/154 - Measured or subjectively estimated visual quality after decoding, e.g. measurement of distortion
  • H04N 19/85 - Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using pre-processing or post-processing specially adapted for video compression
  • H04N 19/44 - Decoders specially adapted therefor, e.g. video decoders which are asymmetric with respect to the encoder
  • H04N 19/895 - Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using pre-processing or post-processing specially adapted for video compression involving methods or arrangements for detection of transmission errors at the decoder in combination with error concealment
  • H04N 19/70 - Methods or arrangements for coding, decoding, compressing or decompressing digital video signals characterised by syntax aspects related to video coding, e.g. related to compression standards
  • H04N 19/86 - Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using pre-processing or post-processing specially adapted for video compression involving reduction of coding artifacts, e.g. of blockiness
  • H04N 7/01 - Conversion of standards
  • H04N 21/434 - Disassembling of a multiplex stream, e.g. demultiplexing audio and video streams or extraction of additional data from a video stream; Remultiplexing of multiplex streams; Extraction or processing of SI; Disassembling of packetised elementary stream
  • H04N 19/176 - Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding the unit being an image region, e.g. an object the region being a block, e.g. a macroblock
  • H04N 19/587 - Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using predictive coding involving temporal sub-sampling or interpolation, e.g. decimation or subsequent interpolation of pictures in a video sequence
  • H04N 19/172 - Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding the unit being an image region, e.g. an object the region being a picture, frame or field
  • H04N 19/423 - Methods or arrangements for coding, decoding, compressing or decompressing digital video signals - characterised by implementation details or hardware specially adapted for video compression or decompression, e.g. dedicated software implementation characterised by memory arrangements

65.

DIRECTED INTERPOLATION AND DATA POST-PROCESSING

      
Application Number 18466961
Status Pending
Filing Date 2023-09-14
First Publication Date 2023-12-28
Owner DOLBY LABORATORIES LICENSING CORPORATION (USA)
Inventor
  • Tourapis, Alexandros
  • Leontaris, Athanasios
  • Pahalawatta, Peshala V.
  • Stec, Kevin J.

Abstract

An encoding device evaluates a plurality of processing and/or post-processing algorithms and/or methods to be applied to a video stream, and signals a selected method, algorithm, class or category of methods/algorithms either in an encoded bitstream or as side information related to the encoded bitstream. A decoding device or post-processor utilizes the signaled algorithm or selects an algorithm/method based on the signaled method or algorithm. The selection is based, for example, on availability of the algorithm/method at the decoder/post-processor and/or cost of implementation. The video stream may comprise, for example, downsampled multiplexed stereoscopic images and the selected algorithm may include any of upconversion and/or error correction techniques that contribute to a restoration of the downsampled images.

IPC Classes  ?

  • H04N 19/597 - Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using predictive coding specially adapted for multi-view video sequence encoding
  • H04N 19/80 - Methods or arrangements for coding, decoding, compressing or decompressing digital video signals - Details of filtering operations specially adapted for video compression, e.g. for pixel interpolation
  • H04N 19/17 - Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding the unit being an image region, e.g. an object
  • H04N 13/161 - Encoding, multiplexing or demultiplexing different image signal components
  • H04N 13/172 - Processing image signals image signals comprising non-image signal components, e.g. headers or format information
  • H04N 13/178 - Metadata, e.g. disparity information
  • H04N 13/218 - Image signal generators using stereoscopic image cameras using a single 2D image sensor using spatial multiplexing
  • H04N 19/154 - Measured or subjectively estimated visual quality after decoding, e.g. measurement of distortion
  • H04N 19/85 - Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using pre-processing or post-processing specially adapted for video compression
  • H04N 19/44 - Decoders specially adapted therefor, e.g. video decoders which are asymmetric with respect to the encoder
  • H04N 19/895 - Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using pre-processing or post-processing specially adapted for video compression involving methods or arrangements for detection of transmission errors at the decoder in combination with error concealment
  • H04N 19/70 - Methods or arrangements for coding, decoding, compressing or decompressing digital video signals characterised by syntax aspects related to video coding, e.g. related to compression standards
  • H04N 19/86 - Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using pre-processing or post-processing specially adapted for video compression involving reduction of coding artifacts, e.g. of blockiness
  • H04N 7/01 - Conversion of standards
  • H04N 21/434 - Disassembling of a multiplex stream, e.g. demultiplexing audio and video streams or extraction of additional data from a video stream; Remultiplexing of multiplex streams; Extraction or processing of SI; Disassembling of packetised elementary stream
  • H04N 19/176 - Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding the unit being an image region, e.g. an object the region being a block, e.g. a macroblock
  • H04N 19/587 - Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using predictive coding involving temporal sub-sampling or interpolation, e.g. decimation or subsequent interpolation of pictures in a video sequence
  • H04N 19/172 - Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding the unit being an image region, e.g. an object the region being a picture, frame or field
  • H04N 19/423 - Methods or arrangements for coding, decoding, compressing or decompressing digital video signals - characterised by implementation details or hardware specially adapted for video compression or decompression, e.g. dedicated software implementation characterised by memory arrangements

66.

MULTI-HALF-TONE IMAGING AND DUAL MODULATION PROJECTION/DUAL MODULATION LASER PROJECTION

      
Application Number 18466976
Status Pending
Filing Date 2023-09-14
First Publication Date 2023-12-28
Owner DOLBY LABORATORIES LICENSING CORPORATION (USA)
Inventor
  • Richards, Martin J.
  • Shields, Jerome

Abstract

Smaller halftone tiles are implemented on a first modulator of a dual modulation projection system. This techniques uses multiple halftones per frame in the pre-modulator synchronized with a modified bit sequence in the primary modulator to effectively increase the number of levels provided by a given tile size in the halftone modulator. It addresses the issue of reduced contrast ratio at low light levels for small tile sizes and allows the use of smaller PSFs which reduce halo artifacts in the projected image and may be utilized in 3D projecting and viewing.

IPC Classes  ?

  • H04N 9/31 - Projection devices for colour picture display
  • G09G 3/20 - Control arrangements or circuits, of interest only in connection with visual indicators other than cathode-ray tubes for presentation of an assembly of a number of characters, e.g. a page, by composing the assembly by combination of individual elements arranged in a matrix

67.

METHODS AND SYSTEMS FOR INTERACTIVE RENDERING OF OBJECT BASED AUDIO

      
Application Number 18346464
Status Pending
Filing Date 2023-07-03
First Publication Date 2023-12-28
Owner
  • Dolby Laboratories Licensing Corporation (USA)
  • DOLBY INTERNATIONAL AB (Ireland)
Inventor
  • France, Robert Andrew
  • Ziegler, Thomas
  • Mehta, Sripal S.
  • Dowell, Andrew Jonathan
  • Saungsomboon, Prinyar
  • Dwyer, Michael David
  • Farahani, Farhad
  • Tsingos, Nicolas R.
  • Sanchez, Freddie

Abstract

Methods for generating an object based audio program which is renderable in a personalizable manner, e.g., to provide an immersive, perception of audio content of the program. Other embodiments include steps of delivering (e.g., broadcasting), decoding, and/or rendering such a program. Rendering of audio objects indicated by the program may provide an immersive experience. The audio content of the program may be indicative of multiple object channels (e.g., object channels indicative of user-selectable and user-configurable objects, and typically also a default set of objects which will be rendered in the absence of a selection by a user) and a bed of speaker channels. Another aspect is an audio processing unit (e.g., encoder or decoder) configured to perform, or which includes a buffer memory which stores at least one frame (or other segment) of an object based audio program (or bitstream thereof) generated in accordance with, any embodiment of the method.

IPC Classes  ?

  • G10L 19/008 - Multichannel audio signal coding or decoding using interchannel correlation to reduce redundancy, e.g. joint-stereo, intensity-coding or matrixing
  • G10L 19/20 - Vocoders using multiple modes using sound class specific coding, hybrid encoders or object based coding
  • H04S 7/00 - Indicating arrangements; Control arrangements, e.g. balance control
  • G06F 3/16 - Sound input; Sound output
  • H04S 3/00 - Systems employing more than two channels, e.g. quadraphonic

68.

IMPROVING BASS RESPONSE FOR A SPEAKER IN A PORTABLE COMPUTING DEVICE

      
Application Number 18037650
Status Pending
Filing Date 2021-11-17
First Publication Date 2023-12-28
Owner DOLBY LABORATORIES LICENSING CORPORATION (USA)
Inventor
  • Xu, Xiaojun
  • Liu, Tiezhong

Abstract

Methods and systems of improving bass response for a speaker in a portable computing device are described. One portable computing device includes first and second cover parts that are joined together to form a casing of the portable computing device, wherein a speaker volume is formed between portions of the first and second cover parts; a speaker arranged within the speaker volume; and one or more elastic spacers arranged between the first and second cover parts. The one or more elastic spacers are arranged to counteract, by their elastic recoil forces, a compression of the speaker volume when the first and second cover parts are under external compressing forces. The one or more elastic spacers are arranged between the first and second cover parts to be partially compressed by the first and second cover parts in the absence of external compressing forces on the first and second cover parts.

IPC Classes  ?

  • H04R 3/04 - Circuits for transducers for correcting frequency response

69.

METHODS AND APPARATUS FOR DECODING A COMPRESSED HOA SIGNAL

      
Application Number 18464505
Status Pending
Filing Date 2023-09-11
First Publication Date 2023-12-28
Owner Dolby Laboratories Licensing Corporation (USA)
Inventor
  • Kordon, Sven
  • Krueger, Alexander
  • Wuebbolt, Oliver

Abstract

Methods and apparatus for decoding a compressed Higher Order Ambisonics (HOA) representation of a sound or soundfield. The method may include receiving a bit stream containing the compressed HOA representation and decoding, based on a determination that there are multiple layers, the compressed HOA representation from the bitstream to obtain a sequence of decoded HOA representations. A first subset of the sequence of decoded HOA representations is determined based only on corresponding ambient HOA components. A second subset of the sequence of decoded HOA representations is determined based on corresponding ambient HOA components and corresponding predominant sound components. For a frame k, the sequence of decoded HOA representations are represented at least in part by Methods and apparatus for decoding a compressed Higher Order Ambisonics (HOA) representation of a sound or soundfield. The method may include receiving a bit stream containing the compressed HOA representation and decoding, based on a determination that there are multiple layers, the compressed HOA representation from the bitstream to obtain a sequence of decoded HOA representations. A first subset of the sequence of decoded HOA representations is determined based only on corresponding ambient HOA components. A second subset of the sequence of decoded HOA representations is determined based on corresponding ambient HOA components and corresponding predominant sound components. For a frame k, the sequence of decoded HOA representations are represented at least in part by c ^ ~ n ( k - 1 ) = { c ^ AMB , n ( k - 1 ) for ⁢ n ⁢ in ⁢ the ⁢ first ⁢ subset c ^ n ( k - 1 ) = c ^ PS , n ( k - 1 ) + c ^ AMB , n ( k - 1 ) , for ⁢ n ⁢ in ⁢ the ⁢ second ⁢ subset where ĉAMB,n(k−1) corresponds to the corresponding ambient HOA components and ĉPS,n(k−1) corresponds to the corresponding predominant sound components.

IPC Classes  ?

  • G10L 19/008 - Multichannel audio signal coding or decoding using interchannel correlation to reduce redundancy, e.g. joint-stereo, intensity-coding or matrixing
  • H04S 3/00 - Systems employing more than two channels, e.g. quadraphonic
  • G10L 19/24 - Variable rate codecs, e.g. for generating different qualities using a scalable representation such as hierarchical encoding or layered encoding

70.

METHOD AND DEVICE FOR ARITHMETIC ENCODING OR ARITHMETIC DECODING

      
Application Number 18465479
Status Pending
Filing Date 2023-09-12
First Publication Date 2023-12-28
Owner DOLBY LABORATORIES LICENSING CORPORATION (USA)
Inventor Wuebbolt, Oliver

Abstract

The invention proposes a method and a device for arithmetic encoding of a current spectral coefficient using preceding spectral coefficients. Said preceding spectral coefficients are already encoded and both, said preceding and current spectral coefficients, are comprised in one or more quantized spectra resulting from quantizing time-frequency-transform of video, audio or speech signal sample values. The invention proposes a method and a device for arithmetic encoding of a current spectral coefficient using preceding spectral coefficients. Said preceding spectral coefficients are already encoded and both, said preceding and current spectral coefficients, are comprised in one or more quantized spectra resulting from quantizing time-frequency-transform of video, audio or speech signal sample values. Said method comprises processing the preceding spectral coefficients, using the processed preceding spectral coefficients for determining a context class being one of at least two different context classes, using the determined context class and a mapping from the at least two different context classes to at least two different probability density functions for determining the probability density function, and arithmetic encoding the current spectral coefficient based on the determined probability density function wherein processing the preceding spectral coefficients comprises non-uniformly quantizing absolutes of the preceding spectral coefficients for use in determining of the context class.

IPC Classes  ?

  • H03M 7/40 - Conversion to or from variable length codes, e.g. Shannon-Fano code, Huffman code, Morse code
  • H04N 19/124 - Quantisation
  • H04N 19/182 - Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding the unit being a pixel

71.

DIRECTED INTERPOLATION AND DATA POST-PROCESSING

      
Application Number 18466954
Status Pending
Filing Date 2023-09-14
First Publication Date 2023-12-28
Owner DOLBY LABORATORIES LICENSING CORPORATION (USA)
Inventor
  • Tourapis, Alexandros
  • Leontaris, Athanasios
  • Pahalawatta, Peshala V.
  • Stec, Kevin J.

Abstract

An encoding device evaluates a plurality of processing and/or post-processing algorithms and/or methods to be applied to a video stream, and signals a selected method, algorithm, class or category of methods/algorithms either in an encoded bitstream or as side information related to the encoded bitstream. A decoding device or post-processor utilizes the signaled algorithm or selects an algorithm/method based on the signaled method or algorithm. The selection is based, for example, on availability of the algorithm/method at the decoder/post-processor and/or cost of implementation. The video stream may comprise, for example, downsampled multiplexed stereoscopic images and the selected algorithm may include any of upconversion and/or error correction techniques that contribute to a restoration of the downsampled images.

IPC Classes  ?

  • H04N 19/597 - Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using predictive coding specially adapted for multi-view video sequence encoding
  • H04N 19/80 - Methods or arrangements for coding, decoding, compressing or decompressing digital video signals - Details of filtering operations specially adapted for video compression, e.g. for pixel interpolation
  • H04N 19/17 - Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding the unit being an image region, e.g. an object
  • H04N 13/161 - Encoding, multiplexing or demultiplexing different image signal components
  • H04N 13/172 - Processing image signals image signals comprising non-image signal components, e.g. headers or format information
  • H04N 13/178 - Metadata, e.g. disparity information
  • H04N 13/218 - Image signal generators using stereoscopic image cameras using a single 2D image sensor using spatial multiplexing
  • H04N 19/154 - Measured or subjectively estimated visual quality after decoding, e.g. measurement of distortion
  • H04N 19/85 - Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using pre-processing or post-processing specially adapted for video compression
  • H04N 19/44 - Decoders specially adapted therefor, e.g. video decoders which are asymmetric with respect to the encoder
  • H04N 19/895 - Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using pre-processing or post-processing specially adapted for video compression involving methods or arrangements for detection of transmission errors at the decoder in combination with error concealment
  • H04N 19/70 - Methods or arrangements for coding, decoding, compressing or decompressing digital video signals characterised by syntax aspects related to video coding, e.g. related to compression standards
  • H04N 19/86 - Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using pre-processing or post-processing specially adapted for video compression involving reduction of coding artifacts, e.g. of blockiness
  • H04N 7/01 - Conversion of standards
  • H04N 21/434 - Disassembling of a multiplex stream, e.g. demultiplexing audio and video streams or extraction of additional data from a video stream; Remultiplexing of multiplex streams; Extraction or processing of SI; Disassembling of packetised elementary stream
  • H04N 19/176 - Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding the unit being an image region, e.g. an object the region being a block, e.g. a macroblock
  • H04N 19/587 - Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using predictive coding involving temporal sub-sampling or interpolation, e.g. decimation or subsequent interpolation of pictures in a video sequence
  • H04N 19/172 - Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding the unit being an image region, e.g. an object the region being a picture, frame or field
  • H04N 19/423 - Methods or arrangements for coding, decoding, compressing or decompressing digital video signals - characterised by implementation details or hardware specially adapted for video compression or decompression, e.g. dedicated software implementation characterised by memory arrangements

72.

MACHINE LEARNING ASSISTED SPATIAL NOISE ESTIMATION AND SUPPRESSION

      
Application Number 18251876
Status Pending
Filing Date 2021-11-04
First Publication Date 2023-12-21
Owner DOLBY LABORATORIES LICENSING CORPORATION (USA)
Inventor
  • Cartwright, Richard J.
  • Wang, Ning

Abstract

In an embodiment, a method comprises: receiving bands of power spectra of an input audio signal and a microphone covariance, and for each band: estimating, using a classifier, respective probabilities of speech and noise; estimating, using a directionality model, a set of means for speech and noise, or a set of means and covariances for speech and noise, based on the microphone covariance for the band and the probabilities; estimating, using a level model, a mean and covariance of noise power based on the probabilities and the power spectra; determining a first noise suppression gain based on the directionality model; determining a second noise suppression gain based on the level model; selecting the first or second noise suppression gain or their sum based on a signal-to-noise ratio of the input audio signal; and scaling a time-frequency representation of the input signal by the selected noise suppression gain.

IPC Classes  ?

  • G10L 21/0232 - Processing in the frequency domain
  • G10L 25/84 - Detection of presence or absence of voice signals for discriminating voice from noise
  • G10L 25/30 - Speech or voice analysis techniques not restricted to a single one of groups characterised by the analysis technique using neural networks

73.

METHOD AND APPARATUS FOR AUDIO PROCESSING USING A CONVOLUTIONAL NEURAL NETWORK ARCHITECTURE

      
Application Number 18032322
Status Pending
Filing Date 2021-10-19
First Publication Date 2023-12-14
Owner DOLBY LABORATORIES LICENSING CORPORATION (USA)
Inventor
  • Sun, Jundai
  • Lu, Lie
  • Shuang, Zhiwei

Abstract

Systems, methods, and computer program products for audio processing based on convolutional neural network (CNN) are described. A first CNN architecture may comprise a contracting path of a U-net, a multi-scale CNN, and an expansive path of a U-net. The contracting path may comprise a first encoding layer and may be configured to generate an output representation of the contracting path. The multi-scale CNN may be configured to generate, based on the output representation of the contracting path, an intermediate representation. The multi-scale CNN may comprise at least two parallel convolution paths. The expansive path may comprise a first decoding layer and may be configured to generate a final representation based on the intermediate representation generated by the multi-scale CNN. Within a second CNN architecture, the first encoding layer may comprise a first multi-scale CNN with at least two parallel convolution paths, and the first decoding layer may comprise a second multi-scale CNN with at least two parallel convolution paths.

IPC Classes  ?

  • G06N 3/0464 - Convolutional networks [CNN, ConvNet]
  • G10L 21/00 - Processing of the speech or voice signal to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility

74.

HYBRID CLOCKING SCHEME FOR TRANSMITTING PACKETIZED AUDIO AND POWER OVER A COMMON CONDUCTOR

      
Application Number 18248522
Status Pending
Filing Date 2021-10-07
First Publication Date 2023-12-14
Owner Dolby Laboratories Licensing Corporation (USA)
Inventor
  • Butler, Joel
  • Sommerfeld, Jeremy

Abstract

A distributed amplification and packetized audio transmission system for clock synchronization and alignment between an audio/power source and endpoints with dedicated amplifiers and speakers. An Ethernet audio signal is combined with a Power-Line Communications (PLC) signal for transmission from the source to the endpoints over a common conductor. A single master clock in the source synchronizes the Ethernet audio transmitter with the PLC transmitter. Each end-point has a PLC receiver to recover the master clock for use by its Ethernet audio receiver to provide reliable clock synchronization between the source clock and the endpoint clocks. The endpoints can adjust and re-timestamp the PTP packetized clock based upon symbol and timing information from the PLC receiver.

IPC Classes  ?

  • H04J 3/06 - Synchronising arrangements
  • H04B 3/54 - Systems for transmission via power distribution lines

75.

METHOD AND APPARATUS FOR PROCESSING OF AUDIO USING A NEURAL NETWORK

      
Application Number 18031790
Status Pending
Filing Date 2021-10-14
First Publication Date 2023-12-07
Owner DOLBY LABORATORIES LICENSING CORPORATION (USA)
Inventor
  • Vinton, Mark S.
  • Zhou, Cong
  • Fejgin, Roy M.
  • Davidson, Grant A.

Abstract

Described herein is a method of processing an audio signal using a neural network or using a first and a second neural network. Described is further a method of training said neural network or of jointly training a set of said first and said second neural network. Moreover, described is a method of obtaining and transmitting a latent feature space representation of a perceptual domain audio signal using a neural network and a method of obtaining an audio signal from a latent feature space representation of a perceptual domain audio signal using a neural network. Described are also respective apparatuses and computer program products.

IPC Classes  ?

  • G10L 19/032 - Quantisation or dequantisation of spectral components
  • G10L 19/06 - Determination or coding of the spectral characteristics, e.g. of the short-term prediction coefficients

76.

PROJECTION SYSTEM AND METHOD WITH FOLD MIRROR AND INTEGRATING ROD ADJUSTMENT

      
Application Number 18249860
Status Pending
Filing Date 2021-10-20
First Publication Date 2023-12-07
Owner Dolby Laboratories Licensing Corporation (USA)
Inventor
  • Jackson, John David
  • Hennigan, Darren
  • Wainwright, Nathan Shawn

Abstract

A projection system and calibration method therefore relate to a light source configured to emit a light in response to an image data, an illumination optical system configured to steer the light, the illumination optical system including a fold mirror and an integrating rod, a digital micromirror device (DMD) including a plurality of micromirrors respectively configured to reflect the steered light to a predetermined location as on-state light or to reflect the steered light as off-state light to a light dump; determining a deviation between an actual angle of orientation and an expected angle of orientation of a respective micromirror of the plurality of micromirrors; calculating a first amount of rotational adjustment corresponding to the fold mirror and a second amount of lateral adjustment corresponding to the integrating rod, and actuating the fold minor and integrating rod according to the corresponding first and second amount.

IPC Classes  ?

  • G03B 21/00 - Projectors or projection-type viewers; Accessories therefor
  • G03B 21/14 - Projectors or projection-type viewers; Accessories therefor - Details
  • G03B 21/20 - Lamp housings

77.

GENERAL MEDIA NEURAL NETWORK PREDICTOR AND A GENERATIVE MODEL INCLUDING SUCH A PREDICTOR

      
Application Number 18248805
Status Pending
Filing Date 2021-10-12
First Publication Date 2023-12-07
Owner
  • Dolby Laboratories Licensing Corporation (USA)
  • DOLBY INTERNATIONAL AB (Ireland)
Inventor
  • Zhou, Cong
  • Vinton, Mark S.
  • Davidson, Grant A.
  • Villemoes, Lars

Abstract

A neural network system for predicting frequency coefficients of a media signal, the neural network system comprising a time predicting portion including at least one neural network trained to predict a first set of output variables representing a specific frequency band of a current time frame given coefficients of one or several previous time frames, and a frequency predicting portion including a at least one neural network trained to predict a second set of output variables representing a specific frequency band given coefficients of one or several frequency bands adjacent to the specific frequency band in said current time frame. Such a neural network system forms a predictor capable of capturing both temporal and frequency dependencies occurring in time-frequency tiles of a media signal.

IPC Classes  ?

78.

TRIM-PASS CORRECTION FOR CLOUD-BASED CODING OF HDR VIDEO

      
Application Number 18044775
Status Pending
Filing Date 2021-09-17
First Publication Date 2023-11-30
Owner Dolby Laboratories Licensing Corporation (USA)
Inventor
  • Kadu, Harshad
  • Su, Guan-Ming

Abstract

In a cloud-based system for encoding high dynamic range (HDR) video, each node receives a video segment and bumper frames. Each segment is subdivided into primary scenes and secondary scenes to derive scene-based forward reshaping functions that minimize the amount of reshaping-related metadata when coding the video segment. When a parent scene of a secondary scene is processed by two or more neighboring nodes, initial forward reshaping functions and trim-pass correction parameters are adjusted using reference tone-mapping functions and updated scene-based trim-pass correction parameters.

IPC Classes  ?

  • H04N 19/85 - Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using pre-processing or post-processing specially adapted for video compression
  • H04N 19/179 - Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding the unit being a scene or a shot
  • H04N 19/172 - Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding the unit being an image region, e.g. an object the region being a picture, frame or field
  • H04N 19/136 - Incoming video signal characteristics or properties

79.

BINAURAL RENDERING FOR HEADPHONES USING METADATA PROCESSING

      
Application Number 18305618
Status Pending
Filing Date 2023-04-24
First Publication Date 2023-11-30
Owner Dolby Laboratories Licensing Corporation (USA)
Inventor
  • Tsingos, Nicolas R.
  • Wilson, Rhonda
  • Bharitkar, Sunil
  • Brown, C. Phillip
  • Seefeldt, Alan J.
  • Audfray, Remi

Abstract

Embodiments are described for a method of rendering audio for playback through headphones comprising receiving digital audio content, receiving binaural rendering metadata generated by an authoring tool processing the received digital audio content, receiving playback metadata generated by a playback device, and combining the binaural rendering metadata and playback metadata to optimize playback of the digital audio content through the headphones.

IPC Classes  ?

  • G06F 3/16 - Sound input; Sound output
  • H04S 7/00 - Indicating arrangements; Control arrangements, e.g. balance control
  • H04R 5/04 - Circuit arrangements
  • H04S 1/00 - Two-channel systems

80.

METHOD AND APPARTUS FOR AUDIO PROCESSING USING A NESTED CONVOLUTIONAL NEURAL NETWORK ARCHITECHTURE

      
Application Number 18032325
Status Pending
Filing Date 2021-10-19
First Publication Date 2023-11-30
Owner DOLBY LABORATORIES LICENSING CORPORATION (USA)
Inventor
  • Sun, Jundai
  • Lu, Lie
  • Shuang, Zhiwei

Abstract

Systems, methods, and computer program products for audio processing based on convolutional neural network (CNN) are described. The CNN architecture may comprise a multi-scale input block and a multi-scale nested block. The multi-scale input block may be configured to receive input data and to generate a first downsampled input data set by downsampling the input data. The multi-scale nested block may comprise a first encoding layer configured to generate a first encoded data set by performing a convolution based on the input data. The multi-scale nested block may comprise a second encoding layer configured to generate a second encoded data set by performing a convolution based on the first downsampled input data set. Furthermore, the multi-scale nested block may comprise a first convolutional layer configured to generate a first output data set by upsampling the second encoded data set, concatenating the first encoded data set and the upsampled second encoded data set, and performing a convolution. The first convolutional layer may be nested between the encoding layers and decoding layers, thereby increasing the number of communication channels with the CNN and simplifying the underlying optimization problem.

IPC Classes  ?

  • G10L 25/30 - Speech or voice analysis techniques not restricted to a single one of groups characterised by the analysis technique using neural networks
  • G06N 3/0464 - Convolutional networks [CNN, ConvNet]
  • G10L 25/84 - Detection of presence or absence of voice signals for discriminating voice from noise

81.

SYSTEM AND TOOLS FOR ENHANCED 3D AUDIO AUTHORING AND RENDERING

      
Application Number 18141538
Status Pending
Filing Date 2023-05-01
First Publication Date 2023-11-30
Owner Dolby Laboratories Licensing Corporation (USA)
Inventor
  • Tsingos, Nicolas R.
  • Robinson, Charles Q.
  • Scharpf, Jurgen W.

Abstract

Improved tools for authoring and rendering audio reproduction data are provided. Some such authoring tools allow audio reproduction data to be generalized for a wide variety of reproduction environments. Audio reproduction data may be authored by creating metadata for audio objects. The metadata may be created with reference to speaker zones. During the rendering process, the audio reproduction data may be reproduced according to the reproduction speaker layout of a particular reproduction environment.

IPC Classes  ?

  • H04S 7/00 - Indicating arrangements; Control arrangements, e.g. balance control
  • H04S 3/00 - Systems employing more than two channels, e.g. quadraphonic
  • H04R 5/02 - Spatial or constructional arrangements of loudspeakers
  • H04S 5/00 - Pseudo-stereo systems, e.g. in which additional channel signals are derived from monophonic signals by means of phase shifting, time delay or reverberation

82.

PREDICTIVE MOTION VECTOR CODING

      
Application Number 18234310
Status Pending
Filing Date 2023-08-15
First Publication Date 2023-11-30
Owner DOLBY LABORATORIES LICENSING CORPORATION (USA)
Inventor
  • Tourapis, Alexandros
  • Leontaris, Athanasios

Abstract

Overlapped block disparity estimation and compensation is described. Compensating for images with overlapped block disparity compensation (OBDC) involves determining if OBDC is enabled in a video bit stream, and determining if OBDC is enabled for one or more macroblocks that neighbor a first macroblock within the video bit stream. The neighboring macroblocks may be transform coded. If OBDC is enabled in the video bit stream and for the one or more neighboring macroblocks, predictions may be made for a region of the first macroblock that has an edge adjacent with the neighboring macroblocks. OBDC can be causally applied. Disparity compensation parameters or modes may be shared amongst views or layers. A variety of predictions may be used with causally-applied OBDC.

IPC Classes  ?

  • H04N 19/139 - Analysis of motion vectors, e.g. their magnitude, direction, variance or reliability
  • H04N 19/597 - Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using predictive coding specially adapted for multi-view video sequence encoding
  • H04N 19/176 - Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding the unit being an image region, e.g. an object the region being a block, e.g. a macroblock
  • H04N 19/46 - Embedding additional information in the video signal during the compression process
  • H04N 19/30 - Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using hierarchical techniques, e.g. scalability
  • H04N 19/61 - Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using transform coding in combination with predictive coding
  • H04N 19/103 - Selection of coding mode or of prediction mode
  • H04N 19/573 - Motion compensation with multiple frame prediction using two or more reference frames in a given prediction direction
  • H04N 19/513 - Processing of motion vectors
  • H04N 19/583 - Motion compensation with overlapping blocks
  • H04N 19/577 - Motion compensation with bidirectional frame interpolation, i.e. using B-pictures
  • H04N 19/182 - Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding the unit being a pixel
  • H04N 19/105 - Selection of the reference unit for prediction within a chosen coding or prediction mode, e.g. adaptive choice of position and number of pixels used for prediction
  • H04N 19/152 - Data rate or code amount at the encoder output by measuring the fullness of the transmission buffer
  • H04N 19/159 - Prediction type, e.g. intra-frame, inter-frame or bidirectional frame prediction

83.

ADAPTIVE BLOCK SWITCHING WITH DEEP NEURAL NETWORKS

      
Application Number 18248294
Status Pending
Filing Date 2021-10-15
First Publication Date 2023-11-30
Owner Dolby Laboratories Licensing Corporation (USA)
Inventor
  • Zhou, Cong
  • Davidson, Grant A.
  • Vinton, Mark S.

Abstract

The present invention relates to a method for predicting transform coefficients representing frequency content of an adaptive block length media signal, by receiving a frame and receiving block length information indicating a number of quantized transform coefficients for each block in the frame, the number of quantized transform coefficients being one of a first or second number, wherein the first number is greater than the second number, determining a first block has the second number of quantized transform coefficients, converting the first block into a converted block having the first number of quantized transform coefficients, conditioning a main neural network trained to predict at least one output variable given at least one conditioning variable, the at least one conditioning variable being based on information regarding the converted block and block length information for the first block, providing at least one predicted transform coefficients from an output stage of the main neural network.

IPC Classes  ?

  • G10L 19/022 - Blocking, i.e. grouping of samples in time; Choice of analysis windows; Overlap factoring
  • G10L 25/30 - Speech or voice analysis techniques not restricted to a single one of groups characterised by the analysis technique using neural networks
  • G10L 19/032 - Quantisation or dequantisation of spectral components
  • G10L 19/04 - Speech or audio signal analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using predictive techniques

84.

PROJECTION SYSTEM AND METHOD WITH ADJUSTABLE ANGLE ILLUMINATION USING LENS DECENTRATION

      
Application Number 18249748
Status Pending
Filing Date 2021-10-21
First Publication Date 2023-11-30
Owner Dolby Laboratories Licensing Corporation (USA)
Inventor
  • Jackson, John David
  • Hennigan, Darren
  • Wainwright, Nathan Shawn

Abstract

A projection system and calibration method therefor relate to a light source configured to emit a light in response to an image data, an illumination optical system configured to steer the light, the illumination optical system including a first lens group and a second lens group, a digital micromirror device (DMD) including a plurality of micromirrors respectively configured to reflect the steered light to a predetermined location as on-state light or to reflect the steered light as off-state light to a light dump; determining a deviation between an actual angle of orientation and an expected angle of orientation of a respective micromirror of the plurality of micromirrors; calculating a first amount of lateral adjustment corresponding to the first lens group and a second amount of lateral adjustment corresponding to the second lens group, and actuating the first and second lens groups according to the corresponding first and second amount.

IPC Classes  ?

  • G03B 21/14 - Projectors or projection-type viewers; Accessories therefor - Details
  • G03B 21/00 - Projectors or projection-type viewers; Accessories therefor
  • G03B 21/20 - Lamp housings
  • H04N 9/31 - Projection devices for colour picture display
  • G03B 5/02 - Lateral adjustment of lens

85.

ASYMMETRICAL HIGH-FREQUENCY WAVEGUIDE, 3-AXIS RIGGING, AND SPHERICAL ENCLOSURE FOR SURROUND SPEAKERS

      
Application Number 18302552
Status Pending
Filing Date 2023-04-18
First Publication Date 2023-11-30
Owner DOLBY LABORATORIES LICENSING CORPORATION (USA)
Inventor
  • Showalter, Garth Norman
  • Di Cola, Mario
  • Gott, John Michael
  • Spurlock, Patrick Ross
  • Carney, Gregory Lynn
  • Gott, Bryce Joseph

Abstract

Embodiments are described for a high-frequency waveguide that improves the performance of large-scale surround sound and immersive audio environments. A horn waveguide is configured to be asymmetric about one of a vertical axis and horizontal axis of the waveguide to form an asymmetric horn waveguide. A spherical enclosure surrounds the asymmetric horn waveguide to form a horn speaker, and a three-axis mounting system is configured to fix the horn speaker to one of a wall or ceiling surface of the venue, wherein the mounting system facilitates rotating the horn speaker to a location that provides maximum coverage of the venue within the passband of the asymmetric horn waveguide.

IPC Classes  ?

  • H04R 1/30 - Combinations of transducers with horns, e.g. with mechanical matching means
  • H04R 1/02 - Casings; Cabinets; Mountings therein
  • H04R 27/00 - Public address systems
  • F16M 11/04 - Means for attachment of apparatus; Means allowing adjustment of the apparatus relatively to the stand

86.

DEEP-LEARNING BASED SPEECH ENHANCEMENT

      
Application Number 18250393
Status Pending
Filing Date 2021-10-29
First Publication Date 2023-11-16
Owner Dolby Laboratories Licensing Corporation (USA)
Inventor
  • Liu, Xiaoyu
  • Horgan, Michael Getty
  • Fejgin, Roy M.
  • Holmberg, Paul

Abstract

A system for suppressing noise and enhancing speech and a related method are disclosed. The system trains a neural network model that takes banded energies corresponding to an original noisy waveform and produces a speech value indicating the amount of speech present in each band at each frame. The neural model comprises a feature extraction block that implements some lookahead. The feature extraction block is followed by an encoder with steady down-sampling along the frequency domain forming a contracting path. The encoder is followed by a corresponding decoder with steady up-sampling along the frequency domain forming an expanding path. The decoder receives scaled output feature maps from the encoder at a corresponding level. The decoder is followed by a classification block that generates a speech value indicating an amount of speech present for each frequency band of the plurality of frequency bands at each frame of the plurality of frames.

IPC Classes  ?

  • G10L 21/0232 - Processing in the frequency domain
  • G10L 19/022 - Blocking, i.e. grouping of samples in time; Choice of analysis windows; Overlap factoring

87.

ADAPTIVE LOCAL RESHAPING FOR SDR-TO-HDR UP-CONVERSION

      
Application Number 18029901
Status Pending
Filing Date 2021-10-01
First Publication Date 2023-11-16
Owner DOLBY LABORATORIES LICENSING CORPORATION (USA)
Inventor
  • Huang, Tsung-Wei
  • Su, Guan-Ming
  • Gadgil, Neeraj J.

Abstract

A global index value is generated for selecting a global reshaping function for an input image of a relatively low dynamic range using luma codewords in the input image. Image filtering is applied to the input image to generate a filtered image. The filtered values of the filtered image provide a measure of local brightness levels in the input image. Local index values are generated for selecting specific local reshaping functions for the input image using the global index value and the filtered values of the filtered image. A reshaped image of a relatively high dynamic range is generated by reshaping the input image with the specific local reshaping functions selected using the local index values.

IPC Classes  ?

  • H04N 19/85 - Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using pre-processing or post-processing specially adapted for video compression
  • H04N 19/80 - Methods or arrangements for coding, decoding, compressing or decompressing digital video signals - Details of filtering operations specially adapted for video compression, e.g. for pixel interpolation
  • H04N 19/169 - Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding
  • H04N 19/186 - Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding the unit being a colour or a chrominance component
  • H04N 19/117 - Filters, e.g. for pre-processing or post-processing
  • G06T 5/00 - Image enhancement or restoration
  • G06T 5/20 - Image enhancement or restoration by the use of local operators

88.

COLOR TRANSFORMATION FOR HDR VIDEO WITH A CODING-EFFICIENCY CONSTRAINT

      
Application Number 18248309
Status Pending
Filing Date 2021-10-14
First Publication Date 2023-11-16
Owner Dolby Laboratories Licensing Corporation (USA)
Inventor Su, Guan-Ming

Abstract

Using a standard-based RGB to YCbCr color transform a new RGB to YCC 3×3 transformation matrix and a 3×1 offset vector are derived under a set of coding-efficiency constraints. The new RGB to YCC 3×3 transform comprises a luminance scaling factor and a 2×2 chroma sub-matrix that preserves the energy of the standard-based RGB to YCbCr transform while maintaining or improving coding efficiency. It also adds support for an authorization or watermarking mechanism in streaming video applications. Examples of using the new color transform using image reshaping are also provided.

IPC Classes  ?

  • G06T 5/00 - Image enhancement or restoration
  • H04N 19/186 - Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding the unit being a colour or a chrominance component
  • G06V 10/56 - Extraction of image or video features relating to colour
  • G06V 10/60 - Extraction of image or video features relating to illumination properties, e.g. using a reflectance or lighting model

89.

Speaker

      
Application Number 29800095
Grant Number D1004573
Status In Force
Filing Date 2021-07-19
First Publication Date 2023-11-14
Grant Date 2023-11-14
Owner DOLBY LABORATORIES LICENSING CORPORATION (USA)
Inventor
  • Walcott, Drew Alexander
  • Byrd, Grayson H.
  • Michaelian, Peter
  • Renz, Brian Edward
  • Stewart, John Carson
  • Proksa, Cody Michael
  • Voron, Vincent
  • Mehta, Sripal S.
  • Seefeldt, Alan J.

90.

Speaker

      
Application Number 29800094
Grant Number D1004572
Status In Force
Filing Date 2021-07-19
First Publication Date 2023-11-14
Grant Date 2023-11-14
Owner DOLBY LABORATORIES LICENSING CORPORATION (USA)
Inventor
  • Walcott, Drew Alexander
  • Byrd, Grayson H.
  • Michaelian, Peter
  • Renz, Brian Edward
  • Stewart, John Carson
  • Proksa, Cody Michael
  • Voron, Vincent
  • Mehta, Sripal S.
  • Seefeldt, Alan J.

91.

INTRA PREDICTION MODE MAPPING METHOD AND DEVICE USING THE METHOD

      
Application Number 18222821
Status Pending
Filing Date 2023-07-17
First Publication Date 2023-11-09
Owner Dolby Laboratories Licensing Corporation (USA)
Inventor Lee, Sun Young

Abstract

The present invention relates to an intra prediction mode mapping method and a device using the method. The intra prediction mode includes: decoding flag information providing information regarding whether an intra prediction mode of a plurality of candidate intra prediction modes for the current block is the same as the intra prediction mode for the current block, and decoding a syntax component including information regarding the intra prediction mode for the current block in order to induce the intra prediction mode for the current block if the intra prediction mode from among the plurality of candidate intra prediction modes for the current block is not the same as the intra prediction mode for the current block. Thus, it is possible to increase the efficiency with which are images are decoded.

IPC Classes  ?

  • H04N 19/11 - Selection of coding mode or of prediction mode among a plurality of spatial predictive coding modes
  • H04N 19/70 - Methods or arrangements for coding, decoding, compressing or decompressing digital video signals characterised by syntax aspects related to video coding, e.g. related to compression standards
  • H04N 19/463 - Embedding additional information in the video signal during the compression process by compressing encoding parameters before transmission
  • H04N 19/157 - Assigned coding mode, i.e. the coding mode being predefined or preselected to be further used for selection of another element or parameter
  • H04N 19/196 - Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the adaptation method, adaptation tool or adaptation type used for the adaptive coding being specially adapted for the computation of encoding parameters, e.g. by averaging previously computed encoding parameters
  • H04N 19/147 - Data rate or code amount at the encoder output according to rate distortion criteria
  • H04N 19/593 - Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using predictive coding involving spatial prediction techniques
  • H04N 19/176 - Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding the unit being an image region, e.g. an object the region being a block, e.g. a macroblock
  • H04N 19/91 - Entropy coding, e.g. variable length coding [VLC] or arithmetic coding
  • H04N 19/159 - Prediction type, e.g. intra-frame, inter-frame or bidirectional frame prediction

92.

MEDIA-COMPENSATED PASS-THROUGH AND MODE-SWITCHING

      
Application Number 18351357
Status Pending
Filing Date 2023-07-12
First Publication Date 2023-11-09
Owner Dolby Laboratories Licensing Corporation (USA)
Inventor
  • Alexander, Mark
  • Li, Chunjian
  • Lando, Joshua Brandon
  • Seefeldt, Alan J.
  • Brown, C. Phillip
  • Breebaart, Dirk Jeroen

Abstract

Media input audio data corresponding to a media stream and microphone input audio data from at least one microphone may be received. A first level of at least one of a plurality of frequency bands of the media input audio data, as well as a second level of at least one of a plurality of frequency bands of the microphone input audio data, may be determined. Media output audio data and microphone output audio data may be produced by adjusting levels of one or more of the first and second plurality of frequency bands based on the perceived loudness of the microphone input audio data, of the microphone output audio data, of the media output audio data and the media input audio data. One or more processes may be modified upon receipt of a mode-switching indication.

IPC Classes  ?

  • H04B 15/00 - Suppression or limitation of noise or interference
  • G06F 3/16 - Sound input; Sound output
  • H04R 1/10 - Earpieces; Attachments therefor
  • H04R 3/04 - Circuits for transducers for correcting frequency response
  • H04R 29/00 - Monitoring arrangements; Testing arrangements

93.

METHOD AND DEVICE FOR PROCESSING A BINAURAL RECORDING

      
Application Number 18026281
Status Pending
Filing Date 2021-09-15
First Publication Date 2023-11-09
Owner
  • Dolby Laboratories Licensing Corporation (USA)
  • Dolby International AB (Ireland)
Inventor
  • Shuang, Zhiwei
  • Ma, Yuanxing
  • Liu, Yang
  • Yang, Ziyu
  • Cengarle, Giulio

Abstract

The present invention relates to a method and device for processing a first and a second audio signal representing an input binaural audio signal acquired by a binaural recording device. The present invention further relates to a method for rendering a binaural audio signal on a speaker system. The method for processing a binaural signal comprising extracting audio information from the first audio signal, computing a band gain for reducing noise in the first audio signal and applying the band gains to respective frequency bands of the first audio signal in accordance with a dynamic scaling factor, to provide a first output audio signal. Wherein the dynamic scaling factor has a value between zero and one and is selected so as to reduce quality degradation for the first audio signal.

IPC Classes  ?

94.

AUDIO DECODER AND DECODING METHOD

      
Application Number 18351769
Status Pending
Filing Date 2023-07-13
First Publication Date 2023-11-09
Owner
  • Dolby Laboratories Licensing Corporation (USA)
  • Dolby International AB (Ireland)
Inventor
  • Breebaart, Dirk Jeroen
  • Cooper, David Matthew
  • Samuelsson, Leif Jonas

Abstract

A method for representing a second presentation of audio channels or objects as a data stream, the method comprising the steps of: (a) providing a set of base signals, the base signals representing a first presentation of the audio channels or objects; (b) providing a set of transformation parameters, the transformation parameters intended to transform the first presentation into the second presentation; the transformation parameters further being specified for at least two frequency bands and including a set of multi-tap convolution matrix parameters for at least one of the frequency bands.

IPC Classes  ?

  • G10L 19/02 - Speech or audio signal analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using spectral analysis, e.g. transform vocoders or subband vocoders
  • H04S 7/00 - Indicating arrangements; Control arrangements, e.g. balance control
  • G10L 19/008 - Multichannel audio signal coding or decoding using interchannel correlation to reduce redundancy, e.g. joint-stereo, intensity-coding or matrixing

95.

Scalable systems for controlling color management comprising varying levels of metadata

      
Application Number 18219344
Grant Number 11917171
Status In Force
Filing Date 2023-07-07
First Publication Date 2023-11-02
Grant Date 2024-02-27
Owner Dolby Laboratories Licensing Corporation (USA)
Inventor
  • Messmer, Neil W.
  • Atkins, Robin
  • Margerm, Steve
  • Longhurst, Peter W.

Abstract

Several embodiments of scalable image processing systems and methods are disclosed herein whereby color management processing of source image data to be displayed on a target display is changed according to varying levels of metadata.

IPC Classes  ?

  • H04N 19/30 - Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using hierarchical techniques, e.g. scalability
  • G06T 9/00 - Image coding
  • G09G 5/02 - Control arrangements or circuits for visual indicators common to cathode-ray tube indicators and other visual indicators characterised by the way in which colour is displayed
  • H04N 1/60 - Colour correction or control
  • H04N 9/69 - Circuits for processing colour signals for controlling the amplitude of colour signals, e.g. automatic chroma control circuits for modifying the colour signals by gamma correction
  • H04N 21/235 - Processing of additional data, e.g. scrambling of additional data or processing content descriptors
  • H04N 21/4402 - Processing of video elementary streams, e.g. splicing a video clip retrieved from local storage with an incoming video stream or rendering scenes according to MPEG-4 scene graphs involving reformatting operations of video signals for household redistribution, storage or real-time display
  • G06F 3/14 - Digital output to display device

96.

METHOD, APPARATUS OR SYSTEMS FOR PROCESSING AUDIO OBJECTS

      
Application Number 18349704
Status Pending
Filing Date 2023-07-10
First Publication Date 2023-11-02
Owner
  • Dolby Laboratories Licensing Corporation (USA)
  • DOLBY INTERNATIONAL AB (Ireland)
Inventor
  • Breebaart, Dirk Jeroen
  • Lu, Lie
  • Tsingos, Nicolas R.
  • Mateos Sole, Antonio

Abstract

Diffuse or spatially large audio objects may be identified for special processing. A decorrelation process may be performed on audio signals corresponding to the large audio objects to produce decorrelated large audio object audio signals. These decorrelated large audio object audio signals may be associated with object locations, which may be stationary or time-varying locations. For example, the decorrelated large audio object audio signals may be rendered to virtual or actual speaker locations. The output of such a rendering process may be input to a scene simplification process. The decorrelation, associating and/or scene simplification processes may be performed prior to a process of encoding the audio data.

IPC Classes  ?

  • G10L 19/008 - Multichannel audio signal coding or decoding using interchannel correlation to reduce redundancy, e.g. joint-stereo, intensity-coding or matrixing
  • G10L 19/20 - Vocoders using multiple modes using sound class specific coding, hybrid encoders or object based coding
  • H04S 3/00 - Systems employing more than two channels, e.g. quadraphonic
  • H04S 7/00 - Indicating arrangements; Control arrangements, e.g. balance control
  • G10L 19/00 - Speech or audio signal analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
  • G10L 19/018 - Audio watermarking, i.e. embedding inaudible data in the audio signal

97.

System and method for non-destructively normalizing loudness of audio signals within portable devices

      
Application Number 18303919
Grant Number 11948592
Status In Force
Filing Date 2023-04-20
First Publication Date 2023-10-26
Grant Date 2024-04-02
Owner
  • Dolby Laboratories Licensing Corporation (USA)
  • DOLBY INTERNATIONAL AB (Ireland)
Inventor
  • Riedmiller, Jeffrey
  • Mundt, Harald
  • Schug, Michael
  • Wolters, Martin

Abstract

Many portable playback devices cannot decode and playback encoded audio content having wide bandwidth and wide dynamic range with consistent loudness and intelligibility unless the encoded audio content has been prepared specially for these devices. This problem can be overcome by including with the encoded content some metadata that specifies a suitable dynamic range compression profile by either absolute values or differential values relative to another known compression profile. A playback device may also adaptively apply gain and limiting to the playback audio. Implementations in encoders, in transcoders and in decoders are disclosed.

IPC Classes  ?

  • G10L 19/22 - Mode decision, i.e. based on audio signal content versus external parameters
  • G10L 19/02 - Speech or audio signal analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using spectral analysis, e.g. transform vocoders or subband vocoders
  • G10L 19/16 - Vocoder architecture
  • G10L 19/26 - Pre-filtering or post-filtering
  • H03G 3/30 - Automatic control in amplifiers having semiconductor devices
  • H03G 3/32 - Automatic control in amplifiers having semiconductor devices the control being dependent upon ambient noise level or sound level
  • H03G 7/00 - Volume compression or expansion in amplifiers

98.

RECURSIVE SEGMENT TO SCENE SEGMENTATION FOR CLOUD-BASED CODING OF HDR VIDEO

      
Application Number 18044771
Status Pending
Filing Date 2021-09-17
First Publication Date 2023-10-26
Owner Dolby Laboratories Licensing Corporation (USA)
Inventor
  • Kadu, Harshad
  • Su, Guan-Ming
  • Gadgil, Neeraj J.
  • Huang, Tsung-Wei

Abstract

In a cloud-based system for encoding high dynamic range (HDR) video, each node receives a video segment and bumper frames. Each segment is subdivided into primary scenes and secondary scenes to derive scene-based forward reshaping functions that minimize the amount of reshaping-related metadata when coding the video segment, while maintaining temporal continuity among scenes processed by multiple nodes. Methods to generate scene-based forward and backward reshaping functions to optimize video coding and improve the coding efficiency of reshaping-related metadata are also examined.

IPC Classes  ?

  • G06V 20/40 - Scenes; Scene-specific elements in video content
  • G06T 5/40 - Image enhancement or restoration by the use of histogram techniques
  • H04N 19/142 - Detection of scene cut or scene change
  • H04N 19/192 - Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the adaptation method, adaptation tool or adaptation type used for the adaptive coding the adaptation method, adaptation tool or adaptation type being iterative or recursive
  • H04N 19/98 - Adaptive-dynamic-range coding [ADRC]

99.

QUANTIZATION AND ENTROPY CODING OF PARAMETERS FOR A LOW LATENCY AUDIO CODEC

      
Application Number 18008445
Status Pending
Filing Date 2021-06-10
First Publication Date 2023-10-26
Owner Dolby Laboratories Licensing Corporation (USA)
Inventor
  • Mcgrath, David S.
  • Tyagi, Rishabh
  • Brown, Stefanie
  • Torres, Juan Felix

Abstract

Described is a method of frame-wise encoding metadata for an input signal, the metadata comprising a plurality of at least partially interrelated parameters calculable from the input signal. The method comprises, for each frame: iteratively performing, by using a looping process, steps of: determining a processing strategy from a plurality of processing strategies for calculating and quantizing the parameters; calculating and quantizing the parameters based on the determined processing strategy to obtain quantized parameters; and encoding the quantized parameters. In particular, each of the plurality of processing strategies comprises a respective first indication indicative of an ordering related to the calculation and quantization of individual parameters; and the processing strategy is determined based on at least one bitrate threshold.

IPC Classes  ?

  • G10L 19/032 - Quantisation or dequantisation of spectral components
  • G10L 19/008 - Multichannel audio signal coding or decoding using interchannel correlation to reduce redundancy, e.g. joint-stereo, intensity-coding or matrixing

100.

Binaural dialogue enhancement

      
Application Number 18309099
Grant Number 11950078
Status In Force
Filing Date 2023-04-28
First Publication Date 2023-10-26
Grant Date 2024-04-02
Owner
  • Dolby Laboratories Licensing Corporation (USA)
  • DOLBY INTERNATIONAL AB (Ireland)
Inventor
  • Samuelsson, Leif Jonas
  • Breebaart, Dirk Jeroen
  • Cooper, David Matthew
  • Koppens, Jeroen

Abstract

Methods for dialogue enhancing audio content, comprising providing a first audio signal presentation of the audio components, providing a second audio signal presentation, receiving a set of dialogue estimation parameters configured to enable estimation of dialogue components from the first audio signal presentation, applying said set of dialogue estimation parameters to said first audio signal presentation, to form a dialogue presentation of the dialogue components; and combining the dialogue presentation with said second audio signal presentation to form a dialogue enhanced audio signal presentation for reproduction on the second audio reproduction system, wherein at least one of said first and second audio signal presentation is a binaural audio signal presentation.

IPC Classes  ?

  • H04S 1/00 - Two-channel systems
  • H04R 5/04 - Circuit arrangements
  • H04S 3/00 - Systems employing more than two channels, e.g. quadraphonic
  • H04S 3/02 - Systems employing more than two channels, e.g. quadraphonic of the matrix type, i.e. in which input signals are combined algebraically, e.g. after having been phase shifted with respect to each other
  • H04S 7/00 - Indicating arrangements; Control arrangements, e.g. balance control
  1     2     3     ...     24        Next Page