Embodiments of the present disclosure provide a solution for video processing. A method for video processing is proposed. The method comprises: determining, for a conversion between a video unit of a video and a bitstream of the video unit, a quantization approach of a latent sample based on whether the latent sample and a neighbor quantized latent sample is in a same region; obtaining, using a neural network, a quantized latent sample comprising a quantized luma latent sample and a quantized chroma latent sample by applying the quantization approach to the latent sample; and performing the conversion based on the quantized latent sample and one of: a synthesis transform network or an analysis transform network.
Embodiments of the present disclosure provide a solution for visual data processing. A method for visual data processing comprises: determining, for a conversion between a current visual unit of visual data and a bitstream of the visual data, precision information indicating at least one precise level for a plurality of modules, at least one of the plurality of modules being based on a neural network model; and performing the conversion by applying the plurality of modules to the current visual unit based on the precision information.
Embodiments of the present disclosure provide a solution for visual data processing. A method for visual data processing is proposed. The method comprises: determining, for a conversion between visual data and a bitstream of the visual data, a target weight for use by a target module in a coding system based on information associated with the visual data, the coding system being implemented with at least one neural network; and performing the conversion by using the coding system based on the target weight.
Embodiments of the present disclosure provide a solution for point cloud coding. A method for point cloud coding is proposed. The method comprises: determining, during a conversion between a current frame of a point cloud sequence and a bitstream of the point cloud sequence, a plurality of azimuthal angle metric values of a plurality of key points associated with a node of the current frame, the node representing a spatial partition of the current frame; determining a prediction of azimuthal angle metric value of the node; determining a context of entropy coding for a plane position of at least one axis associated with the node based on the plurality of azimuthal angle metric values and the prediction of azimuthal angle metric value; and performing the conversion based on the context of entropy coding.
A mechanism for processing media data is disclosed. The mechanism includes changing a quantization step size of a region-adaptive hierarchical transform (RAHT) coefficient in a point cloud video unit. A conversion is performed between the media data and a media data file based on the quantization step size.
H04N 19/13 - Adaptive entropy coding, e.g. adaptive variable length coding [AVLC] or context adaptive binary arithmetic coding [CABAC]
H04N 19/54 - Motion estimation other than block-based using feature points or meshes
H04N 19/61 - Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using transform coding in combination with predictive coding
Embodiments of the present disclosure provide a solution for video processing. A method for video processing is proposed. The method comprises: performing, using a neural network (NN) model and during a conversion between a current block of a video and a bitstream of the video, at least one coding operation on the current block based at least in part on first information associated with one or multiple previously coded frames of the video, wherein the at least one coding operation comprises at least one of the following: a filtering operation, a super-resolution operation, an inter prediction operation, a virtual reference frame generation operation; and performing the conversion based on a result of the at least one coding operation.
H04N 19/50 - Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using predictive coding
H04N 19/59 - Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using predictive coding involving spatial sub-sampling or interpolation, e.g. alteration of picture size or resolution
7.
METHOD, APPARATUS, AND MEDIUM FOR VIDEO PROCESSING
Embodiments of the present disclosure provide a solution for video processing. A method for video processing is proposed. The method comprises: determining, for a conversion between a current video block of a video and a bitstream of the video, a reference template for the current video block based on a current template of the current video block and motion information of at least one subblock of the current video block; and performing the conversion based on the reference template.
Embodiments of the present disclosure provide a solution for visual data processing. A method for visual data processing is proposed. The method comprises: performing a conversion between visual data and a bitstream of the visual data with a neural network (NN) -based model, wherein the bitstream comprises a first indication indicating whether a set of values for a set of parameters for the NN-based model is common to processing of a plurality of components of the visual data.
H04N 19/186 - Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding the unit being a colour or a chrominance component
9.
METHOD, APPARATUS, AND MEDIUM FOR VISUAL DATA PROCESSING
Embodiments of the present disclosure provide a solution for visual data processing. A method for visual data processing is proposed. The method comprises: determining, for a conversion between visual data and a bitstream of the visual data, whether to enable a first module implemented with a first neural network in a coding system, the coding system being implemented with at least one neural network; and performing the conversion by using the coding system based on the determining.
H04N 19/10 - Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
H04N 19/70 - Methods or arrangements for coding, decoding, compressing or decompressing digital video signals characterised by syntax aspects related to video coding, e.g. related to compression standards
H04N 19/90 - Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using coding techniques not provided for in groups , e.g. fractals
10.
METHOD, APPARATUS, AND MEDIUM FOR VIDEO PROCESSING
Embodiments of the present disclosure provide a solution for video processing. A method for video processing is proposed. The method comprises: obtaining, for a conversion between a current video block of a video and a bitstream of the video, an intra mode for the current video block, the intra mode being determined based on at least one of the following: information associated with neighboring samples of the current video block, or a prediction associated with the current video block; applying a transform process on the current video block based on the intra mode; and performing the conversion based on the applying.
Embodiments of the present disclosure provide a solution for video processing. A method for video processing is proposed. The method comprises: obtaining, for a conversion between a current video block of a video and a bitstream of the video, a set of motion vectors for the current video block, the current video block being coded with a subblock-based coding tool; applying a decoder side motion vector refinement (DMVR) process on the set of motion vectors; and performing the conversion based on the applying.
Embodiments of the present disclosure provide a solution for video processing. A method for video processing is proposed. The method comprises: determining, for a conversion between a current video block of a video and a bitstream of the video, a metric for intra prediction of the current video block; determining an intra prediction of a sample at a first position in the current video block based on the metric and the first position; and performing the conversion based on the intra prediction.
H04N 19/11 - Selection of coding mode or of prediction mode among a plurality of spatial predictive coding modes
H04N 19/159 - Prediction type, e.g. intra-frame, inter-frame or bidirectional frame prediction
H04N 19/196 - Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the adaptation method, adaptation tool or adaptation type used for the adaptive coding being specially adapted for the computation of encoding parameters, e.g. by averaging previously computed encoding parameters
H04N 19/176 - Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding the unit being an image region, e.g. an object the region being a block, e.g. a macroblock
H04N 19/577 - Motion compensation with bidirectional frame interpolation, i.e. using B-pictures
H04N 19/44 - Decoders specially adapted therefor, e.g. video decoders which are asymmetric with respect to the encoder
H04N 19/52 - Processing of motion vectors by encoding by predictive encoding
13.
METHOD, APPARATUS, AND MEDIUM FOR VIDEO PROCESSING
Embodiments of the present disclosure provide a solution for video processing. A method for video processing is proposed. The method comprises: determining, during a conversion between a target block of a video and a bitstream of the target block, a target motion vector from a set of candidate motion vectors based on information of a neighbor block associated with the target block; performing a motion estimation of a filtering process based on the target motion vector; and performing the conversion according to the motion estimation.
H04N 19/86 - Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using pre-processing or post-processing specially adapted for video compression involving reduction of coding artifacts, e.g. of blockiness
H04N 19/51 - Motion estimation or motion compensation
14.
METHOD, APPARATUS, AND MEDIUM FOR POINT CLOUD CODING
Embodiments of the present disclosure provide a solution for point cloud coding. A method for point cloud coding is proposed. The method comprises: obtaining, for a conversion between a current point cloud (PC) sample of a point cloud sequence and a bitstream of the point cloud sequence, a first indication indicating whether a multi-reference inter prediction in which a plurality of reference PC samples are used is enabled for the point cloud sequence; and performing the conversion based on the first indication.
H04N 19/597 - Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using predictive coding specially adapted for multi-view video sequence encoding
15.
SWITCHABLE INPUT SOURCES BASED EXTENDED TAPS FOR ADAPTIVE LOOP FILTER IN VIDEO CODING
A mechanism for processing video data is disclosed. The mechanism includes determining to apply an extended tap in an adaptive loop filter (ALF). The ALF may also include a spatial tap, and the extended tap may be different from the spatial tap. A conversion is performed between a visual media data and a bitstream based on the ALF.
Embodiments of the present disclosure provide a solution for video processing. A method for video processing is proposed. The method comprises: obtaining, for a conversion between a current video block of a video and a bitstream of the video, a plurality of predictions for the current video block, the plurality of predictions being determined based on a plurality of different prediction schemes; generating a target prediction for the current video block by fusing the plurality of predictions; and performing the conversion based on the target prediction.
H04N 19/105 - Selection of the reference unit for prediction within a chosen coding or prediction mode, e.g. adaptive choice of position and number of pixels used for prediction
17.
METHOD, APPARATUS, AND MEDIUM FOR VIDEO PROCESSING
Embodiments of the present disclosure provide a solution for video processing. A method for video processing is proposed. The method comprises: obtaining, for a conversion between a current video block of a video and a bitstream of the video, a plurality of affine candidates for the current video block; applying a pruning check on the plurality of affine candidates in accordance with a checking procedure, the checking procedure being common to a plurality of different types of affine candidates; and performing the conversion based on the applying.
Embodiments of the present disclosure provide a solution for video processing. A method for video processing is proposed. The method comprises: dividing, for a conversion between a video unit of a video and a bitstream of the video, the video unit into a plurality of sub-partitions using a predefined approach, wherein the video unit is coded with an intra block copy (IBC) -geometric partition mode (GPM); obtaining a prediction of at least one sub-partition of the video unit using intra block copy; and performing the conversion based on the prediction of the at least one sub-partition of the video unit.
A mechanism for processing video data is disclosed. The mechanism includes determining to apply neural network (NN) based super resolution. A chroma format of an input is changed due to different down-sampling ratios of color components. A conversion is performed between a visual media data and a bitstream based on the chroma format.
H04N 19/59 - Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using predictive coding involving spatial sub-sampling or interpolation, e.g. alteration of picture size or resolution
H04N 19/82 - Methods or arrangements for coding, decoding, compressing or decompressing digital video signals - Details of filtering operations specially adapted for video compression, e.g. for pixel interpolation involving filtering within a prediction loop
20.
METHOD, APPARATUS, AND MEDIUM FOR VIDEO PROCESSING
Embodiments of the present disclosure provide a solution for video processing. A method for video processing is proposed. The method comprises: obtaining a neural network (NN) model for processing a video, the NN model comprising at least one basic block, wherein a basic block comprises: a plurality of branches for parallel processing an input of the basic block, a branch comprising at least one convolutional layer and at least one activation layer, and a plurality of layers for serial processing a combination of outputs of the plurality of branch, the plurality of layers comprising at least one convolutional layer and at least one activation layer; and performing, according to the NN model, a conversion between a current video block of the video and a bitstream of the video.
G06V 10/82 - Arrangements for image or video recognition or understanding using pattern recognition or machine learning using neural networks
G06V 20/40 - Scenes; Scene-specific elements in video content
H04N 19/176 - Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding the unit being an image region, e.g. an object the region being a block, e.g. a macroblock
H04N 19/184 - Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding the unit being bits, e.g. of the compressed video stream
H04N 19/82 - Methods or arrangements for coding, decoding, compressing or decompressing digital video signals - Details of filtering operations specially adapted for video compression, e.g. for pixel interpolation involving filtering within a prediction loop
21.
METHOD, APPARATUS, AND MEDIUM FOR VIDEO PROCESSING
Embodiments of the present disclosure provide a solution for video processing. A method for video processing is proposed. The method comprises: determining, for a conversion between a video unit of a video and a bitstream of the video unit, whether to apply a combination of intra block copy (IBC) and intra prediction (CIBCIP) to the video unit based on at least one of: coding information of the video unit, a color format, a color component or a syntax element; in accordance with a determination that the CIBCIP is applied to the video unit, deriving a prediction of the video unit by combining an IBC prediction signal and an intra prediction signal; and performing the conversion based on the prediction of the video unit.
H04N 19/176 - Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding the unit being an image region, e.g. an object the region being a block, e.g. a macroblock
H04N 19/00 - Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
22.
METHOD, APPARATUS, AND MEDIUM FOR VIDEO PROCESSING
Embodiments of the present disclosure provide a solution for video processing. A method for video processing is proposed. The method comprises: deriving, for a conversion between a video unit of a video and a bitstream of the video, a prediction sample of the video unit; deriving a refined prediction sample of the video unit by applying a refinement process to the prediction sample; and performing the conversion based on the refined prediction sample.
A mechanism for processing video data is disclosed. The mechanism includes determining to apply an adaptive looper filter (ALF) with an extended tap to a picture in a video. An intermediate filtering result of a second filter is used as input for the extended tap. A conversion is performed between a visual media data and a bitstream based on the ALF.
Embodiments of the present disclosure provide a solution for video processing. A method for video processing is proposed. The method comprises: determining, for a conversion between a video unit of a video and a bitstream of the video unit, whether to apply at least one neural network (NN) model for NN-filtering during a process of the video unit; processing the video unit by applying the process to the video unit based on the determining; and performing the conversion based on the processed video unit.
Embodiments of the present disclosure provide a solution for video processing. A method for video processing is proposed. The method comprises: determining, for a conversion between a video unit of a video and a bitstream of the video unit, whether to apply at least one neural network (NN) filter model or determine a rate distortion cost during a rate distortion optimization (RDO) process of the video unit based on at least one of: a distortion without NN filter model, a distortion with n-th NN filter model, a combination of distortions of a plurality of NN filter models, or coding statistics of the video unit, and wherein n is an integer number; determining a coding mode of the video unit based on a rate distortion optimization (RDO) criterion in the RDO process; and performing the conversion based on the coding mode.
Embodiments of the present disclosure provide a solution for video processing. A method for video processing is proposed. The method comprises: performing a conversion between a current video unit of a video and a bitstream of the video, wherein the bitstream comprises at least one set of syntax elements for activating at least one neural-network post-filter (NNPF) for the current video unit, and the current video unit is a portion of a current picture of the video.
Embodiments of the present disclosure provide a method for point cloud coding. The method comprises: determining, for a conversion between a point cloud sequence comprising a current point cloud (PC) sample associated with a transform block and a bitstream of the point cloud sequence, a transform result of an attribute residual between a neighbor attribute of at least one subblock of the transform block and a predicted attribute of the at least one subblock of the transform block, the neighbor attribute being predicted based on an attribute of at least one neighbor block of the transform block; and performing the conversion at least based on the transform result of the attribute residual.
Embodiments of the present disclosure provide a method for point cloud coding. The method comprises: determining, for a conversion between a point cloud sequence comprising at least one point cloud (PC) sample associated with a plurality of nodes and a bitstream of the point cloud sequence, a node index of a first node of the plurality of nodes, wherein the first node is stored in a data structure and is indicated in the data structure by the node index; and performing the conversion based on the position indication and the occupancy information.
H04N 19/59 - Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using predictive coding involving spatial sub-sampling or interpolation, e.g. alteration of picture size or resolution
Embodiments of the present disclosure provide a solution for video processing. A method for video processing is proposed. The method comprises: determining, for a conversion between a current video block of a video and a bitstream of the video, a plurality of motion vector prediction (MVP) candidates of the current video block; determining a candidate list of the current video block by applying a plurality of pruning processes to the plurality of MVP candidates; and performing the conversion based on the candidate list.
H04N 19/176 - Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding the unit being an image region, e.g. an object the region being a block, e.g. a macroblock
30.
ON COEFFICIENT VALUE PREDICTION AND COST DEFINITION
A mechanism for processing video data is disclosed. The mechanism determines to predict a value of a residual coefficient based on a cost. A conversion is performed between a visual media data and the media data file based on the residual coefficient. The coefficient may be used in transform coding or transform-skip coding.
H04N 19/18 - Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding the unit being a set of transform coefficients
H04N 19/192 - Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the adaptation method, adaptation tool or adaptation type used for the adaptive coding the adaptation method, adaptation tool or adaptation type being iterative or recursive
H04N 19/103 - Selection of coding mode or of prediction mode
H04N 19/60 - Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using transform coding
H04N 19/61 - Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using transform coding in combination with predictive coding
31.
ENHANCED SIGNALLING OF PRESELECTION IN A MEDIA FILE
A mechanism for processing video data is disclosed. The mechanism determines information from a Preselection Group Box. A segment_order field is omitted from the Preselection Group Box and sub-boxes thereof. A conversion is performed between a visual media data and the media data file based on the Preselection Group Box.
H04N 19/136 - Incoming video signal characteristics or properties
H04N 19/42 - Methods or arrangements for coding, decoding, compressing or decompressing digital video signals - characterised by implementation details or hardware specially adapted for video compression or decompression, e.g. dedicated software implementation
H04N 19/70 - Methods or arrangements for coding, decoding, compressing or decompressing digital video signals characterised by syntax aspects related to video coding, e.g. related to compression standards
Embodiments of the present disclosure provide a method for point cloud coding. The method comprises: determining, for a conversion between a point cloud sequence comprising a current point cloud (PC) sample associated with a transform block and a bitstream of the point cloud sequence, the transform block comprising a predetermined number of subblocks, each subblock containing at least one point of the point cloud sequence; and performing the conversion by skipping at least one of prediction or transform of the transform block.
Embodiments of the present disclosure provide a solution for video processing. A method for video processing is proposed. The method comprises: determining, for a conversion between a current video unit of a video and a bitstream of the video, a target value of a parameter for ordering a plurality of motion candidates for the current video unit from a set of candidate values of the parameter, wherein the parameter is associated with cost differences related to the plurality of motion candidates, and the current video unit is a portion of a slice of the video; and performing the conversion based on the target value.
H04N 19/593 - Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using predictive coding involving spatial prediction techniques
34.
ENHANCED SIGNALLING OF EXTENDED DEPENDENT RANDOM ACCESS SAMPLE POINT SAMPLES IN A MEDIA FILE
A mechanism for processing media data is disclosed. In one aspect, a method includes determining that a track identifier (ID) identifying a reference track of the media data is included in a track reference type box (TrackReferenceTypeBox) with a reference type (reference_type) equal to associated external stream track ('aest'). The method also includes performing a conversion between the media data and a media data file based on the track identifier.
H04N 19/70 - Methods or arrangements for coding, decoding, compressing or decompressing digital video signals characterised by syntax aspects related to video coding, e.g. related to compression standards
H04N 19/10 - Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
H04N 19/597 - Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using predictive coding specially adapted for multi-view video sequence encoding
H04N 19/30 - Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using hierarchical techniques, e.g. scalability
H04N 21/234 - Processing of video elementary streams, e.g. splicing of video streams or manipulating MPEG-4 scene graphs
35.
ENHANCED SIGNALLING OF LOST OR CORRUPTED SAMPLES IN A MEDIA FILE
A mechanism for processing video data is disclosed. The mechanism includes determining one or more indications for a media unit. A first indication indicates that one or more parameter-set-like network abstraction layer (NAL) units in associated data that are needed for decoding a bitstream are corrupted. A conversion is performed between a visual media data and a bitstream based on the first indication.
H04N 19/89 - Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using pre-processing or post-processing specially adapted for video compression involving methods or arrangements for detection of transmission errors at the decoder
A mechanism for processing video data is disclosed. The mechanism includes determining a referencing track contains a 'supm' track reference, which indicates video in the referencing track is usable as a supplementary video, and that a corresponding main video can be contained in a referenced track. A conversion is performed between a visual media data and a bitstream based on the 'supm' track reference.
H04N 19/46 - Embedding additional information in the video signal during the compression process
H04N 21/431 - Generation of visual interfaces; Content or additional data rendering
H04N 21/234 - Processing of video elementary streams, e.g. splicing of video streams or manipulating MPEG-4 scene graphs
H04N 19/184 - Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding the unit being bits, e.g. of the compressed video stream
Embodiments of the present disclosure provide a solution for video processing. A method for video processing is proposed. The method comprises: performing a conversion between a current video unit of a video and a bitstream of the video, wherein the bitstream comprises information of a downsampling filter for obtaining the current video unit.
Embodiments of the present disclosure provide a solution for video processing. A method for video processing is proposed. The method comprises: performing a conversion between a current video unit of a video and a bitstream of the video, wherein the bitstream comprises at least one set of syntax elements for activating at least one neural-network post-filter (NNPF) for the current video unit, and the current video unit is a portion of a picture of the video.
Embodiments of the present disclosure provide a solution for video processing. A method for video processing is proposed. The method comprises: applying, for a conversion between a current video unit of a video and a bitstream of the video, a neural network filter to the current video unit at least based on auxiliary information associated with the current video unit, the auxiliary information including at least one of: prediction information of the current video unit, partitioning information of the current video unit, or coding information of a previously coded video unit; and performing the conversion based on the applying.
Embodiments of the present disclosure provide a solution for video processing. A method for video processing is proposed. The method comprises: selecting, for a conversion between a current video block of a video and a bitstream of the video, a set of samples in a first color component of the current video block based on information regarding applying a geometric transformation on the current video block; determining a prediction of at least one sample in a second color component of the current video block based on the set of samples and a gradient linear model (GLM), the second color component being different from the first color component; and performing the conversion based on the prediction.
H04N 19/61 - Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using transform coding in combination with predictive coding
H04N 19/593 - Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using predictive coding involving spatial prediction techniques
H04N 19/186 - Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding the unit being a colour or a chrominance component
H04N 19/176 - Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding the unit being an image region, e.g. an object the region being a block, e.g. a macroblock
41.
METHOD, APPARATUS, AND MEDIUM FOR VIDEO PROCESSING
Embodiments of the present disclosure provide a solution for video processing. A method for video processing is proposed. The method comprises: performing a conversion between a current video unit of a video and a bitstream of the video, wherein the bitstream comprises a first indication being allowed to activate a target neural -network post- processing filter (NNPF), the target NNPF being applied to a plurality of video units of the video.
H04N 19/85 - Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using pre-processing or post-processing specially adapted for video compression
H04N 19/86 - Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using pre-processing or post-processing specially adapted for video compression involving reduction of coding artifacts, e.g. of blockiness
H04N 19/117 - Filters, e.g. for pre-processing or post-processing
H04N 19/14 - Coding unit complexity, e.g. amount of activity or edge presence estimation
G06V 10/82 - Arrangements for image or video recognition or understanding using pattern recognition or machine learning using neural networks
Embodiments of the present disclosure provide a solution for point cloud coding. A method for point cloud coding is proposed. The method comprises: determining, for a conversion between a current point cloud (PC) sample of a point cloud sequence and a bitstream of the point cloud sequence, a prediction of a first coefficient for attribute information of a current node in the current PC sample based on a second coefficient for attribute information of a reference node in a reference PC sample associated with the current PC sample; and performing the conversion based on the prediction.
H04N 19/119 - Adaptive subdivision aspects e.g. subdivision of a picture into rectangular or non-rectangular coding blocks
H04N 19/597 - Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using predictive coding specially adapted for multi-view video sequence encoding
43.
METHOD, APPARATUS, AND MEDIUM FOR VIDEO PROCESSING
Embodiments of the present disclosure provide a solution for video processing. A method for video processing is proposed. The method comprises: obtaining, for a conversion between a video unit of a video and a bitstream of the video unit, a prediction of at least one object of the video unit by using an intra block copy (IBC), wherein the video unit is coded with an IBC-geometric partition mode (GPM) where the video unit is geometrically divided into a plurality of sub-partitions or the video unit is predicated by a plurality of hypothesizes; and performing the conversion based on the prediction of the at least one object of the video unit.
H04N 19/119 - Adaptive subdivision aspects e.g. subdivision of a picture into rectangular or non-rectangular coding blocks
H04N 19/593 - Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using predictive coding involving spatial prediction techniques
A mechanism for processing video data is disclosed. The mechanism includes determining parameters for a sample adaptive offset (SAO) filter for a video unit, where the parameters are selected from an explicitly signaled set, or a predefined set, or an adaptive set, and where the adaptive set is derived based on parameters from one or more video units coded before the video unit. A conversion is performed between a visual media data and a bitstream based on the SAO filter.
Embodiments of the present disclosure provide a solution for video processing. A method for video processing is proposed. The method comprises: determining, for a conversion between a current video block of a video and a bitstream of the video, motion information and location information of at least one subblock of a temporal block in a collocated frame of the current video block; determining an affine candidate of the current video block by applying a regression process to the current video block based on the motion information and the location information of the at least one subblock; and performing the conversion based on the affine candidate.
H04N 19/176 - Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding the unit being an image region, e.g. an object the region being a block, e.g. a macroblock
47.
USING DIFFERENT SOURCES FOR HADAMARD DOMAIN FILTER IN VIDEO CODING
A mechanism for processing video data is disclosed. The mechanism includes determining to apply a Hadamard Domain Filter (HDF) process to samples of a picture, wherein the HDF process applies an average sum, a weighted sum, a linear sum, a non-linear sum, a Wiener function, or combinations thereof. A conversion is performed between a visual media data and a bitstream based on the HDF.
H04N 19/82 - Methods or arrangements for coding, decoding, compressing or decompressing digital video signals - Details of filtering operations specially adapted for video compression, e.g. for pixel interpolation involving filtering within a prediction loop
H04N 19/80 - Methods or arrangements for coding, decoding, compressing or decompressing digital video signals - Details of filtering operations specially adapted for video compression, e.g. for pixel interpolation
48.
USING DIFFERENT SOURCES FOR BILATERAL FILTER IN VIDEO CODING
A mechanism for processing video data is disclosed. The mechanism includes determining to apply a bilateral filter (BF) process to samples of a picture, wherein the BF process applies an average sum, a weighted sum, a linear sum, a non-linear sum, a Wiener function, or combinations thereof. A conversion is performed between a visual media data and a bitstream based on the BF process.
H04N 19/82 - Methods or arrangements for coding, decoding, compressing or decompressing digital video signals - Details of filtering operations specially adapted for video compression, e.g. for pixel interpolation involving filtering within a prediction loop
49.
PRESENCE AND RELATIVE DECODING ORDER OF NEURAL-NETWORK POST-FILTER SEI MESSAGES
A mechanism for processing video data is disclosed. The mechanism includes performing a conversion between a visual media data and a bitstream based on a rule. The rule specifies that a neural-network post-filter activation (NNPFA) Supplemental Enhancement Information (SEI) message with a first particular value of an NNPFA identifier is only present in a current Picture Unit (PU) when one or both of the following conditions are met. First, a current Coded Layer Video Sequence (CLVS) contains a neural-network post-filter characteristics (NNPFC) SEI message with a NNPFC identifier (nnpfc_id) equal to the first particular value of the NNPFA identifier in a preceding PU that precedes the current PU in decoding order. Second, an NNPFC SEI message with nnpfc_id equal to the first particular value of the NNPFA identifier is contained in the current PU.
H04N 19/157 - Assigned coding mode, i.e. the coding mode being predefined or preselected to be further used for selection of another element or parameter
H04N 19/102 - Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the element, parameter or selection affected or controlled by the adaptive coding
H04N 19/85 - Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using pre-processing or post-processing specially adapted for video compression
50.
SIGNALING OF NEURAL-NETWORK POST-FILTER PURPOSES AND OUTPUT CHROMA FORMAT
A mechanism for processing video data is disclosed. The mechanism includes determining a value of a neural-network post-filter characteristics (NNPFC) output chroma component block flag (nnpfc_out_sub_c_flag) in a NNPFC Supplemental Enhancement Information (SEI) message, wherein when a chroma format identification code (ChromaFormatIdc) is equal to two, which indicates that a colour format is a 4:2:2 format, and a NNPFC output chroma component block flag (nnpfc_out_sub_c_flag) is present, the value of nnpfc_out_sub_c_flag shall be equal to one. A conversion is performed between a visual media data and a bitstream based on the nnpfc_out_sub_c_flag.
A method, apparatus, and system for processing media data are disclosed. An example method for processing media data includes obtaining a first parameter or a second parameter used to determine a ratio of a neural-network post-filter characteristics (NNPFC) picture relative to a cropped width or a cropped width, where a value of the ratio is constrained to a range with an endpoint, and where the endpoint of the range is based on a value of 16; and performing a conversion between a visual media data and a bitstream based on the ratio.
H04N 19/187 - Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding the unit being a scalable video layer
H04N 19/42 - Methods or arrangements for coding, decoding, compressing or decompressing digital video signals - characterised by implementation details or hardware specially adapted for video compression or decompression, e.g. dedicated software implementation
H04N 19/635 - Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using transform coding using sub-band based transform, e.g. wavelets - characterised by filter definition or implementation details
A method for processing media data is disclosed. In an embodiment, the method includes determining a value of a neural-network post-filter characteristics (NNPFC) purpose (nnpfc_purpose) in a NNPFC supplemental enhancement information (SEI) message, wherein the nnpfc_purpose is configured to be set to include output picture size downsampling, and wherein at least one of (1) an output picture width is not equal to an input picture width and (2) an output picture height is not equal to an input picture height; and performing a conversion between a visual media data and a bitstream based on the nnpfc_purpose.
H04N 19/187 - Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding the unit being a scalable video layer
H04N 19/42 - Methods or arrangements for coding, decoding, compressing or decompressing digital video signals - characterised by implementation details or hardware specially adapted for video compression or decompression, e.g. dedicated software implementation
H04N 19/635 - Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using transform coding using sub-band based transform, e.g. wavelets - characterised by filter definition or implementation details
A mechanism for processing video data is disclosed. The mechanism includes determining a value of a neural-network post-filter activation (NNPFA) identifier (nnpfa_id) in a NNPFA supplemental enhancement information (SEI) message. The value of the nnpfa_id is constrained to be in a range of 0 to 232 − 2, inclusive. A conversion is performed between a visual media data and a bitstream based on the nnpfa_id.
Embodiments of the present disclosure provide a solution for video processing. A method for video processing is proposed. The method comprises: determining, for a conversion between a video unit of a video and a bitstream of the video unit, a Local illumination compensation (LIC) type of the video unit based on at least one neighbouring reconstructed sample of the video unit; and performing the conversion based on the determined LIC type.
Embodiments of the present disclosure provide a solution for video processing. A method for video processing is proposed. The method comprises: determining, for a conversion between a video unit of a video and a bitstream of the video unit, whether the video unit is non-intra coded or intra coded; in accordance with a determination that the video unit is non-intra coded, derive one or more intra prediction modes (IPMs) for the video unit; in accordance with a determination that the video unit is intra coded, obtain a prediction of the video unit without applying an IPM; and performing the conversion based on the one or more IPMs or the obtained.
H04N 19/157 - Assigned coding mode, i.e. the coding mode being predefined or preselected to be further used for selection of another element or parameter
56.
METHOD, APPARATUS, AND MEDIUM FOR VIDEO PROCESSING
Embodiments of the present disclosure provide a solution for video processing. A method for video processing is proposed. The method comprises: determining, for a conversion between a current video block of a video and a bitstream of the video, whether a candidate frame associated with the current video block is a co-located frame based on temporal information of a current frame comprising the current video block, the co-located frame being co-located with the current frame; and performing the conversion based on the determining.
Embodiments of the present disclosure provide a solution for video processing. A method for video processing is proposed. The method comprises: determining, for a conversion between a current video block of a video and a bitstream of the video, a first intra prediction sample of the current video block; determining an adjustment value associated with a further video block in the video, the further video block being coded before the current video block; and performing the conversion based on the first intra prediction sample and the adjustment value.
H04N 19/105 - Selection of the reference unit for prediction within a chosen coding or prediction mode, e.g. adaptive choice of position and number of pixels used for prediction
H04N 19/176 - Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding the unit being an image region, e.g. an object the region being a block, e.g. a macroblock
H04N 19/56 - Motion estimation with initialisation of the vector search, e.g. estimating a good candidate to initiate a search
H04N 19/57 - Motion estimation characterised by a search window with variable size or shape
H04N 19/44 - Decoders specially adapted therefor, e.g. video decoders which are asymmetric with respect to the encoder
58.
METHOD, APPARATUS, AND MEDIUM FOR VIDEO PROCESSING
Embodiments of the present disclosure provide a solution for video processing. A method for video processing is proposed. The method comprises: obtaining, for a conversion between a current video block of a video and a bitstream of the video, information regarding applying a geometric transformation on the current video block; selecting, based on the information, a set of samples in a first color component of the current video block, the set of samples being used for coding at least one sample in a second color component of the current video block, the second color component being different from the first color component; and performing the conversion based on the set of samples.
H04N 19/44 - Decoders specially adapted therefor, e.g. video decoders which are asymmetric with respect to the encoder
H04N 19/82 - Methods or arrangements for coding, decoding, compressing or decompressing digital video signals - Details of filtering operations specially adapted for video compression, e.g. for pixel interpolation involving filtering within a prediction loop
H04N 19/186 - Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding the unit being a colour or a chrominance component
59.
USING NON-ADJACENT SAMPLES FOR ADAPTIVE LOOP FILTER IN VIDEO CODING
A method of processing video data, comprising obtaining a non-adjacent reconstruction sample of a picture of a video, inputting the non-adjacent reconstruction sample of the picture as input for an adaptive loop filter (ALF), and performing a conversion between the video and a bitstream of the video in response to inputting the non-adjacent reconstruction sample of the current picture as the input for the ALF.
H04N 19/82 - Methods or arrangements for coding, decoding, compressing or decompressing digital video signals - Details of filtering operations specially adapted for video compression, e.g. for pixel interpolation involving filtering within a prediction loop
60.
ENHANCED SIGNALLING OF SEI PROCESSING ORDER IN VIDEO BITSTREAMS
A mechanism for processing video data is disclosed. A conversion is performed between a visual media data and a bitstream based on a rule that includes a bitstream conformance requirement which requires that a supplemental enhancement information (SEI) processing order SEI message shall contain a SEI payload type syntax element with at least a first entry and a second entry.
H04N 7/12 - Systems in which the television signal is transmitted via one channel or a plurality of parallel channels, the bandwidth of each channel being less than the bandwidth of the television signal
H04N 11/02 - Colour television systems with bandwidth reduction
61.
METHOD, APPARATUS, AND MEDIUM FOR VISUAL DATA PROCESSING
Embodiments of the present disclosure provide a solution for visual data processing. A method for visual data processing is proposed. The method comprises: determining, for a conversion between at least one bitstream of visual data and the visual data, a residual representation of the visual data at least based on a first probability distribution parameter of the visual data and a gain parameter, the residual representation representing a residual value compared to a second probability distribution representation of the visual data, the gain parameter adjusting a value range of the residual representation; and performing the conversion based on the residual representation.
H04N 19/59 - Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using predictive coding involving spatial sub-sampling or interpolation, e.g. alteration of picture size or resolution
H04N 19/91 - Entropy coding, e.g. variable length coding [VLC] or arithmetic coding
A method of processing video data. The method includes generating a sequence of samples (sampleSeq) comprising a closest preceding stream access point (SAP) of type 1, type 2, or type 3, a first dependent random access point (DRAP) sample (sampleA), and all samples following the first DRAP sample in both decoding order and output order in a track including at least one second DRAP sample (sampleB); ensuring, for each of the second DRAP samples in the sequence of samples, that all data for processing the second DRAP sample is accessible in a referenced sample entry, in the second DRAP sample itself, or in one of the samples preceding the second DRAP sample in decoding order and present in the sequence of samples; and performing a conversion between a video comprising the video data and a bitstream of the video data based on the data.
H04N 19/46 - Embedding additional information in the video signal during the compression process
H04N 19/105 - Selection of the reference unit for prediction within a chosen coding or prediction mode, e.g. adaptive choice of position and number of pixels used for prediction
H04N 19/107 - Selection of coding mode or of prediction mode between spatial and temporal predictive coding, e.g. picture refresh
H04N 19/70 - Methods or arrangements for coding, decoding, compressing or decompressing digital video signals characterised by syntax aspects related to video coding, e.g. related to compression standards
H04N 21/84 - Generation or processing of descriptive data, e.g. content descriptors
H04N 7/24 - Systems for the transmission of television signals using pulse code modulation
H04N 21/845 - Structuring of content, e.g. decomposing content into time segments
63.
NEURAL NETWORK-BASED ADAPTIVE IMAGE AND VIDEO COMPRESSION METHOD
An image decoding method including transforming an input image into latent samples using an analysis transform; quantizing the latent samples y using a hyper encoder to generate quantized hyper latent samples; encoding the quantized hyper latent samples into a bitstream using entropy encoding; applying a latent sample prediction process to obtain quantized latent samples and quantized residual latent samples based on the latent samples y using the quantized hyper latent samples; obtaining prediction samples following the latent sample prediction process; and entropy encoding the quantized hyper latent samples and the quantized residual latent samples into the bitstream.
H04N 19/12 - Selection from among a plurality of transforms or standards, e.g. selection between discrete cosine transform [DCT] and sub-band transform or selection between H.263 and H.264
G06V 10/422 - Global feature extraction by analysis of the whole pattern, e.g. using frequency domain transformations or autocorrelation for representing the structure of the pattern or shape of an object therefor
H04N 19/42 - Methods or arrangements for coding, decoding, compressing or decompressing digital video signals - characterised by implementation details or hardware specially adapted for video compression or decompression, e.g. dedicated software implementation
A mechanism for processing video data is in a neural network disclosed. The mechanism includes obtaining quantized residual latent samples. The quantized residual latent samples are processed to obtain processed quantized residual latent samples. A reconstructed latent sample can then be acquired based on the processed quantized residual latent sample.
H04N 19/61 - Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using transform coding in combination with predictive coding
H04N 19/103 - Selection of coding mode or of prediction mode
65.
METHOD, APPARATUS, AND MEDIUM FOR VISUAL DATA PROCESSING
Embodiments of the present disclosure provide a solution for visual data processing. A method for visual data processing is proposed. The method comprises: obtaining, for a conversion between visual data and a bitstream of the visual data, region information indicating positions and sizes of a plurality of regions in a quantized latent representation of the visual data; selecting, based on the region information, a set of target neighboring samples from a plurality of candidate neighboring samples of a current sample in the quantized latent representation, the set of target neighboring samples being in the same region as the current sample; determining statistical information of the current sample based on the set of target neighboring samples; and performing the conversion based on the statistical information.
H04N 19/90 - Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using coding techniques not provided for in groups , e.g. fractals
Embodiments of the present disclosure provide a solution for video processing. A method for video processing is proposed. The method comprises: determining, for a conversion between a current video block of a video and a bitstream of the video, at least one context model associated with the current video block based on initiation information of the at least one context model, the initiation information of the at least one context model being included in the bitstream; and performing the conversion based on the at least one context model.
Embodiments of the present disclosure provide a solution for video processing. A method for video processing is proposed. The method comprises: applying, for a conversion between a video unit of a video and a bitstream of the video unit, a combination of intra block copy (IBC) and intra prediction (CIBCIP) to the video unit; deriving a prediction of the video unit by combining an IBC predicted signal and an intra predicted signal; and performing the conversion based on the prediction of the video unit.
H04N 19/159 - Prediction type, e.g. intra-frame, inter-frame or bidirectional frame prediction
H04N 19/105 - Selection of the reference unit for prediction within a chosen coding or prediction mode, e.g. adaptive choice of position and number of pixels used for prediction
68.
METHOD, APPARATUS, AND MEDIUM FOR VIDEO PROCESSING
Embodiments of the present disclosure provide a solution for video processing. A method for video processing is proposed. The method comprises: determining, for a conversion between a current video block of a video and a bitstream of the video, a sign prediction of a block vector difference (BVD) of the current video block; determining the BVD at least based on the sign prediction of the BVD; and performing the conversion based on the BVD.
Embodiments of the present disclosure provide a solution for video processing. A method for video processing is proposed. The method comprises: generating, for a conversion between a video unit of a video and a bitstream of the video, an intra mode for the video unit based on coding information associated with the video unit, wherein the video unit is an intra template matching (TM) coded block or the video unit is an intra copy block (IBC) coded block; and performing the conversion based on the generated intra mode.
H04N 19/159 - Prediction type, e.g. intra-frame, inter-frame or bidirectional frame prediction
H04N 19/117 - Filters, e.g. for pre-processing or post-processing
H04N 19/176 - Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding the unit being an image region, e.g. an object the region being a block, e.g. a macroblock
70.
NEURAL NETWORK-BASED IMAGE AND VIDEO COMPRESSION METHOD WITH PARALLEL PROCESSING
An image decoding method including obtaining reconstructed latents ŷ[:,:,:] using an arithmetic decoder; feeding the reconstructed latents into a synthesis neural network; tile partitioning output feature maps into multiple parts based on decoded parameters at one or multiple locations; separately feeding each of the multiple parts into a next stage of a plurality of convolutional layers to obtain spatially partitioned feature maps at an output; and cropping and stitching the spatially partitioned feature maps back to a whole feature map spatially until an image is reconstructed.
H04N 19/17 - Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding the unit being an image region, e.g. an object
H04N 19/436 - Methods or arrangements for coding, decoding, compressing or decompressing digital video signals - characterised by implementation details or hardware specially adapted for video compression or decompression, e.g. dedicated software implementation using parallelised computational arrangements
H04N 19/46 - Embedding additional information in the video signal during the compression process
H04N 19/70 - Methods or arrangements for coding, decoding, compressing or decompressing digital video signals characterised by syntax aspects related to video coding, e.g. related to compression standards
H04N 19/13 - Adaptive entropy coding, e.g. adaptive variable length coding [AVLC] or context adaptive binary arithmetic coding [CABAC]
H04N 19/42 - Methods or arrangements for coding, decoding, compressing or decompressing digital video signals - characterised by implementation details or hardware specially adapted for video compression or decompression, e.g. dedicated software implementation
Embodiments of the present disclosure provide a solution for video processing. A method for video processing is proposed. The method comprises: determining, for a conversion between a video unit of a video and a bitstream of the video, that the video unit is coded by a motion pair from a same direction relative to a current picture, wherein the video unit is applied with an advanced motion vector prediction (AMVP) merge mode and the video unit belongs to a low-delay picture; and performing the conversion based on the motion pair.
Embodiments of the present disclosure provide a solution for point cloud coding. A method for point cloud coding is proposed. The method comprises: determining, for a conversion between a current frame of a point cloud sequence and a bitstream of the point cloud sequence, first attribute information of a first node of the current frame based on second attribute information of a second node of the current frame, a node representing a spatial partition of the current frame, a first partition depth of the first node being different from a second partition depth of the second node; and performing the conversion based on the first attribute information.
H04N 19/597 - Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using predictive coding specially adapted for multi-view video sequence encoding
H04N 19/00 - Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
73.
A NEURAL NETWORK-BASED IMAGE AND VIDEO COMPRESSION METHOD WITH CONDITIONAL CODING
H04N 19/12 - Selection from among a plurality of transforms or standards, e.g. selection between discrete cosine transform [DCT] and sub-band transform or selection between H.263 and H.264
G06V 10/422 - Global feature extraction by analysis of the whole pattern, e.g. using frequency domain transformations or autocorrelation for representing the structure of the pattern or shape of an object therefor
H04N 19/42 - Methods or arrangements for coding, decoding, compressing or decompressing digital video signals - characterised by implementation details or hardware specially adapted for video compression or decompression, e.g. dedicated software implementation
Embodiments of the present disclosure provide a solution for video processing. A method for video processing is proposed. The method comprises: generating, for a conversion between a video unit of a video and a bitstream of the video unit, a sample value of a first color component of the video unit that is corresponding to a sample of a second color component by applying a plurality of filters to at least one sample of the first color component; and performing the conversion based on the generated sample value.
Embodiments of the present disclosure provide a solution for video processing. A method for video processing is proposed. The method comprises: determining, for a conversion between a current video block of a video and a bitstream of the video, whether a candidate frame associated with the current video block is a co-located frame based on a frame type of the candidate frame, the co-located frame being co-located with a frame comprising the current video block; and performing the conversion based on the determining.
Embodiments of the present disclosure provide a solution for video processing. A method for video processing is proposed. The method comprises: determining, for a conversion between a current video block of a video and a bitstream of the video, a plurality of co-located frames of the current video block, the current video block being in a current frame co-located with the plurality of co-located frame; and performing the conversion based on the plurality of co-located frames.
H04N 19/176 - Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding the unit being an image region, e.g. an object the region being a block, e.g. a macroblock
H04N 19/50 - Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using predictive coding
H04N 19/573 - Motion compensation with multiple frame prediction using two or more reference frames in a given prediction direction
H04N 19/52 - Processing of motion vectors by encoding by predictive encoding
77.
MULTI-SOURCE BASED EXTENDED TAPS FOR ADAPTIVE LOOP FILTER IN VIDEO CODING
A mechanism for processing video data is disclosed. The mechanism includes determining to apply an adaptive loop filter (ALF) to a first component of a video unit. The ALF includes one or more extended taps. The one or more extended taps utilize an input source other than spatial neighbor samples of the first component. A conversion is performed between a visual media data and a bitstream based on the ALF.
H04N 19/82 - Methods or arrangements for coding, decoding, compressing or decompressing digital video signals - Details of filtering operations specially adapted for video compression, e.g. for pixel interpolation involving filtering within a prediction loop
H04N 19/117 - Filters, e.g. for pre-processing or post-processing
78.
GEOMETRIC TRANSFORM IN NEURAL NETWORK-BASED CODING TOOLS FOR VIDEO CODING
A mechanism for processing video data is disclosed. The mechanism determines to modify a video unit attendant to applying a video compression function. The modification may include applying a geometric conversion to the video unit. A conversion is performed between a visual media data and a bitstream based on the modified video unit.
H04N 19/136 - Incoming video signal characteristics or properties
H04N 19/20 - Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using video object coding
H04N 19/48 - Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using compressed domain processing techniques other than decoding, e.g. modification of transform coefficients, variable length coding [VLC] data or run-length data
79.
METHOD, APPARATUS, AND MEDIUM FOR VIDEO PROCESSING
Embodiments of the present disclosure provide a solution for video processing. A. method for video processing is proposed. The method comprises: obtaining, for a conversion between a current video block of a video and a bitstream of the video, values for a set of adjusting parameters associated with values for a set of model parameters of a local illumination compensation (LIC) model for coding the current video block; updating the values for the set of model parameters based on the values for the set of adjusting parameters; and performing the conversion based on the updated values for the set of model parameters.
H04N 19/105 - Selection of the reference unit for prediction within a chosen coding or prediction mode, e.g. adaptive choice of position and number of pixels used for prediction
H04N 19/117 - Filters, e.g. for pre-processing or post-processing
H04N 19/139 - Analysis of motion vectors, e.g. their magnitude, direction, variance or reliability
H04N 19/51 - Motion estimation or motion compensation
80.
METHOD, APPARATUS, AND MEDIUM FOR VIDEO PROCESSING
Embodiments of the present disclosure provide a solution for video processing. A method for video processing is proposed. The method comprises: obtaining, for a conversion between a video and a bitstream of the video, a first video block, the first video block being generated based on a first adjustment of a current video block of the video, the first adjustment comprising at least one of adjusting an orientation of the current video block or adjusting positions of samples in the current video block; generating a. second video block based on a second adjustment of the first video block, the second adjustment being an inverse process of the first adjustment; and performing the conversion based on the second video block.
H04N 19/17 - Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding the unit being an image region, e.g. an object
H04N 19/85 - Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using pre-processing or post-processing specially adapted for video compression
H04N 19/88 - Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using pre-processing or post-processing specially adapted for video compression involving rearrangement of data among different coding units, e.g. shuffling, interleaving, scrambling or permutation of pixel data or permutation of transform coefficient data among different blocks
H04N 19/10 - Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
H04N 19/176 - Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding the unit being an image region, e.g. an object the region being a block, e.g. a macroblock
H04N 19/61 - Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using transform coding in combination with predictive coding
H04N 19/119 - Adaptive subdivision aspects e.g. subdivision of a picture into rectangular or non-rectangular coding blocks
H04N 19/147 - Data rate or code amount at the encoder output according to rate distortion criteria
81.
METHOD, APPARATUS, AND MEDIUM FOR POINT CLOUD CODING
Embodiments of the present disclosure provide a solution for point cloud coding. A method for point cloud coding is proposed. The method comprises: obtaining, for a conversion between a current point cloud (PC) sample of a point cloud sequence and a bitstream of the point cloud sequence, target information regarding whether an attribute inter prediction is enabled for the current PC sample, the target information being determined based on at least one of rate information or distortion information associated with coding at least one target PC sample with the attribute inter prediction, wherein the at least one target PC sample comprises at least one of: the current PC sample, or at least one PC sample of the point cloud sequence coded before the current PC sample; and performing the conversion based on the target information.
Embodiments of the present disclosure provide a solution for video processing. A method for video processing is proposed. The method comprises: determining, for a conversion between a current block of a video and a bitstream of the video, motion information of the current block based on motion information of a neighboring block of the current block and an intra block copy merge mode with block vector difference (IBC-MBVD) mode, the neighboring block being coded with a reconstruction-reordered intra block copy (RRIBC) mode; and performing the conversion based on the motion information of the current block.
H04N 19/176 - Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding the unit being an image region, e.g. an object the region being a block, e.g. a macroblock
H04N 19/51 - Motion estimation or motion compensation
H04N 19/52 - Processing of motion vectors by encoding by predictive encoding
83.
METHOD, APPARATUS, AND MEDIUM FOR VIDEO PROCESSING
Embodiments of the present disclosure provide a solution for video processing. A method for video processing is proposed. The method comprises: determining, during a conversion be-tween a target block of a video and a bitstream of the target block, that a combination of an intra block copy (IBC) and an intra prediction (CIBCIP) mode is applied to the target block; obtaining an IBC predicted signal and an intra predicted signal based on the CIBCIP mode; deriving a prediction or a reconstruction of the target block by combining the IBC predicted signal and the intra predicted signal; and performing the conversion based on the prediction or the reconstruction of the target block.
A mechanism for processing video data is disclosed. The mechanism determines to apply a preprocessing function to visual media data as part of an image compression framework. A conversion can then be performed between a visual media data and a bitstream based on the image compression framework and the corresponding preprocessing function.
H04N 19/122 - Selection of transform size, e.g. 8x8 or 2x4x8 DCT; Selection of sub-band transforms of varying structure or type
H04N 19/85 - Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using pre-processing or post-processing specially adapted for video compression
A mechanism for processing video data is disclosed. A padding process is determined for application to derive out of boundary samples for use by a filter applied to in boundary samples within a video unit. A conversion is performed between a visual media data and a bitstream based on the filter.
H04N 19/117 - Filters, e.g. for pre-processing or post-processing
H04N 19/182 - Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding the unit being a pixel
H04N 19/70 - Methods or arrangements for coding, decoding, compressing or decompressing digital video signals characterised by syntax aspects related to video coding, e.g. related to compression standards
A mechanism for processing video data is disclosed. An indication is determined in an Addressable Resource Index (ARI) track configuration box in an ARI track. The ARI track indexes a Common Media Application Format (CMAF) switching set. The indication indicates whether extended dependent random access point (EDRAP) samples are allowed to be present in one or more tracks of the CMAF switching set. A conversion is performed between a media data and a media data file based on the indication.
G06F 16/71 - Indexing; Data structures therefor; Storage structures
H04N 19/70 - Methods or arrangements for coding, decoding, compressing or decompressing digital video signals characterised by syntax aspects related to video coding, e.g. related to compression standards
H04N 21/2343 - Processing of video elementary streams, e.g. splicing of video streams or manipulating MPEG-4 scene graphs involving reformatting operations of video signals for distribution or compliance with end-user requests or end-user device requirements
H04N 21/4402 - Processing of video elementary streams, e.g. splicing a video clip retrieved from local storage with an incoming video stream or rendering scenes according to MPEG-4 scene graphs involving reformatting operations of video signals for household redistribution, storage or real-time display
A mechanism for processing video data is disclosed. A media track is determined to include a track reference type of associated external stream track (aest) A conversion is performed between a media data and a media data file based on a media track. The media track with the track reference type of aest may include an extended dependent random access point (EDRAP) sample.
G06F 16/71 - Indexing; Data structures therefor; Storage structures
H04N 19/70 - Methods or arrangements for coding, decoding, compressing or decompressing digital video signals characterised by syntax aspects related to video coding, e.g. related to compression standards
H04N 21/2343 - Processing of video elementary streams, e.g. splicing of video streams or manipulating MPEG-4 scene graphs involving reformatting operations of video signals for distribution or compliance with end-user requests or end-user device requirements
H04N 21/4402 - Processing of video elementary streams, e.g. splicing a video clip retrieved from local storage with an incoming video stream or rendering scenes according to MPEG-4 scene graphs involving reformatting operations of video signals for household redistribution, storage or real-time display
88.
METHOD, APPARATUS, AND MEDIUM FOR VIDEO PROCESSING
Embodiments of the present disclosure provide a solution for video processing. A method for video processing is proposed. The method comprises: applying, for a conversion between a video unit of a video and a bitstream of the video unit, a wrap around motion compensation (WAMC) during a derivation of motion information for the video unit; and performing the conversion based on the derived motion information.
H04N 19/176 - Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding the unit being an image region, e.g. an object the region being a block, e.g. a macroblock
89.
METHOD, APPARATUS, AND MEDIUM FOR VIDEO PROCESSING
Embodiments of the present disclosure provide a solution for video processing. A method for video processing is proposed. The method comprises: determining, for a conversion between the current video block and a bitstream of the video, a target number for a set of motion candidates, the target number being smaller than or equal to an upper limit value; determining, based on the target number and a fast rate distortion optimization (RDO) process, the set of motion candidates from a plurality of motion candidates for an intra block copy (IBC) based mode for coding the current video block; and performing the conversion based on the set of motion candidates.
Embodiments of the present disclosure provide a solution for video processing. A method for video processing is proposed. The method comprises: obtaining, for a conversion between the current video block and a bitstream of the video, a target number for a set of intra block copy merge mode with block vector difference (IBC-MBVD) candidates, the target number being indicated in the bitstream and dependent on a target configuration of a coding process for coding the current video block; selecting, based on the target number, the set of IBC-MBVD candidates from a plurality of IBC-MBVD candidates associated with an intra block copy (IBC) base candidate for the current video block; and performing the conversion based on the set of IBC-MBVD candidates.
H04N 19/184 - Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding the unit being bits, e.g. of the compressed video stream
91.
METHOD, APPARATUS, AND MEDIUM FOR VIDEO PROCESSING
Embodiments of the present disclosure provide a solution for video processing. A method for video processing is proposed. The method comprises: obtaining, for a conversion between a current video block of a video and a bitstream of the video, a motion candidate list for the current video block; determining, based on a similarity metric between a first motion candidate and a second motion candidate in the motion candidate list, whether to update the motion candidate list; and performing the conversion based on the determination.
H04N 19/61 - Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using transform coding in combination with predictive coding
92.
PARTITIONING INFORMATION IN NEURAL NETWORK-BASED VIDEO CODING
A method implemented by a video coding apparatus. The method includes applying a neural network (NN) filter to an unfiltered sample of a video unit to generate a filtered sample. The NN filter includes an NN filter model generated based on partitioning information of the video unit. Usage of the NN filter is indicated by one or more syntax elements in a bitstream. A conversion is performed between a video media file and a bitstream based on the filtered sample.
H04N 19/70 - Methods or arrangements for coding, decoding, compressing or decompressing digital video signals characterised by syntax aspects related to video coding, e.g. related to compression standards
H04N 19/82 - Methods or arrangements for coding, decoding, compressing or decompressing digital video signals - Details of filtering operations specially adapted for video compression, e.g. for pixel interpolation involving filtering within a prediction loop
Embodiments of the present disclosure provide a solution for video processing. A method for video processing is proposed. The method comprises: applying, for a conversion between a video unit of a video and a bitstream of the video, a connection to a plurality of filters; applying the plurality of filters with the connection in combination to the video unit; and performing the conversion based on the filtered video unit.
H04N 19/117 - Filters, e.g. for pre-processing or post-processing
H04N 19/82 - Methods or arrangements for coding, decoding, compressing or decompressing digital video signals - Details of filtering operations specially adapted for video compression, e.g. for pixel interpolation involving filtering within a prediction loop
A mechanism for processing video data is disclosed. A determination is made to resize an image with a first size to create a resized image with a second size. A conversion is performed between a visual media data and a bitstream based on the resized image. The conversion includes applying a neural network-based coding model to the resized image to achieve variable rate neural network-based compression.
H04N 19/149 - Data rate or code amount at the encoder output by estimating the code amount by means of a model, e.g. mathematical model or statistical model
H04N 19/85 - Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using pre-processing or post-processing specially adapted for video compression
95.
METHOD, APPARATUS, AND MEDIUM FOR VIDEO PROCESSING
Embodiments of the present disclosure provide a solution for video processing. A method for video processing is proposed. The method comprises: generating, based on a neighboring block of a current block of a video for a conversion between the current block and a bitstream of the video, a motion candidate list for an intra block copy template matching merge (IBC-TM-MERGE) mode for coding the current block, the neighboring block being coded with a reconstruction-reordered intra block copy (RRIBC) mode; and performing the conversion based on the motion candidate list.
H04N 19/593 - Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using predictive coding involving spatial prediction techniques
96.
METHOD, APPARATUS, AND MEDIUM FOR VIDEO PROCESSING
Embodiments of the present disclosure provide a solution for video processing. A method for video processing is proposed. The method comprises: determining, for a conversion between a current video block of a video and a bitstream of the video, whether to adjust a reconstructed reordered intra block copy (RRIBC) coded history-based motion vector prediction (HMVP) candidate of the current video block; and performing the conversion based on the determining.
H04N 19/61 - Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using transform coding in combination with predictive coding
97.
METHOD, APPARATUS, AND MEDIUM FOR VIDEO PROCESSING
Embodiments of the present disclosure provide a solution for video processing. A method for video processing is proposed. The method comprises: determining, for a conversion between a video unit of a video and a bitstream of the video unit, whether an adaptive reordering-based motion compensation (ARMC) is applied to a prediction list of the video unit, wherein the prediction list comprises at least one a reconstruction reordered intra block copy (RRIBC) coded motion candidate; and performing the conversion based on the determining.
H04N 19/189 - Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the adaptation method, adaptation tool or adaptation type used for the adaptive coding
98.
Extended Taps Using Different Sources for Adaptive Loop Filter in Video Coding
A method for processing video data includes using one or more extended taps in an ALF or a CCALF and performing a conversion between a visual media data and a bitstream based on the extended taps used in the ALF. A syntax element structure in the bitstream contains one or more filters with at least one extended tap, a first syntax element is in the bitstream to indicate whether a filter with at least one extended tap is enabled. The first syntax element is binarized by unary code, truncated unary code, fixed length code, exponential Golomb code, truncated exponential Golomb code, or bypass code. An extended tap takes information from one or more reference frames in a list zero, a list one, or both. Whether to take information from previously coded frames is dependent on a slice type, a picture type, or a temporal layer index.
H04N 19/82 - Methods or arrangements for coding, decoding, compressing or decompressing digital video signals - Details of filtering operations specially adapted for video compression, e.g. for pixel interpolation involving filtering within a prediction loop
H04N 19/503 - Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using predictive coding involving temporal prediction
H04N 19/463 - Embedding additional information in the video signal during the compression process by compressing encoding parameters before transmission
99.
METHOD, APPARATUS, AND MEDIUM FOR VIDEO PROCESSING
Embodiments of the present disclosure provide a solution for video processing. A method for video processing is proposed. The method comprises: determining, for a conversion between a video unit of a video and a bitstream of the video unit, whether a template based processing is applied to the video unit, wherein the template based processing is based on at least one template in at least one of: a current picture or a reference picture of the video unit; and performing the conversion based on the determining.
Embodiments of the present disclosure provide a solution for video processing. A method for video processing is proposed. The method comprises: determining, for a conversion between a current block of a video and a bitstream of the video, that a first prediction mode is applied to the current block, deriving, in the first prediction mode, prediction samples of the current block based on reference values in a virtual buffer that are determined by a block vector for the current block; wherein reference values in the virtual buffer are derived from blocks of sample values of a same video region of the current video block; performing the conversion based on the prediction samples; determining a set of blocks from a plurality of blocks of the same video region based on coded information of the plurality of blocks, the plurality of blocks being coded before the current block; and updating the virtual buffer based on the set of blocks.
H04N 19/50 - Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using predictive coding
H04N 19/103 - Selection of coding mode or of prediction mode
H04N 19/176 - Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding the unit being an image region, e.g. an object the region being a block, e.g. a macroblock
H04N 19/00 - Methods or arrangements for coding, decoding, compressing or decompressing digital video signals