In recent years learning-based codecs have gained a lot of momentum in the field of multimedia compression. The reason for their success is the ability to learn very informative yet entropy efficient representations of the data. This advantage is what allows them to outperform traditional codecs in most coding tasks. However, most of these codecs are still in early development and are lacking some of the features that are common in traditional codecs. Some examples are high and low complexity operation modes, quality scalability, semantic information exploitation, and, although mostly relevant when talking about learned solutions, the ability to choose if the sample should be decoded with high fidelity or high perceptual quality. In this thesis, some codec-agnostic algorithms that address some of the aforementioned problems are presented. The algorithms should be as independent as possible from the codec design and they should not impair the ability of the codec to be used in the standard way. This should allow them to be easily implemented into any new codec without requiring ad-hoc designs thus allowing to deploy efficient and flexible codecs in a short time. The algorithms proposed in this thesis follow the aforementioned principles since they increase the flexibility of the codecs they were implemented on without considerably affecting their compression efficiency and encoding/decoding time. This constitutes a step forward in the usability of the new generation of codecs in real-case scenarios.

Codec Agnostic Strategies for Enhanced Learned Media Compression

MARI, DANIELE
2025

Abstract

In recent years learning-based codecs have gained a lot of momentum in the field of multimedia compression. The reason for their success is the ability to learn very informative yet entropy efficient representations of the data. This advantage is what allows them to outperform traditional codecs in most coding tasks. However, most of these codecs are still in early development and are lacking some of the features that are common in traditional codecs. Some examples are high and low complexity operation modes, quality scalability, semantic information exploitation, and, although mostly relevant when talking about learned solutions, the ability to choose if the sample should be decoded with high fidelity or high perceptual quality. In this thesis, some codec-agnostic algorithms that address some of the aforementioned problems are presented. The algorithms should be as independent as possible from the codec design and they should not impair the ability of the codec to be used in the standard way. This should allow them to be easily implemented into any new codec without requiring ad-hoc designs thus allowing to deploy efficient and flexible codecs in a short time. The algorithms proposed in this thesis follow the aforementioned principles since they increase the flexibility of the codecs they were implemented on without considerably affecting their compression efficiency and encoding/decoding time. This constitutes a step forward in the usability of the new generation of codecs in real-case scenarios.
24-mar-2025
Inglese
MILANI, SIMONE
Università degli studi di Padova
File in questo prodotto:
File Dimensione Formato  
TesiDanieleMari.pdf

accesso aperto

Dimensione 6.57 MB
Formato Adobe PDF
6.57 MB Adobe PDF Visualizza/Apri

I documenti in UNITESI sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/20.500.14242/199686
Il codice NBN di questa tesi è URN:NBN:IT:UNIPD-199686