Codec Agnostic Strategies for Enhanced Learned Media Compression

Mari, Daniele

In recent years learning-based codecs have gained a lot of momentum in the field of multimedia compression. The reason for their success is the ability to learn very informative yet entropy efficient representations of the data. This advantage is what allows them to outperform traditional codecs in most coding tasks. However, most of these codecs are still in early development and are lacking some of the features that are common in traditional codecs. Some examples are high and low complexity operation modes, quality scalability, semantic information exploitation, and, although mostly relevant when talking about learned solutions, the ability to choose if the sample should be decoded with high fidelity or high perceptual quality. In this thesis, some codec-agnostic algorithms that address some of the aforementioned problems are presented. The algorithms should be as independent as possible from the codec design and they should not impair the ability of the codec to be used in the standard way. This should allow them to be easily implemented into any new codec without requiring ad-hoc designs thus allowing to deploy efficient and flexible codecs in a short time. The algorithms proposed in this thesis follow the aforementioned principles since they increase the flexibility of the codecs they were implemented on without considerably affecting their compression efficiency and encoding/decoding time. This constitutes a step forward in the usability of the new generation of codecs in real-case scenarios.

Codec Agnostic Strategies for Enhanced Learned Media Compression

MARI, DANIELE

2025

Abstract

In recent years learning-based codecs have gained a lot of momentum in the field of multimedia compression. The reason for their success is the ability to learn very informative yet entropy efficient representations of the data. This advantage is what allows them to outperform traditional codecs in most coding tasks. However, most of these codecs are still in early development and are lacking some of the features that are common in traditional codecs. Some examples are high and low complexity operation modes, quality scalability, semantic information exploitation, and, although mostly relevant when talking about learned solutions, the ability to choose if the sample should be decoded with high fidelity or high perceptual quality. In this thesis, some codec-agnostic algorithms that address some of the aforementioned problems are presented. The algorithms should be as independent as possible from the codec design and they should not impair the ability of the codec to be used in the standard way. This should allow them to be easily implemented into any new codec without requiring ad-hoc designs thus allowing to deploy efficient and flexible codecs in a short time. The algorithms proposed in this thesis follow the aforementioned principles since they increase the flexibility of the codecs they were implemented on without considerably affecting their compression efficiency and encoding/decoding time. This constitutes a step forward in the usability of the new generation of codecs in real-case scenarios.

Scheda breve

Scheda completa

Scheda completa (DC)

	Corso di studio
	
				INGEGNERIA DELL'INFORMAZIONE
			
	Data di pubblicazione
	
				24-mar-2025
			
	Lingua
	
				Inglese
			
	Relatore, Supervisor, Advisor o Tutor
	
				MILANI, SIMONE
			
	Nome Editore
	
				Università degli studi di Padova
			
	Collezione di appartenenza
	
				Università degli Studi di Padova

File in questo prodotto:

File	Dimensione	Formato
TesiDanieleMari.pdf accesso aperto Dimensione 6.57 MB Formato Adobe PDF Visualizza/Apri	6.57 MB	Adobe PDF	Visualizza/Apri

I documenti in UNITESI sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/20.500.14242/199686

Il codice NBN di questa tesi è URN:NBN:IT:UNIPD-199686