• Detailed Program


15th April, Monday
11:00 AM - 1:00 PM Registration
1:00 PM - 2:00 PM Welcome cocktail at Politecnico di Bari
2:00 PM - 3:30 PM
Oral presentations

MMSys Research Track
Session 1: Delivering immersive experiences

Session Chair: Carsten Griwodz
Just-in-Time Transcoding of Live 360° Video Streams
F. Hechler, M. Rudolph, A. Rizk
VP9 bitstream-based Tiled Multipoint Control Unit: Scaling simultaneous RGBD user streams in an immersive 3D communication system
S. N. Gunkel, R. Hindriks, Y. Shiferaw, S. Dijkstra-Soudarissanane, O. Niamut
Scalable MDC-Based Volumetric Video Delivery for Real-Time One-to-Many WebRTC Conferencing
M. De Fré, J. van der Hooft, T. Wauters, F. De Turck
Reliability Groups with Standby Flying Light Specks
H. Alimohammadzadeh, S. Zhu, J. Bai, S. Ghandeharizadeh
3:30 PM - 4:00 PM Coffee Break
4:00 PM - 5:20 PM
Oral presentations

MMSys Research Track
Session 2: Adaptive Streaming of Multimedia Content

Session Chair: Maria Martini
QoE Metrics for Interactivity in Video Conferencing Applications: Definition and Evaluation Methodology
J. He, M. Ammar, E. Zegura, E. Halepovic, T. Karagioules
Study-based Models of Game Player Quality of Experience with Frame Display Variation
X. Xu, M. Claypool
How do Users Experience Asynchrony between Visual and Haptic Information?
S. Zoltanski, Ç. Erdem, K. Kousias, O. Alay, C. Griwodz
Automatic Preparation of Sensory Effects: Managing Synchronization in Mulsemedia Applications
M. Josué, D. C. Muchaluat-Saade, M. F. Moreno
7:00 PM Welcome Reception at Hotel delle Nazioni rooftop, Bari seafront (directions)
16th April, Tuesday
8:30 AM - 9:00 AM Registration
9:00 AM - 10:00 AM

Keynote: Robotics and Wearable Haptics in Virtual and Augmented Reality

Domenico Prattichizzo, Universiy of Siena
10:00 AM - 10:30 AM Coffee Break
10:30 AM - 11:50 AM
Oral presentations

MMSys Research Track
Session 3: Adaptive Video Streaming

Session Chair: Silvia Rossi
BOLA360: Near-optimal View and Bitrate Adaptation for 360-degree Video Streaming
A. Zeynali, M. Hajiesmaili, R. K. Sitaraman
QV4: QoE-based Viewpoint-Aware V-PCC-encoded Volumetric Video Streaming
Y. Shi, B. Clement, W. T. Ooi
FovOptix: Human Vision-Compatible Video Encoding and Adaptive Streaming in VR Cloud Gaming
A. Alhilal, Z. Wu, Y. H. Tsui, P. Hui
Low Latency Live Video Streaming over a Low-Earth-Orbit Satellite Network with DASH
J. Zhao, J. Pan
11:50 AM - 12:30 PM
Oral presentations given by organizers

Grand Challenges

360-degree Video on-demand Streaming
Yongqiang Gui (Bytedance)
Offline Reinforcement Learning for Bandwidth Estimation in Real Time Communications
Ezra Ameri (Microsoft)
12:30 PM - 2:00 PM Lunch (ODS and GC posters setup)
2:00 PM - 3:30 PM

Open Datasets and Software

Ceasefire Hierarchical Weapon Dataset
T. Malon , S. Chambon , A. Crouzil , L. Lechelek , G. Jalabert , C. Brocard , N. Bernardeau , L. Abadie , B. Sera , T. Hartmann , M. Le Bras
TACDEC: Dataset for Automatic Tackle Detection in Soccer Game Videos
E. J. Kassab , H. M. Solberg , S. Gautam , C. Midoglu , S. S. Sabet , T. Torjusen , M. Riegler , P. Halvorsen
Nagare Media Engine: Task Error Recovery in MPEG NBMP Workflows Through Event Sourcing
M. Neugebauer
GREEM: An Open-Source Energy Measurement Tool for Video Processing
C. Bauer , S. Afzal , S. Linder , R. Prodan , C. Timmerer
An Open Software Suite for Event-Based Video
A. Freeman
LENS: A LEO Satellite Network Measurement Dataset
J. Zhao , J. Pan
EVCA: Enhanced Video Complexity Analyzer
H. Amirpour , M. Ghasempour , L. Qu , W. Hamidouche , C. Timmerer
QADRA: Quality-Aware Dynamic Resolution Adaptation Framework for Adaptive Video Streaming
A. Premkumar , P. T. Rajendran , V. V. Menon , A. Wieckowski , B. Bross , D. Marpe
Pandia: Open-source Framework for DRL-based Real-time Video Streaming Control
X. Li , E. Vikberg , B. Cho , Y. Xiao
vRetention: A User Viewing Dataset for Popular Video Streaming Services
B. Chen , J. Zhu , Y. Jiang , Y. Hu
Panonut360: A Head and Eye Tracking Dataset for Panoramic Video
Y. Xu , J. Du , J. Wang , Y. Ning , S. Zhou , Y. Cao
VEED: Video Encoding Energy and CO2 Emissions Dataset for AWS EC2 instances
S. Linder , S. Afzal , C. Bauer , H. Amirpour , R. Prodan , C. Timmerer
COCONUT: Content Consumption Energy Measurement Dataset for Adaptive Video Streaming
F. Tashtarian , D. Lorenzi , H. Amirpour , S. Afzal , C. Timmerer
The SoccerSum Dataset for Automated Detection, Segmentation, and Tracking of Objects on the Soccer Pitch
M. H. Sarkhoosh , S. Gautam , C. Midoglu , S. S. Sabet , T. Torjusen , P. Halvorsen
A Driver Activity Dataset with Multiple RGB-D Cameras and mmWave Radars
G. Li , H. Chiang , Y. Li , S. Shirmohammadi , C. Hsu
ComPEQ - MR: Compressed Point Cloud Dataset with Eye-tracking and Quality Assessment in Mixed Reality
M. Nguyen , S. Vats , X. Zhou , I. Viola , P. Cesar , C. Timmerer , H. Hellwagner
uvgRTP 3.0: Towards V3C Volumetric Video Communication
H. Tampio , J. Räsänen , M. Viitanen , A. Mercat , G. Gautier , J. Vanne
APEIRON: a Multimodal Drone Dataset Bridging Perception and Network Data in Outdoor Environments
N. Barone , W. Brescia , L. De Cicco , S. Mascolo
Questset: A VR Dataset for Network and Quality of Experience Studies
S. Baldoni , F. Battisti , F. Chiariotti , F. Mistrorigo , A. B. Shofi , P. Testolina , A. Traspadini , A. Zanella , M. Zorzi
StreetLens: An In-Vehicle Video Dataset for Public Facility Monitoring in Urban Streets
A. A. Jabal , A. Alfarrarjeh , S. Alsaggar , R. AbuRumman , K. Abuqaoud , L. Abuhejleh , I. Almatar , S. H. Kim
MilliNoise: a Millimeter-wave Radar Sparse Point Cloud Dataset in Indoor Scenarios
W. Brescia , P. Gomes , L. Toni , S. Mascolo , L. De Cicco
BostonTwin: the Boston Digital Twin for Ray-Tracing in 6G Networks
P. Testolina , M. Polese , P. Johari , T. Melodia
2:00 PM - 3:30 PM
Posters session

Grand Challenges

360-degree Video on-demand Streaming
Organized and sponsored by Bytedance
Efficient viewport prediction and tiling schemes for 360 degree video streaming
J. Adhuran , M. G. Martini
OMMS: Multiple Control based Adaptive 360° Video Streaming
R. Xu , C. Liu , M. Hu , S. Qian , Y. Zhang , T. Lin
Efficient Tile-Based Adaptive Streaming for 360-Degree Video-on-Demand Systems
D. Sheng , B. Gao , G. Liu , Q. Qi , J. Wang
Offline Reinforcement Learning for Bandwidth Estimation in Real Time Communications
Organized and sponsored by Microsoft
Pioneer: Offline Reinforcement Learning based Bandwidth Estimation for Real-Time Communication
B. Lu , K. Wang , J. Xu , L. Song , R. Xie , W. Zhang
NAORL: Network Feature Aware Offline Reinforcement Learning for Real Time Bandwidth Estimation
W. Zhang , X. Tao , J. Wang
ACM MMSys 2024 Bandwidth Estimation in Real Time Communications Challenge
S. Khairy , G. Mittag , V. Gopal , F. Y. Yan , Z. Niu , E. Ameri , S. Inglis , M. Golestaneh , R. Cutler
Accurate Bandwidth Prediction for Real-Time Media Streaming with Offline Reinforcement Learning
Q. Tan , G. Lv , X. Fang , J. Zhang , Z. Yang , Y. Jiang , Q. Wu
Offline Reinforcement Learning for Bandwidth Estimation in RTC Using a Fast Actor and Not-So-Furious Critic
E. Cetinkaya , A. Pehlivanoglu , I. U. Ayten , B. Yumakogullari , M. E. Ozgun , Y. K. Erinc , E. Deniz , A. C. Begen
3:30 PM - 4:00 PM Coffee Break
4:00 PM - 5:00 PM
Oral presentations

MMSys Research Track
Session 4: Watermarking, Video Filtration, and Neural Radiance Fields

Session Chair: Debora Muchaluat-Saade
FlexMark: Adaptive Watermarking Method for Images
M. A. Arab, A. Ghorbanpour, M. Hefeeda
CVF: Cross-Video Filtration on the Edge
A. Rahmanian, A. Ali-Eldin, S. Amin, H. Gustafsson
Finding Waldo: Towards Efficient Exploration of NeRF Scene Spaces
E. Skartados, M. K. Yucel, B. Manganelli, A. Drosou, A. Saa-Garriga
5:30 PM Bus leaves for social dinner at Covo dei Saraceni (Polignano a Mare)
17th April, Wednesday
8:30 AM - 9:00 AM Registration
9:00 AM - 10:00 AM

Keynote: Media, in the trenches

Dan Jenkins, Nimble Ape and Everycast Labs
10:00 AM - 10:30 AM Coffee Break
10:30 AM - 11:50 AM
Oral presentations

MMSys Research Track
Session 5: AI for Multimedia understanding and Video streaming

Session Chair: Wassim Hamidouche
A Modular System for Enhanced Robustness of Multimedia Understanding Networks via Deep Parametric Estimation
F. Barbato, U. Michieli, M. K. Yucel, P. Zanuttigh, M. Ozay
AGiLE: Enhancing Adaptive GOP in Live Video Streaming
C. Chen, W. Yin, Z. Huang, S. Shi
OASIS: Collaborative Neural-Enhanced Mobile Video Streaming
S. Jin, R. Zhu, A. Hassan, X. Zhu, X. Zhang, Z. M. Mao, F. Qian, Z. Zhang
DIGITWISE: Digital Twin-based Modeling of Adaptive Video Streaming Engagement
E. Artioli, F. Tashtarian, C. Timmerer
11:50 AM - 12:30 PM
Pitches

Doctoral Symposium pitches

Session Chair: Mario Montagud
Generative AI for HTTP Adaptive Streaming
E. Artioli
Analysis and Development of Deep Learning Depth Estimation Techniques for Volumetric Capture and Free Viewpoint Video
J. Usón , J. Cabrera
Design and Implementation of a Low-Latency Origin and Relay for Media-over-QUIC Transport
Z. Gurel
Smart radio access selection and slice allocation for differentiated traffic management over 6G heterogeneous networks
C. Carballo Gonzalez , M. Murroni
Evaluating Visual Attention and QoE for 360° videos with non-spatial and spatial audio
A. Hirway , Y. Qiao , N. Murray
Multiuser Virtual Experiences powered by Holoportation Technologies and Multimodal Human-Computer Interaction (HCI)
M. Hjeij , M. Montagud , D. Rincón
12:30 PM - 2:00 PM Lunch (Demo setup)
2:00 PM - 3:30 PM

Technical Demos

Media-over-QUIC Transport vs. Low-Latency DASH: a Deathmatch Testbed
Z. Gurel , T. E. Civelek , D. Ugur , Y. K. Erinc , A. C. Begen
Demonstrating Adaptive Many-To-Many Immersive Teleconferencing For Volumetric Video
M. De Fré , J. van der Hooft , T. Wauters , F. De Turck
Context-aware chatbot using MLLMs for Cultural Heritage
P. K. Rachabatuni , F. Principi , P. Mazzanti , M. Bertini
PyStream: Enhancing Video Streaming Evaluation
S. Radler , L. Prüller , E. Artioli , F. Tashtarian , C. Timmerer
SmartCrop-H: AI-Based Cropping of Ice Hockey Videos
M. Majidi , M. H. Sarkhoosh , C. Midoglu , S. S. Sabet , T. Kupka , D. Johansen , P. Halvorsen
Power Efficient Multi-CDN Communication over Content Steering Server
B. Kara , G. Simon
Multimodal AI-Based Summarization and Storytelling for Soccer on Social Media
M. H. Sarkhoosh , S. Gautam , C. Midoglu , S. S. Sabet , P. Halvorsen
Integrating Content Authenticity with DASH Video Streaming
S. Petrangeli , H. Wang , M. Fisher , D. Kozma , M. Mahamli , P. Blumenthal , A. Parsons
Looking Beyond the Screen: Natural Eye Contact as a Key to Relatedness in Teleconferences
M. Lux , S. Rabung , P. Herrmann , T. Albert , S. Andreas , S. Hohnwald
Management And Performance of Multiple Video Decoding Instances in Mobile Devices
E. Potetsianakis , E. Alexiou , E. Thomas
Collaborative Cooking in VR: Effects of Network Distortion in Multi-User Virtual Environments
J. Sameri , S. Van Damme , S. Schwarzmann , Q. Wei , R. Trivisonno , F. De Turck , M. T. Vega

Doctoral Symposium posters

Generative AI for HTTP Adaptive Streaming
E. Artioli
Analysis and Development of Deep Learning Depth Estimation Techniques for Volumetric Capture and Free Viewpoint Video
J. Usón , J. Cabrera
Design and Implementation of a Low-Latency Origin and Relay for Media-over-QUIC Transport
Z. Gurel
Smart radio access selection and slice allocation for differentiated traffic management over 6G heterogeneous networks
C. Carballo Gonzalez , M. Murroni
Evaluating Visual Attention and QoE for 360° videos with non-spatial and spatial audio
A. Hirway , Y. Qiao , N. Murray
Multiuser Virtual Experiences powered by Holoportation Technologies and Multimodal Human-Computer Interaction (HCI)
M. Hjeij , M. Montagud , D. Rincón
3:30 PM - 4:00 PM Coffee Break
4:00 PM - 5:00
Oral presentations

MMSys Research Track
Session 6: Visual information analysis, encoding and streaming

Session Chair: Maria Torres Vega
Accelerated Event-Based Feature Detection and Compression for Surveillance Video Systems
A. Freeman, K. Mayer-Patel, M. Singh
Vesper: Learning to Manage Uncertainty in Video Streaming
B. Chen, M. Wu, H. Guo, Z. Yan, K. Nahrstedt
Inter-Frame Parallelization in an Open Optimized VVC Encoder
V. George, J. Brandenburg, G. Hege, T. Hinz, A. Wieckowski, B. Bross, T. Schierl, D. Marpe
5:00 PM - 5:20 PM Closing and Awards
18th April,Thursday
8:30 AM - 9:00 AM Registration
9:00 AM - 10:00 AM

Keynote: Quality management and sustainability in video streaming

Luigi Atzori, University of Cagliari
10:00 AM - 10:30 AM Coffee Break
10:30 AM - 12:20 AM Track 1 - NOSSDAV Workshop
Oral Session 1
Track 2 - MMVE Workshop
Oral Session 1
12:20 PM - 1:50 PM Lunch (MMVE posters setup)
1:50PM-3:30PM Track 1 - NOSSDAV Workshop
Oral Session 2
Track 2 - MMVE Workshop
Poster Session
3:30 PM - 4:00 PM Coffee Break
4:00PM-5:40PM Track 1 - GMSys Workshop
Oral Session
Track 2 - MMVE Workshop
Oral Session 2