• 设为首页
  • 点击收藏
  • 手机版
    手机扫一扫访问
    迪恩网络手机版
  • 关注官方公众号
    微信扫一扫关注
    迪恩网络公众号

FedLive: CNN for saliency, RL for FoV prediction

原作者: [db:作者] 来自: 网络 收藏 邀请

开源软件名称:

FedLive

开源软件地址:

https://gitee.com/uglyghost123/FedLive

开源软件介绍:

FedLive: A Federated Transmission Framework for Panoramic Livecast with Reinforced Variational Inference

1. Abstract

Providing premium panoramic livecast services for worldwide viewers against the negative effects of its ultra-high data rate and delay-sensitive demands is still a significant challenge to the current delivery system. Therefore, it is imperative to explore an efficient way of improving the quality of experience while conserving the bandwidth resources for panoramic livecast. This paper expends the design space of both the Field of View (FoV) prediction and the multi-viewers 360◦ live streaming by presenting a novel cost-efficient federated transmission framework called FedLive. We first propose a gradient-based clustering method to group the geo-distributed viewers with similar viewing behavior into content delivery alliances by exploiting geometric properties of the gradient loss. With viewers’ resources integration, a Reinforced Variational Inference (RVI) structure-based approach is proposed to assist in the collaborative training of the FoV prediction model while accelerating multiple multi-rate tiles delivery. We further design a prediction-based asynchronous delivery algorithm, in which both the high accuracy FoV prediction and efficient live 360◦ video transmission are achieved in a decentralized manner. Finally, we use the synchronized algorithm as a benchmark to evaluate our solution performance over a real-world dataset. Additionally, prototype-based experimental results reveal that our approach provides the highest prediction accuracy, reduces delay, and saves bandwidth compared with state-of-the-art solutions.

2. Framework

image

The above figure presents the diagram of FedLive, the proposed federated transmission framework for PLS. In FedLive multiple types of nodes are involved, including content providers, CDN servers, and viewers with HMDs.FedLive contains two major phases including distributed learning phase and transmission phase with three blocks (viewers side, CDN servers side, and content provider side).This repository introduces distributed learning phase.

In the distributed learning phase, each viewer trains a local FoV prediction model with inputs of its local viewing records and prefetching records provided by the FoV prediction model.The loss information as the output will be used for the local backward propagation and will also be captured by the nearby CDN servers as the input of the user clustering algorithm.Especially, the loss matrix is calculated by adding the predicted FoV binary matrix to the ground truth binary matrix.Once the CDN servers have collected the loss information from all viewers, they will invoke the gradient-based user clustering algorithm to divide the viewers into multiple viewer clusters.Meanwhile, the CDN servers continuously summarize the loss value for each cluster and update it to the content provider along with the clustering results as the input for the unified model training.With the loss information provided by the CDN servers, the content provider updates the unified model for each cluster with the weighted average loss value.The process extends the concept of federated learning by adding clustering.Further, the unified models will be distributed to viewers as part of the live streaming,while the prefetching priority of different tiles is determined corresponding to the predicted results of the unified models.

3. Installation

3.1 Install dependent packages

We build FedLive with Torch+Gym. You can install as follow:

  • Torch: 1.10.1+cu113
  • Python: 3.8
  • Gym: 0.19.0
  • OpenCV: 4.5.4.60
  • Other python packages: please refer to <requestments.txt>

3.2 Download FedLive

First of all, you should download FedLive from Github:

git clone https://gitee.com/uglyghost123/FedLive.git

4. How To Use

4.1 Folders

  • ./game/: Files for RL agent and gym environment. It includes:
    • game/agent.py: RL agent
    • game/grid_video_world.py: the custom environment for 360-degree video FoV prediction
  • ./log/: Files for the experimental log.
  • ./nn_model/: Files for different RL policies.
  • ./pic/: Some pictures of the experiment results.
  • ./save_model/: Save RL models for different user clusters
  • ./utils/: Some data processing scripts.
  • ./main.py: Main function.
  • ./arguments.py: For arguments configuration. such --policy SAC, use soft AC policy
  • ./get_frame.py: Get the video frames.

4.2 Prepare the datasets

  • ./Saliency: 360-degree saliency dataset. link
    image
  • ./Videos: Panoramic videos. link
    image
  • ./frames: Frames extracted from the panoramic videos. (Note configure the path first!)
    image
python get_frames.py
  • ./VRdataset: A head tracking dataset composes of 48 users (24 males and 24 females) watching 18 spherical videos from 5 categories. link
    image

4.3 Run with SAC policy

  • Set parameters and file path before running the code.
    image
  • Start with SAC policy.
python main.py --policy SAC

image

5. Selected Results

5.1 Viewing experience

with the custom environment "grid_video_world.py"

(RVI-SAC)(RVI-AC)(Ground truth)
alt-text-1alt-text-2alt-text-2
(RVI-A3C)(RVI-DDPG)(RVI-TD3)
alt-text-1alt-text-2alt-text-2
(RVI-DDQN)(RVI-PPO)(RVI-DQN)
alt-text-1alt-text-2alt-text-2

5.2 Accuracy, precision, and recall

Further, we evaluate the performance of our solution in terms of prediction accuracy, precision, and recall and we compare it with that of three state-of-the-art solutions: LiveDeep, LiveObj, and PanoSalNet, in an asynchronous manner.

imageimageimage

5.3 The objective function (as QoE performance)

imageimage

6. Contributors

7. Citation

Contact

Xingyan Chen ([email protected]), Southwestern University of Finance and Economics


鲜花

握手

雷人

路过

鸡蛋
该文章已有0人参与评论

请发表评论

全部评论

专题导读
热门推荐
热门话题
阅读排行榜

扫描微信二维码

查看手机版网站

随时了解更新最新资讯

139-2527-9053

在线客服(服务时间 9:00~18:00)

在线QQ客服
地址:深圳市南山区西丽大学城创智工业园
电邮:jeky_zhao#qq.com
移动电话:139-2527-9053

Powered by 互联科技 X3.4© 2001-2213 极客世界.|Sitemap