03 Ef?cient Video Stitching Based on Fast Structure Deformation-有解無憂

1.論文題目和關鍵詞
Title:
Ef?cient Video Stitching Based on Fast Structure Deformation
基于快速結構變形的高效視頻拼接演算法
Keywords：
Computational ef?ciency計算效率；
computer vision計算機視覺；
intensity misalignment強度失準；
machine learning機器學習；
structure misalignment結構失準；
video stitching視頻拼接.

2.摘要大意
In computer vision, video stitching is a very challenging problem. In this paper, we proposed an ef?cient and effective wide-view video stitching method based on fast structure deformation that is capable of simultaneously achieving quality stitching and computational ef?ciency. For a group of synchronized frames, ?rstly, an effective double-seam selection scheme is designed to search two distinct but structurally corresponding seams in the two original images. The seam location of the previous frame is further considered to preserve the interframe consistency. Secondly, along the double seams, 1-D feature detection and matching is performed to capture the structural relationship between the two adjacent views. Thirdly, after feature matching, we propose an ef?cient algorithm to linearly propagate the deformation vectors to eliminate structure misalignment. At last, image intensity misalignment is corrected by rapid gradient fusion based on the successive over relaxation iteration (SORI) solver. A principled solution to the initialization of the SORI signi?cantly reduced the number of iterations required. We have compared favorably our method with seven state-of-the-art image and video stitching algorithms as well as traditional ones. Experimental results show that our method outperforms the existing ones compared in terms of overall stitching quality and computational ef?ciency.

在計算機視覺中，視頻拼接是一個非常具有挑戰性的問題，本文提出了一種基于快速結構變形的廣域視頻拼接方法，該方法能夠同時實作高質量的拼接和計算效率，首先，對于一組同步幀，設計一種有效的雙接縫選擇方案，以搜索兩個原始影像中兩個不同但結構上相對應的接縫，進一步考慮前一幀的接縫位置以保持幀間一致性，其次，沿著雙縫進行一維特征檢測與匹配，以捕捉相鄰兩視圖之間的結構關系，第三，在特征匹配后，提出了一種有效的變形向量線性傳播演算法來消除結構失準，最后，基于逐次超松弛迭代（SORI）求解器，通過快速梯度融合來校正影像強度失準，SORI初始化的原則解決方案顯著減少了所需的迭代次數，我們將我們的方法與七個最新的影像和視頻拼接演算法以及傳統演算法進行了比較，實驗結果表明，該方法在整體拼接質量和計算效率方面優于現有方法，

3.從不同角度分析該篇論文的創新點，并談談有什么學術價值
創新點：
（1）提出了一種基于快速結構變形的視頻拼接方法，該方法具有高質量、低計算量的特點，
（2）提出了一種具有空間和時間一致性的雙縫選擇模型，該模型具有線性復雜性，
（3）理論上證明了變形模型可以有效轉化為線性反插值問題，
（4）提出了一個有效梯度融合的逐次超松弛迭代（SORI）求解器的解決方案，能夠很好的消除強度和結構失準的問題，
學術價值：
（1）以往的研究注重影像拼接，該論文主要研究視頻拼接，且同時解決了前文獻未能解決的高質量拼接和低計算成本的問題，
（2）對于雙接縫選擇，該論文的模型同時考慮了空間和幀間的一致性（時間）約束，可以有效的傳播視頻拼接中的候選接縫，使模型的計算復雜度和重疊區域的面積成線性關系，而以往的研究只考慮了由圖割（graph cut）優化的梯度平滑度和相似性，且是非線性的，
（3）該論文將形變數子化和傳播問題轉化為變形向量的線性反插值，與以往研究中的基于圖割方法的最小化能量函式相比，在視覺上可以達到同樣的效果，

4.對該篇論文結論的理解及對學習作業的啟發
（1）該論文提出了一種有效梯度融合的逐次超松弛迭代（SORI）演算法，不僅在方法上有所創新，且全面提升了計算效率和拼接質量，具有較高的實際應用價值，適用于對計算速度要求較高的場合（人臉識別、物體追蹤、自動駕駛），
（2）文章對于實際場景中存在大量復雜相似結構的影像如何進行精準匹配的問題沒有展開闡述，可做進一步的研究，

轉載請註明出處，本文鏈接：https://www.uj5u.com/yidong/147939.html

標籤：其他

上一篇：ubuntu下撰寫C語言程式

下一篇：基于深度學習的三維語意理解（分割）綜述串列