• 
    

    
    

      99热精品在线国产_美女午夜性视频免费_国产精品国产高清国产av_av欧美777_自拍偷自拍亚洲精品老妇_亚洲熟女精品中文字幕_www日本黄色视频网_国产精品野战在线观看 ?

      基于失真?zhèn)鬟f的時(shí)域自適應(yīng)量化算法

      2017-02-07 09:54:56殷海兵王鴻奎王忠霄
      關(guān)鍵詞:碼率代價(jià)時(shí)域

      殷海兵,王鴻奎,王忠霄

      (中國(guó)計(jì)量大學(xué) 信息工程學(xué)院, 浙江 杭州 310018)

      基于失真?zhèn)鬟f的時(shí)域自適應(yīng)量化算法

      殷海兵,王鴻奎,王忠霄

      (中國(guó)計(jì)量大學(xué) 信息工程學(xué)院, 浙江 杭州 310018)

      碼率控制是視頻編碼器中的關(guān)鍵模塊,其算法直接決定編碼器率失真性能.視頻編碼幀間預(yù)測(cè)導(dǎo)致的編碼失真會(huì)在時(shí)域產(chǎn)生傳遞效應(yīng),考慮該傳遞效應(yīng)是優(yōu)化碼率控制算法性能的關(guān)鍵.宏塊樹碼率控制是一種典型的時(shí)域量化控制算法,核心是根據(jù)編碼單元失真?zhèn)鬟f量(相對(duì)傳遞代價(jià)ρ)自適應(yīng)地調(diào)整量化參數(shù)(偏移量δ),合適的δ-ρ映射關(guān)系是宏塊樹量化控制算法的核心.宏塊樹算法采用基于經(jīng)驗(yàn)的δ-ρ模型,對(duì)不同視頻序列的普適性有待改進(jìn),模型準(zhǔn)確度和精度也需進(jìn)一步優(yōu)化.針對(duì)上述問題,將競(jìng)爭(zhēng)決策方法用于探索最優(yōu)δ-ρ映射關(guān)系,提出了一種率失真性能優(yōu)化的失真時(shí)域傳遞自適應(yīng)量化δ-ρ模型,以改進(jìn)時(shí)域自適應(yīng)量化算法.實(shí)驗(yàn)結(jié)果表明,信噪比BD-PSNR較原模型提升了0.14 dB以上,SSIM性能提升了0.29 dB.算法能更好地控制碼率時(shí)域分配,降低失真時(shí)域傳遞惡化.

      視頻編碼;碼率控制;率失真優(yōu)化;失真?zhèn)鬟f;競(jìng)爭(zhēng)決策

      A temporally adaptive quantization algorithm with constrained distortion propagation in video coding. Journal of Zhejiang University(Science Edition), 2017,44(1):057-063

      0 引 言

      視頻編碼器是數(shù)字電視、視頻監(jiān)控、網(wǎng)絡(luò)流媒體等數(shù)字媒體應(yīng)用中重要的源端設(shè)備.碼率控制是視頻編碼器中非常重要的算法可定制模塊,其任務(wù)是為各編碼單元選擇合適的量化參數(shù)[1-2].量化直接決定編碼失真和碼率,碼率控制算法則直接決定編碼器的率失真性能[1-3].視頻編碼器通常將視頻依次分為GOP、幀和編碼塊等粒度編碼單元,碼率控制也常通過多層次量化調(diào)整實(shí)現(xiàn)量化控制[1,4].目前,已有的碼率控制算法通常不考慮或弱化編碼單元之間的依賴,單獨(dú)進(jìn)行算法設(shè)計(jì)和優(yōu)化[4].

      實(shí)際上,視頻編碼存在復(fù)雜度的多層次空時(shí)域依賴,如:幀內(nèi)預(yù)測(cè)導(dǎo)致的編碼單元失真會(huì)發(fā)生空域傳遞,幀間預(yù)測(cè)導(dǎo)致的失真則在時(shí)域產(chǎn)生傳遞效應(yīng);上下文自適應(yīng)熵編碼會(huì)導(dǎo)致鄰近編碼單元之間編碼碼率消耗相互依賴,不再滿足傳統(tǒng)編碼單元相互獨(dú)立的假設(shè).編碼單元之間依賴的全局優(yōu)化成為提高性能的關(guān)鍵.動(dòng)態(tài)規(guī)劃優(yōu)化適用于存在失真碼率傳遞效應(yīng)的編碼算法優(yōu)化,如考慮幀間依賴的幀級(jí)量化控制以及塊內(nèi)系數(shù)依賴的率失真優(yōu)化量化等[4].因編碼參數(shù)眾多,且關(guān)聯(lián)復(fù)雜度隨動(dòng)態(tài)規(guī)劃網(wǎng)格數(shù)的增加而劇增,故無法應(yīng)用于多層次和同時(shí)優(yōu)化多編碼單元的編碼參數(shù)[3-4].

      近年來,一些學(xué)者對(duì)空時(shí)域依賴全局算法優(yōu)化開展了研究.LIU等[5]提出了時(shí)域失真?zhèn)鬟f失真模型,優(yōu)化可伸縮編碼碼率分配.陳杰等[6]分析并重建了圖像失真與當(dāng)前幀及參考幀q的關(guān)系,并研究參考幀壓縮導(dǎo)致的失真對(duì)總體失真的影響.一些文獻(xiàn)基于幀級(jí)編碼失真和源端誤差及參考失真之間的關(guān)系,構(gòu)建了時(shí)域傳遞失真模型,優(yōu)化了幀級(jí)碼率控制算法:如PANG等[7]構(gòu)建了幀級(jí)失真?zhèn)鬟f模型[8],文獻(xiàn)[8]基于DCT系數(shù)混合分布模型,構(gòu)建了幀級(jí)依賴率失真模型;朱策等[9]建立了源端失真?zhèn)鞑ツP?,通過估計(jì)當(dāng)前編碼單元對(duì)后續(xù)單元的影響,調(diào)整當(dāng)前單元編碼參數(shù)以實(shí)現(xiàn)優(yōu)化.率失真優(yōu)化是視頻算法設(shè)計(jì)的理論基礎(chǔ)[10],率失真性能是視頻算法性能的評(píng)判標(biāo)準(zhǔn)[11].感知失真度量和感知編碼是視頻算法設(shè)計(jì)和優(yōu)化需要考慮的重要因素[11-14].

      工業(yè)界也在視頻編碼優(yōu)化方面做了有益探索,如廣泛應(yīng)用的x264編碼器,采用基于感知的模糊復(fù)雜度模型、時(shí)域MBTree和空域VAQ感知量化技術(shù)[15];這些算法雖在一定程度上提升了x264的編碼性能,但往往是基于經(jīng)驗(yàn)方法構(gòu)建的算法模型.MBTree算法度量反映每個(gè)編碼單元時(shí)域失真?zhèn)鬟f大小的參數(shù)ρ,即“相對(duì)傳遞代價(jià)”.根據(jù)ρ確定量化參數(shù)調(diào)節(jié)量δ.該算法的關(guān)鍵是如何度量ρ,以及采用何種自適應(yīng)調(diào)整策略,即確定δ-ρ函數(shù).ρ越大的編碼單元,失真時(shí)域傳遞量越大.從時(shí)域碼率分配優(yōu)化角度看,應(yīng)該控制δ-ρ函數(shù),減小量化參數(shù),減小失真以降低失真?zhèn)鬟f惡化,從而實(shí)現(xiàn)時(shí)域量化控制優(yōu)化.目前,默認(rèn)δ-ρ模型采用基于經(jīng)驗(yàn)的log函數(shù),缺少理論依據(jù);實(shí)驗(yàn)結(jié)果表明,該簡(jiǎn)單log函數(shù)不能準(zhǔn)確刻畫最優(yōu)函數(shù)關(guān)系,模型對(duì)于不同視頻序列的普適性有待改進(jìn),這一不足在很大程度上影響了MBTree算法性能的充分發(fā)揮.

      針對(duì)上述問題,將競(jìng)爭(zhēng)決策方法應(yīng)用于探索最優(yōu)δ-ρ模型,收集大量最優(yōu)(ρ,δ)樣本數(shù)據(jù),進(jìn)行離線分析,提出一種率失真性能優(yōu)化的δ-ρ模型.下文安排如下:第1節(jié)分析時(shí)域自適應(yīng)量化控制算法框架及其不足;第2節(jié)給出基于競(jìng)爭(zhēng)決策的δ-ρ新模型及其量化控制算法;第3節(jié)給出實(shí)驗(yàn)結(jié)果;第4節(jié)為總結(jié)與展望.

      1 MBtree算法分析

      1.1 相對(duì)傳遞代價(jià)ρ

      MBTree算法基于視頻預(yù)分析的結(jié)果,度量各編碼單元相對(duì)傳遞代價(jià)ρ.預(yù)編碼分析在幀滑動(dòng)窗內(nèi)進(jìn)行,滑動(dòng)窗由當(dāng)前幀及若干鄰近幀組成.基于滑動(dòng)窗內(nèi)降采樣原始圖像進(jìn)行預(yù)編碼分析,采用預(yù)測(cè)誤差SATD度量幀內(nèi)和幀間預(yù)測(cè)代價(jià)ζintra和ζinter,以及幀間運(yùn)動(dòng)預(yù)測(cè)的參考傳遞代價(jià)γpropagate,基于這些參量計(jì)算編碼單元時(shí)域失真的相對(duì)傳遞代價(jià)ρ:

      (1)算法的關(guān)鍵是如何度量參考傳遞代價(jià)γpropagate.假設(shè)當(dāng)前幀s中位于(i,j)處的編碼單元標(biāo)記為(s,i,j),其γpropagate的估計(jì)和傳遞過程如圖1所示,分析如下:

      ①估算用SATD度量的幀內(nèi)、幀間預(yù)測(cè)代價(jià)ζintra和ζinter;若ζinter<ζintra,令ζinter=ζintra;

      ②計(jì)算當(dāng)前單元的代價(jià)傳遞權(quán)重系數(shù)ωpropagate:

      (2)

      ③估算當(dāng)前單元可以傳遞給參考幀位移匹配塊的總體傳遞代價(jià)量γamount:

      γamount(s,i,j)=[ζintra(s,i,j)+γpropagate(s,i,j)]×

      ωpropagate(s,i,j).

      (3)

      γpropagate為當(dāng)前單元(失真?zhèn)鬟f目標(biāo)塊dst)被所有鄰近塊(失真?zhèn)鬟f源塊src)預(yù)測(cè)參考產(chǎn)生的傳入?yún)⒖即鷥r(jià)γin(dst,src)之和,如⑤步分析.

      ④當(dāng)前單元在鄰近幀中的匹配塊可能覆蓋4個(gè)塊(t1,pf,qf),f=1~4,按照實(shí)際參考像素面積估算代價(jià)傳遞權(quán)重Λ(dst,src),根據(jù)權(quán)重按比例分配γamount,計(jì)算t1幀4個(gè)塊(t1,pf,qf),參考當(dāng)前編碼單元(s,i,j)產(chǎn)生的傳入代價(jià)γin(t1,pf,qf;s,i,j),計(jì)算如下:

      γin(t1,pf,qf;s,i,j)=γamount(s,i,j)×

      Λ(t1,pf,qf;s,i,j).

      (4)

      ⑤如圖1所示,假設(shè)鄰近幀有k個(gè)圖像塊(rk,mk,nk),部分或全部地參考了當(dāng)前單元(s,i,j),按式(4)分別計(jì)算傳入?yún)⒖即鷥r(jià)γin(s,i,j;rk,mk,nk),然后計(jì)算鄰近參考幀匹配塊傳遞給當(dāng)前單元的參考傳遞代價(jià)γpropagate:

      (5)

      ⑥計(jì)算當(dāng)前圖像單元的ζintra,ζinter和γpropagate,根據(jù)式(1)計(jì)算相對(duì)傳遞代價(jià)ρ,然后根據(jù)式(6)計(jì)算量化參數(shù)偏移量δ,用于調(diào)整當(dāng)前編碼單元量化參數(shù).

      δ=5(1-qcompress)×log2(1+ρ).

      (6)

      圖1 參考傳遞代價(jià)估算及分配傳遞過程示意圖(k=1,2,…,k)Fig.1 Diagram of estimation, distribution and transferprocess for reference propagation cost(k=1,2,…,k)

      1.2 傳統(tǒng)δ-ρ模型的不足

      式(6)中采用log函數(shù)描述相對(duì)傳遞代價(jià)ρ和量化偏移量δ之間的關(guān)系.qcompress是基于經(jīng)驗(yàn)預(yù)先設(shè)置的值,默認(rèn)為0.6.圖2是qcompress分別取0.3,0.6,0.9時(shí)的δ-ρ圖.

      實(shí)驗(yàn)結(jié)果表明,MBTree算法的性能因序列特征不同而變化,少數(shù)序列性能提升很小甚至無提升,如表1所示.這是因?yàn)椴煌蛄袑?duì)應(yīng)的最優(yōu)δ-ρ不同,而MBTree算法默認(rèn)的δ-ρ模型基于經(jīng)驗(yàn)因而相對(duì)粗糙,從而限制了其應(yīng)用,不利于進(jìn)一步優(yōu)化建模.本文嘗試探索更精細(xì)的模型以描述δ-ρ的關(guān)系,這對(duì)于改進(jìn)MBTree時(shí)域量化控制算法有積極意義.

      圖2 不同qcompress情況下的δ-ρ映射曲線Fig.2 The δ-ρ mapping functions in the casesof different qcompress

      序列l(wèi)og模型PSNR提升量/dB本文未迭代PSNR提升量/dB一次迭代PSNR提升量/dB二次迭代PSNR提升量/dBlog模型SSIMdB提升量/dB未迭代SSIMdB提升量/dB一次迭代SSIMdB提升量/dB二次迭代SSIMdB提升量/dBBridge?close0.67140.66240.72340.75250.43220.47280.51820.5454Mobile0.38720.24030.0202-0.10690.85520.92190.92530.8824News1.25411.30271.30841.24551.20221.29131.36771.3248Paris1.81581.85031.87411.82991.47701.57831.66311.6902Highway-0.0891-0.0626-0.0388-0.09530.07720.09870.12390.1256Bridge?far-0.2056-0.04060.01830.08880.05230.13290.13870.1589Foreman0.42300.45950.42470.46910.34770.44460.50520.5917Coastguard0.21290.17580.14430.10670.33560.37360.34890.3265Container0.58930.66040.72770.72290.33570.47180.55170.5767Hall0.28960.42200.43800.45430.30050.44840.50880.5920平均0.53480.56700.56400.54670.54160.62340.66520.6814

      2 基于競(jìng)爭(zhēng)決策的δ-ρ模型

      2.1 ρ樣本統(tǒng)計(jì)特性

      基于多個(gè)視頻序列收集了大量ρ樣本,統(tǒng)計(jì)分析結(jié)果如圖3所示.發(fā)現(xiàn)ρ主要集中在30之內(nèi),大于30的樣本所占比例極小.實(shí)驗(yàn)測(cè)試證明,小比例范圍的δ-ρ函數(shù),模型精度對(duì)算法影響極小,因此本文重點(diǎn)關(guān)注ρ∈[0,30]區(qū)間,根據(jù)收集的最優(yōu)δ-ρ樣本對(duì),采用競(jìng)爭(zhēng)決策方法離線分析構(gòu)建的δ-ρ函數(shù)模型.

      圖3 ρ值柱狀分布圖Fig.3 The histogram of ρ

      2.2 基于競(jìng)爭(zhēng)決策的建模

      競(jìng)爭(zhēng)決策算法[16]基于競(jìng)爭(zhēng)機(jī)制和決策原理,利用競(jìng)爭(zhēng)決策優(yōu)化確定最優(yōu)結(jié)果,是一種尋優(yōu)算法.實(shí)際上,競(jìng)爭(zhēng)決策是多個(gè)競(jìng)爭(zhēng)者經(jīng)過多次競(jìng)爭(zhēng)和決策后達(dá)到最優(yōu)競(jìng)爭(zhēng)均衡狀態(tài)的過程.對(duì)于每個(gè)輸入的ρ值,有多個(gè)可能的候選競(jìng)爭(zhēng)者δ,有且僅有一個(gè)最優(yōu)的δ值與之對(duì)應(yīng),使得率失真性能最優(yōu).基于此,本文擬采用競(jìng)爭(zhēng)決策方法離線探索優(yōu)化的δ-ρ模型.

      為了得到最優(yōu)δ-ρ模型,擬對(duì)ρ進(jìn)行分段優(yōu)化競(jìng)爭(zhēng),分別分段尋找率失真性能最優(yōu)的δ值.假設(shè)ρ取值區(qū)間為[0,T],將該區(qū)間等分成N段(索引為i),第i段對(duì)應(yīng)的ρi可表示為

      (7)

      每段ρi對(duì)應(yīng)的δ值標(biāo)記為κi=f(ρi),初始值由式(6)確定,但初始值對(duì)應(yīng)的性能并非最優(yōu);本文試圖給第i段κi施加一個(gè)偏移量ωi,得到調(diào)整后的δ值κi′=κi+ωi.采用競(jìng)爭(zhēng)決策為κi選擇最優(yōu)ωi:假定在κi基礎(chǔ)上ωi可上下偏移的最大值為Δδmax,將這個(gè)動(dòng)態(tài)范圍分為M個(gè)子段(M為奇數(shù)),每個(gè)子段索引為j;第j子段對(duì)應(yīng)的ωi記為ωij,算式如下:

      (8)

      在第i段優(yōu)化過程中,其他段ρi取式(6)確定的默認(rèn)偏移量.因此,對(duì)于第i段ρi而言,M個(gè)子段確定了M個(gè)可能競(jìng)爭(zhēng)者ωij,在第i段M個(gè)競(jìng)爭(zhēng)者確定的子段優(yōu)化過程中,各段和子段對(duì)應(yīng)偏移量κl′(j):

      (9)

      (10)

      (11)

      (12)

      然后進(jìn)行第i+1段競(jìng)爭(zhēng)決策,直到得到所有N段對(duì)應(yīng)點(diǎn)(ρi,δi),即為δ-ρ新模型.

      算法流程如下:

      (1)確定競(jìng)爭(zhēng)者:根據(jù)式(7)將[0,T]內(nèi)ρ值分成N段,由式(9)對(duì)每段ρi計(jì)算M個(gè)競(jìng)爭(zhēng)者ωij,如圖4所示.實(shí)驗(yàn)表明,T,N和M分別取30,120和21時(shí),能獲得較滿意的精度,此時(shí)Δδmax為5.25.

      (3)計(jì)算競(jìng)爭(zhēng)力:由競(jìng)爭(zhēng)力函數(shù)式(10),計(jì)算各競(jìng)爭(zhēng)者的率失真性能,如圖5所示.

      圖4 競(jìng)爭(zhēng)者示意圖Fig.4 Competitors in CDA based model building

      圖5 決策函數(shù)示意圖Fig.5 The competition function

      圖6 競(jìng)爭(zhēng)決策示意圖Fig.6 Flow diagram of CDA based model building

      2.3 改進(jìn)的量化控制算法

      競(jìng)爭(zhēng)決策每次迭代得到新的δ-ρ模型,體現(xiàn)為N段分段函數(shù),各段縱坐標(biāo)δ值相同.以i為區(qū)間索引,創(chuàng)建大小為N的一維數(shù)組δ[i].對(duì)當(dāng)前編碼單元按照1.1節(jié)步驟計(jì)算ρ值,并計(jì)算數(shù)組索引i:

      (13)

      數(shù)組δ[i]中存儲(chǔ)的是時(shí)域量化偏移量δMBTree,方差自適應(yīng)量化(VAQ)算法確定空域量化偏移量δVAQ,兩者共同調(diào)節(jié)幀級(jí)量化參數(shù)Qpfrm,得到最終量化系數(shù)Qpfinal,如式(14)所示,實(shí)現(xiàn)了宏塊級(jí)自適應(yīng)量化控制.

      Qpfinal=Qpfrm+δVAQ+δMBTree.

      (14)

      3 實(shí)驗(yàn)結(jié)果

      文獻(xiàn)[9]提出的算法和時(shí)域MBTree量化算法相對(duì)類似,該算法應(yīng)用在AVS參考代碼中,不同標(biāo)準(zhǔn)的參考代碼性能之間存在的差異是由眾多因素決定的,跨平臺(tái)特定模塊算法性能之間的比較缺乏公平性.考慮到x264是工業(yè)界性能最好的H.264/AVC編碼器,本文算法在x264平臺(tái)MBTree算法基礎(chǔ)上做了改進(jìn),所以將MBTree作為比較算法性能時(shí)的參考.

      本文基于x264平臺(tái)進(jìn)行算法性能驗(yàn)證,設(shè)置編碼參數(shù)為:presetslower-bframes2-slow-firstpass.圖7~9分別給出了hall序列3次競(jìng)爭(zhēng)決策算法得到的δ-ρ模型,以MBTree算法中原始模型作為對(duì)比.圖10分別給出了關(guān)閉MBTree算法、使用原log模型和使用本文二次迭代模型時(shí)的幀級(jí)失真結(jié)果.圖11與12給出了上述3種算法的率失真性能對(duì)比.結(jié)果表明,二次迭代的模型對(duì)MBTree算法的主客觀性能都有較明顯的提升;另外,如圖10所示,在幀級(jí)失真波動(dòng)與log模型相似的情況下,序列前面圖像幀失真明顯降低,表明本文算法的時(shí)域碼率分配調(diào)節(jié)發(fā)揮了作用.

      圖7 未迭代的δ-ρ模型Fig.7 The δ-ρ model without iteration

      圖8 一次迭代的δ-ρ模型Fig.8 The δ-ρ model with one iteration

      圖9 二次迭代的δ-ρ模型Fig.9 The δ-ρ model with two iterations

      圖10 幀級(jí)失真波動(dòng)Fig.10 The frame-level distortion fluctuation result

      圖11 率失真曲線(PSNR)Fig.11 The rate distortion curve (PSNR)

      圖12 率失真曲線(SSIMdB)Fig.12 The rate distortion curve(SSIMdB)

      為了綜合評(píng)判算法性能,對(duì)10個(gè)常用的標(biāo)準(zhǔn)序列進(jìn)行測(cè)試,率失真性能結(jié)果如表1所示.SSIM被廣泛用于評(píng)價(jià)主觀質(zhì)量[12-13],這里參考PSNR的定義,將SSIM分?jǐn)?shù)經(jīng)-10×log10(1-SSIM)轉(zhuǎn)化為dB度量的SSIM值(SSIMdB),用于比較算法的主觀圖像質(zhì)量.實(shí)驗(yàn)結(jié)果表明:大多數(shù)序列情況下本文模型的主客觀性能都有提升,BD-PSNR最大可提高0.14dB;SSIMdB最大可提高0.29dB,平均可提高0.14dB,且隨著迭代次數(shù)的增多,性能也有所提升.對(duì)于一些原始MBTree算法客觀性能損害的bridge-far序列,本文模型相較于log模型,其BD-PSNR性能提升了0.28dB.另外,多次迭代模型在BD-PSNR方面提升較少,但SSIMdB提升較顯著.表明經(jīng)過迭代的模型能更好地實(shí)現(xiàn)時(shí)域感知量化控制,提升主觀視頻質(zhì)量.

      本文提出的模型可應(yīng)用于實(shí)時(shí)編碼器.將離線獲得的模型存儲(chǔ)在表格中,在實(shí)際視頻編碼時(shí),通過查表獲得不同ρ值對(duì)應(yīng)的δ值.該模型與傳統(tǒng)的log函數(shù)模型復(fù)雜度接近.

      4 結(jié) 語

      視頻編碼幀間預(yù)測(cè)導(dǎo)致的失真在時(shí)域發(fā)生傳遞效應(yīng),考慮此時(shí)域失真依賴是提高碼率控制算法性能的有效方法.x264中MBTree算法根據(jù)失真?zhèn)鬟f的相對(duì)代價(jià)ρ,度量宏塊被參考的重要性程度,由簡(jiǎn)單的log函數(shù)模型計(jì)算量化偏移量δ,實(shí)現(xiàn)了宏塊級(jí)自適應(yīng)碼率控制.但是,原算法中δ-ρ模型基于經(jīng)驗(yàn)獲得,尚待進(jìn)一步優(yōu)化.針對(duì)該問題,本文將競(jìng)爭(zhēng)決策方法應(yīng)用于探索優(yōu)化的δ-ρ映射關(guān)系,提出了一種率失真性能優(yōu)化的δ-ρ模型.實(shí)驗(yàn)證明,相較于原log模型,本文模型在主客觀性能上均有明顯提升,同時(shí)有效改善了序列頭部失真的狀況.本文δ-ρ模型針對(duì)各個(gè)序列單獨(dú)優(yōu)化,分析序列特征、構(gòu)建內(nèi)容自適應(yīng)的數(shù)學(xué)模型,是下一步研究的目標(biāo).

      [1]LIB,LIHQ,LIL,etal.λdomainratecontrolalgorithmforhighefficiencyvideocoding[J]. IEEE Transactions on Image Processing, 2014,23(9):23-50.

      [2] LEE B, KIM M, NGUYEN T Q. A frame-level rate control scheme based on texture and nontexture rate models for high efficiency video coding[J]. IEEE Transactions on Circuits and Systems for Video Technology, 2014,24(3):465-479.

      [3] YIN H B, YANG E H, YU X. Fast soft decision quantization with adaptive preselection and dynamic trellis graph [J]. IEEE Transactions on Circuits and Systems for Video Technology, 2015,25(8):1362-1375.

      [4] RAMCHANDRAN K, ORTEGA A, VETTERLI M. Bit allocation for dependent quantization with applications to multi resolution and MPEG video coders[J]. IEEE Transactions on Image Processing, 1994,3(5):533-545.

      [5] LIU J Y, CHO Y, GUO Z M, et al. Bit allocation for spatial scalability coding of H.264/SVC with dependent rate-distortion analysis [J]. IEEE Transactions on Circuits and Systems for Video Technology, 2010,20(7):967-981.

      [6] 陳杰,虞露.視頻編碼中考慮參考幀質(zhì)量的重建圖像失真模型[D].杭州:浙江大學(xué),2012. CHEN J,YU L. The Distortion Model of the Reconstructed Picture Considering Reference Frame in Video Coding[D]. Hangzhou: Zhejiang University, 2012.

      [7] PANG C, AU O C, ZOU F, et al. An analytic framework for frame-level dependent bit allocation in hybrid video coding [J]. IEEE Transactions on Circuits and Systems for Video Technology,2013,23(6):990-1002.

      [8] WANG S S, MA S W, WANG S Q, et al. Rate-GOP based rate control for high efficiency video coding [J]. IEEE Journal of Selected Topics in Signal Processing, 2013,7(6):1101-1111.

      [9] 朱策,周益民,李帥,等,基于信源失真時(shí)域傳播的視頻編碼率失真優(yōu)化(AVS-M3406)[C]//The 49th Meeting of AVS. 大連:大連理工大學(xué),2014. ZHU C, ZHOU Y M, LI S, et al. Rate distortion optimization for video coding based on source-end distortion propagation Chain (AVS-M3406)[C]//The 49th Meeting of AVS. Dalian: Dalian University of Technology,2014.

      [10] ORTEGA A, RAMCHANDRAN K. Rate-distortion methods for image and video compression [J]. IEEE Signal Processing Magazine, 1998,15(6):23-50.

      [11] LEE J S, EBRAHIMI T. Perceptual video compression: A survey [J]. IEEE Journal of Selected Topics in Signal Processing, 2012,6(6):684-697.

      [12] 蔣剛毅,朱亞培,郁梅,等.基于感知的視頻編碼方法綜述[J].電子與信息學(xué)報(bào),2013,35(2):474-483.

      JIANG G Y, ZHU Y P, YU M, et al. Perceptual video coding: A survey[J]. Journal of Electronics & Information Technology, 2013,35(2):474-483.

      [13] 崔子冠,朱秀昌.基于結(jié)構(gòu)相似的H.264主觀率失真性能改進(jìn)機(jī)制[J].電子與信息學(xué)報(bào),2012,34(2):433-439. CUI Z G, ZHU X C. Subjective rate-distortion performance improvement scheme for H.264 based on SSIM[J]. Journal of Electronics & Information Technology,2012,34(2):433-439.

      [14] 鄭明魁,蘇凱雄,王衛(wèi)星,等.基于視覺感知的高效視頻編碼標(biāo)準(zhǔn)幀內(nèi)量化矩陣優(yōu)化方法[J].電子與信息學(xué)報(bào),2014,36(12):2861-2868. ZHENG M K, SU K X, WANG W X, et al. An improved intra quantization matrix for high efficiency video coding based on visual perception[J]. Journal of Electronics & Information Technology, 2014,36(12):2861-2868.

      [15] JASON G. A novel macroblock-tree algorithm for high-performance optimization of dependent video coding in H.264/AVC [EB]. http://x264.nl/developers/Dark_Shikari/MBtree%20paper.pdf.

      [16] FUSS I G, NAVARRO D J. Open parallel cooperative and competitive decision processes: A potential provenance for quantum probability decision models [J]. Topics in Cognitive Science, 2013,5(4):818-843.

      YIN Haibing, WANG Hongkui, WANG Zhongxiao

      (CollegeofInformationEngineering,ChinaJiliangUniversity,Hangzhou310018,China)

      Rate control is crucial to rate distortion performance optimization in video coding design. In video coder, temporal prediction bring about distortion propagation along adjacent frames, and it is an efficient way to further improve the video coding efficiency by taking the temporal distortion dependency into consideration. The MBTree rate control is a typical temporal quantization control algorithm, in which the quantization parameter offsetδis employed for quantization adjustment according to the distortion propagation amount, i.e. the relative propagation costρ. An appropriateδ-ρmodel is therefore the key for the MBTree-like adaptive quantization algorithm. Nevertheless, the currentδ-ρmodel in MBTree algorithm is designed in an empirical way with rough accuracy. This model has unsatisfactory universality to different video sequences, thus there is still room left to be improved. This paper focuses on this problem and applies the competitive decision mechanism in exploring the optimizedδ-ρmodel, and then proposes an improvedδ-ρmodel with rate distortion optimization. The simulation results show that the improved MBTree algorithm based on the proposed model can achieve up to 0.14 dB BD-PSNR improvement and 0.29 dB SSIM improvement. The proposed algorithm can also implement better bit allocation in temporal domain and reduce the temporal distortion fluctuation, achieving adaptive quantization control.

      video coding; rate control; rate distortion optimization; distortion propagation; competitive decision

      2016-01-03.

      國(guó)家自然科學(xué)基金資助項(xiàng)目(61572449);浙江省自然科學(xué)基金資助項(xiàng)目(LY15F020022, LY13H180011).

      殷海兵(1974-),ORCID:http://orcid.org/0000-0002-3025-0938,男,碩士,教授,主要從事數(shù)字視頻編解碼研究,E-mail:haibingyin@163.com.

      10.3785/j.issn.1008-9497.2017.01.009

      TN 919

      A

      1008-9497(2017)01-057-07

      猜你喜歡
      碼率代價(jià)時(shí)域
      基于時(shí)域信號(hào)的三電平逆變器復(fù)合故障診斷
      基于狀態(tài)機(jī)的視頻碼率自適應(yīng)算法
      愛的代價(jià)
      海峽姐妹(2017年12期)2018-01-31 02:12:22
      代價(jià)
      基于極大似然準(zhǔn)則與滾動(dòng)時(shí)域估計(jì)的自適應(yīng)UKF算法
      基于時(shí)域逆濾波的寬帶脈沖聲生成技術(shù)
      基于場(chǎng)景突變的碼率控制算法
      X264多線程下碼率控制算法的優(yōu)化
      基于時(shí)域波形特征的輸電線雷擊識(shí)別
      成熟的代價(jià)
      时尚| 赤水市| 甘谷县| 和田县| 怀仁县| 大化| 东海县| 巴青县| 会理县| 临朐县| 临湘市| 平谷区| 兴和县| 板桥市| 广汉市| 星子县| 大丰市| 海安县| 荆门市| 荃湾区| 黑山县| 扶余县| 麻江县| 兴海县| 长宁区| 威海市| 邹城市| 咸阳市| 新晃| 宾川县| 南川市| 砀山县| 安塞县| 江门市| 余江县| 九江县| 卫辉市| 玛纳斯县| 松潘县| 昭平县| 凌海市|