• <tr id="yyy80"></tr>
  • <sup id="yyy80"></sup>
  • <tfoot id="yyy80"><noscript id="yyy80"></noscript></tfoot>
  • 99热精品在线国产_美女午夜性视频免费_国产精品国产高清国产av_av欧美777_自拍偷自拍亚洲精品老妇_亚洲熟女精品中文字幕_www日本黄色视频网_国产精品野战在线观看 ?

    Unified FPGA Design for the HEVC Dequantization and Inverse Transform Modules

    2022-08-23 02:15:38TurkiAlanaziandAhmedBenAtitallah
    Computers Materials&Continua 2022年6期

    Turki M.Alanazi and Ahmed Ben Atitallah

    Department of Electrical Engineering,Jouf University,Sakaka,Aljouf,2014,Saudi Arabia

    Abstract:As the newest standard,the High Efficiency Video Coding(HEVC)is specially designed to minimize the bitrate for video data transfer and to support High Definition (HD) and ULTRA HD video resolutions at the cost of increasing computational complexity relative to earlier standards like the H.264.Therefore,real-time video decoding with HEVC decoder becomes a challenging task.However, the Dequantization and Inverse Transform(DE/IT) are one of the computationally intensive modules in the HEVC decoder which are used to reconstruct the residual block.Thus,in this paper,a unified hardware architecture is proposed to implement the HEVC DE/IT module for all Transform Unit(TU)block size,including 4×4,8×8,16×16 and 32×32.This architecture is designed using the High-Level Synthesis(HLS)and the Low-Level Synthesis(LLS)methods in order to compare and determine the best method to implement in real-time the DE/IT module.In fact, the C/C++ programming language is used to generate an optimized hardware design for DE/IT module through the Xilinx Vivado HLS tool.On the other hand, the LLS hardware architecture is designed by the VHSIC Hardware Description language(VHDL)and using the pipeline technique to decrease the processing time.The experimental results on the Xilinx XC7Z020 FPGA show that the LLS design increases the throughput in term of frame rate by 80% relative to HLS design with a 4.4% increase in the number of Look-Up Tables(LUTs).Compared with existing related works in literature,the proposed architectures demonstrate significant advantages in hardware cost and performance improvement.

    Keywords: HEVC decoder; dequantization; IDCT/IDST; LLS design; HLS design;FPGA

    1 Introduction

    Nowadays, several consumer electronic devices such as television [1], smartphone [2], tablet [3],etc use video standard codec for video compressing and transmitting data with minimum bitrate.In this context,a High Efficiency Video Coding(HEVC)video standard is developed[4,5].The HEVC is a video compression standard that can provide a good performance than the previous standard,i.e.,H.264/AVC [6].In fact, it can support a high video resolution which can reach to 8K (7680×4320)and reduces the bitrate approximately by 50%relative to H.264/AVC[7,8]with same video quality.

    In video standard codec the Dequantization and Inverse Transform(DE/IT)play a very important role to reconstruct the compressed video sequences [9].Nevertheless, in HEVC video decoder, the DE/IT module allows to reconstruct the residual block which can be with several size 4×4, 8×8,16×16, and 32×32.In fact, the largest coding unit in HEVC can be up to 64×64 in size, and the Transform Unit(TU)sizes can be 4×4,8×8,16×16,and 32×32[10].This multiple TU sizes improve the compression performance but increase the computational complexity to reach a real-time execution [11,12].In this context, based on the complexity analysis of the HEVC decoder modules for all-intra configuration performed in[13],we can notice that the entropy decoding(ED),the intra prediction(IP)and the DE/IT modules consume on average 38%,32%and 20%of the total decoding time,respectively.However,according to the analysis given in[14]for the ED module,it is obviously that the computational complexity presented in the regular CABAC algorithm makes it difficult to be pipelined and parallelized in such hardware context.This complexity is caused by the critical bin-tobin data dependencies and the use of feedback between entropy decoding steps.On the other hand,we have proposed in[13]a hardware architecture to implement the IP block.For this reason,we focus in this work to propose an efficient hardware design to reduce the computational complexity of the HEVC DE/IT module.

    Recently, field-programmable gate arrays (FPGAs) have been gaining popularity for image and video processing.Indeed, modern FPGAs have sufficient resources to implement a complex application[15,16].The traditional approach used to design and implement any algorithm in FPGA is Low-Level Synthesis(LLS)using hardware description language(HDL)such as VHSIC hardware description language (VHDL or Verilog).With such low-level design, it is possible to adjust the Register Transfer Level (RTL) description to generate an optimized hardware architecture [17–19].But,this kind of design requires a lot of time and effort specially for complex algorithm.To alleviate this problem,the High-Level Synthesis(HLS)is introduced[20–22].Nevertheless,the HLS flow uses the high-level programming language like C/C++, systemC, etc to generate automatically the RTL design.This approach makes the code more readable, shortens design and verification times, and increases the design reusability over those of handwritten HDL equivalents.

    In literature, many architectures are proposed to implement the dequantization and inverse transform modules for HEVC decoder.In fact, the design outlined in [23] presents an FPGA implementation of the HEVC 2D integer inverse discrete cosine transform(2D-IDCT)using different HLS tools.The developed design can decode 54 frames/sec(FPS)for 1080p video sequences.In[24],the authors provide a System-On-Chip FPGA platform based on Xilinx Zynq to integrate the DCT coding block as an accelerator with HLS tool.The proposed design is capable to perform the coding of 1080@30fps.In [25], a LLS hardware architecture for 2D-IDCT is proposed.This architecture can process 4K@28fps at 135 MHz under XC7Z045 Xilinx FPGA.In addition, Chen et al.[26]design a 2D-IDCT architecture that supports all TU sizes.This architecture can calculate two rows in parallel during the 1D-IDCT instead of only one.In this case,the maximum throughput achieved is about 4K@30fps with the Xilinx Zynq platform.But Liang et al.[27] present an architecture that supports the 2D integer inverse discrete sine transform (2D-IDST) and 2D-IDCT using two 1DIDCT/IDST units and memory block.This architecture calculates four residual pixels in parallel in each clock cycle.This design can decode 7680×4320@30fps.In [28] a hardware implementation of the 2D dequantization,IDCT and IDST(2D-DE/IDCT/IDST)for HEVC decoder is described.The proposed architecture can perform the decoding of 4K@30fps at 200 MHz clock frequency in TSMC 40 nm technology.However, such implementation presents a good compromise in terms of energy efficiency,resources utilization and performance but miss flexibility in term of programmability.On the other,in[29] an efficient OpenCL implementation of the HEVC 2D-DE/IDCT/IDST module is proposed.This software implementation is realized based on an exploitation of the parallel processing offered by modern Graphic Processing Unit (GPUs).The proposed solution can decode 4K@15fps on GeForce GTX 780Ti@1046 MHz.This implementation preserves the flexibility and accuracy of software but with an increase in power consumption.

    When he reached the first court of the castle he saw before him a flight of agate20 steps, and went up them, and passed through several splendidly furnished rooms

    Hence, the aim of this paper is to provide a unified and optimized hardware architecture to implement the 2D-DE/IDCT/IDST module for HEVC decoder.This architecture should support 4×4, 8×8, 16×16 and 32×32 HEVC TU block size and offer a trade-off between performance,hardware cost and processing time.For this, the LLS and HLS design flow are used and explored to design a hardware architecture for HEVC 2D-DE/IDCT/IDST module.However, the HLS 2DDE/IDCT/IDST design is explored through the Xilinx Vivado HLS 2018.1 tool by adding specific directives (e.g., PIPILINE, RESSOURCE, etc) to the high level programming language such as C/C++code.But,the LLS 2D-DE/IDCT/IDST design is developed by using the pipeline technique through the VHDL language.The hardware architectures for both methods are mapped and evaluated on Xilinx XC7Z020 FPGA for processing time and hardware cost in order to determine which design method(LLS or HLS)provides better design productivity when facing a complex algorithm like the 2D-DE/IDCT/IDST module for HEVC decoder.

    He nodded to them, saying, It must be rather dull standing28 on the stairs; I would rather go inside! The halls blazed with lights; councillors and ambassadors were walking about in noiseless shoes carrying gold dishes

    The remainder of the paper is structured as follows.Section 2 introduces an overview of HEVC 2D-DE/IDCT/IDST module.Section 3 describes the hardware architecture designed for 2D-DE/IDCT/IDST module using HLS and LLS design flow.The implementation results and performance evaluation are reported in Section 4.Finally,Section 5 concludes the paper.

    In order to improve the design performances, several directives are added incrementally to the HEVC 2D-DE/IDCT/IDST C code.A part of the C code developed and given as input to Xilinx Vivado HLS tool 18.1 is shown in Figs.4 and 5 used for dequantization and 1D-IDCT8,respectively.Thus, several designs are generated to implement the 2D-DE/IDCT/IDST module.In fact, Design 1 is generated without adding any directive.The synthesis results on the Xilinx XC7Z020 FPGA shows that this design uses 21466 (40.3%) of Look-Up Tables (LUTs), 10252 (19.3%) of Flip-Flops(FFs), 40 (18.2%) of DSP blocks and 30 (21.4%) of BRAM as illustrated in Fig.6 and needs in worst case 15562 clock cycles to reconstruct the 32×32 residual block (Fig.7).From these results,we notice that the number of clock cycles is high.For that, the PIPELINE directive should be used to decrease the number of clock cycles.In fact,from Figs.4 and 5,we can see that the dequantization and IDCT C codes use loop iteration to reconstruct the transom and residual coefficients.The length of the loop iterations depends of the TU size.So, the loop can reach 1024 iterations.However, the PIPLINE directive is applied to loop iterations with interval equal to 1 to reduce the cycles number of latencies required for an input data to reach its output.Consequently, the Design 2 is created.According to Figs.6 and 7,Design 2 allows to decrease by 91%the number of clock cycles but with an important increase in the hardware cost by 45%of LUTs,62.3%of FFs,47.4%of DSP blocks and 44.5%of BRAMs relative to Design 1.Therefore,to reduce the hardware cost,the Design 3 is generated by using the ALLOCATION and RESSOURCE directives.Indeed,the ALLOCATION directive is added to process the multiplication operations in the level of dequantization equation which allows to share the hardware resources between several operations.Furthermore,the RESOURCE directive is used to implement the residual and dequantization arrays by a specific memory block (BRAMs).This optimization has allowed to decrease the hardware cost by 42%of LUTs,30.8%of FFs,42.1%of DSP blocks and 55.5% of BRAMs but with an increase by 61% in the number of clock cycles compared to Design 2 as reported in Figs.6 and 7.On the other hand,we can constate that the Design 3 provides a decrease in number of clock cycles by about 67%in worst case but with an increase by 5%of LUTs,45%of FFs and 9%of DSP blocks relative to Design 1.Afterward,the Design 3 is used for the comparative study with LLS 2D-DE/IDCT/IDST because it offers a trade-off between hardware cost and processing time.

    2 Dequantization and Inverse Transform in HEVC Decoder

    In HEVC, each frame is partitioned into coding tree block structure involving different sizes of large coding units(LCUs)up to 64×64.As illustrated in Fig.1,each LCU can be recursively split into several sizes of coding units(CUs).Starting from a defined partition of CUs,it is possible to further generate two other types of units corresponding to prediction unit(PU),which is used for inter/intra prediction processes,and TU,which is the elementary unit of dequantization and inverse transform.However,the size of the TU block is defined by HEVC encoder and can be 4×4,8×8,16×16 and 32×32.

    The hardware architecture depicted in Fig.8 describes the LLS design of the HEVC 2DDE/IDCT/IDST module.In fact, this architecture is composed by four dequantization units(Unit_DQ0, Unit_DQ1, Unit_DQ2 and Unit_DQ3), 1D-IDCT/IDST unit, transpose memory unit and control unit.It receives as input four 16-bit coefficients(Coeff0,Coeff1,Coeff2 and Coeff3)each three clock cycles from the entropy decoder,size of TU block(selsignal),QP value(QPsignal)andstartsignal and generates as output the residual block andDone_DQ/IDCTsignal which indicates that the residual block is ready.However,the designed architecture contains the dequantization and 1D-IDCT/IDST units in one design in order to benefit of the pipeline technique which can decrease the processing time.

    If you do not come, and if I do not hear from you, then I shall know that you could never be happy so far from the people and the country which you love

    Figure 1:Coding structure in HEVC standard

    Table 1: Definition of f(QP%6)

    After dequantization,the 2D-IDCT is performed.In fact,the IDCT module takes dequantized coefficient and performs as two separate 1D-IDCT to obtain is outputs the residual block.The HEVC decoder supports two types of inverse transform which are IDCT and IDST.The IDST is applied only to the 4×4 TU block.During decoding,the transformed coefficients are converted back to the spatial domain via an inverse transform.According the HEVC,the 2D-IDCT/IDST can be expressed by Eq.(3).

    When the kids came in, he took them for walks along the pier21 near their office. Often she went along and watched Eric, who was becoming a master of sign language, talk and laugh with her boys as no one else had before.

    where SRC is the transformed block, DST is the residual block and M is a N×N IDCT/IDST coefficient matrix which defined by HEVC standard.The value N is depending of the TU size.For each TU size,it has its own transform matrix.But,the 32×32 TU’s transform matrix includes other sizes TU’s transform matrix.In HEVC,the 2D-IDCT/IDST is computed by two 1D-IDCT/IDST where a column inverse transform is followed by a row inverse transform.

    To decrease the implementation complexity of 2D-IDCT/IDST,Chen et al.[30]was proposed a fast algorithm for transform by decomposing the transform matrixMNto some zero matrixes.This decomposition technique is presented by Eq.(4).

    wherePNis the permutation matrix.BNis theN-point butterfly structure.MN,oddandMN,evenare the odd and even parts ofMN, respectively.For example, Eq.(5) illustrates the 8×8 coefficient matrix in HEVC and Eq.(6)the decomposition of this matrix.Fig.2 presents the flowchart of 8-points 1DIDCT based on Chen’s algorithm.

    where:

    Figure 2:Flowchart of 8-points 1D-IDCT based on chen’s algorithm

    3 2D-DE/IDCT/IDST Hardware Architecture

    In this section,we describe the HLS and LLS hardware architectures designed to implement the HEVC 2D-DE/IDCT/IDST algorithm on Xilinx XC7Z020 FPGA.In this work,the HEVC test Model(HM16.0)[31]is used as reference software.

    3.2.2 1D-IDCT/IDST

    3.1 HLS 2D-DE/IDCT/IDST Hardware Architecture

    HLS is gaining more and more popularity specially when using FPGA circuit.Nevertheless,with HLS,it becomes possible to reduce the conception and validation time of the hardware design.Therefore,the exploration and the simulation of multiple hardware architectures can be done in the shortest time.But,HLS requests designers to restructure programs,change a source code and add a specific directive to get a good result.In this context,Xilinx developed the Vivado HLS tool.This tool accepts as input a high-level programming language such as C/C++and generates automatically as output an RTL hardware description.Through this tool,it is possible to add several directives(such as LOOP unrolling,ALLOCATION,RESOURCE,etc)in order to generate an optimize RTL design in terms of hardware cost and processing time.

    3.2.1 Unit_DQ

    Figure 3:Dataflow of the HLS 2D-DE/IDCT/IDST module for HEVC decoder

    However, in the beginning, the HLS architecture receives as input the TU size, the correspond coefficients(maximum 1024 coefficients)and the QP value.Then,these coefficients are dequantized to generate the transform coefficients.After that,if the TU size is egal to 4×4,thus in the first step,the 1D-IDCT4/IDST4 will be applied to the columns of TU to generate the 1D-transfrom coefficients.In the second step,these coefficients are stored in transpose memory to be used for 2D-transfrom.In the last step,the 1D-IDCT4/IDST4 will be applied to the row of TU to reconstruct the residual block.But,if the TU size is equal to 8×8,16×16 or 32×32,so the 4-point odd,8-point odd and 16-point odd are used with 4-point even and butterfly module to produce 1D/2D-IDCT8/16/32 coefficients,respectively.

    “Dance you shall,” said he, “dance in your red shoes till you are pale and cold, till your skin shrivels up and you are a skeleton! Dance you shall, from door to door, and where proud and wicked children live you shall knock, so that they may hear you and fear you! Dance you shall, dance—!”

    Woe82 to him who chooses the middle path! if he had a thousand lives he would not save one; it is very hazardous83; it leads to the Caucasus, and is an endless road

    Figure 4:Dequantization Vivado HLS C code

    Figure 5:8-point IDCT Vivado HLS C code

    Figure 6:HLS synthesis results of HEVC 2D-DE/IDCT/IDST module

    3.2 LLS 2D-DE/IDCT/IDST Hardware Architecture

    Thus,the 2D-DE/IDCT/IDST module receives the coefficients of the TU block from the entropy decoder and applies the dequantization to restore the original Transform coefficients.The dequantization scheme as specified be HEVC is given by Eq.(1).

    Figure 7:Number of clock cycles to reconstruct 4/8/16/32 residual block

    Figure 8:LLS hardware architecture for HEVC 2D-DE/IDCT/IDST module

    For the HLS implementation of the HEVC 2D-DE/IDCT/IDST module, the C code of this module is extracted from HM16.0.The 2D-DE/IDCT/IDST algorithm is implemented with HLS based on the algorithm proposed in Fig.3.In fact, some characteristics are taken account in this algorithm to reduce the hardware cost and complexity and generate a performant HLS design for 2DDE/IDCT/IDST module.However,the HLS design supports 4/8/16/32 TU size.The dequantization and 2D-IDCT/IDST are integrated in one design to minimize the processing time.Moreover,the evenodd 1D-IDCT/IDST algorithm is used as shown in Fig.2.In addition,8-point 1D-IDCT is computed by using 4-point IDCT (even part), 4-point odd and butterfly module.Even for 16-point IDCT is calculated based-on 8-point 1D-IDCT, 8-point odd and butterfly module.Also, 32-point IDCT is determined from 16-point 1D-IDCT,16-point odd and butterfly module.Further,all multiplication operations in 1D-IDCT/IDST are replaced by shift and addition based on the study realized in[25].Furthermore,1D-IDCT/IDST module is called two time and the intermediate memory block is used to save and transpose coefficients to generate 2D-IDCT/IDST.All these characteristics allow to reduce the hardware cost and hardware complexity.

    Fig.9 presents the hardware architecture of Unit_DQ.This architecture is developed based on Eq.(1).Indeed, two Read-Only-Memory (ROM1 and ROM2) are used to store the precalculated values proportional to QP%6 and QP/6,respectively.These memories are addressed based on QP value through the control unit.However, the 16-bit level signal receives the quantized coefficient which is multiplied by the value collected for ROM1.Then,the obtained result is shifted by the value collected from ROM2, added to the offset value and shifted by offset1.In the end, the 16-bit dequantization coefficient is obtained in two clock cycles through CoeffQ signal.

    Figure 9:Hardware architecture of Unit_DQ

    When the Emperor heard from the Prince how he had gained possession of his fair prize, he at once recognized that he had been helped by some magic art, and on the spot gave up all claim to the beautiful mermaid

    Fig.10 depicts the 1D-IDCT/IDST hardware architecture.This architecture is designed to support 4×4, 8×8, 16×16 and 32×32 TU size and use even-odd IDCT decomposition.In fact, the 32-point 1D-IDCT is constructed by using a 4-point even, a 4-point odd, an 8-point odd and a 16-point odd.Moreover,to reduce the hardware cost,the multiplication operations are replaced by shift and addition.Also, when analyzing the IDCT equation, we constate that there are some repetitive coefficients (X2, X4, X9, X18, X36, X64, X90).The Xcoeff components is used to compute these coefficients which are used to deduce the other cofficients(i.e.,:X75=X64+X9+X2).However,1DIDCT/IDST architecture receives dequantized coefficients through 32 SRC signals and produces the transformed coefficients through 32 DST signals in 12 clock cycles, 24 clock cycles, 80 clock cycles and 160 clock cycles for 4/8/16/32 TU size,respectively.

    where coeffDQ is the dequantized coefficient, level is the quantized DCT coefficient, QP is the quantization parameter ranged from 0 to 51 (every TU has its own QP value), N is the size of TU block,B is the bit depth and the function f(QP%6)is determined from Tab.1.The coeffDQ is clipped to the range[-32768,32768]to guarantee that the dequantized coefficient is computed with 16-bit.

    Figure 10:1D-IDCT/IDST hardware architecture

    3.2.3 Transpose Memory

    The transpose memory is used to store the intermediate coefficients between column and row of the inverse transform.It can store the coefficients of all TU size.The access to memory is optimized by concatenation eight 16-bit coefficients.In fact,in one clock cycles,it is possible to write and read 128-bit which mean eight coefficients in same time.

    3.2.4 Control Unit

    The control unit serves to share and synchronize data between all units in our design as shown in Fig.11.Accordingly,for 4×4 TU size,in the first step the control unit send four 16-bit coefficients(1stcolumn)to the dequantized units.Then,in the second step,the dequantized units receive the 2sdcolumn after 3 clock cycles and the 1D-IDCT4/IDST4 process the 1stcolumn in 3 clock cycles.After that,in the third step,1stcolumn is concatenated and stored in transpose memory in one clock cycle,the 2sdcolumn is processed by inverse transform in 3 clock cycles and the 3thcolumn is treated by the dequantized units in 2 clock cycles.Thus,the pipeline technique is used between all units to optimize the processing time.So,for 4×4 TU size, firstly the TU is processed column by column by dequantized units and 1D-IDCT4/IDST4 and the output coefficients for each column are stored in transpose memory.This step needs 16 clock cycles.Then,the 1D-IDCT4/IDST4 is performed again row by row from transpose memory.In the end,the DE/IDCT of 4×4 TU size is obtained in 29 clock cycles.All these steps are used for 8/16/32 TU size and need 77 clock cycles,280 clock cycles and 938 clock cycles,respectively as shown in Fig.11.

    Figure 11:Timing diagram for HEVC 2D-DE/IDCT/IDST module

    4 Implementation Results and Performance Evaluation

    Tab.2 conducts a comparison of the synthesis results,number of clock cycles and clock frequency for HLS and LLS 2D-DE/IDCT/IDST designs under XC7Z020 FPGA for 4×4,8×8,16×16 and 32×32 TU size.It can be seen from this table that the HLS design uses less LUTs by 4.4%but more FFs,RAMs and DSPs by 9.5%,5.7%and 18.2%,respectively,relative to LLS design.Moreover,the LLS design allows to reduce the number of clock cycles by 64%compared to HLS design.

    Table 2: Synthesis results of HLS and LLS flow for 2D-DE/IDCT/IDST under XC7Z020 FPGA

    On the other hand,the performance of HLS and LLS design for HEVC 2D-DE/IDCT/IDST is measured for several class of video sequences such as Class A(2560×1600),Class B(1920×1080),Class C(1280×720)and Class D(832×480).So,from Fig.12,we can see that the frame rate of LLS design can reach 339 fps for class D and 33 fps for class A instead of 66 fps for class D and 6 fps for class A using HLS design.However, with LLS design the frame rate is increased by 80% relative to HLS design in worst case as shown in Fig.13.

    Figure 12:Frame rate comparison between HLS and LLS designs for A,B,C and D video class

    Figure 13:Gain in frame rate for LLS relative to HLS for A,B,C and D video class

    Comparing our HLS 2D-DE/IDCT/IDST design with the HLS design proposed in[23]and[24]used to implement only the 2D-IDCT and 2D-DCT,respectively,we can constate from Tab.3 that our design allows a gain in LUTs and DSP blocks reaching approximately 55% and 59% relative to [23]and[24],respectively.Moreover,the maximal throughput of our design could perform the decoding of 1080p@13fps at 100 MHz instead of the 1080p@54fps at 208 MHz and 1080p@30fps found in[23] for 2D-IDCT and [24] 2D-DCT only, respectively.On the other hand, the designs [25] and [26]propose the LLS design for the 2D-DCT and 2D-IDCT/IDST only which can achieve 4K@28fps and 4K@30fps,respectively.But,our proposed LLS 2D-DE/IDCT/IDST design can reach 1080p@65fps.Hence,our design has better performance than[25]and[26]in term of frame rate and also use lower LUTs and DSP blocks count than[25]and[26]by 28%and 96%,respectively,as reported in Tab.3.Further,comparing the LLS 2D-DE/IDCT/DST design with[21],we can constate that our design can process 4K@16fps at 145 MHz instead of 4K@15fps on GeForce GTX 780Ti@1046 MHz in [29].So, our design is more performant and can achieves lower power consumption.Besides, our design permits more flexibility than[27]and[28]with the almost same performance.

    Table 3: Literature comparison of the dequantization and inverse transform designs

    Table 3:Continued

    5 Conclusion

    In this work, a unified hardware architecture is proposed to implement the HEVC 2DDE/IDCT/IDST module for 4/8/16/32 TU block size.However,two design methods are used to design this hardware architecture which are the HLS and the LLS design flow.Our goal was to compare these two methods and to select the best architecture to implement the HEVC 2D-DE/IDCT/IDST module.It is clear from experimental results under Xilinx XC7Z020 FPGA that the LLS design is more performant than HLS design in terms of processing time and hardware cost.But, the performance of HLS design depends on the selected directives, and the algorithm complexity and can be a good solution to speed up the design time and time to market(TTM).

    Funding Statement:This work was funded by the Deanship of Scientific Research at Jouf University(Kingdom of Saudi Arabia)under grant No.DSR-2021-02-0391.

    The spring sun had chased away the last snow from its hiding place under the hedges; the fields were full of flowers; nightingales sang in the trees, and all the world was gay

    But his mother told him that it was an unhappy man who had lost everything, and had come all this way to consult him, and bade the young man not to be afraid, but to come forward and show himself

    Conflicts of Interest:The authors declare that they have no conflicts of interest to report regarding the present study.

    日韩亚洲欧美综合| 国产精品99久久99久久久不卡 | 内地一区二区视频在线| 久久av网站| 三级国产精品欧美在线观看| 亚洲自偷自拍三级| 亚洲av免费高清在线观看| 国产视频内射| 欧美+日韩+精品| 亚洲一区二区三区欧美精品| 午夜激情久久久久久久| 久久久久网色| 国产免费又黄又爽又色| 天天操日日干夜夜撸| 欧美bdsm另类| 少妇猛男粗大的猛烈进出视频| 国产日韩欧美视频二区| 91精品一卡2卡3卡4卡| 22中文网久久字幕| 久久ye,这里只有精品| 男人和女人高潮做爰伦理| 欧美成人午夜免费资源| 日日摸夜夜添夜夜添av毛片| 亚洲av欧美aⅴ国产| 国产欧美另类精品又又久久亚洲欧美| 人妻人人澡人人爽人人| 午夜影院在线不卡| 成人毛片60女人毛片免费| 亚洲一级一片aⅴ在线观看| 久久午夜福利片| 99热这里只有精品一区| 久久久久久久久久人人人人人人| 国产成人免费观看mmmm| 大香蕉97超碰在线| 少妇精品久久久久久久| 插逼视频在线观看| 国产亚洲欧美精品永久| 国产精品一区www在线观看| 欧美一级a爱片免费观看看| 肉色欧美久久久久久久蜜桃| 精品午夜福利在线看| 午夜老司机福利剧场| 亚洲图色成人| 欧美三级亚洲精品| 九色成人免费人妻av| 国产精品熟女久久久久浪| 国产精品麻豆人妻色哟哟久久| 欧美区成人在线视频| 搡女人真爽免费视频火全软件| 国产高清有码在线观看视频| 国产69精品久久久久777片| 亚洲三级黄色毛片| 你懂的网址亚洲精品在线观看| 精品久久久久久电影网| 插阴视频在线观看视频| 日本欧美视频一区| 国产女主播在线喷水免费视频网站| 亚州av有码| 午夜福利影视在线免费观看| 国产日韩欧美在线精品| 国产黄色免费在线视频| 一级毛片久久久久久久久女| 国产精品伦人一区二区| 色94色欧美一区二区| 又大又黄又爽视频免费| 99久久综合免费| 丝瓜视频免费看黄片| 亚洲经典国产精华液单| 色婷婷av一区二区三区视频| 婷婷色综合www| 国产成人精品一,二区| 寂寞人妻少妇视频99o| 自拍欧美九色日韩亚洲蝌蚪91 | 美女大奶头黄色视频| xxx大片免费视频| 一级毛片我不卡| 精品亚洲乱码少妇综合久久| 两个人的视频大全免费| 久久ye,这里只有精品| a级毛色黄片| 日韩成人伦理影院| 成人国产麻豆网| 久久韩国三级中文字幕| 国产一区有黄有色的免费视频| 欧美xxⅹ黑人| 激情五月婷婷亚洲| 啦啦啦啦在线视频资源| .国产精品久久| 精品一区二区免费观看| 在线 av 中文字幕| 成人综合一区亚洲| 91午夜精品亚洲一区二区三区| av一本久久久久| 久久精品国产亚洲网站| 日日摸夜夜添夜夜添av毛片| 国产一区二区在线观看av| videossex国产| 日本91视频免费播放| 欧美日韩一区二区视频在线观看视频在线| 一级毛片电影观看| 三上悠亚av全集在线观看 | 中文乱码字字幕精品一区二区三区| 日韩精品免费视频一区二区三区 | 两个人免费观看高清视频 | 高清毛片免费看| 99精国产麻豆久久婷婷| 免费av中文字幕在线| 国产日韩欧美视频二区| 少妇高潮的动态图| 欧美精品国产亚洲| 最近2019中文字幕mv第一页| 久久久欧美国产精品| 一区二区三区精品91| 综合色丁香网| av在线app专区| 日本免费在线观看一区| 亚洲国产精品一区三区| 亚洲在久久综合| av福利片在线| 亚洲综合精品二区| 久久毛片免费看一区二区三区| 自拍偷自拍亚洲精品老妇| 校园人妻丝袜中文字幕| 少妇精品久久久久久久| 天天躁夜夜躁狠狠久久av| 精品一区在线观看国产| 熟妇人妻不卡中文字幕| 精品国产一区二区三区久久久樱花| 18禁裸乳无遮挡动漫免费视频| 国产淫片久久久久久久久| 另类亚洲欧美激情| 草草在线视频免费看| 一区二区三区乱码不卡18| 青春草视频在线免费观看| 这个男人来自地球电影免费观看 | 亚洲国产成人一精品久久久| 日本-黄色视频高清免费观看| 色哟哟·www| 建设人人有责人人尽责人人享有的| 日韩,欧美,国产一区二区三区| 国产一区二区在线观看av| 色婷婷久久久亚洲欧美| a级片在线免费高清观看视频| 一区在线观看完整版| 99九九在线精品视频 | 久久久久久久精品精品| 亚洲三级黄色毛片| 少妇丰满av| 亚洲高清免费不卡视频| 亚洲人成网站在线观看播放| 国产 一区精品| videossex国产| 国产免费又黄又爽又色| av福利片在线| 女性被躁到高潮视频| 少妇的逼水好多| 欧美日韩精品成人综合77777| 一级毛片我不卡| 日韩欧美一区视频在线观看 | 老熟女久久久| 婷婷色麻豆天堂久久| 国产av国产精品国产| 99久久人妻综合| 午夜福利网站1000一区二区三区| 国产av国产精品国产| 91成人精品电影| 在线看a的网站| 国产视频首页在线观看| 极品少妇高潮喷水抽搐| 亚洲真实伦在线观看| 热re99久久国产66热| 涩涩av久久男人的天堂| 国产午夜精品久久久久久一区二区三区| 91精品国产国语对白视频| 一个人免费看片子| 少妇人妻一区二区三区视频| 日韩免费高清中文字幕av| 蜜桃在线观看..| 99热国产这里只有精品6| 亚洲欧美清纯卡通| 久久久欧美国产精品| 精品国产乱码久久久久久小说| 婷婷色综合www| a级片在线免费高清观看视频| 亚洲美女视频黄频| 久久精品国产亚洲网站| 99视频精品全部免费 在线| 亚洲内射少妇av| 国产无遮挡羞羞视频在线观看| 国产高清不卡午夜福利| 99九九线精品视频在线观看视频| 免费少妇av软件| 自拍偷自拍亚洲精品老妇| 欧美日韩国产mv在线观看视频| 另类亚洲欧美激情| 精品国产一区二区久久| 亚洲欧美日韩另类电影网站| 高清午夜精品一区二区三区| 久久久久久久精品精品| 成年女人在线观看亚洲视频| 久久99蜜桃精品久久| av国产久精品久网站免费入址| 精品一品国产午夜福利视频| 日本vs欧美在线观看视频 | 久久精品久久久久久噜噜老黄| 中文字幕精品免费在线观看视频 | 成人综合一区亚洲| 搡女人真爽免费视频火全软件| 曰老女人黄片| 美女脱内裤让男人舔精品视频| 久久ye,这里只有精品| 久久精品国产a三级三级三级| 两个人的视频大全免费| 久久久久久久精品精品| 少妇被粗大的猛进出69影院 | 黄色配什么色好看| 成人特级av手机在线观看| 亚洲av不卡在线观看| 国产免费福利视频在线观看| 国产av国产精品国产| 男女无遮挡免费网站观看| 精品久久久久久久久亚洲| 一本一本综合久久| 人人妻人人澡人人爽人人夜夜| 亚洲欧洲日产国产| 老女人水多毛片| 2022亚洲国产成人精品| 一级毛片我不卡| 国产伦理片在线播放av一区| 多毛熟女@视频| 国产爽快片一区二区三区| 纵有疾风起免费观看全集完整版| 色视频在线一区二区三区| 一级爰片在线观看| 亚洲av成人精品一二三区| 黄色视频在线播放观看不卡| 中文字幕人妻丝袜制服| 久久韩国三级中文字幕| 人人妻人人看人人澡| 亚洲综合色惰| 一级av片app| 亚洲第一区二区三区不卡| 午夜免费鲁丝| a级片在线免费高清观看视频| 成人国产av品久久久| 99热这里只有是精品50| 国产伦精品一区二区三区视频9| 少妇 在线观看| 人人妻人人澡人人爽人人夜夜| 国产亚洲最大av| 91成人精品电影| 亚洲天堂av无毛| 久久久久精品性色| 你懂的网址亚洲精品在线观看| 日本av手机在线免费观看| 美女中出高潮动态图| 成人国产麻豆网| 韩国av在线不卡| 精品国产一区二区三区久久久樱花| 青青草视频在线视频观看| 久久国产精品男人的天堂亚洲 | 麻豆成人午夜福利视频| 美女脱内裤让男人舔精品视频| 观看美女的网站| 青春草视频在线免费观看| 看非洲黑人一级黄片| 欧美xxxx性猛交bbbb| av天堂中文字幕网| 高清不卡的av网站| 高清视频免费观看一区二区| 国产亚洲91精品色在线| 亚洲精品一区蜜桃| 日韩精品有码人妻一区| 免费看日本二区| 日韩人妻高清精品专区| 99久国产av精品国产电影| 亚洲在久久综合| 欧美少妇被猛烈插入视频| 又爽又黄a免费视频| 久久国产乱子免费精品| 亚洲精品成人av观看孕妇| av.在线天堂| 久久6这里有精品| 两个人免费观看高清视频 | 人人妻人人看人人澡| 国产欧美另类精品又又久久亚洲欧美| 伊人久久国产一区二区| 亚洲欧洲国产日韩| 国产精品蜜桃在线观看| 午夜日本视频在线| 国产美女午夜福利| 三级国产精品欧美在线观看| 热99国产精品久久久久久7| 九草在线视频观看| 欧美日韩视频高清一区二区三区二| 久热这里只有精品99| 中国三级夫妇交换| 新久久久久国产一级毛片| 日本午夜av视频| 99热全是精品| 亚洲av中文av极速乱| 大香蕉97超碰在线| 成年美女黄网站色视频大全免费 | 国产乱人偷精品视频| 街头女战士在线观看网站| 在线 av 中文字幕| 亚洲av日韩在线播放| 性色avwww在线观看| 久久鲁丝午夜福利片| 中国国产av一级| 亚洲三级黄色毛片| 国产精品国产三级国产av玫瑰| 午夜精品国产一区二区电影| 69精品国产乱码久久久| 国产精品久久久久久久久免| 久久99蜜桃精品久久| 欧美xxxx性猛交bbbb| 永久网站在线| 高清欧美精品videossex| 亚洲经典国产精华液单| 国产成人a∨麻豆精品| 国产成人freesex在线| 成人特级av手机在线观看| 黄色欧美视频在线观看| 大片电影免费在线观看免费| 在线观看一区二区三区激情| 97超视频在线观看视频| 日本91视频免费播放| 国产精品秋霞免费鲁丝片| 欧美 日韩 精品 国产| 久久免费观看电影| 国产美女午夜福利| 午夜日本视频在线| 久久人人爽人人片av| 交换朋友夫妻互换小说| 国产精品熟女久久久久浪| 九九在线视频观看精品| 我要看黄色一级片免费的| 女性生殖器流出的白浆| 中文字幕av电影在线播放| 亚洲无线观看免费| 天堂8中文在线网| 日韩成人伦理影院| 一级毛片久久久久久久久女| 亚洲久久久国产精品| 偷拍熟女少妇极品色| 欧美高清成人免费视频www| 午夜久久久在线观看| 亚洲第一av免费看| 久久人人爽人人爽人人片va| 男女无遮挡免费网站观看| av又黄又爽大尺度在线免费看| 精品人妻一区二区三区麻豆| 一区二区av电影网| 乱码一卡2卡4卡精品| 国内揄拍国产精品人妻在线| 蜜桃久久精品国产亚洲av| 日本午夜av视频| 91精品国产九色| 久久99蜜桃精品久久| 国产日韩欧美视频二区| 91aial.com中文字幕在线观看| 亚洲精品第二区| 午夜日本视频在线| 国产视频内射| 久久国产乱子免费精品| 亚洲人成网站在线播| 欧美bdsm另类| 久久久久久久久久久丰满| 日韩精品有码人妻一区| 国产在线一区二区三区精| 亚洲欧洲国产日韩| 极品少妇高潮喷水抽搐| 五月玫瑰六月丁香| 亚洲丝袜综合中文字幕| 久久ye,这里只有精品| 少妇的逼水好多| 丰满乱子伦码专区| 尾随美女入室| 久久青草综合色| 国产欧美亚洲国产| 好男人视频免费观看在线| 99精国产麻豆久久婷婷| 2021少妇久久久久久久久久久| 成年美女黄网站色视频大全免费 | 国产色婷婷99| 多毛熟女@视频| 久久久精品免费免费高清| 精品一区在线观看国产| 人人妻人人爽人人添夜夜欢视频 | 成人午夜精彩视频在线观看| 久久久精品94久久精品| 亚洲国产精品999| 18禁在线播放成人免费| 国产欧美亚洲国产| 精品国产国语对白av| 黑丝袜美女国产一区| 大片免费播放器 马上看| 国产av精品麻豆| 亚洲国产最新在线播放| 国产有黄有色有爽视频| 日韩亚洲欧美综合| 日韩av在线免费看完整版不卡| 午夜激情福利司机影院| 如日韩欧美国产精品一区二区三区 | 国产成人免费无遮挡视频| 看十八女毛片水多多多| 国语对白做爰xxxⅹ性视频网站| 熟女电影av网| 在线 av 中文字幕| freevideosex欧美| 香蕉精品网在线| 少妇人妻 视频| 黑人巨大精品欧美一区二区蜜桃 | 看非洲黑人一级黄片| 内地一区二区视频在线| 成人国产av品久久久| .国产精品久久| 插阴视频在线观看视频| 国产女主播在线喷水免费视频网站| 亚洲欧美一区二区三区国产| 国产精品福利在线免费观看| 午夜精品国产一区二区电影| 亚洲一级一片aⅴ在线观看| 夜夜爽夜夜爽视频| 久久久精品94久久精品| 国产成人freesex在线| 一二三四中文在线观看免费高清| 99视频精品全部免费 在线| 大香蕉97超碰在线| 男女边摸边吃奶| 天堂俺去俺来也www色官网| 69精品国产乱码久久久| 精品人妻一区二区三区麻豆| 亚洲内射少妇av| 97超碰精品成人国产| 综合色丁香网| av天堂中文字幕网| 国产精品久久久久久精品电影小说| 日韩av不卡免费在线播放| 日本黄色片子视频| 国内揄拍国产精品人妻在线| 久久久久久久亚洲中文字幕| 亚洲精品国产成人久久av| 人人妻人人爽人人添夜夜欢视频 | 免费高清在线观看视频在线观看| 国产精品99久久99久久久不卡 | 少妇裸体淫交视频免费看高清| 日本黄色日本黄色录像| videossex国产| 制服丝袜香蕉在线| 午夜91福利影院| 精品国产露脸久久av麻豆| 国产片特级美女逼逼视频| 男女边摸边吃奶| 亚洲av综合色区一区| 人人妻人人澡人人看| 高清在线视频一区二区三区| 国产精品免费大片| 欧美人与善性xxx| av卡一久久| 国产白丝娇喘喷水9色精品| 亚洲va在线va天堂va国产| 人妻夜夜爽99麻豆av| 亚洲人与动物交配视频| av福利片在线| 午夜影院在线不卡| 99九九线精品视频在线观看视频| 色94色欧美一区二区| 国内精品宾馆在线| av在线观看视频网站免费| 熟妇人妻不卡中文字幕| 国产国拍精品亚洲av在线观看| 国产精品不卡视频一区二区| 国产精品久久久久久av不卡| 人妻夜夜爽99麻豆av| 国产亚洲一区二区精品| 日本wwww免费看| 日韩电影二区| 色视频在线一区二区三区| 成人亚洲欧美一区二区av| 中文乱码字字幕精品一区二区三区| 日产精品乱码卡一卡2卡三| 成年av动漫网址| 日韩大片免费观看网站| 国产成人午夜福利电影在线观看| 91精品国产国语对白视频| 亚洲国产精品一区二区三区在线| a级毛色黄片| 久久精品久久久久久噜噜老黄| 午夜影院在线不卡| 99九九线精品视频在线观看视频| 亚洲av二区三区四区| 春色校园在线视频观看| 一级,二级,三级黄色视频| 美女视频免费永久观看网站| 一本—道久久a久久精品蜜桃钙片| 国产精品国产av在线观看| 亚洲色图综合在线观看| 永久网站在线| 亚洲精品中文字幕在线视频 | 18禁动态无遮挡网站| 久久久久久伊人网av| 欧美变态另类bdsm刘玥| 日日爽夜夜爽网站| 天堂中文最新版在线下载| 日本黄色片子视频| 七月丁香在线播放| 在线观看免费视频网站a站| 人人妻人人添人人爽欧美一区卜| 亚洲成人av在线免费| 水蜜桃什么品种好| 最新中文字幕久久久久| 在线观看免费视频网站a站| 综合色丁香网| 久久人妻熟女aⅴ| 麻豆乱淫一区二区| 亚洲伊人久久精品综合| 日韩一区二区视频免费看| 啦啦啦视频在线资源免费观看| 高清午夜精品一区二区三区| 99热这里只有是精品在线观看| 久久久久久伊人网av| 午夜视频国产福利| 免费观看无遮挡的男女| 久久久久久久久久久久大奶| 极品教师在线视频| 国产一区亚洲一区在线观看| 麻豆成人av视频| 一本久久精品| 最新的欧美精品一区二区| 高清在线视频一区二区三区| 91久久精品国产一区二区三区| 国产精品.久久久| 曰老女人黄片| 国产欧美日韩综合在线一区二区 | 久久久久久伊人网av| 久久毛片免费看一区二区三区| 亚洲成人手机| 夜夜看夜夜爽夜夜摸| 成年人午夜在线观看视频| 91午夜精品亚洲一区二区三区| 51国产日韩欧美| 只有这里有精品99| 高清毛片免费看| 五月玫瑰六月丁香| 欧美高清成人免费视频www| 9色porny在线观看| 精品少妇黑人巨大在线播放| 日日摸夜夜添夜夜爱| 女人久久www免费人成看片| 99九九在线精品视频 | 高清欧美精品videossex| 日韩欧美 国产精品| 亚洲国产色片| 永久免费av网站大全| 欧美日韩av久久| 男女边摸边吃奶| 少妇熟女欧美另类| 一区二区三区精品91| a级毛色黄片| 日韩中字成人| 丝瓜视频免费看黄片| 我的女老师完整版在线观看| 亚洲国产毛片av蜜桃av| 777米奇影视久久| 大又大粗又爽又黄少妇毛片口| 大片电影免费在线观看免费| 国产午夜精品一二区理论片| 亚洲国产精品一区三区| 免费看光身美女| 91精品国产国语对白视频| 国产男女超爽视频在线观看| 久久韩国三级中文字幕| 国产高清不卡午夜福利| 男人和女人高潮做爰伦理| 精品人妻偷拍中文字幕| 国产视频内射| 视频区图区小说| 少妇精品久久久久久久| 天堂8中文在线网| 国产精品一区二区性色av| 国产爽快片一区二区三区| 亚洲精品国产色婷婷电影| 美女脱内裤让男人舔精品视频| 超碰97精品在线观看| 啦啦啦在线观看免费高清www| 看十八女毛片水多多多| 韩国av在线不卡| 亚洲精品色激情综合| 精品酒店卫生间| 少妇人妻精品综合一区二区| 97超视频在线观看视频| 黄色配什么色好看| 最近的中文字幕免费完整| 亚洲av.av天堂| 99re6热这里在线精品视频| 国产 精品1| 国产成人免费观看mmmm| 亚洲在久久综合| 最近2019中文字幕mv第一页| 日本vs欧美在线观看视频 | 亚洲综合精品二区| 日本av免费视频播放| 大又大粗又爽又黄少妇毛片口| 精品卡一卡二卡四卡免费| 一区二区三区精品91| 免费人成在线观看视频色| 六月丁香七月| 黄片无遮挡物在线观看| 午夜免费观看性视频| 成人无遮挡网站| 欧美日韩精品成人综合77777| 国产在线男女| 国语对白做爰xxxⅹ性视频网站|