Arzumanyan R.V., Sukhinov A.I. —
Factorization of the 8x8 HEVC Video Coding Inverse Discrete Transform Matrix and the Fast Algorithm Thereupon
// Software systems and computational methods. – 2016. – ¹ 3.
– P. 287 - 298.
DOI: 10.7256/2454-0714.2016.3.19558
Read the article
Abstract: The subject of the present research is the development of the algorithm for the fast inverse discrete transform of the 8x8 ITU-T H265 (HEVC) video coding standard. The authors of the article analyze differences between the structures of the inverse transform matrix and inverse discrete cosine transform matrix as well as approaches that may be applied to factorization of the aforesaid matrix. They also provide an evaluation of a number of operations necessary to perform the transfer. The authors conduct a numerical experiment to prove the efficiency of the developed algorithm from the point of view of the speed of performance on the central processing unit (CPU). The research method used by the authors is the theoretical analysis and numerical experiment including collection of relevant information and analysis of results. To conduct the numerical experiment the authors have written a C-language program that executes a standard algorithm of the inverse transfer (direct multiplication of the transform matrix and coefficient vectors) and the fast algorithm of the inverse transfer as it is described in the theoretical part of the research. Then the authors have compared the productivity results. The novelty of the research is caused by the fact that the authors offer a new algorithm for the fast transfer of the 8x8 HEVC standard and the scheme of inverse matrix factorization. Compared to previous researches and algorithm, the given algorithm requires fewer arithmetic operations, thus takes less time. At the end of their research article the authors make conclusions regarding the possibility of the fast inverse transfer of the HEVC standard, offer their own scheme for the aforesaid 8x8 matrix factorization and develop the fast algorithm for the inverse transfer based on the discovered factorization schemes.
Arzumanyan R.V., Sukhinov A.I. —
Study the feasibility of high-performance software Google VP9 decoder.
// Software systems and computational methods. – 2016. – ¹ 2.
– P. 184 - 200.
DOI: 10.7256/2454-0714.2016.2.18430
Read the article
Abstract: The article is devoted to optimization and execution of parallel decoding stages of the video signal compressed in accordance with specification Google VP9. The authors in detail discuss the most time consuming stages of decoding and restoring a compressed video and study possible optimization and parallel execution of algorithms underlying such steps using both CPUs and graphics cards with general-purpose computing support. The article gives a comprehensive assessment of the characteristics of the decoding stages, including the requirements for processor and memory subsystem. The main method of the study is in carrying out a numerical experiment with the collection of information of interest and then analyzing the results. Gathering of information is implemented by modifying the source code reference codec and subsequent assembly into a software codec application. The novelty of the work lies in the fact that it carried out a comprehensive analysis of the possibility of computational methods lying in the codec based. The research evaluates the feasibility of parallel calculations, taking into account peculiarities of the target hardware (MCCPU and GPGPU). The authors performed an optimization of arithmetic decoding step taking into account the statistical characteristics of the distribution of the lengths of literals, decoded from a compressed bit stream.
In this article, the authors make conclusions regarding the most computationally complex decoding stages and the possibility of their optimization and parallel implementation, and analyze differences between the described codec a competing codec N265.