Machine Learning-Driven Pre-Broadcast Video Codec Validation: Ensuring Seamless Television Transmission

Khalid El Fayq, Said Tkatek, Lahcen Idouglid

Abstract


This study addresses the critical challenge of ensuring uninterrupted television broadcasting by proactively detecting video codec errors, focusing on TV Laayoune, a prominent Moroccan channel. We developed a machine learningbased methodology that identifies incompatible codecs before they disrupt live broadcasts. The approach involves data collection from multiple sources, including TV Laayoune's archives, metadata extraction via FFmpeg, and a hybrid model that combines Convolutional Neural Networks (CNNs) with Long Short-Term Memory (LSTM) networks. Integrated into the broadcasting pipeline, this model achieved a 95% accuracy rate, significantly enhancing broadcast reliability and operational efficiency. Additionally, we propose a user-friendly interface for real-time error detection, comprehensive workflow integration, and automated alerts. This innovative solution addresses common broadcast challenges, reducing operational risks and improving the viewer experience.

Keywords


Video Codec Errors; Machine Learning; Television Broadcasting; Video Compatibility; Real-time Detection; Broadcast Reliability

References


O. Izima, R. de Fréin, and A. Malik, “A survey of machine learning techniques for video quality prediction from quality of delivery metrics,” Nov. 01, 2021, MDPI. doi: 10.3390/electronics10222851.

D. Alexandre, H.-M. Hang, W.-H. Peng, and M. Domanski, “Deep Video Compression for Interframe Coding,” in 2021 IEEE International Conference on Image Processing (ICIP), IEEE, Sep. 2021, pp. 2124–2128. doi: 10.1109/ICIP42928.2021.9506275.

Alisha Muskaan, Nagarathna S, Sandhya C S, Viju J, and B Sumangala, “Exposing Deep Fake Face Detection using LSTM and CNN,” International Journal of Advanced Research in Science, Communication and Technology, pp. 231–234, May 2024, doi: 10.48175/IJARSCT-18434.

L. Kaur and P. K. Mishra, “Estimation of concise video summaries from long sequence videos using deep learning via LSTM,” Int J Health Sci (Qassim), pp. 9904–9914, Jun. 2022, doi: 10.53730/ijhs.v6ns3.9287.

N. Yan, D. Liu, H. Li, B. Li, L. Li, and F. Wu, “Convolutional Neural Network-Based Fractional-Pixel Motion Compensation,” IEEE Transactions on Circuits and Systems for Video Technology, vol. 29, no. 3, pp. 840–853, Mar. 2019, doi: 10.1109/TCSVT.2018.2816932.

K. Cui, A. B. Koyuncu, A. Boev, E. Alshina, and E. Steinbach, “Convolutional neural network-based post-filtering for compressed YUV420 images and video,” in 2021 Picture Coding Symposium (PCS), IEEE, Jun. 2021, pp. 1–5. doi: 10.1109/PCS50896.2021.9477486.

K. El Fayq, S. Tkatek, L. Idouglid, and J. Abouchabaka, “Detection and Extraction of Faces and Text Lower Third Techniques for an Audiovisual Archive System using Machine Learning,” International Journal of Advanced Computer Science and Applications, vol. 13, no. 9, p. 2022, 2022, doi: 10.14569/IJACSA.2022.0130974.

R. V. Bidwe et al., “Deep Learning Approaches for Video Compression: A Bibliometric Analysis,” Big Data and Cognitive Computing, vol. 6, no. 2, p. 44, Apr. 2022, doi: 10.3390/bdcc6020044.

A. Benoughidene and F. Titouna, “A novel method for video shot boundary detection using CNN-LSTM approach,” Int J Multimed Inf Retr, vol. 11, no. 4, pp. 653–667, Dec. 2022, doi: 10.1007/s13735-022-00251-8.

K. Panneerselvam, K. Mahesh, V. L. H. Josephine, and A. R. Kumar, “Effective and Efficient Video Compression by the Deep Learning Techniques,” Computer Systems Science and Engineering, vol. 45, no. 2, pp. 1047–1061, 2023, doi: 10.32604/csse.2023.030513.

M. Lu and S. Niu, “A detection approach using lstm‐cnn for object removal caused by exemplar‐based image inpainting,” Electronics (Switzerland), vol. 9, no. 5, May 2020, doi: 10.3390/electronics9050858.

S. Yadav et al., “A video compression-cum-classification network for classification from compressed video streams,” Vis Comput, Mar. 2024, doi: 10.1007/s00371-023-03242-w.

Z. Wang, J. Chen, and S. C. H. Hoi, “Deep Learning for Image Super-Resolution: A Survey,” IEEE Trans Pattern Anal Mach Intell, vol. 43, no. 10, pp. 3365–3387, Oct. 2021, doi: 10.1109/TPAMI.2020.2982166.

S. Bouaafia, R. Khemiri, S. Messaoud, O. Ben Ahmed, and F. E. Sayadi, “Deep learning-based video quality enhancement for the new versatile video coding,” Neural Comput Appl, vol. 34, no. 17, pp. 14135–14149, Sep. 2022, doi: 10.1007/s00521-021-06491-9.

K. Chen, H. Wang, S. Fang, X. Li, M. Ye, and H. J. Chao, “RL-AFEC,” in Proceedings of the 13th ACM Multimedia Systems Conference, New York, NY, USA: ACM, Jun. 2022, pp. 96–108. doi: 10.1145/3524273.3528184.

M. Darwich and M. Bayoumi, “Video quality adaptation using CNN and RNN models for cost-effective and scalable video streaming Services,” Cluster Comput, 2024, doi: 10.1007/s10586-024-04315-8.

Y. O. Sharrab, I. Alsmadi, and N. J. Sarhan, “Towards the availability of video communication in artificial intelligence-based computer vision systems utilizing a multi-objective function,” Cluster Comput, vol. 25, no. 1, pp. 231–247, Feb. 2022, doi: 10.1007/s10586-021-03391-4.

G. Ciaparrone, L. Chiariglione, and R. Tagliaferri, “A comparison of deep learning models for end-to-end face-based video retrieval in unconstrained videos,” Neural Comput Appl, vol. 34, no. 10, pp. 7489–7506, May 2022, doi: 10.1007/s00521-021-06875-x.

M. V. Gashnikov, “Video Codec Using Machine Learning Based on Parametric Orthogonal Filters,” Optical Memory and Neural Networks, vol. 32, no. 4, pp. 226–232, Dec. 2023, doi: 10.3103/S1060992X23040021.

F. Steinert and B. Stabernack, “Architecture of a Low Latency H.264/AVC Video Codec for Robust ML based Image Classification: How Region of Interests can Minimize the Impact of Coding Artifacts,” J Signal Process Syst, vol. 94, no. 7, pp. 693–708, Jul. 2022, doi: 10.1007/s11265-021-01727-2.

D. Liu, Y. Li, J. Lin, H. Li, and F. Wu, “Deep Learning-Based Video Coding,” ACM Comput Surv, vol. 53, no. 1, pp. 1–35, Jan. 2021, doi: 10.1145/3368405.

S. Ma, X. Zhang, C. Jia, Z. Zhao, S. Wang, and S. Wang, “Image and Video Compression With Neural Networks: A Review,” IEEE Transactions on Circuits and Systems for Video Technology, vol. 30, no. 6, pp. 1683–1698, Jun. 2020, doi: 10.1109/TCSVT.2019.2910119.

Y. Zhang, S. Kwong, and S. Wang, “Machine learning based video coding optimizations: A survey,” Inf Sci (N Y), vol. 506, pp. 395–423, Jan. 2020, doi: 10.1016/j.ins.2019.07.096.

Z. Xu, X. Tang, C. Ma, and R. Zhang, “Research on Parking Space Detection and Prediction Model Based on CNN-LSTM,” IEEE Access, vol. 12, pp. 30085–30100, 2024, doi: 10.1109/ACCESS.2024.3368521.

X. Dou, X. Cao, and X. Zhang, “Region-of-Interest Based Coding Scheme for Live Videos,” Applied Sciences (Switzerland), vol. 14, no. 9, May 2024, doi: 10.3390/app14093823.

L. Chen, B. Cheng, H. Zhu, H. Qin, L. Deng, and L. Luo, “Fast Versatile Video Coding (VVC) Intra Coding for Power-Constrained Applications,” Electronics (Switzerland), vol. 13, no. 11, Jun. 2024, doi: 10.3390/electronics13112150.

S. K. Im and K. H. Chan, “Faster Intra-Prediction of Versatile Video Coding Using a Concatenate-Designed CNN via DCT Coefficients,” Electronics (Switzerland), vol. 13, no. 11, Jun. 2024, doi: 10.3390/electronics13112214.

N. Li, Z. Wang, and Q. Zhang, “Fast Coding Unit Partitioning Algorithm for Video Coding Standard Based on Block Segmentation and Block Connection Structure and CNN,” Electronics (Switzerland), vol. 13, no. 9, May 2024, doi: 10.3390/electronics13091767.


Full Text: PDF

Refbacks

  • There are currently no refbacks.


 

Indonesian Journal of Electrical Engineering and Informatics (IJEEI)
ISSN 2089-3272

Creative Commons Licence

This work is licensed under a Creative Commons Attribution 4.0 International License.

web analytics
View IJEEI Stats