14 March 2013 Semantic-preload video model based on VOP coding
Author Affiliations +
Proceedings Volume 8768, International Conference on Graphic and Image Processing (ICGIP 2012); 876861 (2013) https://doi.org/10.1117/12.2012827
Event: 2012 International Conference on Graphic and Image Processing, 2012, Singapore, Singapore
Abstract
In recent years, in order to reduce semantic gap which exists between high-level semantics and low-level features of video when the human understanding image or video, people mostly try the method of video annotation where in signal’s downstream, namely further (again) attach labels to the content in video-database. Few people focus on the idea that: Use limited interaction and the means of comprehensive segmentation (including optical technologies) from the front-end of collection of video information (i.e. video camera), with video semantics analysis technology and corresponding concepts sets (i.e. ontology) which belong in a certain domain, as well as story shooting script and the task description of scene shooting etc; Apply different-level semantic descriptions to enrich the attributes of video object and the attributes of image region, then forms a new video model which is based on Video Object Plan (VOP) Coding. This model has potential intellectualized features, and carries a large amount of metadata, and embedded intermediate-level semantic concept into every object. This paper focuses on the latter, and presents a framework of a new video model. At present, this new video model is temporarily named “Video Model of Semantic-Preloaded or Semantic-Preload Video Model (simplified into VMoSP or SPVM)”. This model mainly researches how to add labeling to video objects and image regions in real time, here video object and image region are usually used intermediate semantic labeling, and this work is placed on signal’s upstream (i.e. video capture production stage). Because of the research needs, this paper also tries to analyses the hierarchic structure of video, and divides the hierarchic structure into nine hierarchy semantic levels, of course, this nine hierarchy only involved in video production process. In addition, the paper also point out that here semantic level tagging work (i.e. semantic preloading) only refers to the four middle-level semantic. All in all, this research was unfolded is based on analyzed the characteristic of the existing video mode, and with reference to MPEG series standard.
© (2013) COPYRIGHT Society of Photo-Optical Instrumentation Engineers (SPIE). Downloading of the abstract is permitted for personal use only.
Jianping Yang, Jie Zhang, Xiangjun Chen, "Semantic-preload video model based on VOP coding", Proc. SPIE 8768, International Conference on Graphic and Image Processing (ICGIP 2012), 876861 (14 March 2013); doi: 10.1117/12.2012827; https://doi.org/10.1117/12.2012827
PROCEEDINGS
9 PAGES


SHARE
RELATED CONTENT

Statistical approach to scene change detection
Proceedings of SPIE (March 23 1995)
Depth-controlled 3D TV image coding
Proceedings of SPIE (April 30 1998)
Video parsing using compressed data
Proceedings of SPIE (March 23 1994)

Back to Top