Accommodating Hybrid Retrieval in a Comprehensive Video Data(4)
Figure 3.2 Activity Model and a “Football” example
In Figure 3.2, the left-hand side is the architecture of the Activity Model and right-hand side is a“Football” example. The model consists of four levels, Activity, Event, Motion, and Object. The top threelevels are mainly used by the query language processor to reason what activity the user wants to retrieve, sothat the processor would retrieve the video scene from the database according to the objects (i.e., featureswith spatio-temporal semantics) from the bottom level of the model.
A user may input an activity into the Specification Language Processing component (shown in Figure 3.1(c.1)) by using terms such as “playing sports”, “playing football”, “kicking football”, or “kicking”. Morespecific terms would yield more specific retrieval results. In general, the first word is known as a verb andthe word followed is a noun. The processor would analyze from the Motion level first. After some keywordsare matched, the processor would search up to the Event level using the second word. For example, if theterm is “kicking football”, the processor searches “kicking” from the Motion level, and then uses “football”to search from the Event level. If the term is “playing football” and there is no “playing” in the Motion level,the processor will try to reason the thesaurus of the word and then search again. However, if there is nomatch of word from the model, the processor would just skip the “verb” and search the “noun” from theEvent level to the Activity level. After the threshold of the search is met, the processor would go down to thecorresponding Object level. Then it would input those objects from the Object level into the Feature Index
EDICS
Tree as features and ask the user to input some spatio-temporal semantics (ST-Feature) into the database(shown in Figure 3.1 (c.2)).
At a later time, the user may want to retrieve video data based on some activities from the database. Forexample, he may input an activity query like “kicking football”. The Query Language Processor first getssome collections of objects from the Activity Model (shown in Figure 3.1 (e)) and then retrieves the result asthe original query processing (CAROL/ST) by treating the collections of objects as Features and ST-Features. Therefore, the main purposes of the Activity model are to facilitate annotating all common andsignificant activities.
3.2 CBR Extension to CAROL/ST
While CAROL/ST can facilitate effective retrieval based on rich semantics, for multimedia data such asvideo, visual content is also an inseparable (and can be more significant) part, which is difficult to bedescribed with text. On the other hand, content-based approach to automatically extract and index visualfeatures has been a main trend in the area of computer vision and video processing. To employ best strengthsfrom both areas, an extended version of VideoMAP, which we termed as VideoMAP+ [CWLZ01], isdeveloped for supporting hybrid retrieval of videos through both query-based and content-based accesses.Here we adopt visual content to our prototype only.
Figure 3.3 Architecture of VideoMAP+
The architecture of VideoMAP+ is as shown in Figure 3.3 (which is a modified version of Figure 3.1). Here,the Feature Extraction Component (FEC) is newly added in. During the procedure of Video Segmentation(by VCC), visual feature vector of the video and other object defined in are extracted, such as the color,texture, shape and so on. The Hybrid Query Language Processing module contains three kinds of retrievalformat: CAROL/ST Retrieval-the original retrieval format which mainly uses the semantic annotation andspatio-temporal relation of video. The Content-based Retrieval-module supports the newly added retrievalformat that mainly uses the visual information inherent in the video content, and also their HybridCombination Retrieval. CBR query functions are incorporated to form a hybrid query language. Hence theindices are now based on more video objects and the returning result also includes more video object types.
EDICS
3.2.1 Foundation Classes
VideoMAP+ extends a conventional OODB to define video objects through a specific hierarchy(videoàsceneàsegmentàkeyframe). In addition, it includes the concept of CBR to build index on visualfeatures of these objects. Their class attributes, methods and corresponding relations form a complexnetwork (or, a "map" as shown in Figure 3.4). Below we enumerate the foundation classes of theVideoMAP+ objects at various granularities, namely: Keyframe, Segment, Scene, Video and Visual object(cf. Figure 3.4).
VideoMAP+ is at video segment level. This is not the only bridging level possible, as others (such as thekeyframe and/or scene levels) are also meaningful for bridging the two. In VideoMAP+, the segment level ischosen as the direct bridge due to simplicity and efficiency reasons, because we regard video segments asthe basic unit of retrieval.
3.2.2 Search paths with CBR
After integrating CBR with CAROL/ST, three main groups of objects (i.e. Keyframe, Visual-Object, andImage-Feature) are added to the VideoMAP+ system as shown in the class diagram (Figure 3.4).
Image-Feature: Visual Feature extracted from video object, like color, texture, shape and etc.
Keyframe: The fundamental image frame in video sequence.
EDICS
Visual Object. All salient objects captured in a video’s physical space represented visually or textually areinstances of a physical object. Furthermore, every object has the spatio-temporal layout in the imagesequence.
Four new entry points to search for semantic-feature and visual-object are:
(a) Visual-Object,
(b) Image-Feature,
(c) Activity Model, and
(d) Object Level of the Activity Model.
The Object Level of the Activity Model [CL …… 此处隐藏:5785字,全部文档内容请下载后查看。喜欢就下载吧 ……
相关推荐:
- [高等教育]一年级家长课程教案
- [高等教育]封丘县人民医院深入推进纠正医药购销领
- [高等教育]2017年6月大学英语四级真题试卷及答案(
- [高等教育]2017年北京第二外国语学院文学院824中
- [高等教育]7 高中历史第7单元1861年俄国农奴制改
- [高等教育]【K12学习】4、实际测量-苏教版六年级
- [高等教育]药具培训试卷题库及部分参考答案
- [高等教育]本土电子元器件目录分销商如何赢得生意
- [高等教育]七年级岭南版美术教案
- [高等教育]书作文之书法活动通讯稿
- [高等教育]Endnote X 软件使用入门和用法总结(LS)
- [高等教育]嵌入式系统的现状及发展状况
- [高等教育]2012抗菌药物专项整治活动方案解读
- [高等教育]人教版新课本一年级数学下册期末试卷
- [高等教育]爱课程民法学观后感
- [高等教育]930机组使用说明书1
- [高等教育]煤气设备设施点检标准
- [高等教育]常见室内观叶植物图解
- [高等教育]312党员群众路线心得体会
- [高等教育]小学信息(苗版)第一册全册教案
- 在市---局2010党建大会上的讲话
- 《科哲》提纲及补充阅读材料(2010.7)
- 苏州高博软件技术职业学院论文开题报告
- 兼职导游管理的困境及对策探讨
- 基于通用设计理念的现代厨房产品语义研
- 康乐一中2010年至2011年度鼓号队、花束
- 第10章_数据收集整理与描述_期末复习课
- 2008年黑龙江林甸商贸购物中心营销策划
- 水硬度的测定实验报告
- 五分钟教你拍摄夜景光绘照
- 2014年临床妇产科三基三严试题及答案
- 0第二课 纾解压力第一站了解压力
- 解析建筑工程电气设备安装施工技术要点
- 地方性应用型本科高校“双师型”师资队
- 高考语文专题复习课件:小说阅读指导
- 装饰工程投标书2
- 大学生就业难问题探讨及对策
- English and Its History
- 青岛市城市房屋修缮工程质量监督管理办
- 初中英语形容词和副词的用法和练习题




