教学文库网 - 权威文档分享云平台
您的当前位置:首页 > 文库大全 > 高等教育 >

Accommodating Hybrid Retrieval in a Comprehensive Video Data(2)

来源:网络收集 时间:2025-12-24
导读: EDICS among various applications. They separated the concept of multimedia data into data definition, presentation andtemporal structure, but they do not provide conceptual structure for efficient qu

EDICS

among various applications. They separated the concept of multimedia data into data definition, presentation andtemporal structure, but they do not provide conceptual structure for efficient query processing. Jiang andElmagarmid [JE98] presented a video database system called WVTDB which supports semantic modeling andcontent-based search on the world wide web (WWW). Lee et. al. [Lee+99] developed an icon-based, graphicalquery language GVISUAL which allows a user to specify temporal queries using iconic/graphicalrepresentations, but no support facility for content-based retrieval is provided.

2.3 Hybrid image and video retrieval

In addition, some work on hybrid query retrieval of image and video was surveyed and reviewed in

[YM98]. Chabot [OS95] is a picture retrieval system and its object identification is based on color analysissemi automatically. It uses both keywords and image features for retrieval. In [CH92, GWJ91], the authorsgave an idea of building a hierarchy of image representations from raw image data to objects and relations atthe user semantic level. Chang and Hsu [CH92] analyzed raw image data in terms of their geometric patterns,scenes with semantics, and some meaningful entities. The overall information could be utilized in spatialreasoning and image retrieval. Gupta, Weymouth and Jain [GWJ91] have developed a VIMSYS model forquerying a pictorial database. SEMCOG [LCH97] is an object-based image retrieval system whichintegrated semantic and cognition-based information for retrieval. Besides hybrid image retrieval, there havebeen some hybrid video querying systems [ZLSW95, ZWLS95, CA96]. Zhang et. al. [ZLSW95, ZWLS95]parsed and decomposed raw video data into shots and scenes automatically; while JACOB [CA96] usedcamera operations to split a video into shots. They both use keywords and low-level image features for videoretrieval and browsing.

2.4 Relevance to MPEG-7

MPEG-7 standard, a means of attaching metadata to multimedia content, is often called “MultimediaContent Description Interface”. It aims at providing a rich set of audiovisual description tools to describemultimedia content data, which will support some degree of interpretation of the information’s meaning. It isintended to describe audiovisual information regardless of storage, coding, display, transmission, medium, ortechnology. Audiovisual data content may include still pictures, graphics, 3D models, audio, speech, video,and composition information about how these elements are combined in a multimedia presentation. Specialcases of these general data types may include facial expressions and personal characteristics. MPEG-7 workcan be separated into three parts: Descriptors, Description Schemes, and a Description Definition Language.Descriptors are the representations of low-level features. Description Schemes are structured combinationsof Descriptors, and they can be used to form a richer expression of a higher-level concept. The DescriptionDefinition Language is the language that allows the creation of new Description Schemes and Descriptors. Italso allows the extension and modification of existing Description Schemes.

EDICS

There are many MPEG-7-related projects being undertaken within commercial enterprises, particularlybroadcasting and digital imaging companies.ψ [HARMONY] is a three-way International Digital LibrariesInitiative project between Cornell University, the Distributed Systems Technology Centre, and theUniversity of Bristol’s Institute for Learning and Research Technology. Its objective is to develop aframework to deal with the challenge of describing networked collections of highly complex and mixed-media digital objects. The research draws together works on the RDF, XML, Dublin Core, MPEG-7 andINDECS standards, and focuses on the problem of allowing multiple communities of expertise (e.g., library,education, rights management) to define overlapping descriptive vocabularies for annotating multimediacontent.

DICEMAN (Distributed Internet Content Exchange with MPEG-7 and Agent Negotiations) project is anEC-funded project between Teltec Ireland DCU, CSELT (Italy), IBM (Germany), INA (France), IST(Portugal), KPN Research (Netherlands), Riverland (Britain) and UPC (Spain) [DICEMAN]. Its objective isto develop an end-to-end chain for indexing, storage, search and trading of digital audiovisual content. Thetechnical aspects of this project are mainly: MPEG-7 indexing through a COntent Provider's Application(COPA); the use of Foundation for Intelligent Physical Agents (FIPA) to search and locate the best content;and support for electronic commerce and rights management.

The A4SM project, which is based on GMD's IPSI (Integrated Publication and Information SystemsInstitute), is currently researching the application of IT support to all stages of the video production process

[IPSI]. The purpose is to seamlessly integrate an IT support framework into the production process, i.e., pre-production (e.g., script development, story boarding, etc.), production (e.g., collection of media-data byusing an MPEG-2/7 camera, etc.), and the post-production (support of non-linear editing). In collaborationwith TV-reporters, cameramen and editors they have designed an MPEG-7 camera in combination with amobile annotation device for the reporter, and a mobile editing suite suitable for the generation of news-clips.Overall, the MPEG-7 standard and its related projects concentrate on content description and metadataattachment to multimedia data. Few facilities and little support have been provided by them in terms ofvideo query formulation, processing, and retrieval, which are exactly the main theme of this paper.

3. Hybrid Approach to Video Retrieval

In a Video Database Management system (VDBMS), there exists an important need for efficient retrievalfacility of the voluminous data. Accordingly, many ways are pu …… 此处隐藏:6265字,全部文档内容请下载后查看。喜欢就下载吧 ……

Accommodating Hybrid Retrieval in a Comprehensive Video Data(2).doc 将本文的Word文档下载到电脑,方便复制、编辑、收藏和打印
本文链接:https://www.jiaowen.net/wenku/128086.html(转载请注明文章来源)
Copyright © 2020-2025 教文网 版权所有
声明 :本网站尊重并保护知识产权,根据《信息网络传播权保护条例》,如果我们转载的作品侵犯了您的权利,请在一个月内通知我们,我们会及时删除。
客服QQ:78024566 邮箱:78024566@qq.com
苏ICP备19068818号-2
Top
× 游客快捷下载通道(下载后可以自由复制和排版)
VIP包月下载
特价:29 元/月 原价:99元
低至 0.3 元/份 每月下载150
全站内容免费自由复制
VIP包月下载
特价:29 元/月 原价:99元
低至 0.3 元/份 每月下载150
全站内容免费自由复制
注:下载文档有可能出现无法下载或内容有问题,请联系客服协助您处理。
× 常见问题(客服时间:周一到周五 9:30-18:00)