Accommodating Hybrid Retrieval in a Comprehensive Video Data(2)

来源：网络收集时间：2026-07-13

导读： EDICS among various applications. They separated the concept of multimedia data into data definition, presentation andtemporal structure, but they do not provide conceptual structure for efficient qu

EDICS

among various applications. They separated the concept of multimedia data into data definition, presentation andtemporal structure, but they do not provide conceptual structure for efficient query processing. Jiang andElmagarmid [JE98] presented a video database system called WVTDB which supports semantic modeling andcontent-based search on the world wide web (WWW). Lee et. al. [Lee+99] developed an icon-based, graphicalquery language GVISUAL which allows a user to specify temporal queries using iconic/graphicalrepresentations, but no support facility for content-based retrieval is provided.

2.3 Hybrid image and video retrieval

In addition, some work on hybrid query retrieval of image and video was surveyed and reviewed in

[YM98]. Chabot [OS95] is a picture retrieval system and its object identification is based on color analysissemi automatically. It uses both keywords and image features for retrieval. In [CH92, GWJ91], the authorsgave an idea of building a hierarchy of image representations from raw image data to objects and relations atthe user semantic level. Chang and Hsu [CH92] analyzed raw image data in terms of their geometric patterns,scenes with semantics, and some meaningful entities. The overall information could be utilized in spatialreasoning and image retrieval. Gupta, Weymouth and Jain [GWJ91] have developed a VIMSYS model forquerying a pictorial database. SEMCOG [LCH97] is an object-based image retrieval system whichintegrated semantic and cognition-based information for retrieval. Besides hybrid image retrieval, there havebeen some hybrid video querying systems [ZLSW95, ZWLS95, CA96]. Zhang et. al. [ZLSW95, ZWLS95]parsed and decomposed raw video data into shots and scenes automatically; while JACOB [CA96] usedcamera operations to split a video into shots. They both use keywords and low-level image features for videoretrieval and browsing.

2.4 Relevance to MPEG-7

MPEG-7 standard, a means of attaching metadata to multimedia content, is often called “MultimediaContent Description Interface”. It aims at providing a rich set of audiovisual description tools to describemultimedia content data, which will support some degree of interpretation of the information’s meaning. It isintended to describe audiovisual information regardless of storage, coding, display, transmission, medium, ortechnology. Audiovisual data content may include still pictures, graphics, 3D models, audio, speech, video,and composition information about how these elements are combined in a multimedia presentation. Specialcases of these general data types may include facial expressions and personal characteristics. MPEG-7 workcan be separated into three parts: Descriptors, Description Schemes, and a Description Definition Language.Descriptors are the representations of low-level features. Description Schemes are structured combinationsof Descriptors, and they can be used to form a richer expression of a higher-level concept. The DescriptionDefinition Language is the language that allows the creation of new Description Schemes and Descriptors. Italso allows the extension and modification of existing Description Schemes.

EDICS

There are many MPEG-7-related projects being undertaken within commercial enterprises, particularlybroadcasting and digital imaging companies.ψ [HARMONY] is a three-way International Digital LibrariesInitiative project between Cornell University, the Distributed Systems Technology Centre, and theUniversity of Bristol’s Institute for Learning and Research Technology. Its objective is to develop aframework to deal with the challenge of describing networked collections of highly complex and mixed-media digital objects. The research draws together works on the RDF, XML, Dublin Core, MPEG-7 andINDECS standards, and focuses on the problem of allowing multiple communities of expertise (e.g., library,education, rights management) to define overlapping descriptive vocabularies for annotating multimediacontent.

DICEMAN (Distributed Internet Content Exchange with MPEG-7 and Agent Negotiations) project is anEC-funded project between Teltec Ireland DCU, CSELT (Italy), IBM (Germany), INA (France), IST(Portugal), KPN Research (Netherlands), Riverland (Britain) and UPC (Spain) [DICEMAN]. Its objective isto develop an end-to-end chain for indexing, storage, search and trading of digital audiovisual content. Thetechnical aspects of this project are mainly: MPEG-7 indexing through a COntent Provider's Application(COPA); the use of Foundation for Intelligent Physical Agents (FIPA) to search and locate the best content;and support for electronic commerce and rights management.

The A4SM project, which is based on GMD's IPSI (Integrated Publication and Information SystemsInstitute), is currently researching the application of IT support to all stages of the video production process

[IPSI]. The purpose is to seamlessly integrate an IT support framework into the production process, i.e., pre-production (e.g., script development, story boarding, etc.), production (e.g., collection of media-data byusing an MPEG-2/7 camera, etc.), and the post-production (support of non-linear editing). In collaborationwith TV-reporters, cameramen and editors they have designed an MPEG-7 camera in combination with amobile annotation device for the reporter, and a mobile editing suite suitable for the generation of news-clips.Overall, the MPEG-7 standard and its related projects concentrate on content description and metadataattachment to multimedia data. Few facilities and little support have been provided by them in terms ofvideo query formulation, processing, and retrieval, which are exactly the main theme of this paper.

3. Hybrid Approach to Video Retrieval

In a Video Database Management system (VDBMS), there exists an important need for efficient retrievalfacility of the voluminous data. Accordingly, many ways are pu …… 此处隐藏：6265字，全部文档内容请下载后查看。喜欢就下载吧 ……

Accommodating Hybrid Retrieval in a Comprehensive Video Data(2).doc 将本文的Word文档下载到电脑，方便复制、编辑、收藏和打印

下载这篇word文档

本文链接：https://www.jiaowen.net/wenku/128086.html（转载请注明文章来源）

上一篇：2010福建省驾校考试科目一自动档最新考试试题库(完整版)
下一篇：2003版增值税纳税申报表(适用于营改增地区一般纳税人)