To see the other types of publications on this topic, follow the link: Semantic multimedia representation.

Journal articles on the topic 'Semantic multimedia representation'

Create a spot-on reference in APA, MLA, Chicago, Harvard, and other styles

Select a source type:

Consult the top 50 journal articles for your research on the topic 'Semantic multimedia representation.'

Next to every source in the list of references, there is an 'Add to bibliography' button. Press on it, and we will generate automatically the bibliographic reference to the chosen work in the citation style you need: APA, MLA, Harvard, Chicago, Vancouver, etc.

You can also download the full text of the academic publication as pdf and read online its abstract whenever available in the metadata.

Browse journal articles on a wide variety of disciplines and organise your bibliography correctly.

1

Mylonas, Phivos, Thanos Athanasiadis, Manolis Wallace, Yannis Avrithis, and Stefanos Kollias. "Semantic representation of multimedia content: Knowledge representation and semantic indexing." Multimedia Tools and Applications 39, no. 3 (September 4, 2007): 293–327. http://dx.doi.org/10.1007/s11042-007-0161-4.

Full text
APA, Harvard, Vancouver, ISO, and other styles
2

Zhang, Hong, Yu Huang, Xin Xu, Ziqi Zhu, and Chunhua Deng. "Latent semantic factorization for multimedia representation learning." Multimedia Tools and Applications 77, no. 3 (August 30, 2017): 3353–68. http://dx.doi.org/10.1007/s11042-017-5135-6.

Full text
APA, Harvard, Vancouver, ISO, and other styles
3

Duan, Yiping, Qiyuan Du, Xin Fang, Zhipeng Xie, Zhijin Qin, Xiaoming Tao, Chengkang Pan, and Guangyi Liu. "Multimedia Semantic Communications: Representation, Encoding and Transmission." IEEE Network 37, no. 1 (January 2023): 44–50. http://dx.doi.org/10.1109/mnet.001.2200468.

Full text
APA, Harvard, Vancouver, ISO, and other styles
4

Wagenpfeil, Stefan, Paul Mc Kevitt, and Matthias Hemmje. "Towards Automated Semantic Explainability of Multimedia Feature Graphs." Information 12, no. 12 (December 2, 2021): 502. http://dx.doi.org/10.3390/info12120502.

Full text
Abstract:
Multimedia feature graphs are employed to represent features of images, video, audio, or text. Various techniques exist to extract such features from multimedia objects. In this paper, we describe the extension of such a feature graph to represent the meaning of such multimedia features and introduce a formal context-free PS-grammar (Phrase Structure grammar) to automatically generate human-understandable natural language expressions based on such features. To achieve this, we define a semantic extension to syntactic multimedia feature graphs and introduce a set of production rules for phrases of natural language English expressions. This explainability, which is founded on a semantic model provides the opportunity to represent any multimedia feature in a human-readable and human-understandable form, which largely closes the gap between the technical representation of such features and their semantics. We show how this explainability can be formally defined and demonstrate the corresponding implementation based on our generic multimedia analysis framework. Furthermore, we show how this semantic extension can be employed to increase the effectiveness in precision and recall experiments.
APA, Harvard, Vancouver, ISO, and other styles
5

Al-Khatib, W., Y. F. Day, A. Ghafoor, and P. B. Berra. "Semantic modeling and knowledge representation in multimedia databases." IEEE Transactions on Knowledge and Data Engineering 11, no. 1 (1999): 64–80. http://dx.doi.org/10.1109/69.755616.

Full text
APA, Harvard, Vancouver, ISO, and other styles
6

Petridis, K., S. Bloehdorn, C. Saathoff, N. Simou, S. Dasiopoulou, V. Tzouvaras, S. Handschuh, Y. Avrithis, Y. Kompatsiaris, and S. Staab. "Knowledge representation and semantic annotation of multimedia content." IEE Proceedings - Vision, Image, and Signal Processing 153, no. 3 (2006): 255. http://dx.doi.org/10.1049/ip-vis:20050059.

Full text
APA, Harvard, Vancouver, ISO, and other styles
7

Smith, Roger W., Dorota Kieronska, and Svetha Venkatesh. "Conceptual Representation for Multimedia Information." International Journal of Pattern Recognition and Artificial Intelligence 11, no. 02 (March 1997): 303–27. http://dx.doi.org/10.1142/s0218001497000147.

Full text
Abstract:
Multimedia information is now routinely available in the forms of text, pictures, animation and sound. Although text objects are relatively easy to deal with (in terms of information search and retrieval), other information bearing objects (such as sound, images, animation) are more difficult to index. Our research is aimed at developing better ways of representing multimedia objects by using a conceptual representation based on Schank's conceptual dependencies. Moreover, the representation allows for users' individual interpretations to be embedded in the system. This will alleviate the problems associated with traditional semantic networks by allowing for coexistence of multiple views of the same information. The viability of the approach is tested, and the preliminary results reported.
APA, Harvard, Vancouver, ISO, and other styles
8

Chang, Xiaojun, Zhigang Ma, Yi Yang, Zhiqiang Zeng, and Alexander G. Hauptmann. "Bi-Level Semantic Representation Analysis for Multimedia Event Detection." IEEE Transactions on Cybernetics 47, no. 5 (May 2017): 1180–97. http://dx.doi.org/10.1109/tcyb.2016.2539546.

Full text
APA, Harvard, Vancouver, ISO, and other styles
9

Yang, Bo, and Ali R. Hurson. "Similarity-Based Clustering Strategy for Mobile Ad Hoc Multimedia Databases." Mobile Information Systems 1, no. 4 (2005): 253–73. http://dx.doi.org/10.1155/2005/317136.

Full text
Abstract:
Multimedia data are becoming popular in wireless ad hoc environments. However, the traditional content-based retrieval techniques are inefficient in ad hoc networks due to the multiple limitations such as node mobility, computation capability, memory space, network bandwidth, and data heterogeneity. To provide an efficient platform for multimedia retrieval, we propose to cluster ad hoc multimedia databases based on their semantic contents, and construct a virtual hierarchical indexing infrastructure overlaid on the mobile databases. This content-aware clustering scheme uses a semantic-aware framework as the theoretical foundation for data organization. Several novel techniques are presented to facilitate the representation and manipulation of multimedia data in ad hoc networks: 1) using concise distribution expressions to represent the semantic similarity of multimedia data, 2) constructing clusters based on the semantic relationships between multimedia entities, 3) reducing the cost of content-based multimedia retrieval through the restriction of semantic distances, and 4) employing a self-adaptive mechanism that dynamically adjusts to the content and topology changes of the ad hoc networks. The proposed scheme is scalable, fault-tolerant, and efficient in performing content-based multimedia retrieval as demonstrated in our combination of theoretical analysis and extensive experimental studies.
APA, Harvard, Vancouver, ISO, and other styles
10

Luan, Xi Dao, Yu Xiang Xie, Yi Hong Tan, Sai Hu, Zhi Ping Chen, and Jing Wang. "Description Logic Based Objects and Space Relations Representation." Applied Mechanics and Materials 48-49 (February 2011): 366–72. http://dx.doi.org/10.4028/www.scientific.net/amm.48-49.366.

Full text
Abstract:
This theme focuses on representing and reasoning high-level semantic based on concepts and their space relations. As to multimedia data, such as image and video, acquiring, representing and retrieving high-level semantic information has been a confused problem for a long time. Without the support of knowledge database, it is an impossible mission to carry out the simple synonymous retrieval, let alone retrieving the abstract semantic. This paper proposes some algorithms to translate restored concepts and their relations into a Concept Semantic Network, which is visualized by SVG finally. The paper also introduces the method of recording concepts distribution by description logic, which services users with concepts and distribution retrieval.
APA, Harvard, Vancouver, ISO, and other styles
11

Yokota, Masao. "Aware Computing in Spatial Language Understanding Guided by Cognitively Inspired Knowledge Representation." Applied Computational Intelligence and Soft Computing 2012 (2012): 1–10. http://dx.doi.org/10.1155/2012/184103.

Full text
Abstract:
Mental image directed semantic theory (MIDST) has proposed an omnisensory mental image model and its description languageLmd. This language is designed to represent and compute human intuitive knowledge of space and can provide multimedia expressions with intermediate semantic descriptions in predicate logic. It is hypothesized that such knowledge and semantic descriptions are controlled by human attention toward the world and therefore subjective to each human individual. This paper describesLmdexpression of human subjective knowledge of space and its application to aware computing in cross-media operation between linguistic and pictorial expressions as spatial language understanding.
APA, Harvard, Vancouver, ISO, and other styles
12

Penta, Antonio. "A multimedia semantic framework for image understanding and retrieval." Encyclopedia with Semantic Computing and Robotic Intelligence 01, no. 01 (March 2017): 1650001. http://dx.doi.org/10.1142/s2425038416500012.

Full text
Abstract:
On the grounds, ontologies have been shown to be a powerful resource for the interpretation and translation of the terminological and semantic relationships within domains of interest but it is still unclear how they can be applied in the context of multimedia data. In this paper, we describe a framework which can capture and manage semantic information related to the multimedia data by modeling in the ontology their features. In particular, the proposed ontology-based framework is organized in the following way: at the lower levels, spatial objects, colors, shapes are represented, and semantic relationships can be established among them; at the higher levels, objects with semantic properties are put into relationship among themselves as well as with the corresponding low-level objects. On this basis, we have designed an ontological system particularly suitable for image retrieval. We have also taken into account the inherent uncertainty related to the representation and detection of multimedia properties in this complex domain. Along this work, we have provided examples from the image domain; moreover, since ontologies provide a semantic means for the semantic comparison of objects and relationships across different formats, the system is easily extensible to other, heterogeneous data sources.
APA, Harvard, Vancouver, ISO, and other styles
13

Afef, Zwidi, Ameni Yengui, and Neji Mahmoud. "Research system of semantic information in medical videoconference based on conceptual graphs and domain ontologies." INTERNATIONAL JOURNAL OF MANAGEMENT & INFORMATION TECHNOLOGY 7, no. 2 (November 30, 2013): 979–99. http://dx.doi.org/10.24297/ijmit.v7i2.703.

Full text
Abstract:
The multiplication of the number of AudioVisual Documents (AVD) engendered a problem while searching for information within gigantic databases of which we are incapable to index their contents completely manually. Indeed, several complex difficulties are put by these documents because of the vertiginous increase of the quantity of the multimedia data to be treated and the specification met in the representation and the extraction of their contents in particular semantics of the fact that these documents contain three types of media (text, sound, image). AVDs can be classified in professional broadcasted videos (movies, emissions), sporting videos, video controlling, videoconference etc. In this paper, we propose a model of representation of the semantic contents of videoconferences documents in medicine based on the conceptual graphs taking into account the different modalities. This model is based on the concepts extraction and the semantic relations between them and appeals ontology domain.
APA, Harvard, Vancouver, ISO, and other styles
14

Latif, Afshan, Aqsa Rasheed, Umer Sajid, Jameel Ahmed, Nouman Ali, Naeem Iqbal Ratyal, Bushra Zafar, Saadat Hanif Dar, Muhammad Sajid, and Tehmina Khalil. "Content-Based Image Retrieval and Feature Extraction: A Comprehensive Review." Mathematical Problems in Engineering 2019 (August 26, 2019): 1–21. http://dx.doi.org/10.1155/2019/9658350.

Full text
Abstract:
Multimedia content analysis is applied in different real-world computer vision applications, and digital images constitute a major part of multimedia data. In last few years, the complexity of multimedia contents, especially the images, has grown exponentially, and on daily basis, more than millions of images are uploaded at different archives such as Twitter, Facebook, and Instagram. To search for a relevant image from an archive is a challenging research problem for computer vision research community. Most of the search engines retrieve images on the basis of traditional text-based approaches that rely on captions and metadata. In the last two decades, extensive research is reported for content-based image retrieval (CBIR), image classification, and analysis. In CBIR and image classification-based models, high-level image visuals are represented in the form of feature vectors that consists of numerical values. The research shows that there is a significant gap between image feature representation and human visual understanding. Due to this reason, the research presented in this area is focused to reduce the semantic gap between the image feature representation and human visual understanding. In this paper, we aim to present a comprehensive review of the recent development in the area of CBIR and image representation. We analyzed the main aspects of various image retrieval and image representation models from low-level feature extraction to recent semantic deep-learning approaches. The important concepts and major research studies based on CBIR and image representation are discussed in detail, and future research directions are concluded to inspire further research in this area.
APA, Harvard, Vancouver, ISO, and other styles
15

Lemos, Daniela Lucas da Silva, and Renato Rocha Souza. "Knowledge Organization Systems for the Representation of Multimedia Resources on the Web: A Comparative Analysis." KNOWLEDGE ORGANIZATION 47, no. 4 (2020): 300–319. http://dx.doi.org/10.5771/0943-7444-2020-4-300.

Full text
Abstract:
The lack of standardization in the production, organization and dissemination of information in documentation centers and institutions alike, as a result from the digitization of collections and their availability on the internet has called for integration efforts. The sheer availability of multimedia content has fostered the development of many distinct and, most of the time, independent metadata standards for its description. This study aims at presenting and comparing the existing standards of metadata, vocabularies and ontologies for multimedia annotation and also tries to offer a synthetic overview of its main strengths and weaknesses, aiding efforts for semantic integration and enhancing the findability of available multimedia resources on the web. We also aim at unveiling the characteristics that could, should and are perhaps not being highlighted in the characterization of multimedia resources.
APA, Harvard, Vancouver, ISO, and other styles
16

Cai, Liewu, Lei Zhu, Hongyan Zhang, and Xinghui Zhu. "DA-GAN: Dual Attention Generative Adversarial Network for Cross-Modal Retrieval." Future Internet 14, no. 2 (January 27, 2022): 43. http://dx.doi.org/10.3390/fi14020043.

Full text
Abstract:
Cross-modal retrieval aims to search samples of one modality via queries of other modalities, which is a hot issue in the community of multimedia. However, two main challenges, i.e., heterogeneity gap and semantic interaction across different modalities, have not been solved efficaciously. Reducing the heterogeneous gap can improve the cross-modal similarity measurement. Meanwhile, modeling cross-modal semantic interaction can capture the semantic correlations more accurately. To this end, this paper presents a novel end-to-end framework, called Dual Attention Generative Adversarial Network (DA-GAN). This technique is an adversarial semantic representation model with a dual attention mechanism, i.e., intra-modal attention and inter-modal attention. Intra-modal attention is used to focus on the important semantic feature within a modality, while inter-modal attention is to explore the semantic interaction between different modalities and then represent the high-level semantic correlation more precisely. A dual adversarial learning strategy is designed to generate modality-invariant representations, which can reduce the cross-modal heterogeneity efficiently. The experiments on three commonly used benchmarks show the better performance of DA-GAN than these competitors.
APA, Harvard, Vancouver, ISO, and other styles
17

Yuan, Xu, Hua Zhong, Zhikui Chen, Fangming Zhong, and Yueming Hu. "Multimedia Feature Mapping and Correlation Learning for Cross-Modal Retrieval." International Journal of Grid and High Performance Computing 10, no. 3 (July 2018): 29–45. http://dx.doi.org/10.4018/ijghpc.2018070103.

Full text
Abstract:
This article describes how with the rapid increasing of multimedia content on the Internet, the need for effective cross-modal retrieval has attracted much attention recently. Many related works ignore the latent semantic correlations of modalities in the non-linear space and the extraction of high-level modality features, which only focuses on the semantic mapping of modalities in linear space and the use of low-level artificial features as modality feature representation. To solve these issues, the authors first utilizes convolutional neural networks and topic modal to obtain a high-level semantic feature of various modalities. Sequentially, they propose a supervised learning algorithm based on a kernel with partial least squares that can capture semantic correlations across modalities. Finally, the joint model of different modalities is learnt by the training set. Extensive experiments are conducted on three benchmark datasets that include Wikipedia, Pascal and MIRFlickr. The results show that the proposed approach achieves better retrieval performance over several state-of-the-art approaches.
APA, Harvard, Vancouver, ISO, and other styles
18

Alti, Adel, Sébastian Laborie, and Philippe Roose. "A Community-Based Semantic Social Context-Aware Driven Adaptation for Multimedia Documents." International Journal of Virtual Communities and Social Networking 7, no. 2 (April 2015): 31–49. http://dx.doi.org/10.4018/ijvcsn.2015040102.

Full text
Abstract:
This paper presents an approach to enhance users experience through the use of recommendations and social networks for on-the-fly (at runtime) adaptation of multimedia documents. This paper presents also CSSAP, a dynamic service selection and assembly tool based on new user profiles and community profiles defined as set of semantic metadata, which context, quality of service and quality of experience parameters. The tool is based on community-aware semantic services and offer architecture, with three layers (semantic query, community management and semantic services). The most innovative characteristic of the tool is that it profits from the potential of semantic representation techniques to express context constraints and community's interests, while they may be useful to generate and manage of complex dynamic adaptation process. This tool improves assembly of relevant adaptation services for communities inferred social influence from a Facebook as virtual P2P environment. The proposed approach has been validated through a prototype for mobiles user of multimedia contents exchanges. The goal is to improve assembly of potential adaptation services and the efficiency and effectiveness of the authors' approach.
APA, Harvard, Vancouver, ISO, and other styles
19

Kollia, Ilianna, Nikolaos Simou, Andreas Stafylopatis, and Stefanos Kollias. "SEMANTIC IMAGE ANALYSIS USING A SYMBOLIC NEURAL ARCHITECTURE." Image Analysis & Stereology 29, no. 3 (November 1, 2010): 159. http://dx.doi.org/10.5566/ias.v29.p159-172.

Full text
Abstract:
Image segmentation and classification are basic operations in image analysis and multimedia search which have gained great attention over the last few years due to the large increase of digital multimedia content. A recent trend in image analysis aims at incorporating symbolic knowledge representation systems and machine learning techniques. In this paper, we examine interweaving of neural network classifiers and fuzzy description logics for the adaptation of a knowledge base for semantic image analysis. The proposed approach includes a formal knowledge component, which, assisted by a reasoning engine, generates the a-priori knowledge for the image analysis problem. This knowledge is transferred to a kernel based connectionist system, which is then adapted to a specific application field through extraction and use of MPEG-7 image descriptors. Adaptation of the knowledge base can be achieved next. Combined segmentation and classification of images, or video frames, of summer holidays, is the field used to illustrate the good performance of the proposed approach.
APA, Harvard, Vancouver, ISO, and other styles
20

ORIA, VINCENT, and M. TAMER ÖZSU. "VIEWS OR POINTS OF VIEW ON IMAGES." International Journal of Image and Graphics 03, no. 01 (January 2003): 55–79. http://dx.doi.org/10.1142/s0219467803000919.

Full text
Abstract:
Images like other multimedia data need to be described as it is difficult to grasp their semantics from the raw data. With the emergence of standards like MPEG-7, multimedia data will be increasingly produced together with some semantic descriptors. But a description of a multimedia data is just an interpretation, a point of view on the data and different interpretations can exist for the same multimedia data. In this paper we explore the use of view techniques to define and manage different points of view on images. Views have been widely used in relational database management systems to extend modeling capabilities, and to provide logical data independence. Since our image model is defined on an object-oriented model, we will first propose a powerful object-oriented mechanism based on the distinction between class and type. The object view is used in the image view definition. The image view mechanism exploits the separation of the physical representation in an image of a real world object from the real object itself to allow different interpretations of an image region. Finally we will discuss the implementation of the image view mechanisms on the existing object models.
APA, Harvard, Vancouver, ISO, and other styles
21

Ha, Hsin-Yu, Fausto C. Fleites, and Shu-Ching Chen. "Content-Based Multimedia Retrieval Using Feature Correlation Clustering and Fusion." International Journal of Multimedia Data Engineering and Management 4, no. 2 (April 2013): 46–64. http://dx.doi.org/10.4018/jmdem.2013040103.

Full text
Abstract:
Nowadays, only processing visual features is not enough for multimedia semantic retrieval due to the complexity of multimedia data, which usually involve a variety of modalities, e.g. graphics, text, speech, video, etc. It becomes crucial to fully utilize the correlation between each feature and the target concept, the feature correlation within modalities, and the feature correlation across modalities. In this paper, the authors propose a Feature Correlation Clustering-based Multi-Modality Fusion Framework (FCC-MMF) for multimedia semantic retrieval. Features from different modalities are combined into one feature set with the same representation via a normalization and discretization process. Within and across modalities, multiple correspondence analysis is utilized to obtain the correlation between feature-value pairs, which are then projected onto the two principal components. K-medoids algorithm, which is a widely used partitioned clustering algorithm, is selected to minimize the Euclidean distance within the resulted clusters and produce high intra-correlated feature-value pair clusters. Majority vote is applied to subsequently decide which cluster each feature belongs to. Once the feature clusters are formed, one classifier is built and trained for each cluster. The correlation and confidence of each classifier are considered while fusing the classification scores, and mean average precision is used to evaluate the final ranked classification scores. Finally, the proposed framework is applied on NUS-wide Lite data set to demonstrate the effectiveness in multimedia semantic retrieval.
APA, Harvard, Vancouver, ISO, and other styles
22

Jiao, Sai-Mei, Hai-feng Wang, Kun Zhang, and Ya-qi Hu. "Neural Linguistic Steganalysis via Multi-Head Self-Attention." Journal of Electrical and Computer Engineering 2021 (April 17, 2021): 1–5. http://dx.doi.org/10.1155/2021/6668369.

Full text
Abstract:
Linguistic steganalysis can indicate the existence of steganographic content in suspicious text carriers. Precise linguistic steganalysis on suspicious carrier is critical for multimedia security. In this paper, we introduced a neural linguistic steganalysis approach based on multi-head self-attention. In the proposed steganalysis approach, words in text are firstly mapped into semantic space with a hidden representation for better modeling the semantic features. Then, we utilize multi-head self-attention to model the interactions between words in carrier. Finally, a softmax layer is utilized to categorize the input text as cover or stego. Extensive experiments validate the effectiveness of our approach.
APA, Harvard, Vancouver, ISO, and other styles
23

Gvishiani, N. B. "A MULTIMODAL ‘TEXT’: THE LINGUOPRAGMATIC PECULIARITIES OF VERBAL AND NON-VERBAL COMPONENTS INTERACTING IN DIFFERENT COMMUNICATIVE TYPES OF DISCOURSE." Voprosy Kognitivnoy Lingvistiki, no. 1 (2023): 15–17. http://dx.doi.org/10.20916/1812-3228-2022-15-17.

Full text
Abstract:
The article dwells on the interaction of verbal and non-verbal components in different communicative media - painting, filmography, and art-reviews discourse. In modern art, we come across various ‘mixed media’ in creating visual or moving images, which may also include the verbal component. The power of linguistic discourse is then applied in the spheres where traditionally other modes were found to prevail. In conceptualism, the word becomes a means of reflection and in neo surrealism - it fills expressive narratives growing into illocutionary speech acts. In the article, text is considered as part of art-multimedia objects and the visual image - as incorporated into a verbal narrative. The dominant role of the verbal component is traced in the conceptual perception of art objects as well as in creating ‘potentially multimodal’ journalistic texts through concrete and abstract linguistic representation. If concrete representation is realized in referential meanings of words, abstract representation hinges on their emotive meanings. It has been observed that whatever the word’s function in art-multimedia may be, it results in broadening the word’s semantic scope and extending its conceptual potential.
APA, Harvard, Vancouver, ISO, and other styles
24

De Masi, A. "DIGITAL DOCUMENTATION’S ONTOLOGY: CONTEMPORARY DIGITAL REPRESENTATIONS AS EXPRESS AND SHARED MODELS OF REGENERATION AND RESILIENCE IN THE PLATFORM BIM/CONTAMINATED HYBRID REPRESENTATION." International Archives of the Photogrammetry, Remote Sensing and Spatial Information Sciences XLVI-M-1-2021 (August 28, 2021): 189–97. http://dx.doi.org/10.5194/isprs-archives-xlvi-m-1-2021-189-2021.

Full text
Abstract:
Abstract. The study illustrates a university research project of “Digital Documentation’s Ontology”, to be activated with other universities, of an Platform (P) – Building Information Modeling (BIM) articulated on a Contaminated Hybrid Representation (diversification of graphic models); the latter, able to foresee categories of Multi-Representations that interact with each other for to favour several representations, adapted to a different information density in the digital multi-scale production, is intended as platform (grid of data and information at different scales, semantic structure from web content, data and information storage database, archive, model and form of knowledge and ontological representation shared) of: inclusive digital ecosystem development; digital regenerative synergies of representation with adaptable and resilient content in hybrid or semi-hybrid Cloud environments; phenomenological reading of the changing complexity of environmental reality; hub solution of knowledge and simulcast description of information of Cultural Heritage (CH); multimedia itineraries to enhance participatory and attractive processes for the community; factor of cohesion and sociality, an engine of local development. The methodology of P-BIM/CHR is articulated on the following ontologies: Interpretative and Codification, Morphology, Lexicon, Syntax, Metamorphosis, Metadata in the participatory system, Regeneration, Interaction and Sharing. From the point of view the results and conclusion the study allowed to highlight: a) Digital Regenerative synergies of representation; b) Smart CH Model for an interconnection of systems and services within a complex set of relationships.
APA, Harvard, Vancouver, ISO, and other styles
25

Bogdanova, Galina, Todor Todorov, Nikolay Noev, and Stefka Kancheva. "Research on Linguistic Approaches, Used for Semantic Explanation of Bell’s Knowledge." Digital Presentation and Preservation of Cultural and Scientific Heritage 2 (September 30, 2012): 155–60. http://dx.doi.org/10.55630/dipp.2012.2.7.

Full text
Abstract:
This paper presents a research of linguistic structure of Bulgarian bells knowledge. The idea of building semantic structure of Bulgarian bells appeared during the “Multimedia fund – BellKnow” project. In this project was collected a lots of data about bells, their structure, history, technical data, etc. This is the first attempt for computation linguistic explain of bell knowledge and deliver a semantic representation of that knowledge. Based on this research some linguistic components, aiming to realize different types of analysis of text objects are implemented in term dictionaries. Thus, we lay the foundation of the linguistic analysis services in these digital dictionaries aiding the research of kinds, number and frequency of the lexical units that constitute various bell objects.
APA, Harvard, Vancouver, ISO, and other styles
26

Zhu, Xinghui, Liewu Cai, Zhuoyang Zou, and Lei Zhu. "Deep Multi-Semantic Fusion-Based Cross-Modal Hashing." Mathematics 10, no. 3 (January 29, 2022): 430. http://dx.doi.org/10.3390/math10030430.

Full text
Abstract:
Due to the low costs of its storage and search, the cross-modal retrieval hashing method has received much research interest in the big data era. Due to the application of deep learning, the cross-modal representation capabilities have risen markedly. However, the existing deep hashing methods cannot consider multi-label semantic learning and cross-modal similarity learning simultaneously. That means potential semantic correlations among multimedia data are not fully excavated from multi-category labels, which also affects the original similarity preserving of cross-modal hash codes. To this end, this paper proposes deep multi-semantic fusion-based cross-modal hashing (DMSFH), which uses two deep neural networks to extract cross-modal features, and uses a multi-label semantic fusion method to improve cross-modal consistent semantic discrimination learning. Moreover, a graph regularization method is combined with inter-modal and intra-modal pairwise loss to preserve the nearest neighbor relationship between data in Hamming subspace. Thus, DMSFH not only retains semantic similarity between multi-modal data, but integrates multi-label information into modal learning as well. Extensive experimental results on two commonly used benchmark datasets show that our DMSFH is competitive with the state-of-the-art methods.
APA, Harvard, Vancouver, ISO, and other styles
27

Zhang, Ruiping. "A Personalized Course Resource Recommendation Method Based on Deep Learning in an Online Multi-Modal Multimedia Education Cloud Platform." International Journal of Information Technologies and Systems Approach 16, no. 2 (March 2, 2023): 1–14. http://dx.doi.org/10.4018/ijitsa.319344.

Full text
Abstract:
Aiming at the problem that unstructured text in online multi-modal multimedia education is easy to cause error propagation, this paper proposes a personalized course resource recommendation method using deep learning in online multi-modal multimedia education cloud platform. First, the word vector of the text is obtained from the course data set by using the BERT pre-training model, and its semantic information in different contexts is analyzed. Then, the more complex representation of each word is extracted through the long short-term memory network (LSTM), in which the multi-head attention layer adds different weights to different word vector to better capture the key information in the sentence. Finally, the CRF layer is used to identify sentence entities, and the Sigmoid layer is used to extract relations, thus completing personalized course resource recommendation, which is significantly improved compared with other models. Experimental analysis shows that the algorithm is effective in personalized course resource recommendation.
APA, Harvard, Vancouver, ISO, and other styles
28

Wu, Xiao-Ming, Xin Luo, Yu-Wei Zhan, Chen-Lu Ding, Zhen-Duo Chen, and Xin-Shun Xu. "Online Enhanced Semantic Hashing: Towards Effective and Efficient Retrieval for Streaming Multi-Modal Data." Proceedings of the AAAI Conference on Artificial Intelligence 36, no. 4 (June 28, 2022): 4263–71. http://dx.doi.org/10.1609/aaai.v36i4.20346.

Full text
Abstract:
With the vigorous development of multimedia equipments and applications, efficient retrieval of large-scale multi-modal data has become a trendy research topic. Thereinto, hashing has become a prevalent choice due to its retrieval efficiency and low storage cost. Although multi-modal hashing has drawn lots of attention in recent years, there still remain some problems. The first point is that existing methods are mainly designed in batch mode and not able to efficiently handle streaming multi-modal data. The second point is that all existing online multi-modal hashing methods fail to effectively handle unseen new classes which come continuously with streaming data chunks. In this paper, we propose a new model, termed Online enhAnced SemantIc haShing (OASIS). We design novel semantic-enhanced representation for data, which could help handle the new coming classes, and thereby construct the enhanced semantic objective function. An efficient and effective discrete online optimization algorithm is further proposed for OASIS. Extensive experiments show that our method can exceed the state-of-the-art models. For good reproducibility and benefiting the community, our code and data are already publicly available.
APA, Harvard, Vancouver, ISO, and other styles
29

Sujatha, Dr K., Koulik Ghsoh, and Aneesh Anand. "Domain Adaptation and Semantic Drawing Driven Sketch-to-Photo Retrieval using Collaborative Generative Representation Learning." International Journal for Research in Applied Science and Engineering Technology 12, no. 5 (May 31, 2024): 1473–80. http://dx.doi.org/10.22214/ijraset.2024.61734.

Full text
Abstract:
Abstract: Sketch-based face recognition is an interesting task in vision and multimedia research yet it is quite challenging due to the great difference between face photos and sketches. In this paper we propose a novel approach for photo-sketch generation aiming to automatically transform face photos into detail-preserving personal sketches. Unlike the traditional models synthesizing sketches based on a dictionary of exemplars we develop a fully convolutional network to learn the end-to-end photosketch mapping. Our approach takes whole face photos as inputs and directly generates the corresponding sketch images with efficient inference and learning in which the architecture is stacked by only convolutional kernels of very small sizes. The exemplar-based method is most frequently used in face sketch synthesis because of its efficiency in representing the nonlinear mapping between face photos and sketches. However, the sketches synthesized by existing exemplar-based methods suffer from block artifacts and blur effects. In addition, most exemplar-based methods ignore the training sketches in the weight representation process. To improve synthesis performance, a novel joint training model is proposed in this paper, taking sketches into consideration. First, we construct the joint training photo and sketch by concatenating the original photo and its sketch with a high-pass filtered image of their corresponding sketch. Then, an offline random sampling strategy is adopted for each test photo patch to select the joint training photo and sketch patches in the neighboring region. Finally, a novel locality constraint is designed to calculate the reconstruction weight, allowing the synthesized sketches to have more detailed information
APA, Harvard, Vancouver, ISO, and other styles
30

Stella, Massimo, Michael S. Vitevitch, and Federico Botta. "Cognitive Networks Extract Insights on COVID-19 Vaccines from English and Italian Popular Tweets: Anticipation, Logistics, Conspiracy and Loss of Trust." Big Data and Cognitive Computing 6, no. 2 (May 12, 2022): 52. http://dx.doi.org/10.3390/bdcc6020052.

Full text
Abstract:
Monitoring social discourse about COVID-19 vaccines is key to understanding how large populations perceive vaccination campaigns. This work reconstructs how popular and trending posts framed semantically and emotionally COVID-19 vaccines on Twitter. We achieve this by merging natural language processing, cognitive network science and AI-based image analysis. We focus on 4765 unique popular tweets in English or Italian about COVID-19 vaccines between December 2020 and March 2021. One popular English tweet contained in our data set was liked around 495,000 times, highlighting how popular tweets could cognitively affect large parts of the population. We investigate both text and multimedia content in tweets and build a cognitive network of syntactic/semantic associations in messages, including emotional cues and pictures. This network representation indicates how online users linked ideas in social discourse and framed vaccines along specific semantic/emotional content. The English semantic frame of “vaccine” was highly polarised between trust/anticipation (towards the vaccine as a scientific asset saving lives) and anger/sadness (mentioning critical issues with dose administering). Semantic associations with “vaccine,” “hoax” and conspiratorial jargon indicated the persistence of conspiracy theories and vaccines in extremely popular English posts. Interestingly, these were absent in Italian messages. Popular tweets with images of people wearing face masks used language that lacked the trust and joy found in tweets showing people with no masks. This difference indicates a negative effect attributed to face-covering in social discourse. Behavioural analysis revealed a tendency for users to share content eliciting joy, sadness and disgust and to like sad messages less. Both patterns indicate an interplay between emotions and content diffusion beyond sentiment. After its suspension in mid-March 2021, “AstraZeneca” was associated with trustful language driven by experts. After the deaths of a small number of vaccinated people in mid-March, popular Italian tweets framed “vaccine” by crucially replacing earlier levels of trust with deep sadness. Our results stress how cognitive networks and innovative multimedia processing open new ways for reconstructing online perceptions about vaccines and trust.
APA, Harvard, Vancouver, ISO, and other styles
31

Zhai, Xiaohua, Yuxin Peng, and Jianguo Xiao. "Heterogeneous Metric Learning with Joint Graph Regularization for Cross-Media Retrieval." Proceedings of the AAAI Conference on Artificial Intelligence 27, no. 1 (June 29, 2013): 1198–204. http://dx.doi.org/10.1609/aaai.v27i1.8464.

Full text
Abstract:
As the major component of big data, unstructured heterogeneous multimedia content such as text, image, audio, video and 3D increasing rapidly on the Internet. User demand a new type of cross-media retrieval where user can search results across various media by submitting query of any media. Since the query and the retrieved results can be of different media, how to learn a heterogeneous metric is the key challenge. Most existing metric learning algorithms only focus on a single media where all of the media objects share the same data representation. In this paper, we propose a joint graph regularized heterogeneous metric learning (JGRHML) algorithm, which integrates the structure of different media into a joint graph regularization. In JGRHML, different media are complementary to each other and optimizing them simultaneously can make the solution smoother for both media and further improve the accuracy of the final metric. Based on the heterogeneous metric, we further learn a high-level semantic metric through label propagation. JGRHML is effective to explore the semantic relationship hidden across different modalities. The experimental results on two datasets with up to five media types show the effectiveness of our proposed approach.
APA, Harvard, Vancouver, ISO, and other styles
32

WANG, ZHIYONG, ZHERU CHI, DAGAN FENG, and AH CHUNG TSOI. "CONTENT-BASED IMAGE RETRIEVAL WITH RELEVANCE FEEDBACK USING ADAPTIVE PROCESSING OF TREE-STRUCTURE IMAGE REPRESENTATION." International Journal of Image and Graphics 03, no. 01 (January 2003): 119–43. http://dx.doi.org/10.1142/s0219467803000944.

Full text
Abstract:
Content-based image retrieval has become an essential technique in multimedia data management. However, due to the difficulties and complications involved in the various image processing tasks, a robust semantic representation of image content is still very difficult (if not impossible) to achieve. In this paper, we propose a novel content-based image retrieval approach with relevance feedback using adaptive processing of tree-structure image representation. In our approach, each image is first represented with a quad-tree, which is segmentation free. Then a neural network model with the Back-Propagation Through Structure (BPTS) learning algorithm is employed to learn the tree-structure representation of the image content. This approach that integrates image representation and similarity measure in a single framework is applied to the relevance feedback of the content-based image retrieval. In our approach, an initial ranking of the database images is first carried out based on the similarity between the query image and each of the database images according to global features. The user is then asked to categorize the top retrieved images into similar and dissimilar groups. Finally, the BPTS neural network model is used to learn the user's intention for a better retrieval result. This process continues until satisfactory retrieval results are achieved. In the refining process, a fine similarity grading scheme can also be adopted to improve the retrieval performance. Simulations on texture images and scenery pictures have demonstrated promising results which compare favorably with the other relevance feedback methods tested.
APA, Harvard, Vancouver, ISO, and other styles
33

Yu, Jing, Zhao Lu, Shoulin Yin, and Mirjana Ivanovic. "News recommendation model based on encoder graph neural network and bat optimization in online social multimedia art education." Computer Science and Information Systems, no. 00 (2024): 25. http://dx.doi.org/10.2298/csis231225025y.

Full text
Abstract:
At present, the existing news recommendation system fails to fully consider the semantic information of news, meanwhile, the uneven popularity of news will also cause the phenomenon of long tail. Therefore, we propose a novel news recommendation model based on encoder graph neural network and Bat optimization in online social networks. Firstly, Bat optimization algorithm is used to improve the effect of news clustering. Secondly, the concept of metadata is introduced into the graph neural network, and the ontology of learning resources based on knowledge points is established to realize the correlation between news resources. Finally, the model combining Convolutional Neural Network (CNN) and attention network is used to learn the representation of news, and Gate Recurrent Unit (GRU) is used to learn the short-term preferences of users from their recent reading history. We carry out experiments on real news datasets, and compared with other advanced methods, the proposed model has better evaluation indexes.
APA, Harvard, Vancouver, ISO, and other styles
34

MIRENKOV, NIKOLAY, ALEXANDER VAZHENIN, RENTARO YOSHIOKA, TSUKASA EBIHARA, TETSUYA HIROTOMI, and TATIANA MIRENKOVA. "SELF-EXPLANATORY COMPONENTS: A NEW PROGRAMMING PARADIGM." International Journal of Software Engineering and Knowledge Engineering 11, no. 01 (February 2001): 5–36. http://dx.doi.org/10.1142/s0218194001000414.

Full text
Abstract:
A new multimedia programming paradigm is presented. It is based on a system of micro- and macro-icons (composite pictures) representing self-explanatory software components in a "film" format. A film is a series of color stills supported, if necessary, by text and sound. Each still is to represent a view of objects or processes. Each film is to represent a multiple view (an extended set of dynamic and/or static features) of objects or processes. A self-explanatory film means that the associated stills are organized and presented in such a way that the semantic richness of a computational scheme is clearly brought out. Icons and films are acquired in a net-accessible database. The user should not study them in advance. The film management system provides simple access to database items and modes to manipulate films. In this paper we explain where the database items are taken from and how the self-explanatory features of items are reached. We also describe how these items can be used for multimedia representation of methods and data and for programming users' algorithmic ideas. In addition, some technical details related to the film management system, rendering engines used for displaying various features of the software components, and the icon language are presented. Special attention is paid to how computational formulas can be attached to a film.
APA, Harvard, Vancouver, ISO, and other styles
35

Lai, Jingjuan, Hanxiong Chen, and Yuzuru Fujiwara. "An information-base system based on the self-organization of concepts represented by terms." Terminology 3, no. 2 (January 1, 1996): 313–34. http://dx.doi.org/10.1075/term.3.2.05lai.

Full text
Abstract:
Since multimedia information is complicated inform and vast in amount, conventional database-management systems or knowledge-base-management systems are hardly appropriate to store, manage, and utilize expertise effectively. A new type of information model is developed according to an analysis of the information used by specialists for research and development, and a prototype information-management system is implemented. The system consists of three parts: (1) flexible storage without special constraints on format and representation; (2) self-organization of terms by extracting semantic relationships among them; and (3) advanced utilization functions such as analogical reasoning, inductive inference, abductive inference, as well as information retrieval, numerical calculation, and deductive inference. Thesauri which are automatically compiled and refined are used as conceptual structures of the information. Thus obtained, conceptual structures can be used for sophisticated applications, including analogical reasoning, induction, and abduction. The principle of open-world reasoning and an algorithm of analogy are developed. An example of practical application to polymer information is presented.
APA, Harvard, Vancouver, ISO, and other styles
36

Rogushina, J. V. "Use of Ontology-based knowledge Organization Sysytems for WIKI Resources." PROBLEMS IN PROGRAMMING, no. 1 (March 2022): 023–33. http://dx.doi.org/10.15407/pp2022.01.023.

Full text
Abstract:
The paper considers the theoretical foundations of knowledge organization systems (KOSs) in intelligent ontology-based applications. The aim of this study is to analyze the use of different types of KOSs to organize and improve the knowledge base of semantic Wiki resources that contain heterogeneous multimedia content of large volume and have a complex structure integrated knowledge from different domains. The dialects of the OWL ontology representation language and their expressiveness for representing special cases of ontologies used in KOSs are considered. The criteria for the classification of KOSs and sphere of their usage are analyzed. Formal model of ontology for semantic Wiki resource is proposed. This model is integrated with various implementing means for different types of relations between objects in the Semantic MediaWiki environment based on templates. Problems of access and retrieval of information in these resources and methods of their solving from the KOSs point of view are considered. The software implementation of the proposed approach with the example of the portal version of the Great Ukrainian Encyclopedia (e-VUE) is realized. The urgency of the problem intensifies by the need for national information resources in martial law situation, for which the determining factors of effective information processing are both the ability to obtain satisfaction of complex information needs and the relevance of the information obtained. This increases the importance of official government portals that integrate reliable data from various fields of knowledge and prevent possible misrepresentation (both accidental and malicious) of information in resources with open content generation.
APA, Harvard, Vancouver, ISO, and other styles
37

Rogushina, J. V., and I. J. Grishanova. "Ontological methods and tools for semantic extension of the media WIKI technology." PROBLEMS IN PROGRAMMING, no. 2-3 (September 2020): 061–73. http://dx.doi.org/10.15407/pp2020.02-03.061.

Full text
Abstract:
Practical aspects of ontological approach to organization of intelligent Wiki-based information resources (IR) are considered. We analyze the main features, capabilities and limitations of MediaWiki as a technological platform for development of the Web-based information resource and suggest main directions of its refinement. We propose an abstract model of MediaWiki architecture that formalizes relations between the main components of this software environment and analyze the ways of its semantic extensions based on ontological representation of domain knowledge. An original algorithm of semantic Wiki pages matching with domain ontology is developed. We propose an ontological model of IR that formalizes its knowledge base structure and explicitly performs main features of typical information objects (TIO) of this IR. Such TIOs depend on domain specifics and purposes of IR, therefore their development has to involve domain experts and knowledge engineers. Use of ontology corresponding to the set of Wiki pages (either with semantic markup or without it) provides new IR functions associated with semantic search and navigation. Other important aspect of intelligent Wiki resource development deals with adaptation of user interface to the specifics of IR: enabling various tools of navigation, visualization and content analysis by processing of TIO features enriches IR functionality, reduces access time to information and makes usage of IR more efficient. Developing additional MediaWiki functionality with new requests to the MediaWiki API using TIO templates, extends data analysis and integration capabilities, and offers different, user-focused, IR content views expands the possibilities of data integration and proposes various user-oriented representations of IR content. Wiki resource semantization allows the use knowledge acquired from such IR by external application, or example, by search engines for intelligent Web retrieval. Domain ontologies based on various subsets of the Wiki pages and generated by them thesauri can be used by various Semantic Web applications, both independently or in general technological chain for personified retrieval focused on individual users and their tasks. Approbation of this approach is demonstrated by MAIPS retrieval system. We consider the use semantic similarity of concepts represented by Wiki-pages of IR as an additional way of intelligent navigation between these pages. Such approach allows to group Wiki pages according to user interests by different aspects of their content and structure. Wiki ontologies are considered as the basis for estimation of semantic similarity between domain concepts pertinent to user task. Such elements of Wiki ontology as classes, property values of class instances and relations between them are used as parameters for the quantitative assessment of semantic similarity of Wiki pages. We propose to use local similarity and generate the sets of semantically similar concepts (SSC) that takes into account some subset of page properties and categories defined by user needs. Such sets of SSCs can be considered as user task thesauri for other applications. In addition, we propose to enrich the basic tools of MediaWiki used for access management to the IR content with specialized software code that performs content classification that take into consideration separate namespaces, categories, templates and semantic properties of TIO acquired from Wiki markup. We demonstrate the software implementation of proposed solutions by developing of portal version of the Great Ukrainian Encyclopedia (e-VUE) that contains heterogeneous multimedia content with complex structure. We analyze the specifics of e-VUE knowledge system and develop its formalized TIO representation based on Semantic Web technologies and ontological analysis. Ontological model of e-VUE and original methods of its processing used for this project extend the functionality of the portal in the area of search, navigation, integration and protection of content based on background domain knowledge. In addition, original user interface of e-VUE is developed with an allowance for Encyclopedia knowledge specifics, substantially differs from the standard Wiki, meets the requirements, goals and objectives of this IR and provides a lot of additional features.
APA, Harvard, Vancouver, ISO, and other styles
38

Strashko, Iryna V. "PHONETIC, LEXICAL, GRAMMATICAL, COGNITIVE, AND PRAGMATIC LEVELS OF THE LINGUISTIC PERSONALITY (BASED ON THE INTERVIEW FROM THE AUTHOR’S MULTIMEDIA CORPUS)." Scientific Journal of National Pedagogical Dragomanov University. Series 9. Current Trends in Language Development, no. 25 (June 30, 2023): 79–89. http://dx.doi.org/10.31392/npu-nc.series9.2023.25.06.

Full text
Abstract:
The paper focuses on the analysis of the means of representation of the informant’s linguistic personality at phonetic, lexical, grammatical, cognitive, and pragmatic levels in the oral discourse. The material of the study is a transcript of an audio recording of one interview from the author’s multimedia corpus “Everyone has their own war”. The interview was recorded in the Ukrainian language in one of the most emotionally, psychologically, and physically difficult moments of the informant’s life. Despite a certain limitation of language material, the peculiarities of the speech manifestations of the linguistic personality of the informant, a twenty-nine age widow (a woman and a mother), are representative since she describes her life and the life of her family after the full-scale invasion on February 24 and until May 2022. The analysis of the informant’s linguistic personality shows that the verbal and semantic specificity is determined by the volume of lexical items, the peculiarities of nominating speech objects and the choice of means for their characteristic, as well as the style of speech. The informant’s speech is characterized by violations of literary norms: it is full of adapted and unadapted lexical and morphological units of the Russian language, and improper pronunciation of words, which in general correlates with her cultural and educational level. The informant’s vocabulary is pragmatically functional and determined by the level of education, social status, type of employment and living conditions. It clearly reflects the essence and content of the linguistic personality. The vocabulary of the everyday sphere prevails, onyms (toponyms, anthroponyms, ergonyms) and a small amount of military lexicon are also registered. Emotional and evaluative interjections with a positive or negative assessment are representatives of the emotional, functional, and semantic sphere of the informant’s speech. The connotative coloration is provided, in particular, by the verbal characterization of the occupiers, which includes ethnonymic nicknames, including those based on appearance, language, and behaviour. In terms of content and values, the discursive activity of the informant, represented by referential semantic elements, is determined by extralinguistic factors and it correlates with universal values. The motivational and pragmatic aspect of linguistic personality is grounded on the desire to speak out, and includes life or situational goals, which are reflected in the discourse. It is manifested, in particular, in the manner of speech, in the choice of markers used to organize and control the discursive coherence. The analysis of the pragmatic markers included their functions, the specifics of their use and frequency.
APA, Harvard, Vancouver, ISO, and other styles
39

Vishniakou, U. A., and A. P. Kovalev. "ONLINE-SERVICES AND INFORMATION TECHNOLOGIES IN DISTANCE LEARNING." «System analysis and applied information science», no. 4 (February 8, 2018): 66–71. http://dx.doi.org/10.21122/2309-4923-2017-4-66-71.

Full text
Abstract:
The article deals with the analysis of distance learning (DL) methods, approaches, technologies, tools, the use as known online services so and developing the new ones. The terminology in area of DL is discussed and differences between correspondence course and DL are done. The development tendencies of distance learning are analyzed. Their technical and organization components are done. The course programs for DL are realizing by software which functions are shown. The typical lines of DL, their advances and lacks are conceded. As DL advances are self activity, individuality, independence and so on. As DL lacks are insufficiently individual, psychological, practical aspects, writing forms of DL and so on.Technologies and organization of DL including IT are discussed. The tutor activity is divided on two stages: decision of methodological, organizational problems and realization of distance courses. The various kind of online services in DL such as chats, web, TV, video conferences multimedia, robot learning, web-services are shown. Such IT for DL as CD, net, TV, satellite, cloud are discussed.The models of integration decisions for DL development such as Remote Procedure Calls (RPS), Enterprise Application Integration (EAL), Web-Services (WS), Enterprise Service Bus (ESB) are proposed. The content of e-learning online services including intellectual technologies and cloud computing are done. As new one integration method for DL is Semantic Web and Web-service (SWWS) with knowledge representation support on ontology base and knowledge processing on agents support are representation.
APA, Harvard, Vancouver, ISO, and other styles
40

Scianna, A., and M. La Guardia. "GLOBE BASED 3D GIS SOLUTIONS FOR VIRTUAL HERITAGE." ISPRS - International Archives of the Photogrammetry, Remote Sensing and Spatial Information Sciences XLII-4/W10 (September 12, 2018): 171–77. http://dx.doi.org/10.5194/isprs-archives-xlii-4-w10-171-2018.

Full text
Abstract:
<p><strong>Abstract.</strong> During the last years, many solutions have been proposed for 3D Virtual Heritage representations. Recently, also new technologies for online gaming evolved, based on javascript libraries (WebGL), used to create and publish virtual interactive environments. They are based on recent Web browser’s functionalities, surpassing some limitations of VRML technologies. On the side of geospatial information, technology has evolved from desktop GIS to 2D WebGIS and globe applications. The use of globe applications is, today, very diffused due to its immediate and at the same time impressive representation of the earth surface and territories. These technologies have been, also, applied to Virtual Heritage 3D reconstructions, to improve the fruition of Cultural Heritage (CH), with the achievement of interesting results. The topic of this paper is the experimentation on the fusion between globe based and gaming technologies (in our case WebGL) that allow achieving a more user-centric and powerful solution useful for publishing 3D geospatial information of CH on Web. This choice allows obtaining GIS oriented 3D models, typical of globe applications, and, at the same time, a more immersive exploration of CH and its surrounding environment. In particular, it also gives complementary text and multimedia information on the history, architectural features of each cultural good, based on querying of semantic information. The test field of the research is the construction of the 3D GIS virtual globe model of the Manfredonic Castle of Mussomeli (Sicily-Italy), developed for PON-NEPTIS EU Project, to compare open-source technologies and commercial proprietary applications.</p>
APA, Harvard, Vancouver, ISO, and other styles
41

Petrenko, M. G., O. V. Palagin, M. O. Boyko, and S. M. Matveyshyn. "Knowledge-Oriented Tool Complex for Developing Databases of Scientific Publications and Taking into account Semantic Web Technology." Control Systems and Computers, no. 3 (299) (2022): 11–28. http://dx.doi.org/10.15407/csc.2022.03.011.

Full text
Abstract:
Introduction. The development of theories, methods, and algorithms for the discovery and formation of new knowledge has always occupied one of the central places for any researcher, especially if he is actively working on the creation of new scientific publications. It is known that there is no universal language for the formal description of concepts (knowledge) and systemology of transdisciplinary scientific research. And therefore, scientists face a number of priority problems, including the problem of significantly accelerating the receipt by a researcher of the cognitively structured information he needs from his sources. The tool complex for processing databases of scientific publications is oriented in this way to a researcher who has published from several tens to hundreds of scientific papers. We are not aware of search engines that could provide such information to a researcher in the shortest possible time. The toolkit implements Information Retrieval and Knowledge Discovery in Databases technologies with an emphasis on Semantic Web and cognitive graphics technologies and tools. The development of such a tool complex involves three stages: at the first stage, tools for implementing the complex, methods and algorithms for the interaction of the “User – Knowledge Engineer – Remote Endpoint” system and filling it with data are created; the second stage, the tasks of multimedia representation of figurative-conceptual structures are solved, which are described in scientific documents, and at the third stage — the solution of the problem of extracting new knowledge. Purpose. The purpose of our research was to further develop a tool complex for processing databases of scientific publications, which allows a scientist to significantly speed up the receipt of the necessary cognitively structured information from his sources. Methods. The methods and models used in the work are based on the information technologies of the Semantic Web and ontological engineering. Results. A tool complex for processing databases of scientific publications based on a remote endpoint based on the Apachi Jena Fuseki server, basic UML diagrams of functioning and examples of executing user requests have been developed. Conclusion. The article introduced and described the architectural and structural organization of the tool complex, which includes a local network from the user’s PC and the PC of the administrator-knowledge engineer and a remote endpoint based on the Apachi Jena Fuseki server, the main UML diagrams of the tool complex functioning and examples of executing user requests.
APA, Harvard, Vancouver, ISO, and other styles
42

Mongelli, Marialuisa, Giulia Chellini, Silvio Migliori, Antonio Perozziello, Samuele Pierattini, Marco Puccini, and Alessandro Cosma. "Comparison and integration of techniques for the study and valorisation of the Corsini Throne in Corsini Gallery in Roma." ACTA IMEKO 10, no. 1 (March 31, 2021): 40. http://dx.doi.org/10.21014/acta_imeko.v10i1.816.

Full text
Abstract:
<p>In recent years, digital technologies for enhancement and use of cultural heritage items has grown considerably. Multimedia, virtual and augmented reality and 3D reconstructions make it possible to bring the general public closer to an understanding of something that no longer exists or that is from a distant time. But digital tools can serve more than educational purposes.</p><p>To date, digitisation has become above all an essential tool in most cultural heritage projects involving conservation, restoration, documentation and research.</p><p>This article shows a process that integrates photogrammetry and structured light scans to obtain a 3D reconstruction of the Corsini Throne, preserved at the Corsini Gallery in Rome for its exhibition using a web application combined with semantic representation of metadata following FAIR principles. The process began during the development of the WeACT3 Project (Acting Together – Technology for Art, Culture, Tourism and Territory) jointly signed by the CIVITA Association, and the National Barberini and Corsini Galleries, collaborating in a partnership of several national and international enterprises. Within EcoDigit project, financed by Lazio Region, an automated web tool prototype was developed by ENEA. It is able to display 3D models with correlated scientific information to assist research activities and knowledge sharing.</p>
APA, Harvard, Vancouver, ISO, and other styles
43

Wagenpfeil, Stefan, Felix Engel, Paul Mc Kevitt, and Matthias Hemmje. "AI-Based Semantic Multimedia Indexing and Retrieval for Social Media on Smartphones." Information 12, no. 1 (January 19, 2021): 43. http://dx.doi.org/10.3390/info12010043.

Full text
Abstract:
To cope with the growing number of multimedia assets on smartphones and social media, an integrated approach for semantic indexing and retrieval is required. Here, we introduce a generic framework to fuse existing image and video analysis tools and algorithms into a unified semantic annotation, indexing and retrieval model resulting in a multimedia feature vector graph representing various levels of media content, media structures and media features. Utilizing artificial intelligence (AI) and machine learning (ML), these feature representations can provide accurate semantic indexing and retrieval. Here, we provide an overview of the generic multimedia analysis framework (GMAF) and the definition of a multimedia feature vector graph framework (MMFVGF). We also introduce AI4MMRA to detect differences, enhance semantics and refine weights in the feature vector graph. To address particular requirements on smartphones, we introduce an algorithm for fast indexing and retrieval of graph structures. Experiments to prove efficiency, effectiveness and quality of the algorithm are included. All in all, we describe a solution for highly flexible semantic indexing and retrieval that offers unique potential for applications such as social media or local applications on smartphones.
APA, Harvard, Vancouver, ISO, and other styles
44

Ermolaeva, E. N., and N. V. Potapova. "Lingvovisual Pragmatics of Pulled-Out Elements in English-Language Internet Media Texts." Vestnik NSU. Series: History and Philology 20, no. 6 (August 11, 2021): 247–62. http://dx.doi.org/10.25205/1818-7919-2021-20-6-247-262.

Full text
Abstract:
Nowadays the study of media text pragmatics is one of the research priorities in media linguistics. The pragmatic potential of a media text is actualized through the symbiosis of its verbal, nonverbal, and multimedia components, which are equally capable of having a powerful impact on mass consciousness. The article focuses on the linguovisual pragmatics of the so-called “pulled-out” elements in English-language Internet media texts, which have not been studied so far. A pulled-out element is a graphically emphasized construction within a media text, containing a very short summary of the topic covered in the article, or quotations with different references describing the position of the journalist, participants of the event or experts towards the topic, or containing additional information. Following their functional orientation and type of graphical display, the pulled-out elements are divided into three main types: callouts, pull quotes, block quotes. At the graphic level, all three types are represented by a font and font size different from the article itself; they are often located to the left or in the center of the article and can be highlighted with a colored background. The linguistic representation of the pulled-out elements is determined by their functional nature: a simple but pragmatically effective syntactic and semantic structure of the included sentences is used, in most cases implementing the “clickbait” principle. The type, content, and quantity of the pulled-out elements used depend on the genre specifics and linguistic properties of the media text. The pulled-out elements of the media text perform a number of functions, the main of which are informative, attractive, affective, integrative, and ideological. It is stated that the pulled-out elements, being an integral attribute of the modern media text and one of the ways of its creolization, effectively incorporate verbal and nonverbal (graphic) components to have a multi-layered pragmatic impact on the recipient. A comprehensive study of the nature of this phenomenon, regarding its actualization at the structural and semantic levels, is necessary and relevant for media linguistics at the present stage of its development.
APA, Harvard, Vancouver, ISO, and other styles
45

Doroschuk, Elena Sergeevna. "Innovative Potential of Photobook Format in Ethnocultural Communication." Ethnic Culture, no. 2 (3) (June 20, 2020): 68–73. http://dx.doi.org/10.31483/r-74972.

Full text
Abstract:
The features of such a widely used format as a photo book in the context of visual ethnography were reviewed in the article. It is noted that the photobook is studied as a tool for creating visual ethnographic materials that allow to conduct a research on modern cultures and ethnic groups to form a cultural identity. Methods. As the subject of analysis, modern photobooks created by the photographer from Japan Ikuru Kuwajima were selected. Results. The potential of the photobook as an author's work is revealed and its communicative potential in ethnocultural interaction is described. An ethno-photo book is defined as a format of visual communication in which each photograph has an ethnical meaning, which contributes to the creation of author's photo narration, as a specific form of reflection of an ethnos, with a representation of ethnic images. The special functions of the ethno-photo book, which are realized upon activation of the author’s principle, are highlighted: the search for their own identity; pictorial (plot) narrative about an ethnic group; creating the integrity of ethno-narration; increment of information about the ethnic group; ethnos research by means of a photo image; details of the ethnic world view; preservation of ethnic pictures of the world; comprehending the culture of another. It has been determined that a modern photo book is distinguished by documentary content and multimedia features that give its content traits of pragmatism and streaming. An ethno-photo book is manifested as a meaningful substantial work in which the author narrates a pictorial story about an ethnos through photographs, creating a holistic artistic and semantic image of the ethnos. It is concluded that all this contributes to a special emphasis of the reader on certain elements of the ethnographic image and contributes to the creation of new information about the ethnos. It is mentioned that one of the varieties of photobooks is the author's photobook, as an in-depth study of oneself in the context of the ethnicity of the territories reflected in the photo-chronicles of the photographer.
APA, Harvard, Vancouver, ISO, and other styles
46

Wagenpfeil, Stefan, Paul Mc Kevitt, and Matthias Hemmje. "Smart Multimedia Information Retrieval." Analytics 2, no. 1 (February 20, 2023): 198–224. http://dx.doi.org/10.3390/analytics2010011.

Full text
Abstract:
The area of multimedia information retrieval (MMIR) faces two major challenges: the enormously growing number of multimedia objects (i.e., images, videos, audio, and text files), and the fast increasing level of detail of these objects (e.g., the number of pixels in images). Both challenges lead to a high demand of scalability, semantic representations, and explainability of MMIR processes. Smart MMIR solves these challenges by employing graph codes as an indexing structure, attaching semantic annotations for explainability, and employing application profiling for scaling, which results in human-understandable, expressive, and interoperable MMIR. The mathematical foundation, the modeling, implementation detail, and experimental results are shown in this paper, which confirm that Smart MMIR improves MMIR in the area of efficiency, effectiveness, and human understandability.
APA, Harvard, Vancouver, ISO, and other styles
47

Zhang, Hao, Gong Wen Xu, Wan Rong Guo, Ming Hai Liao, Chun Xiu Xu, Qian Zhao, and Hong Luan Zhao. "The Application of Cross-Media Retrieval Technology Based on Ontology." Applied Mechanics and Materials 738-739 (March 2015): 1299–302. http://dx.doi.org/10.4028/www.scientific.net/amm.738-739.1299.

Full text
Abstract:
As a large number of the multimedia information emerges, the cross-media retrieval system becomes an important research focus. The cross-media retrieval system is based on the traditional content retrieval, extracting color, texture, and shape features vector of the images. A new method was carried out in this paper. Firstly, the uniform semantic representational framework was built to organize the different mode media heterogeneous characteristics. Secondly, the Ontology database representing each type of media concepts was set up. The Ontology database organizes the low level features of the multimedia objects to associate multimedia files in the semantic level. Thirdly, the cross-media retrieval algorithm based on ontology was introduced. The results of the experiment showed that this cross-media retrieval method based on the Ontology was more effective and accurate.
APA, Harvard, Vancouver, ISO, and other styles
48

KÜÇÜK, DİLEK, N. BURCUÖZGÜR, ADNAN YAZICI, and MURAT KOYUNCU. "A FUZZY CONCEPTUAL MODEL FOR MULTIMEDIA DATA WITH A TEXT-BASED AUTOMATIC ANNOTATION SCHEME." International Journal of Uncertainty, Fuzziness and Knowledge-Based Systems 17, supp01 (August 2009): 135–52. http://dx.doi.org/10.1142/s0218488509006066.

Full text
Abstract:
The size of multimedia data is increasing fast due to the abundance of multimedia applications. Modeling the semantics of the data effectively is crucial for proper management of it. In this paper, we present a fuzzy conceptual data model for multimedia data which is also generic in the sense that it can be adapted to all multimedia domains. The model takes an object-oriented approach and it handles fuzziness at different representation levels where fuzziness is inherent in multimedia applications and should be properly modeled. The proposed model also has the nice feature of representing the structural hierarchy of multimedia data as well as the spatial and temporal relations of the data. The model is applied to the news video domain and implemented as a fuzzy multimedia database system where it turns out to be effective in representing the domain and thereby provides an evidence for the general applicability of the model. The model is accompanied by an automatic multimedia annotation scheme which makes use of information extraction techniques on the corresponding multimedia texts.
APA, Harvard, Vancouver, ISO, and other styles
49

Xing, Ling, Qiang Ma, Honghai Wu, and Ping Xie. "General Multimedia Trust Authentication Framework for 5G Networks." Wireless Communications and Mobile Computing 2018 (June 28, 2018): 1–9. http://dx.doi.org/10.1155/2018/8974802.

Full text
Abstract:
Due to the varieties of services and the openness of network architectures, great challenges for information security of the 5G systems are posed. Although there exist various and heterogeneous security communication mechanisms, it is imperative to develop a more general and more ubiquitous authentication method for data security. In this paper, we propose for the 5G networks a novel multimedia authentication framework, which is based upon the trusted content representation (TCR). The framework is general and suitable for various multimedia contents, e.g., text, audio, and video. The generality of the framework is achieved by the TCR technique, which authenticates the contents’ semantics in both high and low levels. Analysis shows that the authentication framework is able to authenticate multimedia contents effectively in terms of active and passive authenticating ways.
APA, Harvard, Vancouver, ISO, and other styles
50

Straccia, U. "Reasoning within Fuzzy Description Logics." Journal of Artificial Intelligence Research 14 (April 1, 2001): 137–66. http://dx.doi.org/10.1613/jair.813.

Full text
Abstract:
Description Logics (DLs) are suitable, well-known, logics for managing structured knowledge. They allow reasoning about individuals and well defined concepts, i.e., set of individuals with common properties. The experience in using DLs in applications has shown that in many cases we would like to extend their capabilities. In particular, their use in the context of Multimedia Information Retrieval (MIR) leads to the convincement that such DLs should allow the treatment of the inherent imprecision in multimedia object content representation and retrieval. In this paper we will present a fuzzy extension of ALC, combining Zadeh's fuzzy logic with a classical DL. In particular, concepts becomes fuzzy and, thus, reasoning about imprecise concepts is supported. We will define its syntax, its semantics, describe its properties and present a constraint propagation calculus for reasoning in it.
APA, Harvard, Vancouver, ISO, and other styles
We offer discounts on all premium plans for authors whose works are included in thematic literature selections. Contact us to get a unique promo code!

To the bibliography