Auswahl der wissenschaftlichen Literatur zum Thema „Audio content analysis“

Geben Sie eine Quelle nach APA, MLA, Chicago, Harvard und anderen Zitierweisen an

Wählen Sie eine Art der Quelle aus:

Machen Sie sich mit den Listen der aktuellen Artikel, Bücher, Dissertationen, Berichten und anderer wissenschaftlichen Quellen zum Thema "Audio content analysis" bekannt.

Neben jedem Werk im Literaturverzeichnis ist die Option "Zur Bibliographie hinzufügen" verfügbar. Nutzen Sie sie, wird Ihre bibliographische Angabe des gewählten Werkes nach der nötigen Zitierweise (APA, MLA, Harvard, Chicago, Vancouver usw.) automatisch gestaltet.

Sie können auch den vollen Text der wissenschaftlichen Publikation im PDF-Format herunterladen und eine Online-Annotation der Arbeit lesen, wenn die relevanten Parameter in den Metadaten verfügbar sind.

Zeitschriftenartikel zum Thema "Audio content analysis"

1

Raj, Bhiksha, Paris Smaragdis, Malcolm Slaney, Chung-Hsien Wu, Liming Chen und Hyoung-Gook Kim. „Scalable Audio-Content Analysis“. EURASIP Journal on Audio, Speech, and Music Processing 2010 (2010): 1–2. http://dx.doi.org/10.1155/2010/467278.

Der volle Inhalt der Quelle
APA, Harvard, Vancouver, ISO und andere Zitierweisen
2

Lu, Lie, und Alan Hanjalic. „Audio Keywords Discovery for Text-Like Audio Content Analysis and Retrieval“. IEEE Transactions on Multimedia 10, Nr. 1 (Januar 2008): 74–85. http://dx.doi.org/10.1109/tmm.2007.911304.

Der volle Inhalt der Quelle
APA, Harvard, Vancouver, ISO und andere Zitierweisen
3

Lie Lu, Hong-Jiang Zhang und Hao Jiang. „Content analysis for audio classification and segmentation“. IEEE Transactions on Speech and Audio Processing 10, Nr. 7 (Oktober 2002): 504–16. http://dx.doi.org/10.1109/tsa.2002.804546.

Der volle Inhalt der Quelle
APA, Harvard, Vancouver, ISO und andere Zitierweisen
4

Li, Y., und C. Dorai. „Instructional Video Content Analysis Using Audio Information“. IEEE Transactions on Audio, Speech and Language Processing 14, Nr. 6 (November 2006): 2264–74. http://dx.doi.org/10.1109/tasl.2006.872602.

Der volle Inhalt der Quelle
APA, Harvard, Vancouver, ISO und andere Zitierweisen
5

JBARI, ATMAN, ABDELLAH ADIB und DRISS ABOUTAJDINE. „BLIND AUDIO SEPARATION AND CONTENT ANALYSIS IN THE TIME-SCALE DOMAIN“. International Journal of Semantic Computing 01, Nr. 03 (September 2007): 307–18. http://dx.doi.org/10.1142/s1793351x07000184.

Der volle Inhalt der Quelle
Annotation:
In this paper, we address the problem of Blind Audio Separation (BAS) by content evaluation of audio signals in the Time-Scale domain. Most of the proposed techniques rely on independence or at least uncorrelation assumption of the source signals exploiting mutual information or second/high order statistics. Here, we present a new algorithm, for instantaneous mixture, that considers only different time-scale source signature properties. Our approach lies in wavelet transformation advantages and proposes for this a new representation; Spatial Time Scale Distributions (STSD), to characterize energy and interference of the observed data. The BAS will be allowed by joint diagonalization, without a prior orthogonality constraint, of a set of selected diagonal STSD matrices. Several criteria will be proposed, in the transformed time-scale space, to assess the separated audio signal contents. We describe the logistics of the separation and the content rating, thus an exemplary implementation on synthetic signals and real audio recordings show the high efficiency of the proposed technique to restore the audio signal contents.
APA, Harvard, Vancouver, ISO und andere Zitierweisen
6

Magalhaes, Tairone Nunes, Felippe Brandão Barros und Mauricio Alves Loureiro. „Iracema: a Python library for audio content analysis“. Revista de Informática Teórica e Aplicada 27, Nr. 4 (23.12.2020): 127–38. http://dx.doi.org/10.22456/2175-2745.107202.

Der volle Inhalt der Quelle
Annotation:
Iracema is a Python library that aims to provide models for the extraction of meaningful informationfrom recordings of monophonic pieces of music, for purposes of research in music performance. With this objective in mind, we propose an architecture that will provide to users an abstraction level that simplifies the manipulation of different kinds of time series, as well as the extraction of segments from them. In this paper we: (1) introduce some key concepts at the core of the proposed architecture; (2) describe the current functionalities of the package; (3) give some examples of the application programming interface; and (4) give some brief examples of audio analysis using the system.
APA, Harvard, Vancouver, ISO und andere Zitierweisen
7

Tzanetakis, George, und Perry Cook. „MARSYAS: a framework for audio analysis“. Organised Sound 4, Nr. 3 (16.11.2000): 169–75. http://dx.doi.org/10.1017/s1355771800003071.

Der volle Inhalt der Quelle
Annotation:
Existing audio tools handle the increasing amount of computer audio data inadequately. The typical tape-recorder paradigm for audio interfaces is inflexible and time consuming, especially for large data sets. On the other hand, completely automatic audio analysis and annotation is impossible using current techniques. Alternative solutions are semi-automatic user interfaces that let users interact with sound in flexible ways based on content. This approach offers significant advantages over manual browsing, annotation and retrieval. Furthermore, it can be implemented using existing techniques for audio content analysis in restricted domains. This paper describes MARSYAS, a framework for experimenting, evaluating and integrating such techniques. As a test for the architecture, some recently proposed techniques have been implemented and tested. In addition, a new method for temporal segmentation based on audio texture is described. This method is combined with audio analysis techniques and used for hierarchical browsing, classification and annotation of audio files.
APA, Harvard, Vancouver, ISO und andere Zitierweisen
8

Barker, Alexander B., Kathy Whittamore, John Britton, Rachael L. Murray und Jo Cranwell. „A content analysis of alcohol content in UK television“. Journal of Public Health 41, Nr. 3 (14.10.2018): 462–69. http://dx.doi.org/10.1093/pubmed/fdy142.

Der volle Inhalt der Quelle
Annotation:
Abstract Background Exposure to audio-visual alcohol content in media is associated with subsequent alcohol use in young people, but the extent of exposure contained in UK free-to-air prime-time television has not been explored since 2010. We report an analysis of alcohol content in a sample of UK free-to-air prime-time television broadcasts in 2015 and compare this with a similar analysis from 2010. Methods Content analysis of all programmes and advertisement/trailer breaks broadcast on the five national UK free-to-air channels in the UK between 6 and 10 pm during three separate weeks in September, October and November 2015. Results Alcohol content occurred in over 50% of all programmes broadcast and almost 50% of all advert/trailer periods between programmes. The majority of alcohol content occurred before the 9 pm watershed. Branding occurred in 3% of coded intervals and involved 122 brands, though three brands (Heineken, Corona and Fosters) accounted for almost half of all brand appearances. Conclusion Audio-visual alcohol content, including branding, is prevalent in UK television, and is therefore a potential driver of alcohol use in young people. These findings are virtually unchanged from our earlier analysis of programme content from 2010.
APA, Harvard, Vancouver, ISO und andere Zitierweisen
9

Martyniuk, Tetiana, Maksym Mykytiuk und Mykola Zaitsev. „FEATURES OF ANALYSIS OF MULTICHANNEL AUDIO SIGNALSFEATURES OF ANALYSIS OF MULTICHANNEL AUDIO SIGNALS“. ГРААЛЬ НАУКИ, Nr. 2-3 (09.04.2021): 302–5. http://dx.doi.org/10.36074/grail-of-science.02.04.2021.061.

Der volle Inhalt der Quelle
Annotation:
The rapid growth of audio content has led to the need to use tools for analysis and quality control of audio signals using software and hardware and modules. The fastest-growing industry is software and programming languages.The Python programming language today has the most operational and visual capabilities for working with sound. When developing programs for computational signal analysis, it provides the optimal balance of high and low-level programming functions. Compared to Matlab or other similar solutions, Python is free and allows you to create standalone applications without the need for large, permanently installed files and a virtual environment.
APA, Harvard, Vancouver, ISO und andere Zitierweisen
10

Yao Wang, Zhu Liu und Jin-Cheng Huang. „Multimedia content analysis-using both audio and visual clues“. IEEE Signal Processing Magazine 17, Nr. 6 (2000): 12–36. http://dx.doi.org/10.1109/79.888862.

Der volle Inhalt der Quelle
APA, Harvard, Vancouver, ISO und andere Zitierweisen

Dissertationen zum Thema "Audio content analysis"

1

Yapp, Lawrence. „Content-based indexing of MPEG video through the analysis of the accompanying audio /“. Thesis, Connect to this title online; UW restricted, 1997. http://hdl.handle.net/1773/5835.

Der volle Inhalt der Quelle
APA, Harvard, Vancouver, ISO und andere Zitierweisen
2

Melih, Kathy, und n/a. „Audio Source Separation Using Perceptual Principles for Content-Based Coding and Information Management“. Griffith University. School of Information Technology, 2004. http://www4.gu.edu.au:8080/adt-root/public/adt-QGU20050114.081327.

Der volle Inhalt der Quelle
Annotation:
The information age has brought with it a dual problem. In the first place, the ready access to mechanisms to capture and store vast amounts of data in all forms (text, audio, image and video), has resulted in a continued demand for ever more efficient means to store and transmit this data. In the second, the rapidly increasing store demands effective means to structure and access the data in an efficient and meaningful manner. In terms of audio data, the first challenge has traditionally been the realm of audio compression research that has focused on statistical, unstructured audio representations that obfuscate the inherent structure and semantic content of the underlying data. This has only served to further complicate the resolution of the second challenge resulting in access mechanisms that are either impractical to implement, too inflexible for general application or too low level for the average user. Thus, an artificial dichotomy has been created from what is in essence a dual problem. The founding motivation of this thesis is that, although the hypermedia model has been identified as the ideal, cognitively justified method for organising data, existing audio data representations and coding models provide little, if any, support for, or resemblance to, this model. It is the contention of the author that any successful attempt to create hyperaudio must resolve this schism, addressing both storage and information management issues simultaneously. In order to achieve this aim, an audio representation must be designed that provides compact data storage while, at the same time, revealing the inherent structure of the underlying data. Thus it is the aim of this thesis to present a representation designed with these factors in mind. Perhaps the most difficult hurdle in the way of achieving the aims of content-based audio coding and information management is that of auditory source separation. The MPEG committee has noted this requirement during the development of its MPEG-7 standard, however, the mechanics of "how" to achieve auditory source separation were left as an open research question. This same committee proposed that MPEG-7 would "support descriptors that can act as handles referring directly to the data, to allow manipulation of the multimedia material." While meta-data tags are a part solution to this problem, these cannot allow manipulation of audio material down to the level of individual sources when several simultaneous sources exist in a recording. In order to achieve this aim, the data themselves must be encoded in such a manner that allows these descriptors to be formed. Thus, content-based coding is obviously required. In the case of audio, this is impossible to achieve without effecting auditory source separation. Auditory source separation is the concern of computational auditory scene analysis (CASA). However, the findings of CASA research have traditionally been restricted to a limited domain. To date, the only real application of CASA research to what could loosely be classified as information management has been in the area of signal enhancement for automatic speech recognition systems. In these systems, a CASA front end serves as a means of separating the target speech from the background "noise". As such, the design of a CASA-based approach, as presented in this thesis, to one of the most significant challenges facing audio information management research represents a significant contribution to the field of information management. Thus, this thesis unifies research from three distinct fields in an attempt to resolve some specific and general challenges faced by all three. It describes an audio representation that is based on a sinusoidal model from which low-level auditory primitive elements are extracted. The use of a sinusoidal representation is somewhat contentious with the modern trend in CASA research tending toward more complex approaches in order to resolve issues relating to co-incident partials. However, the choice of a sinusoidal representation has been validated by the demonstration of a method to resolve many of these issues. The majority of the thesis contributes several algorithms to organise the low-level primitives into low-level auditory objects that may form the basis of nodes or link anchor points in a hyperaudio structure. Finally, preliminary investigations in the representation’s suitability for coding and information management tasks are outlined as directions for future research.
APA, Harvard, Vancouver, ISO und andere Zitierweisen
3

Scheuneman, Scott Isabel. „"Deadly Women": Examining (Audio)Visual (Re)Presentations of Violent Women and Girls in Infotainment Media“. Thesis, Université d'Ottawa / University of Ottawa, 2015. http://hdl.handle.net/10393/33453.

Der volle Inhalt der Quelle
Annotation:
Women have historically been the subject of stereotypes – especially criminalized women as they are constructed in the mass media. These stereotypes become particularly problematic when they are invoked in infotainment media – a genre that combines information and entertainment and presents itself as primarily factual. As such, ideological messages delivered through infotainment are also (re)presented as truthful and may be more likely to be taken up by an unquestioning audience. This research aimed to answer the following research question: How does infotainment portray women who commit serious violent crime? In order to answer this question, a qualitative content analysis was employed and “Deadly Women”, a televised infotainment series that narrates and re-enacts true crime stories of women who kill, was selected as a case study. The sample consisted of previously identified typologies: mothers who kill their children, women who kill their partners, adolescent girls who kill, and vigilantes who kill their abusers. Stemming from a critical feminist framework, the analysis revealed that Deadly Women relies on two primary trajectories to explain the violence committed by women and girls. While both trajectories emphasized gendered stereotypes that involved emotionality and mental health issues, they were nonetheless distinct. The first trajectory evoked narratives of the ‘emotionless’ and ‘psychopathic’ perpetrator; while the second trajectory characterized the offender as overly ‘emotional’ and ‘depressed’. These trajectories, along with their related variables, problematically (re)presented violent women and girls in simplistic and dualistic manners that served to obscure rather than to clarify the circumstances surrounding their crimes.
APA, Harvard, Vancouver, ISO und andere Zitierweisen
4

Gómez, Gutiérrez Emilia. „Tonal description of music audio signals“. Doctoral thesis, Universitat Pompeu Fabra, 2006. http://hdl.handle.net/10803/7537.

Der volle Inhalt der Quelle
Annotation:
Aquesta tesi doctoral proposa i avalua un enfocament computacional per a la descripció automàtica dels aspectes tonals de la música a partir de l'anàlisi de senyals d'-audio polifòniques. Aquests mètodes es centren en el càlcul de descriptors de distribucions de notes, en l'estimació de tonalitat d'una peça, en la visualització de l'evolució del centre tonal o en la mesura de la similitud tonal entre dues peces diferents.
Aquesta tesi contribueix substancialment al camp de la descripció tonal mitjançant mètodes computacionals: a) Proporciona una revisió multidisciplinària dels sistemes d'estimació de la tonalitat; b) Defineix una sèrie de requeriments que han de complir els descriptors tonals de baix nivell; c) Proporciona una avaluació quantitativa i modular dels mètodes proposats; d) Justifica la idea de que per a certes aplicacions es poden fer servir mètodes que treballen amb partitures sense la necessitat de realitzar una transcripció automàtica e) Estén la literatura existent que treballa amb música clàssica a altres generes musicals; f) Demostra la utilitat dels descriptors tonals per a comparar peces musicals; g) Proporciona un algoritme optimitzat que es fa servir dins un sistema real per a visualització, cerca i recomanació musical, que treballa amb més d'un milió de obres musicals.
Esta tesis doctoral propone y evalúa un enfoque computacional para la descripción automática de aspectos tonales de la música a partir del análisis de señales de audio polifónicas. Estos métodos se centran en calcular descriptores de distribución de notas, en estimar la tonalidad de una pieza, en visualizar la evolución del centro tonal o en medir la similitud tonal entre dos piezas diferentes.
Esta tesis contribuye sustancialmente al campo de la descripción tonal mediante métodos computacionales: a) Proporciona una revisión multidisciplinar de los sistemas de estimación de la tonalidad; b) Define una serie de requerimientos que deben cumplir los descriptores tonales de bajo nivel; c) Proporciona una evaluación cuantitativa y modular de los métodos propuestos; d) Respalda la idea de que para ciertas aplicaciones no es necesario obtener una transcripción perfecta de la partitura, y que se pueden utilizar métodos que trabajan con partituras sin realizar una transcripción automática; e) Extiende la literatura existente que trabaja con música clásica a otros géneros musicales; f) Demuestra la utilidad de los descriptores tonales para comparar piezas musicales; g) Proporciona un algoritmo optimizado que se utiliza en un sistema real para visualización, búsqueda y recomendación musical, que trabaja con mas de un millón de piezas musicales.
This doctoral dissertation proposes and evaluates a computational approach for the automatic description of tonal aspects of music from the analysis of polyphonic audio signals. These algorithms focus on the computation of pitch class distributions descriptors, the estimation of the key of a piece, the visualization of the evolution of its tonal center or the measurement of the similarity between two different musical pieces.
This dissertation substantially contributes to the field of computational tonal description: a) It provides a multidisciplinary review of tonal induction systems; b) It defines a set of requirements for low-level tonal features; c) It provides a quantitative and modular evaluation of the proposed methods; d) It contributes to bridge the gap between audio and symbolic-oriented methods without the need of a perfect transcription; e) It extents current literature dealing with classical music to other musical genres; f) It shows the usefulness of tonal descriptors for music similarity; g) It provides an optimized method which is used in a real system for music visualization and retrieval, working with over a million of musical pieces.
APA, Harvard, Vancouver, ISO und andere Zitierweisen
5

Walters, Thomas C. „Auditory-based processing of communication sounds“. Thesis, University of Cambridge, 2011. https://www.repository.cam.ac.uk/handle/1810/240577.

Der volle Inhalt der Quelle
Annotation:
This thesis examines the possible benefits of adapting a biologically-inspired model of human auditory processing as part of a machine-hearing system. Features were generated by an auditory model, and used as input to machine learning systems to determine the content of the sound. Features were generated using the auditory image model (AIM) and were used for speech recognition and audio search. AIM comprises processing to simulate the human cochlea, and a 'strobed temporal integration' process which generates a stabilised auditory image (SAI) from the input sound. The communication sounds which are produced by humans, other animals, and many musical instruments take the form of a pulse-resonance signal: pulses excite resonances in the body, and the resonance following each pulse contains information both about the type of object producing the sound and its size. In the case of humans, vocal tract length (VTL) determines the size properties of the resonance. In the speech recognition experiments, an auditory filterbank was combined with a Gaussian fitting procedure to produce features which are invariant to changes in speaker VTL. These features were compared against standard mel-frequency cepstral coefficients (MFCCs) in a size-invariant syllable recognition task. The VTL-invariant representation was found to produce better results than MFCCs when the system was trained on syllables from simulated talkers of one range of VTLs and tested on those from simulated talkers with a different range of VTLs. The image stabilisation process of strobed temporal integration was analysed. Based on the properties of the auditory filterbank being used, theoretical constraints were placed on the properties of the dynamic thresholding function used to perform strobe detection. These constraints were used to specify a simple, yet robust, strobe detection algorithm. The syllable recognition system described above was then extended to produce features from profiles of the SAI and tested with the same syllable database as before. For clean speech, performance of the features was comparable to that of those generated from the filterbank output. However when pink noise was added to the stimuli, performance dropped more slowly as a function of signal-to-noise ratio when using the SAI-based AIM features, than when using either the filterbank-based features or the MFCCs, demonstrating the noise-robustness properties of the SAI representation. The properties of the auditory filterbank in AIM were also analysed. Three models of the cochlea were considered: the static gammatone filterbank, dynamic compressive gammachirp (dcGC) and the pole-zero filter cascade (PZFC). The dcGC and gammatone are standard filterbank models, whereas the PZFC is a filter cascade, which more accurately models signal propagation in the cochlea. However, while the architecture of the filterbanks is different, they have all been successfully fitted to psychophysical masking data from humans. The abilities of the filterbanks to measure pitch strength were assessed, using stimuli which evoke a weak pitch percept in humans, in order to ascertain whether there is any benefit in the use of the more computationally efficient PZFC.Finally, a complete sound effects search system using auditory features was constructed in collaboration with Google research. Features were computed from the SAI by sampling the SAI space with boxes of different scales. Vector quantization (VQ) was used to convert this multi-scale representation to a sparse code. The 'passive-aggressive model for image retrieval' (PAMIR) was used to learn the relationships between dictionary words and these auditory codewords. These auditory sparse codes were compared against sparse codes generated from MFCCs, and the best performance was found when using the auditory features.
APA, Harvard, Vancouver, ISO und andere Zitierweisen
6

Ong, Bee Suan. „Structural analysis and segmentation of music signals“. Doctoral thesis, Universitat Pompeu Fabra, 2007. http://hdl.handle.net/10803/7544.

Der volle Inhalt der Quelle
Annotation:
Con la reciente explosión cuantitativa de bibliotecas y colecciones de música en formato
digital, la descripción del contenido desempeña un papel fundamental para una gestión y
búsqueda eficientes de archivos de audio. La presente tesis doctoral pretende hacer un
análisis automático de la estructura de piezas musicales a partir del análisis de una
grabación, es decir, extraer una descripción estructural a partir de señales musicales
polifónicas. En la medida en que la repetición y transformación de la estructura de la
música genera una identificación única de una obra musical, extraer automáticamente
esta información puede vincular entre sí descripciones de bajo y alto nivel de una señal
musical y puede proporcionar al usuario una manera más efectiva de interactuar con un
contenido de audio. Para algunas aplicaciones basadas en contenido, encontrar los límites
de determinados segmentos de una grabación resulta indispensable. Así pues, también se
investiga la segmentación temporal de audio a nivel semántico, al igual que la
identificación de extractos representativos de una señal musical que pueda servir como
resumen de la misma. Para ello se emplea una técnica de análisis a un nivel de
abstracción más elevado que permite obtener una mejor división en segmentos. Tanto
desde el punto de vista teórico como práctico, esta investigación no sólo ayuda a
incrementar nuestro conocimiento respecto a la estructura musical, sino que también
proporciona una ayuda al examen y a la valoración musical.
With the recent explosion in the quantity of digital audio libraries and databases, content
descriptions play an important role in efficiently managing and retrieving audio files.
This doctoral research aims to discover and extract structural description from
polyphonic music signals. As repetition and transformations of music structure creates a
unique identity of music itself, extracting such information can link low-level and higherlevel
descriptions of music signal and provide better quality access plus powerful way of
interacting with audio content. Finding appropriate boundary truncations is indispensable
in certain content-based applications. Thus, temporal audio segmentation at the semantic
level and the identification of representative excerpts from music audio signal are also
investigated. We make use of higher-level analysis technique for better segment
truncation. From both theoretical and practical points of view, this research not only
helps in increasing our knowledge of music structure but also facilitates in time-saving
browsing and assessing of music.
APA, Harvard, Vancouver, ISO und andere Zitierweisen
7

Nazareno, Claudio. „Regulation and the promotion of national audio-visual content in the era of digital convergence : a comparative analysis of the United Kingdom, France and Brazil“. Thesis, University of Roehampton, 2013. https://pure.roehampton.ac.uk/portal/en/studentthesis/regulation-and-the-promotion-of-national-audiovisual-content-in-the-era-of-digital-convergence(c52f8024-66fd-4525-8258-c78532077836).html.

Der volle Inhalt der Quelle
Annotation:
This research investigates how the production and distribution of nationally produced television programmes, films and other audiovisual content can be promoted by regulation. The study identifies current regulatory tools to promote national content and differences in policy approaches. It also identifies new issues resulting from the substantial transformation the media environment has undergone in recent years. The audiovisual sector today is characterised by an abundance of television channels and telecommunication services and by ongoing digital convergence, all impacting on the effectiveness and rationale of content regulation. Focused on the UK, France and Brazil, this comparative research investigates the political, regulatory, socio-cultural, economic, technological and market changes of the communication services in the recent decades and how this has impacted on the provision of national audiovisual content. The starting point of the analysis are the 1980s, when broadcasting started to be gradually liberalised, and it concludes in the 2000s, when information technologies, telecommunications and broadcasting converge into interconnected, complementary and supplementary services. This research offers a three step method of analysis which contributes to a new understanding of the mechanisms and implications involved in the production and distribution of nationally produced content in digital times. In the first step, the various socio-cultural aims of communication services as enacted in legislation are analysed. The second step explores the consequent regulatory tools for the fulfilment of those objectives. The third step provides a market evaluation of the audiovisual industries, which broadcasters and other distribution platforms provide which kind of content, and also looks at audiences‟ viewing preferences to get a better understanding of what type of content should be supported through legislation. As outcomes, this research proposes to academics, policymakers and regulators a new definition for national content and a series of regulatory actions for fostering national audiovisual industries in times of digital convergence.
APA, Harvard, Vancouver, ISO und andere Zitierweisen
8

Laborová, Anna. „Extrakce parametrů pro výzkum interpretačního výkonu“. Master's thesis, Vysoké učení technické v Brně. Fakulta elektrotechniky a komunikačních technologií, 2021. http://www.nusl.cz/ntk/nusl-442575.

Der volle Inhalt der Quelle
Annotation:
Different music performances of the same piece may significantly differ from each other. Not only the composer and the score defines the listener’s music experience, but the music performance itself is an integral part of this experience. Four parameter classes can be used to describe a performance objectively: tempo and timing, loudness (dynamics), timbre, and pitch. Each of the individual parameters or their combination can generate a unique characteristic performance. The extraction of such objective parameters is one of the difficulties in the field of Music Performance Analysis and Music Information Retrieval. The submitted work summarizes knowledge and methods from both of the fields. The system is applied to extract data from 31 string quartet performances of 2. movement Lento of String Quartet no. 12 F major (1893) by czech romantic composer Antonín Dvořák (1841–1904).
APA, Harvard, Vancouver, ISO und andere Zitierweisen
9

Nesvadba, Jan. „Segmentation sémantique des contenus audio-visuels“. Bordeaux 1, 2007. http://www.theses.fr/2007BOR13456.

Der volle Inhalt der Quelle
Annotation:
Dans ce travail, nous avons mis au point une méthode de segmentation des contenus audiovisuels applicable aux appareils de stockage domestiques pour cela nous avons expérimenté un système distribué pour l'analyse du contenu composé de modules individuels d'analyse : les service unit. L'un entre eux a été dédié à la caractérisation des éléments hors contenu, i. E. Les publicités, et offre de bonnes perfermances. Parallélement, nous avons testé différents détecteurs de changement de plans afin de retenir le meilleur d'ente eux pour la suite. Puis, nous avons proposé une étude des règles de production des films, i. E. Grammaire de films, qui a permis de définir les séquences de parallel shot. Nous avons, ainsi, testé quatre méthodes de regroupement basées similarité afin de retenir la meilleure d'entre elles pour la suite. Finalement, nous avons recherché différentes méthodes de détection des frontières de scènes et avons obtenu les meilleurs résultats en combinant une méthode basée couleur avec un critère de longueur de plan. Ce dernier offre des performances justifiant son intégration dans les appareils de stockage grand public.
APA, Harvard, Vancouver, ISO und andere Zitierweisen
10

Khaledi, Soheila. „Corporate Risk Disclosure: A Content Analysis of Swedish Interim Reports“. Thesis, Uppsala universitet, Företagsekonomiska institutionen, 2014. http://urn.kb.se/resolve?urn=urn:nbn:se:uu:diva-231965.

Der volle Inhalt der Quelle
Annotation:
The aim of this research is to examine the determinants of the level of corporate risk disclosure (CRD) in the interim reports of Swedish non-financial companies. A quantitative research approach is used, the sample data of which consist of 166 firms with 4,849 interim reports over a 10-year period. By utilizing the notion of risk and its definition, I have distinguished three categories of risk, namely risk as uncertainty, risk as threat and risk as opportunity. A systematic content analysis is conducted with the use of a software program, which is specifically designed for this purpose. The number of sentences that contain keywords related to the three risk categories is counted as the total CRD score, which is transformed to the disclosure index. I have examined the impact of firms’ characteristics and corporate governance mechanisms on the level of CRD based on agency theory. The ordinary least squares regression method with  control for fixed year effects is used to analyse the data, which show that firm size and audit committee have a positive relationship with the level of corporate risk disclosure. The result demonstrates also that there is a negative relationship between family ownership and the level of CRD, and an insignificant relationship between leverage and the level of CRD.
APA, Harvard, Vancouver, ISO und andere Zitierweisen

Bücher zum Thema "Audio content analysis"

1

Lerch, Alexander. Audio content analysis: An introduction. Hoboken, N.J: Wiley, 2012.

Den vollen Inhalt der Quelle finden
APA, Harvard, Vancouver, ISO und andere Zitierweisen
2

Lerch, Alexander. An Introduction to Audio Content Analysis. Hoboken, NJ, USA: John Wiley & Sons, Inc., 2012. http://dx.doi.org/10.1002/9781118393550.

Der volle Inhalt der Quelle
APA, Harvard, Vancouver, ISO und andere Zitierweisen
3

Voronina, Larisa. Financial accounting: theory and practice. ru: INFRA-M Academic Publishing LLC., 2021. http://dx.doi.org/10.12737/1171982.

Der volle Inhalt der Quelle
Annotation:
The textbook is based on the normative acts of the system of regulatory regulation of accounting currently in force in the Russian Federation in accordance with the latest amendments to the Tax Code of the Russian Federation and the Labor Code of the Russian Federation. The basics of the organization of accounting and the principles of its differentiation into financial and managerial accounting are considered. The methodology of accounting for the assets, liabilities and capital of the organization is described, the main aspects of taxation are presented. Numerous practical examples, questions for self-examination and interviews, tests and workshops are given for all chapters. The content of the textbook and the professional competencies formed based on the results of its study meet the requirements of the federal state educational standards of higher education of the latest generation, the Main Professional Educational Program of Higher Education (OPOP HE) "Accounting, analysis and audit" in the direction of training 38.03.01 "Economics" and the working program of the discipline "Accounting financial Accounting" (MFUA). For students of economic universities and faculties, students of the system of advanced training and retraining, for practitioners of accounting services, audit companies and administrative and managerial personnel.
APA, Harvard, Vancouver, ISO und andere Zitierweisen
4

Sil'vestrov, Sergey, Vladimir Starovoytov, Vladimir Bauer, Aleksandr Selivanov, Vladimir Lepskiy, Aleksandr Raykov, Svetlana Lipina et al. Strategic planning in the public sector of the economy. ru: INFRA-M Academic Publishing LLC., 2021. http://dx.doi.org/10.12737/1081855.

Der volle Inhalt der Quelle
Annotation:
This collective monograph continues a series of scientific studies and publications on the problems of strategic planning, which have been carried out for several years at the Financial University under the Government of the Russian Federation with the involvement of specialists from other scientific and educational organizations. A series of research papers in 2017-2019 was devoted to the analysis of strategic development risks and the analysis of global strategic planning practice, the general methodology of strategic planning and forecasting (including in the context of ensuring Russia's economic security), the approach to the formation of life cycles of preparation and revision of strategic planning documents and their comparative analysis, the experience of coordinating budget, project and process types of management and financing, monitoring risks and threats, the use of new information tools in the strategic planning complex, including blockchain, and also naturally develops such aspects of previous research as analysis of world practice, coordination of budget, project and process types of management and financing, the use of information technologies. However, at the same time, a special task was set — to approach a comprehensive analysis of the strategic planning process as a whole, especially to study its documentary support as the core of the organization of this process and the implementation of its results in the practice of public administration, as well as to analyze the scientific support of strategic planning as an essential aspect of all strategic planning and strategic management activities in the entirety of its aspects (goal setting, forecast, design, programming, planning, control and audit). It is intended for specialists from the humanities, natural sciences and technical fields of knowledge focused on management and development problems, for undergraduates and postgraduates, as well as for a wide audience of management practitioners, including those related to strategic planning processes in the public sector.
APA, Harvard, Vancouver, ISO und andere Zitierweisen
5

David, Bers, und Tree Stephen 1949-, Hrsg. The music of Psalms, Proverbs and Job in the Hebrew Bible: A revised theory of musical accents in the Hebrew Bible : general analysis, bibliography, table of contents : CD with complete text and audio files of musically reconstructed Psalms, Proverbs and Job, sung by computer speech synthesis. Wiesbaden: Harrassowitz Verlag, 2011.

Den vollen Inhalt der Quelle finden
APA, Harvard, Vancouver, ISO und andere Zitierweisen
6

Mucchielli, Alex. Étude des communications: Information et communication interne : pour de nouveaux audits. 2. Aufl. Paris: Armand Colin, 2005.

Den vollen Inhalt der Quelle finden
APA, Harvard, Vancouver, ISO und andere Zitierweisen
7

Mucchielli, Alex. Étude des communications: Approche par la modélisation des relations. Paris: A. Colin, 2004.

Den vollen Inhalt der Quelle finden
APA, Harvard, Vancouver, ISO und andere Zitierweisen
8

Mucchielli, Alex. Étude des communications: Nouvelles approches. Paris: Colin, 2006.

Den vollen Inhalt der Quelle finden
APA, Harvard, Vancouver, ISO und andere Zitierweisen
9

Mucchielli, Alex. Étude des communications: Approche systémique dans les organisations. Paris: Colin, 2004.

Den vollen Inhalt der Quelle finden
APA, Harvard, Vancouver, ISO und andere Zitierweisen
10

Mucchielli, Alex. Étude des communications: Approche par la contextualisation. Paris: Colin, 2005.

Den vollen Inhalt der Quelle finden
APA, Harvard, Vancouver, ISO und andere Zitierweisen

Buchteile zum Thema "Audio content analysis"

1

Lu, Lie, und Alan Hanjalic. „Audio Content Analysis“. In Encyclopedia of Database Systems, 198–201. New York, NY: Springer New York, 2018. http://dx.doi.org/10.1007/978-1-4614-8265-9_1528.

Der volle Inhalt der Quelle
APA, Harvard, Vancouver, ISO und andere Zitierweisen
2

Lu, Lie, und Alan Hanjalic. „Audio Content Analysis“. In Encyclopedia of Database Systems, 154–56. Boston, MA: Springer US, 2009. http://dx.doi.org/10.1007/978-0-387-39940-9_1528.

Der volle Inhalt der Quelle
APA, Harvard, Vancouver, ISO und andere Zitierweisen
3

Lu, Lie, und Alan Hanjalic. „Audio Content Analysis“. In Encyclopedia of Database Systems, 1–3. New York, NY: Springer New York, 2016. http://dx.doi.org/10.1007/978-1-4899-7993-3_1528-2.

Der volle Inhalt der Quelle
APA, Harvard, Vancouver, ISO und andere Zitierweisen
4

Smaragdis, Paris, Regunathan Radhakrishnan und Kevin W. Wilson. „Context Extraction Through Audio Signal Analysis“. In Multimedia Content Analysis, 1–34. Boston, MA: Springer US, 2008. http://dx.doi.org/10.1007/978-0-387-76569-3_1.

Der volle Inhalt der Quelle
APA, Harvard, Vancouver, ISO und andere Zitierweisen
5

Lu, Lie, und Alan Hanjalic. „Audio Content Discovery: An Unsupervised Approach“. In Multimedia Content Analysis, 1–39. Boston, MA: Springer US, 2008. http://dx.doi.org/10.1007/978-0-387-76569-3_4.

Der volle Inhalt der Quelle
APA, Harvard, Vancouver, ISO und andere Zitierweisen
6

Zhang, Tong, und C. C. Jay Kuo. „Audio Feature Analysis“. In Content-Based Audio Classification and Retrieval for Audiovisual Data Parsing, 35–54. Boston, MA: Springer US, 2001. http://dx.doi.org/10.1007/978-1-4757-3339-6_3.

Der volle Inhalt der Quelle
APA, Harvard, Vancouver, ISO und andere Zitierweisen
7

De Santo, M., G. Percannella, C. Sansone und M. Vento. „Unsupervised News Video Segmentation by Combined Audio-Video Analysis“. In Multimedia Content Representation, Classification and Security, 273–81. Berlin, Heidelberg: Springer Berlin Heidelberg, 2006. http://dx.doi.org/10.1007/11848035_37.

Der volle Inhalt der Quelle
APA, Harvard, Vancouver, ISO und andere Zitierweisen
8

Zhang, Tong, und C. C. Jay Kuo. „Image Sequence Analysis“. In Content-Based Audio Classification and Retrieval for Audiovisual Data Parsing, 85–104. Boston, MA: Springer US, 2001. http://dx.doi.org/10.1007/978-1-4757-3339-6_6.

Der volle Inhalt der Quelle
APA, Harvard, Vancouver, ISO und andere Zitierweisen
9

Żwan, Paweł, Przemysław Sobala, Piotr Szczuko und Andrzej Czyzewski. „Audio Content Analysis in the Urban Area Telemonitoring System“. In Multimedia Services in Intelligent Environments, 227–39. Berlin, Heidelberg: Springer Berlin Heidelberg, 2010. http://dx.doi.org/10.1007/978-3-642-13396-1_10.

Der volle Inhalt der Quelle
APA, Harvard, Vancouver, ISO und andere Zitierweisen
10

Kang, Chan-Mi, und Joong-Hwan Baek. „Audio Content Analysis for Understanding Structures of Scene in Video“. In Lecture Notes in Computer Science, 1213–18. Berlin, Heidelberg: Springer Berlin Heidelberg, 2006. http://dx.doi.org/10.1007/11816157_151.

Der volle Inhalt der Quelle
APA, Harvard, Vancouver, ISO und andere Zitierweisen

Konferenzberichte zum Thema "Audio content analysis"

1

Pfeiffer, Silvia, Stephan Fischer und Wolfgang Effelsberg. „Automatic audio content analysis“. In the fourth ACM international conference. New York, New York, USA: ACM Press, 1996. http://dx.doi.org/10.1145/244130.244139.

Der volle Inhalt der Quelle
APA, Harvard, Vancouver, ISO und andere Zitierweisen
2

Kumar, Anurag, und Bhiksha Raj. „Weakly supervised scalable audio content analysis“. In 2016 IEEE International Conference on Multimedia and Expo (ICME). IEEE, 2016. http://dx.doi.org/10.1109/icme.2016.7552989.

Der volle Inhalt der Quelle
APA, Harvard, Vancouver, ISO und andere Zitierweisen
3

Mitral, Vikramjit, Daniel Garcia-Romero und Carol Y. Espy-Wilson. „Language detection in audio content analysis“. In ICASSP 2008 - 2008 IEEE International Conference on Acoustics, Speech and Signal Processing. IEEE, 2008. http://dx.doi.org/10.1109/icassp.2008.4518058.

Der volle Inhalt der Quelle
APA, Harvard, Vancouver, ISO und andere Zitierweisen
4

Wu, Chung-Ping, Po-Chyi Su und C. C. Jay Kuo. „Robust and efficient digital audio watermarking using audio content analysis“. In Electronic Imaging, herausgegeben von Ping W. Wong und Edward J. Delp III. SPIE, 2000. http://dx.doi.org/10.1117/12.384992.

Der volle Inhalt der Quelle
APA, Harvard, Vancouver, ISO und andere Zitierweisen
5

Arias, Jose Anibal, Regine Andre-Obrecht und Jerome Farinas. „Automatic low-dimensional analysis of audio databases“. In 2008 International Workshop on Content-Based Multimedia Indexing. IEEE, 2008. http://dx.doi.org/10.1109/cbmi.2008.4564996.

Der volle Inhalt der Quelle
APA, Harvard, Vancouver, ISO und andere Zitierweisen
6

Lu, Lie, und Alan Hanjalic. „Towards optimal audio "keywords" detection for audio content analysis and discovery“. In the 14th annual ACM international conference. New York, New York, USA: ACM Press, 2006. http://dx.doi.org/10.1145/1180639.1180825.

Der volle Inhalt der Quelle
APA, Harvard, Vancouver, ISO und andere Zitierweisen
7

Zhu, Rong, Haojun Ai und Ruimin Hu. „Audio Segmentation in AAC Domain for Content Analysis“. In 2009 5th International Conference on Wireless Communications, Networking and Mobile Computing (WiCOM). IEEE, 2009. http://dx.doi.org/10.1109/wicom.2009.5301778.

Der volle Inhalt der Quelle
APA, Harvard, Vancouver, ISO und andere Zitierweisen
8

Magalhaes, Tairone, Felippe Barros und Maurício Loureiro. „Iracema: a Python library for audio content analysis“. In Simpósio Brasileiro de Computação Musical. Sociedade Brasileira de Computação - SBC, 2019. http://dx.doi.org/10.5753/sbcm.2019.10418.

Der volle Inhalt der Quelle
Annotation:
This paper introduces the alpha version of a Python library called Iracema, which aims to provide models for the extraction of meaningful information from recordings of monophonic pieces of music, for purposes of research in music performance. With this objective in mind, we propose an architecture that will provide to users an abstraction level that simplifies the manipulation of different kinds of time series, as well as the extraction of segments from them. In this paper we: (1) introduce some key concepts at the core of the proposed architecture; (2) list the current functionalities of the package; (3) give some examples of the application programming interface; and (4) give some brief examples of audio analysis using the system.
APA, Harvard, Vancouver, ISO und andere Zitierweisen
9

Kereliuk, Corey, Bob L. Sturm und Jan Larsen. „Deep learning, audio adversaries, and music content analysis“. In 2015 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics (WASPAA). IEEE, 2015. http://dx.doi.org/10.1109/waspaa.2015.7336950.

Der volle Inhalt der Quelle
APA, Harvard, Vancouver, ISO und andere Zitierweisen
10

Mitra, Vikramjit, Daniel Garcia-Romero und Carol Y. Espy-Wilson. „Language and genre detection in audio content analysis“. In Interspeech 2008. ISCA: ISCA, 2008. http://dx.doi.org/10.21437/interspeech.2008-621.

Der volle Inhalt der Quelle
APA, Harvard, Vancouver, ISO und andere Zitierweisen

Berichte der Organisationen zum Thema "Audio content analysis"

1

Baluk, Nadia, Natalia Basij, Larysa Buk und Olha Vovchanska. VR/AR-TECHNOLOGIES – NEW CONTENT OF THE NEW MEDIA. Ivan Franko National University of Lviv, Februar 2021. http://dx.doi.org/10.30970/vjo.2021.49.11074.

Der volle Inhalt der Quelle
Annotation:
The article analyzes the peculiarities of the media content shaping and transformation in the convergent dimension of cross-media, taking into account the possibilities of augmented reality. With the help of the principles of objectivity, complexity and reliability in scientific research, a number of general scientific and special methods are used: method of analysis, synthesis, generalization, method of monitoring, observation, problem-thematic, typological and discursive methods. According to the form of information presentation, such types of media content as visual, audio, verbal and combined are defined and characterized. The most important in journalism is verbal content, it is the one that carries the main information load. The dynamic development of converged media leads to the dominance of image and video content; the likelihood of increasing the secondary content of the text increases. Given the market situation, the effective information product is a combined content that combines text with images, spreadsheets with video, animation with infographics, etc. Increasing number of new media are using applications and website platforms to interact with recipients. To proceed, the peculiarities of the new content of new media with the involvement of augmented reality are determined. Examples of successful interactive communication between recipients, the leading news agencies and commercial structures are provided. The conditions for effective use of VR / AR-technologies in the media content of new media, the involvement of viewers in changing stories with augmented reality are determined. The so-called immersive effect with the use of VR / AR-technologies involves complete immersion, immersion of the interested audience in the essence of the event being relayed. This interaction can be achieved through different types of VR video interactivity. One of the most important results of using VR content is the spatio-temporal and emotional immersion of viewers in the plot. The recipient turns from an external observer into an internal one; but his constant participation requires that the user preferences are taken into account. Factors such as satisfaction, positive reinforcement, empathy, and value influence the choice of VR / AR content by viewers.
APA, Harvard, Vancouver, ISO und andere Zitierweisen
2

Phillips, Jake. Understanding the impact of inspection on probation. Sheffield Hallam University, 2021. http://dx.doi.org/10.7190/shu.hkcij.05.2021.

Der volle Inhalt der Quelle
Annotation:
This research sought to understand the impact of probation inspection on probation policy, practice and practitioners. This important but neglected area of study has significant ramifications because the Her Majesty’s Inspectorate of Probation has considerable power to influence policy through its inspection regime and research activities. The study utilised a mixed methodological approach comprising observations of inspections and interviews with people who work in probation, the Inspectorate and external stakeholders. In total, 77 people were interviewed or took part in focus groups. Probation practitioners, managers and leaders were interviewed in the weeks after an inspection to find out how they experienced the process of inspection. Staff at HMI Probation were interviewed to understand what inspection is for and how it works. External stakeholders representing people from the voluntary sector, politics and other non-departmental bodies were interviewed to find out how they used the work of inspection in their own roles. Finally, leaders within the National Probation Service and Her Majesty’s Prisons and Probation Service were interviewed to see how inspection impacts on policy more broadly. The data were analysed thematically with five key themes being identified. Overall, participants were positive about the way inspection is carried out in the field of probation. The main findings are: 1. Inspection places a burden on practitioners and organisations. Practitioners talked about the anxiety that a looming inspection created and how management teams created additional pressures which were hard to cope with on top of already high workloads. Staff responsible for managing the inspection and with leadership positions talked about the amount of time the process of inspection took up. Importantly, inspection was seen to take people away from their day jobs and meant other priorities were side-lined, even if temporarily. However, the case interviews that practitioners take part in were seen as incredibly valuable exercises which gave staff the opportunity to reflect on their practice and receive positive feedback and validation for their work. 2. Providers said that the findings and conclusions from inspections were often accurate and, to some extent, unsurprising. However, they sometimes find it difficult to implement recommendations due to reports failing to take context into account. Negative reports have a serious impact on staff morale, especially for CRCs and there was concern about the impact of negative findings on a provider’s reputation. 3. External stakeholders value the work of the Inspectorate. The Inspectorate is seen to generate highly valid and meaningful data which stakeholders can use in their own roles. This can include pushing for policy reform or holding government to account from different perspectives. In particular, thematic inspections were seen to be useful here. 4. The regulatory landscape in probation is complex with an array of actors working to hold providers to account. When compared to other forms of regulation such as audit or contract management the Inspectorate was perceived positively due to its methodological approach as well as the way it reflects the values of probation itself. 5. Overall, the inspectorate appears to garner considerable legitimacy from those it inspects. This should, in theory, support the way it can impact on policy and practice. There are some areas for development here though such as more engagement with service users. While recognising that the Inspectorate has made a concerted effort to do this in the last two years participants all felt that more needs to be done to increase that trust between the inspectorate and service users. Overall, the Inspectorate was seen to be independent and 3 impartial although this belief was less prevalent amongst people in CRCs who argued that the Inspectorate has been biased towards supporting its own arguments around reversing the now failed policy of Transforming Rehabilitation. There was some debate amongst participants about how the Inspectorate could, or should, enforce compliance with its recommendations although most people were happy with the primarily relational way of encouraging compliance with sanctions for non-compliance being considered relatively unnecessary. To conclude, the work of the Inspectorate has a significant impact on probation policy, practice and practitioners. The majority of participants were positive about the process of inspection and the Inspectorate more broadly, notwithstanding some of the issues raised in the findings. There are some developments which the Inspectorate could consider to reduce the burden inspection places on providers and practitioners and enhance its impact such as amending the frequency of inspection, improving the feedback given to practitioners and providing more localised feedback, and working to reduce or limit perceptions of bias amongst people in CRCs. The Inspectorate could also do more to capture the impact it has on providers and practitioners – both positive and negative - through existing procedures that are in place such as post-case interview surveys and tracking the implementation of recommendations.
APA, Harvard, Vancouver, ISO und andere Zitierweisen
Wir bieten Rabatte auf alle Premium-Pläne für Autoren, deren Werke in thematische Literatursammlungen aufgenommen wurden. Kontaktieren Sie uns, um einen einzigartigen Promo-Code zu erhalten!

Zur Bibliographie