Academic literature on the topic 'Audio content analysis'

Create a spot-on reference in APA, MLA, Chicago, Harvard, and other styles

Select a source type:

Consult the lists of relevant articles, books, theses, conference reports, and other scholarly sources on the topic 'Audio content analysis.'

Next to every source in the list of references, there is an 'Add to bibliography' button. Press on it, and we will generate automatically the bibliographic reference to the chosen work in the citation style you need: APA, MLA, Harvard, Chicago, Vancouver, etc.

You can also download the full text of the academic publication as pdf and read online its abstract whenever available in the metadata.

Journal articles on the topic "Audio content analysis"

1

Raj, Bhiksha, Paris Smaragdis, Malcolm Slaney, Chung-Hsien Wu, Liming Chen, and Hyoung-Gook Kim. "Scalable Audio-Content Analysis." EURASIP Journal on Audio, Speech, and Music Processing 2010 (2010): 1–2. http://dx.doi.org/10.1155/2010/467278.

Full text
APA, Harvard, Vancouver, ISO, and other styles
2

Lu, Lie, and Alan Hanjalic. "Audio Keywords Discovery for Text-Like Audio Content Analysis and Retrieval." IEEE Transactions on Multimedia 10, no. 1 (January 2008): 74–85. http://dx.doi.org/10.1109/tmm.2007.911304.

Full text
APA, Harvard, Vancouver, ISO, and other styles
3

Lie Lu, Hong-Jiang Zhang, and Hao Jiang. "Content analysis for audio classification and segmentation." IEEE Transactions on Speech and Audio Processing 10, no. 7 (October 2002): 504–16. http://dx.doi.org/10.1109/tsa.2002.804546.

Full text
APA, Harvard, Vancouver, ISO, and other styles
4

Li, Y., and C. Dorai. "Instructional Video Content Analysis Using Audio Information." IEEE Transactions on Audio, Speech and Language Processing 14, no. 6 (November 2006): 2264–74. http://dx.doi.org/10.1109/tasl.2006.872602.

Full text
APA, Harvard, Vancouver, ISO, and other styles
5

JBARI, ATMAN, ABDELLAH ADIB, and DRISS ABOUTAJDINE. "BLIND AUDIO SEPARATION AND CONTENT ANALYSIS IN THE TIME-SCALE DOMAIN." International Journal of Semantic Computing 01, no. 03 (September 2007): 307–18. http://dx.doi.org/10.1142/s1793351x07000184.

Full text
Abstract:
In this paper, we address the problem of Blind Audio Separation (BAS) by content evaluation of audio signals in the Time-Scale domain. Most of the proposed techniques rely on independence or at least uncorrelation assumption of the source signals exploiting mutual information or second/high order statistics. Here, we present a new algorithm, for instantaneous mixture, that considers only different time-scale source signature properties. Our approach lies in wavelet transformation advantages and proposes for this a new representation; Spatial Time Scale Distributions (STSD), to characterize energy and interference of the observed data. The BAS will be allowed by joint diagonalization, without a prior orthogonality constraint, of a set of selected diagonal STSD matrices. Several criteria will be proposed, in the transformed time-scale space, to assess the separated audio signal contents. We describe the logistics of the separation and the content rating, thus an exemplary implementation on synthetic signals and real audio recordings show the high efficiency of the proposed technique to restore the audio signal contents.
APA, Harvard, Vancouver, ISO, and other styles
6

Magalhaes, Tairone Nunes, Felippe Brandão Barros, and Mauricio Alves Loureiro. "Iracema: a Python library for audio content analysis." Revista de Informática Teórica e Aplicada 27, no. 4 (December 23, 2020): 127–38. http://dx.doi.org/10.22456/2175-2745.107202.

Full text
Abstract:
Iracema is a Python library that aims to provide models for the extraction of meaningful informationfrom recordings of monophonic pieces of music, for purposes of research in music performance. With this objective in mind, we propose an architecture that will provide to users an abstraction level that simplifies the manipulation of different kinds of time series, as well as the extraction of segments from them. In this paper we: (1) introduce some key concepts at the core of the proposed architecture; (2) describe the current functionalities of the package; (3) give some examples of the application programming interface; and (4) give some brief examples of audio analysis using the system.
APA, Harvard, Vancouver, ISO, and other styles
7

Tzanetakis, George, and Perry Cook. "MARSYAS: a framework for audio analysis." Organised Sound 4, no. 3 (November 16, 2000): 169–75. http://dx.doi.org/10.1017/s1355771800003071.

Full text
Abstract:
Existing audio tools handle the increasing amount of computer audio data inadequately. The typical tape-recorder paradigm for audio interfaces is inflexible and time consuming, especially for large data sets. On the other hand, completely automatic audio analysis and annotation is impossible using current techniques. Alternative solutions are semi-automatic user interfaces that let users interact with sound in flexible ways based on content. This approach offers significant advantages over manual browsing, annotation and retrieval. Furthermore, it can be implemented using existing techniques for audio content analysis in restricted domains. This paper describes MARSYAS, a framework for experimenting, evaluating and integrating such techniques. As a test for the architecture, some recently proposed techniques have been implemented and tested. In addition, a new method for temporal segmentation based on audio texture is described. This method is combined with audio analysis techniques and used for hierarchical browsing, classification and annotation of audio files.
APA, Harvard, Vancouver, ISO, and other styles
8

Barker, Alexander B., Kathy Whittamore, John Britton, Rachael L. Murray, and Jo Cranwell. "A content analysis of alcohol content in UK television." Journal of Public Health 41, no. 3 (October 14, 2018): 462–69. http://dx.doi.org/10.1093/pubmed/fdy142.

Full text
Abstract:
Abstract Background Exposure to audio-visual alcohol content in media is associated with subsequent alcohol use in young people, but the extent of exposure contained in UK free-to-air prime-time television has not been explored since 2010. We report an analysis of alcohol content in a sample of UK free-to-air prime-time television broadcasts in 2015 and compare this with a similar analysis from 2010. Methods Content analysis of all programmes and advertisement/trailer breaks broadcast on the five national UK free-to-air channels in the UK between 6 and 10 pm during three separate weeks in September, October and November 2015. Results Alcohol content occurred in over 50% of all programmes broadcast and almost 50% of all advert/trailer periods between programmes. The majority of alcohol content occurred before the 9 pm watershed. Branding occurred in 3% of coded intervals and involved 122 brands, though three brands (Heineken, Corona and Fosters) accounted for almost half of all brand appearances. Conclusion Audio-visual alcohol content, including branding, is prevalent in UK television, and is therefore a potential driver of alcohol use in young people. These findings are virtually unchanged from our earlier analysis of programme content from 2010.
APA, Harvard, Vancouver, ISO, and other styles
9

Martyniuk, Tetiana, Maksym Mykytiuk, and Mykola Zaitsev. "FEATURES OF ANALYSIS OF MULTICHANNEL AUDIO SIGNALSFEATURES OF ANALYSIS OF MULTICHANNEL AUDIO SIGNALS." ГРААЛЬ НАУКИ, no. 2-3 (April 9, 2021): 302–5. http://dx.doi.org/10.36074/grail-of-science.02.04.2021.061.

Full text
Abstract:
The rapid growth of audio content has led to the need to use tools for analysis and quality control of audio signals using software and hardware and modules. The fastest-growing industry is software and programming languages.The Python programming language today has the most operational and visual capabilities for working with sound. When developing programs for computational signal analysis, it provides the optimal balance of high and low-level programming functions. Compared to Matlab or other similar solutions, Python is free and allows you to create standalone applications without the need for large, permanently installed files and a virtual environment.
APA, Harvard, Vancouver, ISO, and other styles
10

Yao Wang, Zhu Liu, and Jin-Cheng Huang. "Multimedia content analysis-using both audio and visual clues." IEEE Signal Processing Magazine 17, no. 6 (2000): 12–36. http://dx.doi.org/10.1109/79.888862.

Full text
APA, Harvard, Vancouver, ISO, and other styles

Dissertations / Theses on the topic "Audio content analysis"

1

Yapp, Lawrence. "Content-based indexing of MPEG video through the analysis of the accompanying audio /." Thesis, Connect to this title online; UW restricted, 1997. http://hdl.handle.net/1773/5835.

Full text
APA, Harvard, Vancouver, ISO, and other styles
2

Melih, Kathy, and n/a. "Audio Source Separation Using Perceptual Principles for Content-Based Coding and Information Management." Griffith University. School of Information Technology, 2004. http://www4.gu.edu.au:8080/adt-root/public/adt-QGU20050114.081327.

Full text
Abstract:
The information age has brought with it a dual problem. In the first place, the ready access to mechanisms to capture and store vast amounts of data in all forms (text, audio, image and video), has resulted in a continued demand for ever more efficient means to store and transmit this data. In the second, the rapidly increasing store demands effective means to structure and access the data in an efficient and meaningful manner. In terms of audio data, the first challenge has traditionally been the realm of audio compression research that has focused on statistical, unstructured audio representations that obfuscate the inherent structure and semantic content of the underlying data. This has only served to further complicate the resolution of the second challenge resulting in access mechanisms that are either impractical to implement, too inflexible for general application or too low level for the average user. Thus, an artificial dichotomy has been created from what is in essence a dual problem. The founding motivation of this thesis is that, although the hypermedia model has been identified as the ideal, cognitively justified method for organising data, existing audio data representations and coding models provide little, if any, support for, or resemblance to, this model. It is the contention of the author that any successful attempt to create hyperaudio must resolve this schism, addressing both storage and information management issues simultaneously. In order to achieve this aim, an audio representation must be designed that provides compact data storage while, at the same time, revealing the inherent structure of the underlying data. Thus it is the aim of this thesis to present a representation designed with these factors in mind. Perhaps the most difficult hurdle in the way of achieving the aims of content-based audio coding and information management is that of auditory source separation. The MPEG committee has noted this requirement during the development of its MPEG-7 standard, however, the mechanics of "how" to achieve auditory source separation were left as an open research question. This same committee proposed that MPEG-7 would "support descriptors that can act as handles referring directly to the data, to allow manipulation of the multimedia material." While meta-data tags are a part solution to this problem, these cannot allow manipulation of audio material down to the level of individual sources when several simultaneous sources exist in a recording. In order to achieve this aim, the data themselves must be encoded in such a manner that allows these descriptors to be formed. Thus, content-based coding is obviously required. In the case of audio, this is impossible to achieve without effecting auditory source separation. Auditory source separation is the concern of computational auditory scene analysis (CASA). However, the findings of CASA research have traditionally been restricted to a limited domain. To date, the only real application of CASA research to what could loosely be classified as information management has been in the area of signal enhancement for automatic speech recognition systems. In these systems, a CASA front end serves as a means of separating the target speech from the background "noise". As such, the design of a CASA-based approach, as presented in this thesis, to one of the most significant challenges facing audio information management research represents a significant contribution to the field of information management. Thus, this thesis unifies research from three distinct fields in an attempt to resolve some specific and general challenges faced by all three. It describes an audio representation that is based on a sinusoidal model from which low-level auditory primitive elements are extracted. The use of a sinusoidal representation is somewhat contentious with the modern trend in CASA research tending toward more complex approaches in order to resolve issues relating to co-incident partials. However, the choice of a sinusoidal representation has been validated by the demonstration of a method to resolve many of these issues. The majority of the thesis contributes several algorithms to organise the low-level primitives into low-level auditory objects that may form the basis of nodes or link anchor points in a hyperaudio structure. Finally, preliminary investigations in the representation’s suitability for coding and information management tasks are outlined as directions for future research.
APA, Harvard, Vancouver, ISO, and other styles
3

Scheuneman, Scott Isabel. ""Deadly Women": Examining (Audio)Visual (Re)Presentations of Violent Women and Girls in Infotainment Media." Thesis, Université d'Ottawa / University of Ottawa, 2015. http://hdl.handle.net/10393/33453.

Full text
Abstract:
Women have historically been the subject of stereotypes – especially criminalized women as they are constructed in the mass media. These stereotypes become particularly problematic when they are invoked in infotainment media – a genre that combines information and entertainment and presents itself as primarily factual. As such, ideological messages delivered through infotainment are also (re)presented as truthful and may be more likely to be taken up by an unquestioning audience. This research aimed to answer the following research question: How does infotainment portray women who commit serious violent crime? In order to answer this question, a qualitative content analysis was employed and “Deadly Women”, a televised infotainment series that narrates and re-enacts true crime stories of women who kill, was selected as a case study. The sample consisted of previously identified typologies: mothers who kill their children, women who kill their partners, adolescent girls who kill, and vigilantes who kill their abusers. Stemming from a critical feminist framework, the analysis revealed that Deadly Women relies on two primary trajectories to explain the violence committed by women and girls. While both trajectories emphasized gendered stereotypes that involved emotionality and mental health issues, they were nonetheless distinct. The first trajectory evoked narratives of the ‘emotionless’ and ‘psychopathic’ perpetrator; while the second trajectory characterized the offender as overly ‘emotional’ and ‘depressed’. These trajectories, along with their related variables, problematically (re)presented violent women and girls in simplistic and dualistic manners that served to obscure rather than to clarify the circumstances surrounding their crimes.
APA, Harvard, Vancouver, ISO, and other styles
4

Gómez, Gutiérrez Emilia. "Tonal description of music audio signals." Doctoral thesis, Universitat Pompeu Fabra, 2006. http://hdl.handle.net/10803/7537.

Full text
Abstract:
Aquesta tesi doctoral proposa i avalua un enfocament computacional per a la descripció automàtica dels aspectes tonals de la música a partir de l'anàlisi de senyals d'-audio polifòniques. Aquests mètodes es centren en el càlcul de descriptors de distribucions de notes, en l'estimació de tonalitat d'una peça, en la visualització de l'evolució del centre tonal o en la mesura de la similitud tonal entre dues peces diferents.
Aquesta tesi contribueix substancialment al camp de la descripció tonal mitjançant mètodes computacionals: a) Proporciona una revisió multidisciplinària dels sistemes d'estimació de la tonalitat; b) Defineix una sèrie de requeriments que han de complir els descriptors tonals de baix nivell; c) Proporciona una avaluació quantitativa i modular dels mètodes proposats; d) Justifica la idea de que per a certes aplicacions es poden fer servir mètodes que treballen amb partitures sense la necessitat de realitzar una transcripció automàtica e) Estén la literatura existent que treballa amb música clàssica a altres generes musicals; f) Demostra la utilitat dels descriptors tonals per a comparar peces musicals; g) Proporciona un algoritme optimitzat que es fa servir dins un sistema real per a visualització, cerca i recomanació musical, que treballa amb més d'un milió de obres musicals.
Esta tesis doctoral propone y evalúa un enfoque computacional para la descripción automática de aspectos tonales de la música a partir del análisis de señales de audio polifónicas. Estos métodos se centran en calcular descriptores de distribución de notas, en estimar la tonalidad de una pieza, en visualizar la evolución del centro tonal o en medir la similitud tonal entre dos piezas diferentes.
Esta tesis contribuye sustancialmente al campo de la descripción tonal mediante métodos computacionales: a) Proporciona una revisión multidisciplinar de los sistemas de estimación de la tonalidad; b) Define una serie de requerimientos que deben cumplir los descriptores tonales de bajo nivel; c) Proporciona una evaluación cuantitativa y modular de los métodos propuestos; d) Respalda la idea de que para ciertas aplicaciones no es necesario obtener una transcripción perfecta de la partitura, y que se pueden utilizar métodos que trabajan con partituras sin realizar una transcripción automática; e) Extiende la literatura existente que trabaja con música clásica a otros géneros musicales; f) Demuestra la utilidad de los descriptores tonales para comparar piezas musicales; g) Proporciona un algoritmo optimizado que se utiliza en un sistema real para visualización, búsqueda y recomendación musical, que trabaja con mas de un millón de piezas musicales.
This doctoral dissertation proposes and evaluates a computational approach for the automatic description of tonal aspects of music from the analysis of polyphonic audio signals. These algorithms focus on the computation of pitch class distributions descriptors, the estimation of the key of a piece, the visualization of the evolution of its tonal center or the measurement of the similarity between two different musical pieces.
This dissertation substantially contributes to the field of computational tonal description: a) It provides a multidisciplinary review of tonal induction systems; b) It defines a set of requirements for low-level tonal features; c) It provides a quantitative and modular evaluation of the proposed methods; d) It contributes to bridge the gap between audio and symbolic-oriented methods without the need of a perfect transcription; e) It extents current literature dealing with classical music to other musical genres; f) It shows the usefulness of tonal descriptors for music similarity; g) It provides an optimized method which is used in a real system for music visualization and retrieval, working with over a million of musical pieces.
APA, Harvard, Vancouver, ISO, and other styles
5

Walters, Thomas C. "Auditory-based processing of communication sounds." Thesis, University of Cambridge, 2011. https://www.repository.cam.ac.uk/handle/1810/240577.

Full text
Abstract:
This thesis examines the possible benefits of adapting a biologically-inspired model of human auditory processing as part of a machine-hearing system. Features were generated by an auditory model, and used as input to machine learning systems to determine the content of the sound. Features were generated using the auditory image model (AIM) and were used for speech recognition and audio search. AIM comprises processing to simulate the human cochlea, and a 'strobed temporal integration' process which generates a stabilised auditory image (SAI) from the input sound. The communication sounds which are produced by humans, other animals, and many musical instruments take the form of a pulse-resonance signal: pulses excite resonances in the body, and the resonance following each pulse contains information both about the type of object producing the sound and its size. In the case of humans, vocal tract length (VTL) determines the size properties of the resonance. In the speech recognition experiments, an auditory filterbank was combined with a Gaussian fitting procedure to produce features which are invariant to changes in speaker VTL. These features were compared against standard mel-frequency cepstral coefficients (MFCCs) in a size-invariant syllable recognition task. The VTL-invariant representation was found to produce better results than MFCCs when the system was trained on syllables from simulated talkers of one range of VTLs and tested on those from simulated talkers with a different range of VTLs. The image stabilisation process of strobed temporal integration was analysed. Based on the properties of the auditory filterbank being used, theoretical constraints were placed on the properties of the dynamic thresholding function used to perform strobe detection. These constraints were used to specify a simple, yet robust, strobe detection algorithm. The syllable recognition system described above was then extended to produce features from profiles of the SAI and tested with the same syllable database as before. For clean speech, performance of the features was comparable to that of those generated from the filterbank output. However when pink noise was added to the stimuli, performance dropped more slowly as a function of signal-to-noise ratio when using the SAI-based AIM features, than when using either the filterbank-based features or the MFCCs, demonstrating the noise-robustness properties of the SAI representation. The properties of the auditory filterbank in AIM were also analysed. Three models of the cochlea were considered: the static gammatone filterbank, dynamic compressive gammachirp (dcGC) and the pole-zero filter cascade (PZFC). The dcGC and gammatone are standard filterbank models, whereas the PZFC is a filter cascade, which more accurately models signal propagation in the cochlea. However, while the architecture of the filterbanks is different, they have all been successfully fitted to psychophysical masking data from humans. The abilities of the filterbanks to measure pitch strength were assessed, using stimuli which evoke a weak pitch percept in humans, in order to ascertain whether there is any benefit in the use of the more computationally efficient PZFC.Finally, a complete sound effects search system using auditory features was constructed in collaboration with Google research. Features were computed from the SAI by sampling the SAI space with boxes of different scales. Vector quantization (VQ) was used to convert this multi-scale representation to a sparse code. The 'passive-aggressive model for image retrieval' (PAMIR) was used to learn the relationships between dictionary words and these auditory codewords. These auditory sparse codes were compared against sparse codes generated from MFCCs, and the best performance was found when using the auditory features.
APA, Harvard, Vancouver, ISO, and other styles
6

Ong, Bee Suan. "Structural analysis and segmentation of music signals." Doctoral thesis, Universitat Pompeu Fabra, 2007. http://hdl.handle.net/10803/7544.

Full text
Abstract:
Con la reciente explosión cuantitativa de bibliotecas y colecciones de música en formato
digital, la descripción del contenido desempeña un papel fundamental para una gestión y
búsqueda eficientes de archivos de audio. La presente tesis doctoral pretende hacer un
análisis automático de la estructura de piezas musicales a partir del análisis de una
grabación, es decir, extraer una descripción estructural a partir de señales musicales
polifónicas. En la medida en que la repetición y transformación de la estructura de la
música genera una identificación única de una obra musical, extraer automáticamente
esta información puede vincular entre sí descripciones de bajo y alto nivel de una señal
musical y puede proporcionar al usuario una manera más efectiva de interactuar con un
contenido de audio. Para algunas aplicaciones basadas en contenido, encontrar los límites
de determinados segmentos de una grabación resulta indispensable. Así pues, también se
investiga la segmentación temporal de audio a nivel semántico, al igual que la
identificación de extractos representativos de una señal musical que pueda servir como
resumen de la misma. Para ello se emplea una técnica de análisis a un nivel de
abstracción más elevado que permite obtener una mejor división en segmentos. Tanto
desde el punto de vista teórico como práctico, esta investigación no sólo ayuda a
incrementar nuestro conocimiento respecto a la estructura musical, sino que también
proporciona una ayuda al examen y a la valoración musical.
With the recent explosion in the quantity of digital audio libraries and databases, content
descriptions play an important role in efficiently managing and retrieving audio files.
This doctoral research aims to discover and extract structural description from
polyphonic music signals. As repetition and transformations of music structure creates a
unique identity of music itself, extracting such information can link low-level and higherlevel
descriptions of music signal and provide better quality access plus powerful way of
interacting with audio content. Finding appropriate boundary truncations is indispensable
in certain content-based applications. Thus, temporal audio segmentation at the semantic
level and the identification of representative excerpts from music audio signal are also
investigated. We make use of higher-level analysis technique for better segment
truncation. From both theoretical and practical points of view, this research not only
helps in increasing our knowledge of music structure but also facilitates in time-saving
browsing and assessing of music.
APA, Harvard, Vancouver, ISO, and other styles
7

Nazareno, Claudio. "Regulation and the promotion of national audio-visual content in the era of digital convergence : a comparative analysis of the United Kingdom, France and Brazil." Thesis, University of Roehampton, 2013. https://pure.roehampton.ac.uk/portal/en/studentthesis/regulation-and-the-promotion-of-national-audiovisual-content-in-the-era-of-digital-convergence(c52f8024-66fd-4525-8258-c78532077836).html.

Full text
Abstract:
This research investigates how the production and distribution of nationally produced television programmes, films and other audiovisual content can be promoted by regulation. The study identifies current regulatory tools to promote national content and differences in policy approaches. It also identifies new issues resulting from the substantial transformation the media environment has undergone in recent years. The audiovisual sector today is characterised by an abundance of television channels and telecommunication services and by ongoing digital convergence, all impacting on the effectiveness and rationale of content regulation. Focused on the UK, France and Brazil, this comparative research investigates the political, regulatory, socio-cultural, economic, technological and market changes of the communication services in the recent decades and how this has impacted on the provision of national audiovisual content. The starting point of the analysis are the 1980s, when broadcasting started to be gradually liberalised, and it concludes in the 2000s, when information technologies, telecommunications and broadcasting converge into interconnected, complementary and supplementary services. This research offers a three step method of analysis which contributes to a new understanding of the mechanisms and implications involved in the production and distribution of nationally produced content in digital times. In the first step, the various socio-cultural aims of communication services as enacted in legislation are analysed. The second step explores the consequent regulatory tools for the fulfilment of those objectives. The third step provides a market evaluation of the audiovisual industries, which broadcasters and other distribution platforms provide which kind of content, and also looks at audiences‟ viewing preferences to get a better understanding of what type of content should be supported through legislation. As outcomes, this research proposes to academics, policymakers and regulators a new definition for national content and a series of regulatory actions for fostering national audiovisual industries in times of digital convergence.
APA, Harvard, Vancouver, ISO, and other styles
8

Laborová, Anna. "Extrakce parametrů pro výzkum interpretačního výkonu." Master's thesis, Vysoké učení technické v Brně. Fakulta elektrotechniky a komunikačních technologií, 2021. http://www.nusl.cz/ntk/nusl-442575.

Full text
Abstract:
Different music performances of the same piece may significantly differ from each other. Not only the composer and the score defines the listener’s music experience, but the music performance itself is an integral part of this experience. Four parameter classes can be used to describe a performance objectively: tempo and timing, loudness (dynamics), timbre, and pitch. Each of the individual parameters or their combination can generate a unique characteristic performance. The extraction of such objective parameters is one of the difficulties in the field of Music Performance Analysis and Music Information Retrieval. The submitted work summarizes knowledge and methods from both of the fields. The system is applied to extract data from 31 string quartet performances of 2. movement Lento of String Quartet no. 12 F major (1893) by czech romantic composer Antonín Dvořák (1841–1904).
APA, Harvard, Vancouver, ISO, and other styles
9

Nesvadba, Jan. "Segmentation sémantique des contenus audio-visuels." Bordeaux 1, 2007. http://www.theses.fr/2007BOR13456.

Full text
Abstract:
Dans ce travail, nous avons mis au point une méthode de segmentation des contenus audiovisuels applicable aux appareils de stockage domestiques pour cela nous avons expérimenté un système distribué pour l'analyse du contenu composé de modules individuels d'analyse : les service unit. L'un entre eux a été dédié à la caractérisation des éléments hors contenu, i. E. Les publicités, et offre de bonnes perfermances. Parallélement, nous avons testé différents détecteurs de changement de plans afin de retenir le meilleur d'ente eux pour la suite. Puis, nous avons proposé une étude des règles de production des films, i. E. Grammaire de films, qui a permis de définir les séquences de parallel shot. Nous avons, ainsi, testé quatre méthodes de regroupement basées similarité afin de retenir la meilleure d'entre elles pour la suite. Finalement, nous avons recherché différentes méthodes de détection des frontières de scènes et avons obtenu les meilleurs résultats en combinant une méthode basée couleur avec un critère de longueur de plan. Ce dernier offre des performances justifiant son intégration dans les appareils de stockage grand public.
APA, Harvard, Vancouver, ISO, and other styles
10

Khaledi, Soheila. "Corporate Risk Disclosure: A Content Analysis of Swedish Interim Reports." Thesis, Uppsala universitet, Företagsekonomiska institutionen, 2014. http://urn.kb.se/resolve?urn=urn:nbn:se:uu:diva-231965.

Full text
Abstract:
The aim of this research is to examine the determinants of the level of corporate risk disclosure (CRD) in the interim reports of Swedish non-financial companies. A quantitative research approach is used, the sample data of which consist of 166 firms with 4,849 interim reports over a 10-year period. By utilizing the notion of risk and its definition, I have distinguished three categories of risk, namely risk as uncertainty, risk as threat and risk as opportunity. A systematic content analysis is conducted with the use of a software program, which is specifically designed for this purpose. The number of sentences that contain keywords related to the three risk categories is counted as the total CRD score, which is transformed to the disclosure index. I have examined the impact of firms’ characteristics and corporate governance mechanisms on the level of CRD based on agency theory. The ordinary least squares regression method with  control for fixed year effects is used to analyse the data, which show that firm size and audit committee have a positive relationship with the level of corporate risk disclosure. The result demonstrates also that there is a negative relationship between family ownership and the level of CRD, and an insignificant relationship between leverage and the level of CRD.
APA, Harvard, Vancouver, ISO, and other styles

Books on the topic "Audio content analysis"

1

Lerch, Alexander. Audio content analysis: An introduction. Hoboken, N.J: Wiley, 2012.

Find full text
APA, Harvard, Vancouver, ISO, and other styles
2

Lerch, Alexander. An Introduction to Audio Content Analysis. Hoboken, NJ, USA: John Wiley & Sons, Inc., 2012. http://dx.doi.org/10.1002/9781118393550.

Full text
APA, Harvard, Vancouver, ISO, and other styles
3

Voronina, Larisa. Financial accounting: theory and practice. ru: INFRA-M Academic Publishing LLC., 2021. http://dx.doi.org/10.12737/1171982.

Full text
Abstract:
The textbook is based on the normative acts of the system of regulatory regulation of accounting currently in force in the Russian Federation in accordance with the latest amendments to the Tax Code of the Russian Federation and the Labor Code of the Russian Federation. The basics of the organization of accounting and the principles of its differentiation into financial and managerial accounting are considered. The methodology of accounting for the assets, liabilities and capital of the organization is described, the main aspects of taxation are presented. Numerous practical examples, questions for self-examination and interviews, tests and workshops are given for all chapters. The content of the textbook and the professional competencies formed based on the results of its study meet the requirements of the federal state educational standards of higher education of the latest generation, the Main Professional Educational Program of Higher Education (OPOP HE) "Accounting, analysis and audit" in the direction of training 38.03.01 "Economics" and the working program of the discipline "Accounting financial Accounting" (MFUA). For students of economic universities and faculties, students of the system of advanced training and retraining, for practitioners of accounting services, audit companies and administrative and managerial personnel.
APA, Harvard, Vancouver, ISO, and other styles
4

Sil'vestrov, Sergey, Vladimir Starovoytov, Vladimir Bauer, Aleksandr Selivanov, Vladimir Lepskiy, Aleksandr Raykov, Svetlana Lipina, et al. Strategic planning in the public sector of the economy. ru: INFRA-M Academic Publishing LLC., 2021. http://dx.doi.org/10.12737/1081855.

Full text
Abstract:
This collective monograph continues a series of scientific studies and publications on the problems of strategic planning, which have been carried out for several years at the Financial University under the Government of the Russian Federation with the involvement of specialists from other scientific and educational organizations. A series of research papers in 2017-2019 was devoted to the analysis of strategic development risks and the analysis of global strategic planning practice, the general methodology of strategic planning and forecasting (including in the context of ensuring Russia's economic security), the approach to the formation of life cycles of preparation and revision of strategic planning documents and their comparative analysis, the experience of coordinating budget, project and process types of management and financing, monitoring risks and threats, the use of new information tools in the strategic planning complex, including blockchain, and also naturally develops such aspects of previous research as analysis of world practice, coordination of budget, project and process types of management and financing, the use of information technologies. However, at the same time, a special task was set — to approach a comprehensive analysis of the strategic planning process as a whole, especially to study its documentary support as the core of the organization of this process and the implementation of its results in the practice of public administration, as well as to analyze the scientific support of strategic planning as an essential aspect of all strategic planning and strategic management activities in the entirety of its aspects (goal setting, forecast, design, programming, planning, control and audit). It is intended for specialists from the humanities, natural sciences and technical fields of knowledge focused on management and development problems, for undergraduates and postgraduates, as well as for a wide audience of management practitioners, including those related to strategic planning processes in the public sector.
APA, Harvard, Vancouver, ISO, and other styles
5

David, Bers, and Tree Stephen 1949-, eds. The music of Psalms, Proverbs and Job in the Hebrew Bible: A revised theory of musical accents in the Hebrew Bible : general analysis, bibliography, table of contents : CD with complete text and audio files of musically reconstructed Psalms, Proverbs and Job, sung by computer speech synthesis. Wiesbaden: Harrassowitz Verlag, 2011.

Find full text
APA, Harvard, Vancouver, ISO, and other styles
6

Mucchielli, Alex. Étude des communications: Information et communication interne : pour de nouveaux audits. 2nd ed. Paris: Armand Colin, 2005.

Find full text
APA, Harvard, Vancouver, ISO, and other styles
7

Mucchielli, Alex. Étude des communications: Approche par la modélisation des relations. Paris: A. Colin, 2004.

Find full text
APA, Harvard, Vancouver, ISO, and other styles
8

Mucchielli, Alex. Étude des communications: Nouvelles approches. Paris: Colin, 2006.

Find full text
APA, Harvard, Vancouver, ISO, and other styles
9

Mucchielli, Alex. Étude des communications: Approche systémique dans les organisations. Paris: Colin, 2004.

Find full text
APA, Harvard, Vancouver, ISO, and other styles
10

Mucchielli, Alex. Étude des communications: Approche par la contextualisation. Paris: Colin, 2005.

Find full text
APA, Harvard, Vancouver, ISO, and other styles

Book chapters on the topic "Audio content analysis"

1

Lu, Lie, and Alan Hanjalic. "Audio Content Analysis." In Encyclopedia of Database Systems, 198–201. New York, NY: Springer New York, 2018. http://dx.doi.org/10.1007/978-1-4614-8265-9_1528.

Full text
APA, Harvard, Vancouver, ISO, and other styles
2

Lu, Lie, and Alan Hanjalic. "Audio Content Analysis." In Encyclopedia of Database Systems, 154–56. Boston, MA: Springer US, 2009. http://dx.doi.org/10.1007/978-0-387-39940-9_1528.

Full text
APA, Harvard, Vancouver, ISO, and other styles
3

Lu, Lie, and Alan Hanjalic. "Audio Content Analysis." In Encyclopedia of Database Systems, 1–3. New York, NY: Springer New York, 2016. http://dx.doi.org/10.1007/978-1-4899-7993-3_1528-2.

Full text
APA, Harvard, Vancouver, ISO, and other styles
4

Smaragdis, Paris, Regunathan Radhakrishnan, and Kevin W. Wilson. "Context Extraction Through Audio Signal Analysis." In Multimedia Content Analysis, 1–34. Boston, MA: Springer US, 2008. http://dx.doi.org/10.1007/978-0-387-76569-3_1.

Full text
APA, Harvard, Vancouver, ISO, and other styles
5

Lu, Lie, and Alan Hanjalic. "Audio Content Discovery: An Unsupervised Approach." In Multimedia Content Analysis, 1–39. Boston, MA: Springer US, 2008. http://dx.doi.org/10.1007/978-0-387-76569-3_4.

Full text
APA, Harvard, Vancouver, ISO, and other styles
6

Zhang, Tong, and C. C. Jay Kuo. "Audio Feature Analysis." In Content-Based Audio Classification and Retrieval for Audiovisual Data Parsing, 35–54. Boston, MA: Springer US, 2001. http://dx.doi.org/10.1007/978-1-4757-3339-6_3.

Full text
APA, Harvard, Vancouver, ISO, and other styles
7

De Santo, M., G. Percannella, C. Sansone, and M. Vento. "Unsupervised News Video Segmentation by Combined Audio-Video Analysis." In Multimedia Content Representation, Classification and Security, 273–81. Berlin, Heidelberg: Springer Berlin Heidelberg, 2006. http://dx.doi.org/10.1007/11848035_37.

Full text
APA, Harvard, Vancouver, ISO, and other styles
8

Zhang, Tong, and C. C. Jay Kuo. "Image Sequence Analysis." In Content-Based Audio Classification and Retrieval for Audiovisual Data Parsing, 85–104. Boston, MA: Springer US, 2001. http://dx.doi.org/10.1007/978-1-4757-3339-6_6.

Full text
APA, Harvard, Vancouver, ISO, and other styles
9

Żwan, Paweł, Przemysław Sobala, Piotr Szczuko, and Andrzej Czyzewski. "Audio Content Analysis in the Urban Area Telemonitoring System." In Multimedia Services in Intelligent Environments, 227–39. Berlin, Heidelberg: Springer Berlin Heidelberg, 2010. http://dx.doi.org/10.1007/978-3-642-13396-1_10.

Full text
APA, Harvard, Vancouver, ISO, and other styles
10

Kang, Chan-Mi, and Joong-Hwan Baek. "Audio Content Analysis for Understanding Structures of Scene in Video." In Lecture Notes in Computer Science, 1213–18. Berlin, Heidelberg: Springer Berlin Heidelberg, 2006. http://dx.doi.org/10.1007/11816157_151.

Full text
APA, Harvard, Vancouver, ISO, and other styles

Conference papers on the topic "Audio content analysis"

1

Pfeiffer, Silvia, Stephan Fischer, and Wolfgang Effelsberg. "Automatic audio content analysis." In the fourth ACM international conference. New York, New York, USA: ACM Press, 1996. http://dx.doi.org/10.1145/244130.244139.

Full text
APA, Harvard, Vancouver, ISO, and other styles
2

Kumar, Anurag, and Bhiksha Raj. "Weakly supervised scalable audio content analysis." In 2016 IEEE International Conference on Multimedia and Expo (ICME). IEEE, 2016. http://dx.doi.org/10.1109/icme.2016.7552989.

Full text
APA, Harvard, Vancouver, ISO, and other styles
3

Mitral, Vikramjit, Daniel Garcia-Romero, and Carol Y. Espy-Wilson. "Language detection in audio content analysis." In ICASSP 2008 - 2008 IEEE International Conference on Acoustics, Speech and Signal Processing. IEEE, 2008. http://dx.doi.org/10.1109/icassp.2008.4518058.

Full text
APA, Harvard, Vancouver, ISO, and other styles
4

Wu, Chung-Ping, Po-Chyi Su, and C. C. Jay Kuo. "Robust and efficient digital audio watermarking using audio content analysis." In Electronic Imaging, edited by Ping W. Wong and Edward J. Delp III. SPIE, 2000. http://dx.doi.org/10.1117/12.384992.

Full text
APA, Harvard, Vancouver, ISO, and other styles
5

Arias, Jose Anibal, Regine Andre-Obrecht, and Jerome Farinas. "Automatic low-dimensional analysis of audio databases." In 2008 International Workshop on Content-Based Multimedia Indexing. IEEE, 2008. http://dx.doi.org/10.1109/cbmi.2008.4564996.

Full text
APA, Harvard, Vancouver, ISO, and other styles
6

Lu, Lie, and Alan Hanjalic. "Towards optimal audio "keywords" detection for audio content analysis and discovery." In the 14th annual ACM international conference. New York, New York, USA: ACM Press, 2006. http://dx.doi.org/10.1145/1180639.1180825.

Full text
APA, Harvard, Vancouver, ISO, and other styles
7

Zhu, Rong, Haojun Ai, and Ruimin Hu. "Audio Segmentation in AAC Domain for Content Analysis." In 2009 5th International Conference on Wireless Communications, Networking and Mobile Computing (WiCOM). IEEE, 2009. http://dx.doi.org/10.1109/wicom.2009.5301778.

Full text
APA, Harvard, Vancouver, ISO, and other styles
8

Magalhaes, Tairone, Felippe Barros, and Maurício Loureiro. "Iracema: a Python library for audio content analysis." In Simpósio Brasileiro de Computação Musical. Sociedade Brasileira de Computação - SBC, 2019. http://dx.doi.org/10.5753/sbcm.2019.10418.

Full text
Abstract:
This paper introduces the alpha version of a Python library called Iracema, which aims to provide models for the extraction of meaningful information from recordings of monophonic pieces of music, for purposes of research in music performance. With this objective in mind, we propose an architecture that will provide to users an abstraction level that simplifies the manipulation of different kinds of time series, as well as the extraction of segments from them. In this paper we: (1) introduce some key concepts at the core of the proposed architecture; (2) list the current functionalities of the package; (3) give some examples of the application programming interface; and (4) give some brief examples of audio analysis using the system.
APA, Harvard, Vancouver, ISO, and other styles
9

Kereliuk, Corey, Bob L. Sturm, and Jan Larsen. "Deep learning, audio adversaries, and music content analysis." In 2015 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics (WASPAA). IEEE, 2015. http://dx.doi.org/10.1109/waspaa.2015.7336950.

Full text
APA, Harvard, Vancouver, ISO, and other styles
10

Mitra, Vikramjit, Daniel Garcia-Romero, and Carol Y. Espy-Wilson. "Language and genre detection in audio content analysis." In Interspeech 2008. ISCA: ISCA, 2008. http://dx.doi.org/10.21437/interspeech.2008-621.

Full text
APA, Harvard, Vancouver, ISO, and other styles

Reports on the topic "Audio content analysis"

1

Baluk, Nadia, Natalia Basij, Larysa Buk, and Olha Vovchanska. VR/AR-TECHNOLOGIES – NEW CONTENT OF THE NEW MEDIA. Ivan Franko National University of Lviv, February 2021. http://dx.doi.org/10.30970/vjo.2021.49.11074.

Full text
Abstract:
The article analyzes the peculiarities of the media content shaping and transformation in the convergent dimension of cross-media, taking into account the possibilities of augmented reality. With the help of the principles of objectivity, complexity and reliability in scientific research, a number of general scientific and special methods are used: method of analysis, synthesis, generalization, method of monitoring, observation, problem-thematic, typological and discursive methods. According to the form of information presentation, such types of media content as visual, audio, verbal and combined are defined and characterized. The most important in journalism is verbal content, it is the one that carries the main information load. The dynamic development of converged media leads to the dominance of image and video content; the likelihood of increasing the secondary content of the text increases. Given the market situation, the effective information product is a combined content that combines text with images, spreadsheets with video, animation with infographics, etc. Increasing number of new media are using applications and website platforms to interact with recipients. To proceed, the peculiarities of the new content of new media with the involvement of augmented reality are determined. Examples of successful interactive communication between recipients, the leading news agencies and commercial structures are provided. The conditions for effective use of VR / AR-technologies in the media content of new media, the involvement of viewers in changing stories with augmented reality are determined. The so-called immersive effect with the use of VR / AR-technologies involves complete immersion, immersion of the interested audience in the essence of the event being relayed. This interaction can be achieved through different types of VR video interactivity. One of the most important results of using VR content is the spatio-temporal and emotional immersion of viewers in the plot. The recipient turns from an external observer into an internal one; but his constant participation requires that the user preferences are taken into account. Factors such as satisfaction, positive reinforcement, empathy, and value influence the choice of VR / AR content by viewers.
APA, Harvard, Vancouver, ISO, and other styles
2

Phillips, Jake. Understanding the impact of inspection on probation. Sheffield Hallam University, 2021. http://dx.doi.org/10.7190/shu.hkcij.05.2021.

Full text
Abstract:
This research sought to understand the impact of probation inspection on probation policy, practice and practitioners. This important but neglected area of study has significant ramifications because the Her Majesty’s Inspectorate of Probation has considerable power to influence policy through its inspection regime and research activities. The study utilised a mixed methodological approach comprising observations of inspections and interviews with people who work in probation, the Inspectorate and external stakeholders. In total, 77 people were interviewed or took part in focus groups. Probation practitioners, managers and leaders were interviewed in the weeks after an inspection to find out how they experienced the process of inspection. Staff at HMI Probation were interviewed to understand what inspection is for and how it works. External stakeholders representing people from the voluntary sector, politics and other non-departmental bodies were interviewed to find out how they used the work of inspection in their own roles. Finally, leaders within the National Probation Service and Her Majesty’s Prisons and Probation Service were interviewed to see how inspection impacts on policy more broadly. The data were analysed thematically with five key themes being identified. Overall, participants were positive about the way inspection is carried out in the field of probation. The main findings are: 1. Inspection places a burden on practitioners and organisations. Practitioners talked about the anxiety that a looming inspection created and how management teams created additional pressures which were hard to cope with on top of already high workloads. Staff responsible for managing the inspection and with leadership positions talked about the amount of time the process of inspection took up. Importantly, inspection was seen to take people away from their day jobs and meant other priorities were side-lined, even if temporarily. However, the case interviews that practitioners take part in were seen as incredibly valuable exercises which gave staff the opportunity to reflect on their practice and receive positive feedback and validation for their work. 2. Providers said that the findings and conclusions from inspections were often accurate and, to some extent, unsurprising. However, they sometimes find it difficult to implement recommendations due to reports failing to take context into account. Negative reports have a serious impact on staff morale, especially for CRCs and there was concern about the impact of negative findings on a provider’s reputation. 3. External stakeholders value the work of the Inspectorate. The Inspectorate is seen to generate highly valid and meaningful data which stakeholders can use in their own roles. This can include pushing for policy reform or holding government to account from different perspectives. In particular, thematic inspections were seen to be useful here. 4. The regulatory landscape in probation is complex with an array of actors working to hold providers to account. When compared to other forms of regulation such as audit or contract management the Inspectorate was perceived positively due to its methodological approach as well as the way it reflects the values of probation itself. 5. Overall, the inspectorate appears to garner considerable legitimacy from those it inspects. This should, in theory, support the way it can impact on policy and practice. There are some areas for development here though such as more engagement with service users. While recognising that the Inspectorate has made a concerted effort to do this in the last two years participants all felt that more needs to be done to increase that trust between the inspectorate and service users. Overall, the Inspectorate was seen to be independent and 3 impartial although this belief was less prevalent amongst people in CRCs who argued that the Inspectorate has been biased towards supporting its own arguments around reversing the now failed policy of Transforming Rehabilitation. There was some debate amongst participants about how the Inspectorate could, or should, enforce compliance with its recommendations although most people were happy with the primarily relational way of encouraging compliance with sanctions for non-compliance being considered relatively unnecessary. To conclude, the work of the Inspectorate has a significant impact on probation policy, practice and practitioners. The majority of participants were positive about the process of inspection and the Inspectorate more broadly, notwithstanding some of the issues raised in the findings. There are some developments which the Inspectorate could consider to reduce the burden inspection places on providers and practitioners and enhance its impact such as amending the frequency of inspection, improving the feedback given to practitioners and providing more localised feedback, and working to reduce or limit perceptions of bias amongst people in CRCs. The Inspectorate could also do more to capture the impact it has on providers and practitioners – both positive and negative - through existing procedures that are in place such as post-case interview surveys and tracking the implementation of recommendations.
APA, Harvard, Vancouver, ISO, and other styles
We offer discounts on all premium plans for authors whose works are included in thematic literature selections. Contact us to get a unique promo code!

To the bibliography