Дисертації: "Speech imagery"

1

Scott, Mark. "Speech imagery as corollary discharge." Thesis, University of British Columbia, 2012. http://hdl.handle.net/2429/42231.

Повний текст джерела

Анотація:

This thesis tests the theory that the sensory content of inner speech is constituted by corollary discharge. Corollary discharge is a signal generated by the motor system and is a “prediction” of the sensory consequences of the motor system’s actions. Corollary discharge normally functions in the nervous system to segregate self-caused sensations from externally-caused sensations. It does this, partially, by attenuating the nervous system’s response to self-caused sensations. This thesis argues that corollary discharge has been co-opted in humans to provide the sensory content of speech imagery. The thesis further tests the claim that the sensory detail contained in speech imagery is sufficiently rich and sufficiently similar to the representations of external speech sounds that the perception of external speech sounds can be influenced by inner speech. This thesis claims that the perception of external speech is altered because corollary discharge prepares the auditory system to hear those sensory features which the corollary-discharge signal carries. These claims were tested experimentally by having participants engage in specific forms of speech imagery while categorizing external sounds. In one set of experiments, when external sound and speech imagery were in synchrony and were similar in content, the perception of the external sound was altered — the external sound came to be heard as matching the content of the speech imagery. In a second set of experiments, the presence of corollary discharge in speech imagery was tested. When a sensation matches a corollary discharge signal, the sensation tends to have an attenuated impact. This attenuation is a hallmark of corollary discharge. In this set of experiments, when participants’ speech imagery matched an external sound, the perceptual impact of the external sound was attenuated. Proper controls ensured that it was the degree of match between the speech imagery and the external sound that was responsible for this attenuation, rather than some extraneous factor.

Стилі APA, Harvard, Vancouver, ISO та ін.

2

McCord, Walter White. "The contribution of agricultural imagery to the interpretation of Amos." Theological Research Exchange Network (TREN), 1996. http://www.tren.com.

Повний текст джерела

Стилі APA, Harvard, Vancouver, ISO та ін.

3

Peixoto, Michael Viana. "PrÃtica intersemiÃtica no discurso imagÃtico-cancional de Adriana Calcanhotto: uma proposta de anÃlise." Universidade Federal do CearÃ, 2014. http://www.teses.ufc.br/tde_busca/arquivo.php?codArquivo=11914.

Повний текст джерела

Анотація:

FundaÃÃo de Amparo Ã Pesquisa do Estado do CearÃ
A presente tese âPrÃtica intersemiÃtica no Discurso ImagÃtico-Cancional de Adriana Calcanhotto: uma proposta de anÃliseâ estuda e define o discurso imagÃtico-cancional como uma prÃtica discursiva que mobiliza, atravÃs de procedimentos discursivos, linguagens de diferentes modalidades (tanto de natureza verbal quanto nÃo-verbal) para, num processo intersemiÃtico, compatibilizarem-se com a produÃÃo literomusical e, a partir disso, propiciarem a construÃÃo de sentidos. O alicerce teÃrico no qual fincamos esse conceito procede da AnÃlise do Discurso de linha francesa, considerando, sobretudo, as reflexÃes de Maingueneau (1999), Costa (2012), dentre outros. Com base nisso, a questÃo norteadora da pesquisa: Que proposta de abordagem teÃrica e metodolÃgica para uma anÃlise do discurso imagÃtico-cancional pode ser elaborada a partir das categorias discursivas? A operacionalizaÃÃo dessa questÃo e do objetivo se deu por meio da metodologia exploratÃria, em que, a partir de um corpus especÃfico â a produÃÃo literomusical de Adriana Calcanhotto compreendida entre 1990 e 2000, cuja delimitaÃÃo se deu por ordem cronolÃgica, a fim de se perceber como as respectivas produÃÃes se organizam de acordo com o inÃcio e o encerramento da dÃcada; a partir da apropriaÃÃo das categorias discursivas, cancionais e visuais, elaboramos um guia que propÃe um percurso que viabilize a construÃÃo dos sentidos do texto. Ã luz dessa metodologia, procedemos o exercÃcio de anÃlise dos dados o qual nos permitiu a conclusÃo de que a natureza interdiscursiva do discurso imagÃtico-cancional propicia sentidos tais que sÃ os sÃo possÃveis devido ao fenÃmeno da intersemioticidade que se estabelece e que . Essa conclusÃo nos possibilita afirmar que, em virtude disso, hÃ que promover um letramento verbo-visual; tais construÃÃes discursivas requerem do leitor uma aprendizagem acerca do modo de ler determinadas produÃÃes discursivas.
This thesis " intersemiotic Practice in Speech - Cancional imagery of Adriana Calcanhotto: a proposed analysis " studies and sets the image- cancional discourse as a discursive practice that mobilizes through discursive procedures , languages of different modalities (both verbal nature as nonverbal) to, in intersemiotic process compatibilizarem with the literomusical production and , from that , they encourage the construction of meaning. The theoretical foundation on which fincamos this concept comes from the analysis of French Discourse, considering especially the reflections of Maingueneau (1999), Costa (2012), among others. Based on this, the guiding research question: What proposal for theoretical and methodological approach to an analysis of image- cancional speech can be compiled from the discursive categories? The operationalization of this issue and the goal was through the exploratory methodology, in which, from a specific corpus - the literomusical production of Adriana Calcanhotto between 1990 and 2000, whose limits given in chronological order, in order to realize as their productions are organized according to the opening and closure of the decade; from the appropriation of discursive and visual cancionais , categories prepared a guide that offers a path that makes possible the construction of meanings of the text . In light of this methodology , we proceed to the performance analysis of the data which allowed us to conclude that the nature of the image- interdiscursive cancional speech provides such meanings that are only possible due to the phenomenon of intersemioticidade that is established and that. This conclusion allows us to state that, because of this, there is a verb that promote visual literacy; such discursive constructions require the reader learning about the way of reading certain discursive productions.

Стилі APA, Harvard, Vancouver, ISO та ін.

4

Nalborczyk, Ladislas. "Understanding rumination as a form of inner speech : probing the role of motor processes." Thesis, Université Grenoble Alpes (ComUE), 2019. http://www.theses.fr/2019GREAS017/document.

Повний текст джерела

Анотація:

Les ruminations mentales sont majoritairement exprimées sous forme verbale et il a été proposé de les considérer, par conséquent, comme une forme dysfonctionnelle de parole intérieure (i.e., production mentale de parole). D’autre part, les recherches sur la psychophysiologie de la parole intérieure ont révélé que les processus neuronaux impliqués dans la parole à voix haute et dans la parole intérieure sont similaires. Ces observations sont cohérentes avec l’idée que certaines formes de parole intérieure pourraient être considérées comme une forme de simulation de la parole à voix haute, de la même manière que certaines actions imaginées peuvent être considérées comme le résultat d’une simulation de l’action correspondante (par exemple, marcher et s’imaginer en train de marcher). En d’autres termes, l’hypothèse de la simulation motrice suggère que le système moteur de la parole devrait également être impliqué lors de la production de parole intérieure. L’hypothèse corollaire peut être formulée, selon laquelle la production de parole intérieure (et de ruminations) devrait être perturbée par une perturbation du système moteur de la parole. Nous avons mené une série de cinq études visant à sonder le rôle du système moteur de la parole dans les ruminations. Dans l’ensemble, nos résultats soulignent que, bien que la rumination verbale puisse être considérée comme une forme de parole intérieure, elle ne semble pas recruter spécifiquement le système moteur de la parole. Plus précisément, nous soutenons que la rumination peut être considérée comme une forme de parole intérieure particulièrement condensée, qui s’exprimerait sous la forme d’une représentation phonologique, et dont les traits articulatoires ne seraient pas complètements spécifiés. Nous faisons le lien entre ces résultats et l’hypothèse théorique du cadre “habitude-but” de la rumination dépressive et nous discutons de leurs implications pour les théories de la production de parole intérieure
Rumination is known to be a predominantly verbal process and has been proposed to be considered as such as a dysfunctional form of inner speech (i.e., the silent production of words in one’s mind). On the other hand, research on the psychophysiology of inner speech revealed that the neural processes involved in overt and covert speech tend to be very similar. This is coherent with the idea that some forms of inner speech could be considered as a kind of simulation of overt speech, in the same way as imagined actions can be considered as the result of a simulation of the corresponding overt action (e.g., walking and imagined walking). In other words, the motor simulation hypothesis suggests that the speech motor system should be involved as well during inner speech production. The corollary hypothesis might be drawn, according to which the production of inner speech (and rumination) should be disrupted by a disruption of the speech motor system. We conducted a series of five studies aiming to probe the role of the speech motor system in rumination. Overall, our results highlight that although verbal rumination may be considered as a form of inner speech, it might not specifically involve the speech motor system. More precisely, we argue that rumination might be considered as a particularly strongly condensed form of inner speech that does not systematically involve fully specified articulatory features. We discuss these findings in relation to the habit-goal framework of depressive rumination and we discuss the implications of these findings for theories of inner speech production

Стилі APA, Harvard, Vancouver, ISO та ін.

5

Hofmann, Petra. "Infernal imagery in Anglo-Saxon charters." Thesis, St Andrews, 2008. http://hdl.handle.net/10023/498.

Повний текст джерела

Стилі APA, Harvard, Vancouver, ISO та ін.

6

Wendel, Sue M. "Insights into the Mental Imagery and Gestural Awareness of Representational Gestures Produced in Everyday Talk: An Exploratory Study of Using Participants' Comments as Data." PDXScholar, 2015. https://pdxscholar.library.pdx.edu/open_access_etds/2646.

Повний текст джерела

Анотація:

To better understand representational gestures used in everyday talk, this study explores the ways participants talk about their own mental imagery and gestural awareness, and how their comments affect analysis. Literature pertaining to representational gestures, mental imagery, gestural awareness, and self-report data provide the theoretical framework for the study's design and implementation. Data is drawn from observations of two video recorded dyads engaged in everyday conversation, and four audio recorded interviews with each participant individually as they viewed and commented on selected video segments in which they had produced a representational gesture. Findings indicate that participants talked about mental imagery and gestural awareness in ways that were descriptive, explanatory, and self-reflective. They described their mental imagery in i) visual and motor terms, ii) as mental simulations, iii) as textural sensations, and iv) in linguistic metaphors. Participants talked about gestural awareness in terms of i) spontaneity, ii) intentionality, and iii) affective states. Taken altogether, participant comments suggest embodied cognition as a useful framework for analyzing and understanding representational gestures. Further, findings indicate that participant comments served to i) confirm, ii) clarify, and/or iii) expand my analysis, suggesting that participant comments can enhance understanding of mental imagery and gestural awareness in ways that could not be achieved by a researcher's observations and analysis alone.

Стилі APA, Harvard, Vancouver, ISO та ін.

7

Runnals, Jennifer Jane. "Exploring the Cardiovascular Response to Anger Imagery and Speech in Vietnam Veterans With and Without Posttraumatic Stress Disorder." Also available to VCU users online at:, 2007. http://hdl.handle.net/10156/1882.

Повний текст джерела

Стилі APA, Harvard, Vancouver, ISO та ін.

8

Hung, Pei-Fang. "Mental imagery and idiom understanding in adults: Examining dual coding theory." Thesis, University of Oregon, 2010. http://hdl.handle.net/1794/10878.

Повний текст джерела

Анотація:

xiii, 205 p. : ill. A print copy of this thesis is available through the UO Libraries. Search the library catalog for the location and call number.
This study examined idiom understanding in 120 neurologically healthy adults, ages 20-29 (20s Group), 40-49 (40s Group), 60-69 (60s Group), and 80-89 (80s Group) years old. Each participant was administered a familiarity task, definition explanation task, mental imagery task, and forced-choice comprehension task. Twenty idioms, 10 transparent and 10 opaque, were used with no supporting contexts. Participants were asked to rate the familiarity of each idiom, to provide a definition of each, to generate a mental image of each, and to select the best definition of each from among four options. It was predicted that younger and older adults would perform equally well on the comprehension task but that older adults would perform poorer than younger adults on the explanation task. Additionally, mental imagery of idioms was expected to become more figurative with advancing age, and participants were expected to perform better on highly familiar and transparent idioms than on less familiar and opaque ones. Participants rated all 20 idioms as highly familiar, with the lowest familiarity rating for participants in the 20s Group. No significant differences were found on the forced-choice comprehension task across the four age groups although the 20s Group scored the lowest among all age groups. The 60s Group performed significantly better than the 20s Group on the definition explanation task, but no significant differences were found between the other age groups. No significant differences were found in generating mental images between transparent and opaque idioms, and mental images tended to be figurative rather than literal for both types of idioms. The present study adds to our knowledge of idiom understanding across adulthood. Familiarity seemed to play a stronger role than transparency in idiom understanding in adults. Once an idiom was learned and stored as a lexical unit, people used the idiomatic meaning and generated figurative mental imagery immediately without accessing the literal meaning or the literal mental image.
Committee in charge: Marilyn Nippold, Chairperson, Special Education and Clinical Sciences; Roland Good, Member, Special Education and Clinical Sciences; Deborah Olson, Member, Special Education and Clinical Sciences; Nathaniel Teich, Outside Member, English

Стилі APA, Harvard, Vancouver, ISO та ін.

9

Diedrichs, Victoria Anne. "Leveraging Pupillometry and Luminance-Based Mental Imagery for a Novel Mode of Communication." Master's thesis, Temple University Libraries, 2015. http://cdm16002.contentdm.oclc.org/cdm/ref/collection/p245801coll10/id/352749.

Повний текст джерела

Анотація:

Communication Sciences
M.A.
The aim of the present study was to characterize participants’ abilities to answer binary yes/no questions by mentally manipulating imagery to produce imagined changes in luminance, which would in turn cause reflexive perturbations in pupil diameter. First, a paired association was established with participants, linking “yes” responses with imagining a “sunny sky” and “no” responses with imagining a “dark room”. Participants (N=20) then answered 16 yes/no questions using this response method, in place of providing verbal or gestural (e.g., head nod) answers. Pupil diameters were recorded for a period of 8000 ms following each stimulus question while participants maintained the mental image that corresponded with their answer. We hypothesized that on average, “no” responses would yield a pupil dilation and increased diameter relative to baseline, while “yes” responses would instead result in constrictions and smaller pupil diameters compared to baseline. A 2-factor repeated measures analysis of variance (ANOVA), where time was one factor and response type (i.e., yes or no) was the other, revealed a statistically significant interaction of time and response type, a significant main effect of time, and a trend toward significance for response type in aggregated group data. Item level discrimination consisted of comparing the mean pupil diameter in response to a single item for a single participant (e.g., “yes” response on one trial) to the mean pupil diameter of all contrasting responses for that same participant (e.g., all “no” response trials). This method achieved a 64.5% discrimination accuracy. This investigation affirmed the plausibility of leveraging pupillometry and luminance-based mental imagery in favor of an alternative communication system for individuals who are locked-in, as well as its potential as a screening tool. However, further investigation is warranted prior to its implementation.
Temple University--Theses

Стилі APA, Harvard, Vancouver, ISO та ін.

10

Major, Mary Elizabeth. "War's Visual Discourse| A Content Analysis of Iraq War Imagery." Thesis, Portland State University, 2013. http://pqdtopen.proquest.com/#viewpdf?dispub=1535957.

Повний текст джерела

Анотація:

This study reports the findings of a systematic visual content analysis of 356 randomly sampled images published about the Iraq War in Time, Newsweek, and U.S. News and World Report from 2003-2009. In comparison to a 1995 Gulf War study, published images in all three newsmagazines continued to be U.S.-centric, with the highest content frequencies reflected in the categories U.S. troops on combat patrol, Iraqi civilians, and U.S. political leaders respectively. These content categories do not resemble the results of the Gulf War study in which armaments garnered the largest share of the images with 23%.

This study concludes that embedding photojournalists, in addition to media economics, governance, and the media-organizational culture, restricted an accurate representation of the Iraq War and its consequences. Embedding allowed more access to both troops and civilians than the journalistic pool system of the Gulf War, which stationed the majority of journalists in Saudi Arabia and allowed only a few journalists into Iraq with the understanding they would share information. However, the perceived opportunity by journalists to more thoroughly cover the war through the policy of embedding was not realized to the extent they had hoped for. The embed protocols acted more as an indirect form of censorship.

Стилі APA, Harvard, Vancouver, ISO та ін.

11

Wang, Kang. "Image Transfer Between Magnetic Resonance Images and Speech Diagrams." Thesis, Université d'Ottawa / University of Ottawa, 2020. http://hdl.handle.net/10393/41533.

Повний текст джерела

Анотація:

Realtime Magnetic Resonance Imaging (MRI) is a method used for human anatomical study. MRIs give exceptionally detailed information about soft-tissue structures, such as tongues, that other current imaging techniques cannot achieve. However, the process requires special equipment and is expensive. Hence, it is not quite suitable for all patients. Speech diagrams show the side view positions of organs like the tongue, throat, and lip of a speaking or singing person. The process of making a speech diagram is like the semantic segmentation of an MRI, which focuses on the selected edge structure. Speech diagrams are easy to understand with a clear speech diagram of the tongue and inside mouth structure. However, it often requires manual annotation on the MRI machine by an expert in the field. By using machine learning methods, we achieved transferring images between MRI and speech diagrams in two directions. We first matched videos of speech diagram and tongue MRIs. Then we used various image processing methods and data augmentation methods to make the paired images easy to train. We built our network model inspired by different cross-domain image transfer methods and applied reference-based super-resolution methods—to generate high-resolution images. Thus, we can do the transferring work through our network instead of manually. Also, generated speech diagram can work as an intermediary part to be transferred to other medical images like computerized tomography (CT), since it is simpler in structure compared to an MRI. We conducted experiments using both the data from our database and other MRI video sources. We use multiple methods to do the evaluation and comparisons with several related methods show the superiority of our approach.

Стилі APA, Harvard, Vancouver, ISO та ін.

12

Louridas, Efstathios. "Image processing and analysis of videofluoroscopy images in cleft palate patients." Thesis, University of Kent, 1998. http://ethos.bl.uk/OrderDetails.do?uin=uk.bl.ethos.267392.

Повний текст джерела

Стилі APA, Harvard, Vancouver, ISO та ін.

13

Napper, Nick Louis. "An investigation into the impact of visual aids in post-compulsory education." Thesis, University of Exeter, 2014. http://hdl.handle.net/10871/16561.

Повний текст джерела

Анотація:

This study seeks insight into the use of visual aids in contemporary post-compulsory teaching. The importance of the study is linked to the large number of students who enter Higher Education; many of whom regularly receive lectures supported by visual displays which comprise textual summaries of a lecturerâs speech. This thesis comprises a two-part study and employs a mixed methods approach. The first part inquires into teachersâ and lecturersâ practice with regard to their visual aids, and the second compares the effectiveness of text, images and imagery displayed in support of a lecture. The investigation into lecturersâ practice found many post-compulsory teachers and lecturers had received no training in the design and use of visual aids during their initial teacher training. It is suggested this privation may underpin a de facto choice of projected text as a visual aid, the use of which is not clearly supported by contemporary models of memory and mental processing. In a comparison of visual modalities, an increase in learner engagement was recorded for the display of carefully designed images, and also for directed imagery. No positive impact was recorded for text summaries of 50-64 words displayed concurrently with speech, although recall was improved when text was restricted to five words or fewer. The conclusion is drawn that the display of this modality without temporal pauses may offer limited educational advantage to students, and a method of planned apportionment of speech and text is proposed in which contemporary theories of memory and processing are taken into account. These observations have significant implications for a lecture environment in which such text summaries are often relied upon for visual support. The findings of the thesis are combined to propose a principle of Visual Working Memory Utilisation (VWMU), upon which future research into visual aid design and use in post compulsory education might be based.

Стилі APA, Harvard, Vancouver, ISO та ін.

14

Hart, Lauren Alexandra. "Development of Thai Speech Audiometry Materials for Measuring Speech Recognition Thresholds." Diss., CLICK HERE for online access, 2008. http://contentdm.lib.byu.edu/ETD/image/etd2546.pdf.

Повний текст джерела

Стилі APA, Harvard, Vancouver, ISO та ін.

15

Peixoto, Michael Viana. "Prática intersemiótica no discurso imagético-cancional de Adriana Calcanhotto: uma proposta de análise." www.teses.ufc.br, 2013. http://www.repositorio.ufc.br/handle/riufc/8929.

Повний текст джерела

Анотація:

PEIXOTO, Michael Viana. Prática intersemiótica no discurso imagético-cancional de Adriana Calcanhotto: uma proposta de análise. 2013. 170f. – Tese (Doutorado) – Universidade Federal do Ceará, Departamento de Letras Vernáculas, Programa de Pós-graduação em Linguística, Fortaleza (CE), 2013.
Submitted by Márcia Araújo (marcia_m_bezerra@yahoo.com.br) on 2014-08-27T13:22:20Z No. of bitstreams: 1 2013_tese_mvpeixoto.pdf: 4610205 bytes, checksum: 62ae1cf598cca221bb3117394bf86faf (MD5)
Approved for entry into archive by Márcia Araújo(marcia_m_bezerra@yahoo.com.br) on 2014-08-27T16:35:08Z (GMT) No. of bitstreams: 1 2013_tese_mvpeixoto.pdf: 4610205 bytes, checksum: 62ae1cf598cca221bb3117394bf86faf (MD5)
Made available in DSpace on 2014-08-27T16:35:08Z (GMT). No. of bitstreams: 1 2013_tese_mvpeixoto.pdf: 4610205 bytes, checksum: 62ae1cf598cca221bb3117394bf86faf (MD5) Previous issue date: 2013
This thesis " intersemiotic Practice in Speech - Cancional imagery of Adriana Calcanhotto: a proposed analysis " studies and sets the image- cancional discourse as a discursive practice that mobilizes through discursive procedures , languages of different modalities (both verbal nature as nonverbal) to, in intersemiotic process compatibilizarem with the literomusical production and , from that , they encourage the construction of meaning. The theoretical foundation on which fincamos this concept comes from the analysis of French Discourse, considering especially the reflections of Maingueneau (1999), Costa (2012), among others. Based on this, the guiding research question: What proposal for theoretical and methodological approach to an analysis of image- cancional speech can be compiled from the discursive categories? The operationalization of this issue and the goal was through the exploratory methodology, in which, from a specific corpus - the literomusical production of Adriana Calcanhotto between 1990 and 2000, whose limits given in chronological order, in order to realize as their productions are organized according to the opening and closure of the decade; from the appropriation of discursive and visual cancionais , categories prepared a guide that offers a path that makes possible the construction of meanings of the text . In light of this methodology , we proceed to the performance analysis of the data which allowed us to conclude that the nature of the image- interdiscursive cancional speech provides such meanings that are only possible due to the phenomenon of intersemioticidade that is established and that. This conclusion allows us to state that, because of this, there is a verb that promote visual literacy; such discursive constructions require the reader learning about the way of reading certain discursive productions.
A presente tese “Prática intersemiótica no Discurso Imagético-Cancional de Adriana Calcanhotto: uma proposta de análise” estuda e define o discurso imagético-cancional como uma prática discursiva que mobiliza, através de procedimentos discursivos, linguagens de diferentes modalidades (tanto de natureza verbal quanto não-verbal) para, num processo intersemiótico, compatibilizarem-se com a produção literomusical e, a partir disso, propiciarem a construção de sentidos. O alicerce teórico no qual fincamos esse conceito procede da Análise do Discurso de linha francesa, considerando, sobretudo, as reflexões de Maingueneau (1999), Costa (2012), dentre outros. Com base nisso, a questão norteadora da pesquisa: Que proposta de abordagem teórica e metodológica para uma análise do discurso imagético-cancional pode ser elaborada a partir das categorias discursivas? A operacionalização dessa questão e do objetivo se deu por meio da metodologia exploratória, em que, a partir de um corpus específico – a produção literomusical de Adriana Calcanhotto compreendida entre 1990 e 2000, cuja delimitação se deu por ordem cronológica, a fim de se perceber como as respectivas produções se organizam de acordo com o início e o encerramento da década; a partir da apropriação das categorias discursivas, cancionais e visuais, elaboramos um guia que propõe um percurso que viabilize a construção dos sentidos do texto. À luz dessa metodologia, procedemos o exercício de análise dos dados o qual nos permitiu a conclusão de que a natureza interdiscursiva do discurso imagético-cancional propicia sentidos tais que só os são possíveis devido ao fenômeno da intersemioticidade que se estabelece e que . Essa conclusão nos possibilita afirmar que, em virtude disso, há que promover um letramento verbo-visual; tais construções discursivas requerem do leitor uma aprendizagem acerca do modo de ler determinadas produções discursivas.

Стилі APA, Harvard, Vancouver, ISO та ін.

16

Slade, Katie Bedke. "Speech Reception Threshold Materials for Taiwan Mandarin." Diss., CLICK HERE for online access, 2006. http://contentdm.lib.byu.edu/ETD/image/etd1379.pdf.

Повний текст джерела

Стилі APA, Harvard, Vancouver, ISO та ін.

17

Milner, Benjamin Peter. "Speech recognition in adverse environments." Thesis, University of East Anglia, 1994. https://ueaeprints.uea.ac.uk/2907/.

Повний текст джерела

Стилі APA, Harvard, Vancouver, ISO та ін.

18

Khurshid, Azar. "Pitch estimation for noisy speech." Thesis, University of Plymouth, 2002. http://hdl.handle.net/10026.1/1692.

Повний текст джерела

Анотація:

In this dissertation a biologically plausible system of pitch estimation is proposed. The system is designed from the bottom up to be robust to challenging noise conditions. This robustness to the presence of noise in the signal is achieved by developing a new representation of the speech signal, based on the operation of damped harmonic oscillators, and temporal mode analysis of their output. This resulting representation is shown to possess qualities which are not degraded in presence of noise. A harmonic grouping based system is used to estimate the pitch frequency. A detailed statistical analysis is performed on the system, and performance compared with some of the most established and recent pitch estimation and tracking systems. The detailed analysis includes results of experiments with a variety of noises with a large range of signal to noise ratios, under different signal conditions. Situations where the interfering "noise" is speech from another speaker are also considered. The proposed system is able to estimate the pitch of both the main speaker, and the interfering speaker, thus emulating the phenomena of auditory streaming and "cocktail party effect" in terms of pitch perception. The results of the extensive statistical analysis show that the proposed system exhibits some very interesting properties in its ability of handling noise. The results also show that the proposed system’s overall performance is much better than any of the other systems tested, especially in presence of very large amounts of noise. The system is also shown to successfully simulate some very interesting psychoacoustical pitch perception phenomena. Through a detailed and comparative computational requirements analysis, it is also demonstrated that the proposed system is comparatively inexpensive in terms of processing and memory requirements.

Стилі APA, Harvard, Vancouver, ISO та ін.

19

Long, Christopher J. "Wavelet methods in speech recognition." Thesis, Loughborough University, 1999. https://dspace.lboro.ac.uk/2134/14108.

Повний текст джерела

Анотація:

In this thesis, novel wavelet techniques are developed to improve parametrization of speech signals prior to classification. It is shown that non-linear operations carried out in the wavelet domain improve the performance of a speech classifier and consistently outperform classical Fourier methods. This is because of the localised nature of the wavelet, which captures correspondingly well-localised time-frequency features within the speech signal. Furthermore, by taking advantage of the approximation ability of wavelets, efficient representation of the non-stationarity inherent in speech can be achieved in a relatively small number of expansion coefficients. This is an attractive option when faced with the so-called 'Curse of Dimensionality' problem of multivariate classifiers such as Linear Discriminant Analysis (LDA) or Artificial Neural Networks (ANNs). Conventional time-frequency analysis methods such as the Discrete Fourier Transform either miss irregular signal structures and transients due to spectral smearing or require a large number of coefficients to represent such characteristics efficiently. Wavelet theory offers an alternative insight in the representation of these types of signals. As an extension to the standard wavelet transform, adaptive libraries of wavelet and cosine packets are introduced which increase the flexibility of the transform. This approach is observed to be yet more suitable for the highly variable nature of speech signals in that it results in a time-frequency sampled grid that is well adapted to irregularities and transients. They result in a corresponding reduction in the misclassification rate of the recognition system. However, this is necessarily at the expense of added computing time. Finally, a framework based on adaptive time-frequency libraries is developed which invokes the final classifier to choose the nature of the resolution for a given classification problem. The classifier then performs dimensionaIity reduction on the transformed signal by choosing the top few features based on their discriminant power. This approach is compared and contrasted to an existing discriminant wavelet feature extractor. The overall conclusions of the thesis are that wavelets and their relatives are capable of extracting useful features for speech classification problems. The use of adaptive wavelet transforms provides the flexibility within which powerful feature extractors can be designed for these types of application.

Стилі APA, Harvard, Vancouver, ISO та ін.

20

Stewart, Darryl William. "Syllable based continuous speech recognition." Thesis, Queen's University Belfast, 2000. http://ethos.bl.uk/OrderDetails.do?uin=uk.bl.ethos.325993.

Повний текст джерела

Стилі APA, Harvard, Vancouver, ISO та ін.

21

Luettin, Juergen. "Visual speech and speaker recognition." Thesis, University of Sheffield, 1997. http://ethos.bl.uk/OrderDetails.do?uin=uk.bl.ethos.264432.

Повний текст джерела

Стилі APA, Harvard, Vancouver, ISO та ін.

22

Costa, Paula Dornhofer Paro 1978. "Two-dimensional expressive speech animation = Animação 2D de fala expressiva." [s.n.], 2015. http://repositorio.unicamp.br/jspui/handle/REPOSIP/260903.

Повний текст джерела

Анотація:

Orientador: José Mario De Martino
Tese (doutorado) - Universidade Estadual de Campinas, Faculdade de Engenharia Elétrica e de Computação
Made available in DSpace on 2018-08-26T21:43:57Z (GMT). No. of bitstreams: 1 Costa_PaulaDornhoferParo_D.pdf: 15894797 bytes, checksum: 194a20ae502dfc7198a008d576e23e4c (MD5) Previous issue date: 2015
Resumo: O desenvolvimento da tecnologia de animação facial busca atender uma demanda crescente por aplicações envolvendo assistentes, vendedores, tutores e apresentadores de notícias virtuais; personagens realistas de videogames, agentes sociais e ferramentas para experimentos científicos em psicologia e ciências comportamentais. Um aspecto relevante e desafiador no desenvolvimento de cabeças falantes, ou "talking heads", é a reprodução realista dos movimentos articulatórios da fala combinados aos elementos de comunicação não-verbal e de expressão de emoções. Este trabalho presenta uma metodologia de síntese de animação facial baseada em imagens, ou animação facial 2D, que permite a reprodução de uma ampla gama de estados emocionais de fala expressiva, além de suportar a modulação de movimentos da cabeça e o controle de elementos faciais tais como o piscar de olhos e o arqueamento de sobrancelhas. A síntese da animação utiliza uma base de imagens-protótipo que são processadas para obtenção dos quadros-chave da animação. Os pesos utilizados para a combinação das imagens-protótipo são derivados de um modelo estatístico de aparência e formas, construído a partir de um conjunto de imagens de treinamento extraídas de um corpus audiovisual de uma face real. A síntese das poses-chave é guiada pela transcrição fonética temporizada da fala a ser animada e pela informação do estado emocional almejado. As poses-chave representam visemas dependentes de contexto fonético que implicitamente modelam os efeitos da coarticulação na fala visual. A transição entre poses-chave adjacentes é realizada por um algoritmo de metamorfose não-linear entre imagens. As animações sintetizadas aplicando-se a metodologia proposta foram avaliadas por meio de avaliação perceptual de reconhecimento de emoções. Dentre as contribuições deste trabalho encontra-se a construção de uma base de dados de vídeo e captura de movimento para fala expressiva em português do Brasil
Abstract: The facial animation technology experiences an increasing demand for applications involving virtual assistants, sellers, tutors and newscasters; lifelike game characters, social agents, and tools for scientific experiments in psychology and behavioral sciences. A relevant and challenging aspect of the development of talking heads is the realistic reproduction of the speech articulatory movements combined with the elements of non-verbal communication and the expression of emotions. This work presents an image-based, or 2D, facial animation synthesis methodology that allows the reproduction of a wide range of expressive speech emotional states and also supports the modulation of head movements and the control of face elements, like the blinking of the eyes and the raising of the eyebrows. The synthesis of the animation uses a database of prototype images which are combined to produce animation keyframes. The weights used for combining the prototype images are derived from a statistical active appearance model (AAM), which is built from a set of sample images extracted from an audio-visual corpus of a real face. The generation of the animation keyframes is driven by the timed phonetic transcription of the speech to be animated and the desired emotional state. The keyposes consist of expressive context-dependent visemes that implicitly model the speech coarticulation effects. The transition between adjacent keyposes is performed through a non-linear image morphing algorithm. To evaluate the synthesized animations, a perceptual evaluation based on the recognition of emotions was performed. Among the contributions of the work is also the building of a database of expressive speech video and motion capture data for Brazilian Portuguese
Doutorado
Engenharia de Computação
Doutora em Engenharia Elétrica

Стилі APA, Harvard, Vancouver, ISO та ін.

23

Käsemodel, Thays Berretta. "Avaliação da correção de atenuação e espalhamento em imagens SPECT em protocolo cerebral." Universidade de São Paulo, 2014. http://www.teses.usp.br/teses/disponiveis/59/59135/tde-24112014-141242/.

Повний текст джерела

Анотація:

A tomografia computadorizada por emissão de fóton único (SPECT) é uma das modalidades de diagnóstico na Medicina Nuclear em que se detecta a radiação emitida por um radiofármaco previamente administrado ao paciente. Visto que osfótons emitidos sofrem interações com o corpo do paciente, fazem-se necessárias as correções de atenuação e de espalhamento a fim de melhor representar a distribuição do radiofármaco, e assimresultar em imagens mais precisas. O objetivo deste trabalho é avaliar os parâmetros anotados como padrão para reconstruções de imagens tomográficas e as correções de atenuação e de espalhamento em imagens SPECT do Hospital de Clínicas da Faculdade de Medicina da Universidade de São Paulo, por meio de análises qualitativas e quantitativas das imagens reconstruída a partir das aquisições tomográficas. Sob um protocolo de SPECT-CT cerebral modificado para duas janelas de aquisição, foram adquiridas imagens SPECT e SPECT-CT (BrightView XCT, Philips) utilizando fantomaJaszczak e reconstruídas pelos métodos FBP, MLEM e OSEM. Os resultados mostram que o método FBP apresenta imagens de baixa precisão devido à baixa SNR. A avaliação sugere o uso dos métodos iterativos MLEM e OSEM com correção de atenuação como método padrão de reconstrução de imagens de perfusão cerebral. De acordo com a avaliação de imagens do fantomaJaszczak e análise do contraste entre esfera fria ebackground,propõe-se análise observacional e avaliação das imagens clínicas reconstruídas pelo método OSEM com os parâmetros 3 iterações, 16 subsets, filtro Butterworth com frequência de corte 0,34 e potencia 1, como novos parâmetros padrão de reconstrução de imagens.
Single Photon Emission Computed Tomography (SPECT) is one of the diagnostic modalities in nuclear medicine, it detects the radiation emitted by a radioisotope previously administered to the patient. Since the photons undergo interactions with the patient\'s body,attenuationand scatteringcorrections are necessary in order to best represent the distribution of the radiopharmaceutical, and thus result in more accurate images. The aim of this study is to evaluate the standard parameters for tomographic imagesreconstruction, and attenuation and scatter corrections ofSPECT images, from Hospital das Clínicas da Faculdade de Medicina de RibeirãoPreto, Universidade de São Paulo, through qualitativeand quantitative analysis of the reconstructed image obtained from SPECT aquisitions. Though a modified to two windows of acquisition protocol for cerebral SPECT-CT, we acquired SPECT and SPECT-CT images (BrightView XCT, Philips) using phantom Jaszczak and the ones were reconstructed by FBP, MLEM and OSEM methods. The results show that the FBP method has poor image precision due to low SNR. The review suggests the use of iterative methods MLEM and OSEM with attenuation correction as a standard method of image reconstruction of cerebral perfusion. According to the images the phantom Jaszczak and contrast analysis between cold sphere and background, we propose observational analysis and evaluations of clinical images reconstructed by OSEM method with parameters 3 iterations, 16 subsets, Butterworth filter with cutoff frequency 0.34 and order 1, as newstandard parameters for image reconstruction parameters.

Стилі APA, Harvard, Vancouver, ISO та ін.

24

Wilson, Shawn C. "Voice recognition systems : assessment of implementation aboard U.S. naval ships." Thesis, Monterey, Calif. : Springfield, Va. : Naval Postgraduate School ; Available from National Technical Information Service, 2003. http://library.nps.navy.mil/uhtbin/hyperion-image/03Mar%5FWilson.pdf.

Повний текст джерела

Анотація:

Thesis (M.S. in Information Systems and Operations)--Naval Postgraduate School, March 2003.
Thesis advisor(s): Michael T. McMaster, Kenneth J. Hagan. Includes bibliographical references (p. 47-49). Also available online.

Стилі APA, Harvard, Vancouver, ISO та ін.

25

Yan, Ming. "VLSI architectures for speech and image coding applications." Thesis, Queen's University Belfast, 1989. http://ethos.bl.uk/OrderDetails.do?uin=uk.bl.ethos.356855.

Повний текст джерела

Стилі APA, Harvard, Vancouver, ISO та ін.

26

Arnab, Sylvester. "Python for speech/image processing and computer vision /." Leeds, 2001. http://www.leeds.ac.uk/library/counter2/compstmsc/20002001/arnab.doc.

Повний текст джерела

Стилі APA, Harvard, Vancouver, ISO та ін.

27

So, Stephen. "Efficient Block Quantisation for Image and Speech Coding." Thesis, Griffith University, 2005. http://hdl.handle.net/10072/366625.

Повний текст джерела

Анотація:

Signal coding or compression has played a significant role in the success of digital communications and multimedia. The use of signal coding pervades many aspects of our digital lifestyle-a lifestyle that has seen widespread demand for applications like third generation mobile telephony, portable music players, Internet-based video conferencing, digital television, etc. The issues that arise, when dealing with the transmission and storage of digital media, are the limited bandwidth of communication channels, the limited capacity of storage devices, and the limited processing ability of the encoding and decoding devices. The aim of signal coding is therefore to represent digital media, such as speech, music, images, and video, as efficiently as possible. Coding efficiency encompasses rate-distortion (for lossy coding), computational complexity, and static memory requirements. The fundamental operation in lossy signal coding is quantisation. Its rate-distortion efficiency is influenced by the properties of the signal source, such as statistical dependencies and its probability density function. Vector quantisers are known to theoretically achieve the lowest distortion, at a given rate and dimension, of any quantisation scheme, though their computational complexity and memory requirements grow exponentially with rate and dimension. Structurally constrained vector quantisers, such as product code vector quantisers, alleviate these complexity issues, though this is achieved at the cost of degraded rate-distortion performance. Block quantisers or transform coders, which are a special case of product code vector quantisation, possess both low computational and memory requirements, as well as the ability to scale to any bitrate, which is termed as bitrate scalability. However, the prerequisite for optimal block quantisation, namely a purely Gaussian data source with uniform correlation, is rarely ever met with real-world signals. The Gaussian mixture model-based block quantiser, which was originally developed for line spectral frequency (LSF) quantisation for speech coding, overcomes these problems of source mismatch and non-stationarity by estimating the source using a GMM. The split vector quantiser, which was also successfully applied to LSF quantisation in the speech coding literature, is a product code vector quantiser that overcomes the complexity problem of unconstrained vector quantisers, by partitioning vectors into sub-vectors and quantising each one independently. The complexity can be significant reduced via more vector splitting, though this inevitably leads to an accompanying degradation in the rate-distortion efficiency. This is because the structural constraint of vector splitting causes losses in several properties of vector quantisers, which are termed as 'advantages'. This dissertation makes several contributions to the area of block and vector quantisation, more specifically to the GMM-based block quantiser and split vector quantiser, which aim to improve their rate-distortion and computational efficiency. These new quantisation schemes are evaluated and compared with existing and popular schemes in the areas of lossy image coding, LSF quantisation in narrowband speech coding, LSF and immittance spectral pair (ISP) quantisation in wideband speech coding, and Mel frequency-warped cepstral coefficient (MFCC) quantisation in distributed speech recognition. These contributions are summarised below. A novel technique for encoding fractional bits in a fixed-rate 0MM-based block quantiser scheme is presented. In the 0MM-based block quantiser, fractional bitrates are often assigned to each of the cluster block quantisers. This new encoding technique leads to better utilisation of the bit budget by allowing the use of, and providing for the encoding of, quantiser levels in a fixed-rate framework. The algorithm is based on a generalised positional number system and has a low complexity. A lower complexity 0MM-based block quantiser, that replaces the KLT with the discrete cosine transform (DOT), is proposed for image coding. Due to its source independent nature and amenability to efficient implementation, the DOT allows a fast 0MM-based block quantiser to be realised that achieves comparable rate-distortion performance as the KLT-based scheme in the block quantisation of images. Transform image coding often suffers from block artifacts at relatively low bitrates. We propose a scheme that minimises the block artifacts of block quantisation by pre-processing the image using the discrete wavelet transform, extracting vectors via a tree structure that exploits spatial self-similarity, and quantising these vectors using the 0MM-based block quantiser. Visual examination shows that block artifacts are considerably reduced by the wavelet pre-processing step. The multi-frame 0MM-based block quantiser is a modified scheme that exploits memory across successive frames or vectors. Its main advantages over the memoryless scheme in the application of LSF and ISP quantisation, are better rate-distortion and computational efficiency, through the exploitation of correlation across multiple frames and mean squared error selection criterion, respectively. The multi-frame 0MM-based block quantiser is also evaluated for the quantisation of MFCC feature vectors for distributed speech recognition and is shown to be superior to all quantisation schemes considered. A new product code vector quantiser, called the switched split vector quantiser (SSVQ), is proposed for speech LSF and ISP quantisation. SSVQ is a hybrid scheme, combining a switch vector quantiser with several split vector quantisers. It aims to overcome the losses of rate-distortion efficiency in split vector quantisers, by exploiting full vector dependencies before the vector splitting. It is shown that the SSVQ alleviates the losses in two of the three vector quantiser 'advantages'. The SSVQ also has a remarkably low computational complexity, though this is achieved at the cost of an increase in memory requirements.
Thesis (PhD Doctorate)
Doctor of Philosophy (PhD)
School of Microelectronic Engineering
Full Text

Стилі APA, Harvard, Vancouver, ISO та ін.

28

Raeesy, Zeynabalsadat. "Automatic analysis of magnetic resonance images of speech articulation." Thesis, University of Oxford, 2013. http://ora.ox.ac.uk/objects/uuid:ffa6d290-6920-4204-8d65-e4f2f09278c5.

Повний текст джерела

Анотація:

Magnetic resonance imaging (MRI) technology has facilitated capturing the dynamics of speech production at fine temporal and spatial resolutions, thus generating substantial quantities of images to be analysed. Manual processing of large MRI databases is labour intensive and time consuming. Hence, to study articulation on large scale, techniques for automatic feature extraction are needed. This thesis investigates approaches for automatic information extraction from an MRI database of dynamic articulation. We first study the articulation by observing the pixel intensity variations in image sequences. The correspondence between acoustic segments and images is established by forced alignment of speech signals recorded during the articulation. We obtain speaker-specific typical phoneme articulations that represent general articulatory configurations in running speech. Articulation dynamics are parametrised by measuring the magnitude of change in intensities over time. We demonstrate a direct correlation between the dynamics of articulation thus measured and the energy of the generated acoustic signals. For more sophisticated applications, a parametric description of vocal tract shape is desired. We investigate different shape extraction techniques and present a framework that can automatically identify and extract the vocal tract shapes. The framework incorporates shape prior information and intensity features in recognising and delineating the shape. The new framework is a promising new tool for automatic identification of vocal tract boundaries in large MRI databases, as demonstrated through extensive assessments. The segmentation framework proposed in this thesis is, to the best of our knowledge, novel in the field of speech production. The methods investigated in this thesis facilitate automatic information extraction from images, either for studying the dynamics of articulation or for vocal tract shape modelling. This thesis advances the state-of-the-art by bringing new perspectives to studying articulation, and introducing a segmentation framework that is automatic, does not require extensive initialisation, and reports a minimum number of failures.

Стилі APA, Harvard, Vancouver, ISO та ін.

29

Cahill, Louise Margaret. "Motor speech function following childhood traumatic brain injury /." [St. Lucia, Qld.], 2002. http://www.library.uq.edu.au/pdfserve.php?image=thesisabs/absthe16948.pdf.

Повний текст джерела

Стилі APA, Harvard, Vancouver, ISO та ін.

30

Bunker, Lisa Dawn. "Development of Tongan Materials for Determining Speech Recognition Thresholds." Diss., CLICK HERE for online access, 2008. http://contentdm.lib.byu.edu/ETD/image/etd2419.pdf.

Повний текст джерела

Стилі APA, Harvard, Vancouver, ISO та ін.

31

Ratclilff, Rose. "Psychometrically equivalent bisyllabic words for speech reception threshold testing in Arabic /." Diss., CLICK HERE for online access, 2006. http://contentdm.lib.byu.edu/ETD/image/etd1418.pdf.

Повний текст джерела

Стилі APA, Harvard, Vancouver, ISO та ін.

32

Forsingdal, Shareen Lisa. "Factors relating to outcomes in late talkers following an early language intervention programme /." [St. Lucia, Qld.], 2001. http://www.library.uq.edu.au/pdfserve.php?image=thesisabs/absthe16338.pdf.

Повний текст джерела

Стилі APA, Harvard, Vancouver, ISO та ін.

33

Prager, Richard William. "Parallel processing networks for automatic speech recognition." Thesis, University of Cambridge, 1987. http://ethos.bl.uk/OrderDetails.do?uin=uk.bl.ethos.238443.

Повний текст джерела

Стилі APA, Harvard, Vancouver, ISO та ін.

34

Baothman, Fatmah bint Abdul Rahman. "Phonology-based automatic speech recognition for Arabic." Thesis, University of Huddersfield, 2002. http://ethos.bl.uk/OrderDetails.do?uin=uk.bl.ethos.273720.

Повний текст джерела

Стилі APA, Harvard, Vancouver, ISO та ін.

35

Holmes, Wendy Jane. "Modelling segmental variability for automatic speech recognition." Thesis, University College London (University of London), 1997. http://ethos.bl.uk/OrderDetails.do?uin=uk.bl.ethos.267859.

Повний текст джерела

Стилі APA, Harvard, Vancouver, ISO та ін.

36

Keller, Laurel Anne. "Psychometrically Equivalent Trisyllabic Words for Speech Reception Threshold Testing in Spanish." Diss., CLICK HERE for online access, 2009. http://contentdm.lib.byu.edu/ETD/image/etd2776.pdf.

Повний текст джерела

Стилі APA, Harvard, Vancouver, ISO та ін.

37

Jennings, Lara-Jill. "Psychometrically Equivalent Digital Recordings for Speech Audiometry Testing in Mandarin Chinese: Standard Mandarin Dialect." Diss., CLICK HERE for online access, 2005. http://contentdm.lib.byu.edu/ETD/image/etd1124.pdf.

Повний текст джерела

Стилі APA, Harvard, Vancouver, ISO та ін.

38

Tsukanova, Anastasiia. "Articulatory speech synthesis." Electronic Thesis or Diss., Université de Lorraine, 2019. http://www.theses.fr/2019LORR0166.

Повний текст джерела

Анотація:

Cette thèse se situe dans le domaine de la synthèse articulatoire de la parole et est organisé en trois grandes parties : les deux premières sont consacrées au développement de deux synthétiseurs articulatoires de la parole ; la troisième traite des liens que l'on peut établir entre les deux approches utilisées. Le premier synthétiseur est issu d'une approche à base de règles. Celle-ci visait à obtenir le contrôle complet sur les articulateurs (mâchoire, langue, lèvres, vélum, larynx et épiglotte). Elle s'appuyait sur des données statiques du plan sagittal médian obtenues par IRM (Imagerie par Résonance Magnétique) correspondant à des articulations bloquées de voyelles du français, ainsi que des syllabes de type consonne-voyelle, et était composée de plusieurs étapes : l'encodage de l'ensemble des données grâce à un modèle du conduit vocal basé sur l'ACP (analyse en composantes principales) ; l'utilisation des configurations articulatoires obtenues comme sources de positions à atteindre et destinées à piloter le synthétiseur à base de règles qui est la contribution principale de cette première partie ; l'ajustement des conduits vocaux obtenus selon une perspective phonétique ; la simulation acoustique permettant d'obtenir un signal acoustique. Les résultats de cette synthèse ont été évalués de manière visuelle, acoustique et perceptuelle, et les problèmes rencontrés ont été identifiés et classés selon leurs origines, qui pouvaient être : les données, leur modélisation, l'algorithme contrôlant la forme du conduit vocal, la traduction de cette forme en fonctions d'aire, ou encore la simulation acoustique. Ces analyses nous permettent de conclure que, parmi les tests effectués, les stratégies articulatoires des voyelles et des occlusives sont les plus correctes, suivies par celles des nasales et des fricatives. La seconde approche a été développée en s'appuyant sur un synthétiseur de référence constitué d'un réseau de neurones feed-forward entraîné à l'aide de la méthode standard du système Merlin sur des données audio composées de parole en langue française enregistrée par IRM en temps réel. Ces données ont été segmentées phonétiquement et linguistiquement. Ces données audio, malgré un débruitage, étaient fortement parasitées par le son de la machine à IRM. Nous avons complété le synthétiseur de référence en ajoutant huit paramètres représentant de l'information articulatoire : l'ouverture des lèvres et leur protrusion, la distance entre la langue et le vélum, entre le vélum et la paroi pharyngale, et enfin entre la langue et la paroi pharyngale. Ces paramètres ont été extraits automatiquement à partir des images et alignés au signal et aux spécifications linguistiques. Les séquences articulatoires et les séquences de parole, générées conjointement, ont été évaluées à l'aide de différentes mesures : distance de déformation temporelle dynamique, la distortion mel-cepstrum moyenne, l'erreur de prédiction de l'apériodicité, et trois mesures pour F0 : RMSE (root mean square error), CORR (coéfficient de corrélation) and V/UV (frame-level voiced/unvoiced error). Une analyse de la pertinence des paramètres articulatoires par rapport aux labels phonétiques a également été réalisée. Elle permet de conclure que les paramètres articulatoires générés s'approchent de manière acceptable des paramètres originaux, et que l'ajout des paramètres articulatoires n'a pas dégradé le modèle acoustique original. Les deux approches présentées ci-dessus ont en commun l'utilisation de deux types de données IRM. Ce point commun a motivé la recherche, dans les données temps réel, des images clés, c'est-à-dire les configurations statiques IRM, utilisées pour modéliser la coarticulation. Afin de comparer les images IRM statiques avec les images dynamiques en temps réel, nous avons utilisé plusieurs mesures : [...]
The thesis is set in the domain of articulatory speech synthesis and consists of three major parts: the first two are dedicated to the development of two articulatory speech synthesizers and the third addresses how we can relate them to each other. The first approach results from a rule-based approach to articulatory speech synthesis that aimed to have a comprehensive control over the articulators (the jaw, the tongue, the lips, the velum, the larynx and the epiglottis). This approach used a dataset of static mid-sagittal magnetic resonance imaging (MRI) captures showing blocked articulation of French vowels and a set of consonant-vowel syllables; that dataset was encoded with a PCA-based vocal tract model. Then the system comprised several components: using the recorded articulatory configurations to drive a rule-based articulatory speech synthesizer as a source of target positions to attain (which is the main contribution of this first part); adjusting the obtained vocal tract shapes from the phonetic perspective; running an acoustic simulation unit to obtain the sound. The results of this synthesis were evaluated visually, acoustically and perceptually, and the problems encountered were broken down by their origin: the dataset, its modeling, the algorithm for managing the vocal tract shapes, their translation to the area functions, and the acoustic simulation. We concluded that, among our test examples, the articulatory strategies for vowels and stops are most correct, followed by those of nasals and fricatives. The second explored approach started off a baseline deep feed-forward neural network-based speech synthesizer trained with the standard recipe of Merlin on the audio recorded during real-time MRI (RT-MRI) acquisitions: denoised (and yet containing a considerable amount of noise of the MRI machine) speech in French and force-aligned state labels encoding phonetic and linguistic information. This synthesizer was augmented with eight parameters representing articulatory information---the lips opening and protrusion, the distance between the tongue and the velum, the velum and the pharyngeal wall and the tongue and the pharyngeal wall---that were automatically extracted from the captures and aligned with the audio signal and the linguistic specification. The jointly synthesized speech and articulatory sequences were evaluated objectively with dynamic time warping (DTW) distance, mean mel-cepstrum distortion (MCD), BAP (band aperiodicity prediction error), and three measures for F0: RMSE (root mean square error), CORR (correlation coefficient) and V/UV (frame-level voiced/unvoiced error). The consistency of articulatory parameters with the phonetic label was analyzed as well. I concluded that the generated articulatory parameter sequences matched the original ones acceptably closely, despite struggling more at attaining a contact between the articulators, and that the addition of articulatory parameters did not hinder the original acoustic model. The two approaches above are linked through the use of two different kinds of MRI speech data. This motivated a search for such coarticulation-aware targets as those that we had in the static case to be present or absent in the real-time data. To compare static and real-time MRI captures, the measures of structural similarity, Earth mover's distance, and SIFT were utilized; having analyzed these measures for validity and consistency, I qualitatively and quantitatively studied their temporal behavior, interpreted it and analyzed the identified similarities. I concluded that SIFT and structural similarity did capture some articulatory information and that their behavior, overall, validated the static MRI dataset. [...]

Стилі APA, Harvard, Vancouver, ISO та ін.

39

Mangum, Tanya Crawford. "Performance Intensity Functions for Digitally Recorded Japanese Speech Audiometry Materials." Diss., CLICK HERE for online access, 2005. http://contentdm.lib.byu.edu/ETD/image/etd838.pdf.

Повний текст джерела

Стилі APA, Harvard, Vancouver, ISO та ін.

40

RAMOS, Paulo Roberto. "A percepção dos riscos ambientais: Do discurso midiático a vida cotidiana de populações da cidade de Campina Grande - PB." Universidade Federal de Campina Grande, 2003. http://dspace.sti.ufcg.edu.br:8080/jspui/handle/riufcg/1871.

Повний текст джерела

Анотація:

Submitted by Johnny Rodrigues (johnnyrodrigues@ufcg.edu.br) on 2018-10-01T20:14:39Z No. of bitstreams: 1 PAULO ROBERTO RAMOS - DISSERTAÇÃO PPGCS 2003..pdf: 21868091 bytes, checksum: d17f83a2d24ad1cfd4eb56b9723ebdc7 (MD5)
Made available in DSpace on 2018-10-01T20:14:39Z (GMT). No. of bitstreams: 1 PAULO ROBERTO RAMOS - DISSERTAÇÃO PPGCS 2003..pdf: 21868091 bytes, checksum: d17f83a2d24ad1cfd4eb56b9723ebdc7 (MD5) Previous issue date: 2003-03
CNPq
Capes
A importância da mídia (mass media) na constituição do imaginário e das práticas sociais tem despertado a atenção de muitos pesquisadores para o estudo do chamado poder simbólico que engendra e, ainda, dos seus desdobramentos na vida cotidiana das populações. Em regra, a mídia tem tratado a problemática ambiental de maneira limitada, ou como preservacionismo de áreas naturais, ou promovendo o ecoturismo, ou assumindo um discurso tecnicista de um conhecimento especialista e distante da vida cotidiana das populações locais. O objetivo central deste trabalho é contribuir com a compreensão dos impactos do discurso midiático na percepção dos riscos dos problemas ambientais. Procuramos observar como tais fenômenos (mídia e degradação ambiental) se intercomunicam para a formação das práticas sociais e discursivas, no estabelecimento do status quo, na formação das hierarquias sociais e das estruturas de poder, que se manifestam numa sociedade com índices crescentes de degradação social e ambiental. Através da análise da programação da TV, principalmente a Rede Globo, e de outros meios de comunicação social de massa, bem como do acompanhamento da vida cotidiana das populações pesquisadas, com o auxilio das técnicas da Observação Participante e da Análise de Discurso, foi possível perceber como ocorre a influência do discurso midiático na formação de uma racionalidade e de uma lógica discursiva que acabam justificando e estimulando os desequilíbrios e desigualdades existentes. Observamos que prevalece na mídia uma lógica pautada nas hierarquias e diferenças sociais e nos processos de degradação ambiental, enquanto aspectos invariáveis e permanentes do desenvolvimento humano. Identificamos a existência de uma contradição formada entre os apelos ecológicos e a institucionalização dos problemas ambientais que se manifestam pulverizados e isolados na mídia, ao mesmo tempo em que ocorre a reprodução da racionalidade e das práticas de degradação ambiental por toda programação, e de diferentes formas na vida cotidiana das populações pesquisadas.
The importance of the media (mass media) in the constitution of the imaginary and of the social practices it has been waking up the many researchers' attention for the srudy of the call to can symbolic that engenders and, still, of your unfoldings in the daily life of the populations. In ruie, the media has agreement the environmental problem in a limited way, or as preservationism of natural áreas, or promoting the echo-tourism, or assuming a speech technicism ofa specialist and distant knovvledge ofthe daily life ofthe local populations. The objective headquarters o!this work is to contribute with the understanding of the impacts of the speech mediatic in the perception of the risks of the environmental problems. We tried to observe as such phenomena (media and environmental degradation) they intercommunicate for the formation of the you practice social and discursive, in the establishment of the status quo, in the formation of the social hierarchies and of the structures of power, that show in a society with growing indexes of social and environmental degradation. Through the analysis of the programming of the TV, mainly the Rede Globo, and of other means of social communication of mass, as well as of the accompaniment of the daily life of the researched populations, with I aid him of the techniques of the Participant Observation and ofthe Analysis of Speech, it was possible to notice how it happens the influence ofthe speech mediatic in the formation of a rationality and of a discursive logic that end up justifying and stimulating the unbalances and existent inequalities. We observed that prevails in the media a ruled logic in the hierarchies and social differences and in the processes of environmental degradation, while invariable and permanent aspects of the human development. We identified the existence of a contradiction formed between the ecological appeals and the institutions of the environmental problems that show powdered and isolated in the media, at the same time in that happens the reproduction of the rationality and of the practices of environmental degradation for ali programming, and in different ways in the daily life of the researched populations.

Стилі APA, Harvard, Vancouver, ISO та ін.

41

Chan, Dominic Sai Fan. "Speech production modelling based on glottal inverse filtering." Thesis, Imperial College London, 1994. http://ethos.bl.uk/OrderDetails.do?uin=uk.bl.ethos.307161.

Повний текст джерела

Стилі APA, Harvard, Vancouver, ISO та ін.

42

Kounoudes, Anastasis. "Epoch estmimation for closed-phase analysis of speech." Thesis, Imperial College London, 2001. http://ethos.bl.uk/OrderDetails.do?uin=uk.bl.ethos.248213.

Повний текст джерела

Стилі APA, Harvard, Vancouver, ISO та ін.

43

Scott, Simon David. "A data-driven approach to visual speech synthesis." Thesis, University of Bath, 1996. http://ethos.bl.uk/OrderDetails.do?uin=uk.bl.ethos.307116.

Повний текст джерела

Стилі APA, Harvard, Vancouver, ISO та ін.

44

Harte, Naomi Antonia. "Segmental phonetic features and models for speech recognition." Thesis, Queen's University Belfast, 1999. http://ethos.bl.uk/OrderDetails.do?uin=uk.bl.ethos.287466.

Повний текст джерела

Стилі APA, Harvard, Vancouver, ISO та ін.

45

Devaney, Jason Wayne. "A study of articulatory gestures for speech synthesis." Thesis, University of Liverpool, 1995. http://ethos.bl.uk/OrderDetails.do?uin=uk.bl.ethos.284254.

Повний текст джерела

Стилі APA, Harvard, Vancouver, ISO та ін.

46

Savvides, Vasos E. "Perceptual models in speech quality assessment and coding." Thesis, Loughborough University, 1988. https://dspace.lboro.ac.uk/2134/36273.

Повний текст джерела

Анотація:

The ever-increasing demand for good communications/toll quality speech has created a renewed interest into the perceptual impact of rate compression. Two general areas are investigated in this work, namely speech quality assessment and speech coding. In the field of speech quality assessment, a model is developed which simulates the processing stages of the peripheral auditory system. At the output of the model a "running" auditory spectrum is obtained. This represents the auditory (spectral) equivalent of any acoustic sound such as speech. Auditory spectra from coded speech segments serve as inputs to a second model. This model simulates the information centre in the brain which performs the speech quality assessment.

Стилі APA, Harvard, Vancouver, ISO та ін.

47

Kim, Misty Noelani. "Psychometrically Equivalent Trisyllabic Words for Speech Reception Threshold Testing in Cantonese." Diss., CLICK HERE for online access, 2007. http://contentdm.lib.byu.edu/ETD/image/etd1824.pdf.

Повний текст джерела

Стилі APA, Harvard, Vancouver, ISO та ін.

48

Gilbert, Kristen Nicole. "Psychometrically Equivalent Bisyllabic Words for Speech Reception Threshold Testing in Mongolian." Diss., CLICK HERE for online access, 2009. http://contentdm.lib.byu.edu/ETD/image/etd2877.pdf.

Повний текст джерела

Стилі APA, Harvard, Vancouver, ISO та ін.

49

McAuliffe, Megan Jane. "An investigation of articulation and speech rate in Parkinson's disease /." [St. Lucia, Qld.], 2003. http://www.library.uq.edu.au/pdfserve.php?image=thesisabs/absthe17543.pdf.

Повний текст джерела

Стилі APA, Harvard, Vancouver, ISO та ін.

50

Sanders, Marybeth Corey. "Inter- and intra-speaker variability : a palatometric study /." Diss., CLICK HERE for online access, 2007. http://contentdm.lib.byu.edu/ETD/image/etd2113.pdf.

Повний текст джерела

Стилі APA, Harvard, Vancouver, ISO та ін.

Дисертації з теми "Speech imagery"

Оформте джерело за APA, MLA, Chicago, Harvard та іншими стилями