Дисертації з теми "FEATURE ENCODING"
Оформте джерело за APA, MLA, Chicago, Harvard та іншими стилями
Ознайомтеся з топ-24 дисертацій для дослідження на тему "FEATURE ENCODING".
Біля кожної праці в переліку літератури доступна кнопка «Додати до бібліографії». Скористайтеся нею – і ми автоматично оформимо бібліографічне посилання на обрану працю в потрібному вам стилі цитування: APA, MLA, «Гарвард», «Чикаго», «Ванкувер» тощо.
Також ви можете завантажити повний текст наукової публікації у форматі «.pdf» та прочитати онлайн анотацію до роботи, якщо відповідні параметри наявні в метаданих.
Переглядайте дисертації для різних дисциплін та оформлюйте правильно вашу бібліографію.
Gollnick, Clare Ann. "Probabilistic encoding and feature selectivity in the somatosensory pathway." Diss., Georgia Institute of Technology, 2014. http://hdl.handle.net/1853/54025.
Повний текст джерелаSeger, Cedric. "An investigation of categorical variable encoding techniques in machine learning: binary versus one-hot and feature hashing." Thesis, KTH, Skolan för elektroteknik och datavetenskap (EECS), 2018. http://urn.kb.se/resolve?urn=urn:nbn:se:kth:diva-237426.
Повний текст джерелаMaskininlärningsmetoder kan användas för att lösa viktiga binära klassificeringsuppgifter i domäner som displayannonsering och rekommendationssystem. I många av dessa domäner är kategoriska variabler vanliga och ofta av hög kardinalitet. Användning av one-hot-kodning under sådana omständigheter leder till väldigt högdimensionella vektorrepresentationer. Detta orsakar minnesoch beräkningsproblem för maskininlärningsmodeller. Denna uppsats undersökte användbarheten för ett binärt kodningsschema där kategoriska värden var avbildade på heltalvärden som sedan kodades i ett binärt format. Detta binära system tillät att representera kategoriska värden med hjälp av log2(d) -dimensionella vektorer, där d är dimensionen förknippad med en one-hot kodning. För att utvärdera prestandan för den binära kodningen jämfördes den mot one-hot och en hashbaserad kodning. En linjär logistikregression och ett neuralt nätverk tränades med hjälp av data från två offentligt tillgängliga dataset: Criteo och Census, och den slutliga prestandan jämfördes. Resultaten visade att en one-hot kodning med en linjär logistisk regressionsmodell gav den bästa prestandan enligt PR-AUC måttet. Denna metod använde dock 118 och 65,953 dimensionella vektorrepresentationer för Census respektive Criteo. En binär kodning ledde till en lägre prestanda generellt, men använde endast 35 respektive 316 dimensioner. Den binära kodningen presterade väsentligt sämre specifikt för Criteo datan, istället var hashbaserade kodningen en mer attraktiv lösning. Försämringen i prestationen associerad med binär och hashbaserad kodning kunde mildras av att använda ett neuralt nätverk.
Zhang, Cuicui. "Quad-Tree based Image Encoding Methods for Data-Adaptive Visual Feature Learning." 京都大学 (Kyoto University), 2015. http://hdl.handle.net/2433/199435.
Повний текст джерелаChambers, Anna. "Progressive Recovery of Cortical and Midbrain Sound Feature Encoding Following Profound Cochlear Neuropathy." Thesis, Harvard University, 2015. http://nrs.harvard.edu/urn-3:HUL.InstRepos:14226064.
Повний текст джерелаMoallem, Theodore M. 1976. "Articulatory feature encoding and sensorimotor training for tactually supplemented speech reception by the hearing-impaired." Thesis, Massachusetts Institute of Technology, 2011. http://hdl.handle.net/1721.1/68454.
Повний текст джерелаCataloged from PDF version of thesis.
Includes bibliographical references (p. 150-159).
This thesis builds on previous efforts to develop tactile speech-reception aids for the hearing-impaired. Whereas conventional hearing aids mainly amplify acoustic signals, tactile speech aids convert acoustic information into a form perceptible via the sense of touch. By facilitating visual speechreading and providing sensory feedback for vocal control, tactile speech aids may substantially enhance speech communication abilities in the absence of useful hearing. Research for this thesis consisted of several lines of work. First, tactual detection and temporal order discrimination by congenitally deaf adults were examined, in order to assess the practicability of encoding acoustic speech information as temporal relationships among tactual stimuli. Temporal resolution among most congenitally deaf subjects was deemed adequate for reception of tactually-encoded speech cues. Tactual offset-order discrimination thresholds substantially exceeded those measured for onset-order, underscoring fundamental differences between stimulus masking dynamics in the somatosensory and auditory systems. Next, a tactual speech transduction scheme was designed with the aim of extending the amount of articulatory information conveyed by an earlier vocoder-type tactile speech display strategy. The novel transduction scheme derives relative amplitude cues from three frequency-filtered speech bands, preserving the cross-channel timing information required for consonant voicing discriminations, while retaining low-frequency modulations that distinguish voiced and aperiodic signal components. Additionally, a sensorimotor training approach ("directed babbling") was developed with the goal of facilitating tactile speech acquisition through frequent vocal imitation of visuo-tactile speech stimuli and attention to tactual feedback from one's own vocalizations. A final study evaluated the utility of the tactile speech display in resolving ambiguities among visually presented consonants, following either standard or enhanced sensorimotor training. Profoundly deaf and normal-hearing participants trained to exploit tactually-presented acoustic information in conjunction with visual speechreading to facilitate consonant identification in the absence of semantic context. Results indicate that the present transduction scheme can enhance reception of consonant manner and voicing information and facilitate identification of syllableinitial and syllable-final consonants. The sensorimotor training strategy proved selectively advantageous for subjects demonstrating more gradual tactual speech acquisition. Simple, low-cost tactile devices may prove suitable for widespread distribution in developing countries, where hearing aids and cochlear implants remain unaffordable for most severely and profoundly deaf individuals. They have the potential to enhance verbal communication with minimal need for clinical intervention.
by Theodore M. Moallem.
Ph.D.
Laczik, Tamás. "Encoding Temporal Healthcare Data for Machine Learning." Thesis, KTH, Skolan för elektroteknik och datavetenskap (EECS), 2021. http://urn.kb.se/resolve?urn=urn:nbn:se:kth:diva-299433.
Повний текст джерелаDenna avhandling innehåller en recension av tidigare arbete inom områden av kodning av sekventiell sjukvårdsdata och förutsägelse av transplantat- mot- värdsjukdom, ett medicinskt tillstånd, baserat på patienthistoria med maskininlärning. En ny kodning av sådan data föreslås i maskininlärningssyfte. Den föreslagna kodningen, kallad bag of binned weighted events, är en kombination av två strategier som föreslagits i tidigare arbete, kallad bag of binned events och bag of weighted events. Ett empiriskt experiment är utformat för att utvärdera den föreslagna prestandan för den föreslagna kodningen över olika binningfönster jämfört med tidigare kodningar, baserat på AUC- måttet. Experimentet utförs på verkliga sjukvårdsdata som erhållits från svenska register, med random forest och logistic regression. Efter filtrering av data, lösning av kvalitetsproblem och justering av hyperparametrar för modellerna, erhålls slutliga resultat. Dessa resultat indikerar att den föreslagna kodningsstrategin presterar i nivå med, eller något bättre än bag of weighted events, och överträffar i de flesta fall bag of binned events. Skillnader i mått är dock små. Det observeras också att den föreslagna kodningen vanligtvis fungerar bättre med längre binningfönster som kan tillskrivas dataljud. Framtida arbete föreslås i form av att upprepa experimentet med olika datamängder och modeller, samt att ändra binningfönstrets längd för basalgoritmerna.
Dulas, Michael Robert. "The effect of explicitly directing attention toward item-feature relationships on source memory and aging: an erp study." Thesis, Georgia Institute of Technology, 2011. http://hdl.handle.net/1853/41187.
Повний текст джерелаPauzin, François Philippe [Verfasser], Patrick [Gutachter] Krieger, and Stephan [Gutachter] Herlitze. "A corticothalamic circuit for refining tactile encoding : a switch between feature detection and discrimination / François Philippe Pauzin ; Gutachter: Patrick Krieger, Stephan Herlitze ; International Graduate School of Neuroscience." Bochum : Ruhr-Universität Bochum, 2018. http://d-nb.info/117520496X/34.
Повний текст джерелаBisso, Paul W. (Paul William). "Leveraging features of nanoscale particulates for information encoding, quantitative sensing and drug delivery." Thesis, Massachusetts Institute of Technology, 2017. http://hdl.handle.net/1721.1/115691.
Повний текст джерелаCataloged from PDF version of thesis. "February 2017." Handwritten on title page "February 2018."
Includes bibliographical references.
It is both uncontroversial and unassailable to assert that small things can often go where big things cannot. It is similarly prosaic to note that at smaller length scales, matter behaves differently than at larger length scales. This thesis exploits these intuitive and simple axioms to yield advances in three independent lines of enquiry: (i) robust and practically accessible encoding of information within microparticles, (ii) rapid, quantitative sensing of hydrophobic colloids and (iii) immunologically-focused drug delivery. Specifically, upconversion nanocrystals are used as the foundation of a novel spatial-spectral patterning motif to produce polymer microparticles with unique, decodable identities. With large single-particle encoding capacities (>10-⁶), an ultralow decoding false alarm rate (<10-⁹), and pronounced insensitivity to both particle chemistry and harsh processing conditions, this architecture enables practical deployment of encoded microparticles in applications with orthogonal requirements, including multiplexed bioassays and covert labeling of objects and packaging for anti-counterfeiting. Next, the large specific surface area of nanoscale objects is exploited by a family of zwitterionic, surfactant-like molecular rotors to develop a broadly applicable tool for sensitive, quantitative, and accessible nanoscale metrology. This tool is shown to address multiple challenges in nanometrology of self-assembled structures, including (i) quantification of surfactant adsorption isotherms on metal oxide surfaces, (ii) determination of self-assembly onset concentration, and (iii) high-throughput readout of drug delivery nanoparticle mass concentration. Finally, the combination of small size and large interfacial area was exploited to design nanoscale formulations for (i) ex vivo delivery to human neutrophils, a significant element of the innate immune system and (ii) targeted delivery of therapeutics to the asthmatic lung.
by Paul W. Bisso.
Ph. D.
Kundu, Benjamin Ina Annesha. "Imaging platforms for detecting and analyzing skin features and Its stability : with applications in skin health and in using the skin as a body-relative position-encoding system." Thesis, Massachusetts Institute of Technology, 2015. http://hdl.handle.net/1721.1/100114.
Повний текст джерелаCataloged from PDF version of thesis.
Includes bibliographical references (pages 121-124).
Skin imaging is a powerful, noninvasive method used with potential to aid in the diagnosis of various dermatological diseases and assess overall skin health. This thesis discusses imaging platforms that were developed to aid in studying skin features and characteristics at different time and length scales to characterize and monitor skin. Two applications are considered: (1) using natural skin features as a position encoding system and an aid for volume reconstruction of ultrasound imaging and (2) studying natural skin feature evolution or stability over time to aid in assessing skin health. A 5-axis, rigid translational scanning system was developed to capture images at specific locations and to validate skin based body registration algorithms. We show that natural skin features could be used to perform ultrasound based reconstruction accurate to 0.06 mm. A portable, handheld scanning device was designed to study skin characteristics at different time and length scales. With this imaging platform, we analyze skin features at different length scales: [mu]m (for microreliefs), mm (for moles and pores), and cm (for distances between microreliefs and other features). Preliminary algorithms are used to automatically identify microreliefs. Further work in image processing is required to assess skin variation using these images.
by Ina Annesha Kundu.
S.M.
Kempf, Alexandre. "Nonlinear encoding of sounds in the auditory cortex Temporal asymmetries in auditory coding and perception reflect multi-layered nonlinearities Cortical recruitment determines learning dynamics and strategy Interactions between nonlinear features in the mouse auditory cortex Context-dependent signaling of coincident auditory and visual events in primary visual cortex." Thesis, Sorbonne Paris Cité, 2018. http://www.theses.fr/2018USPCB085.
Повний текст джерелаPerceptual objects are the elementary units used by the brain to construct an inner world representation of the environment from multiple physical sources, like light or sound waves. While the physical signals are first encoded by receptors in peripheral organs into neuroelectric signals, the emergence of perceptual object require extensive processing in the central nervous system which is not yet fully characterized. Interestingly, recent advances in deep learning shows that implementing series of nonlinear and linear operations is a very efficient way to create models that categorize visual and auditory perceptual objects similarly to humans. In contrast, most of the current knowledge about the auditory system concentrates on linear transformations. In order to establish a clear example of the contribution of auditory system nonlinearities to perception, we studied the encoding of sounds with an increasing intensity (up ramps) and a decreasing intensity (down ramps) in the mouse auditory cortex. Two behavioral tasks showed evidence that these two sounds are perceived with unequal salience despite carrying the same physical energy and spectral content, a phenomenon incompatible with linear processing. Recording the activity of large cortical populations for up- and down-ramping sounds, we found that cortex encodes them into distinct sets of non-linear features, and that asymmetric feature selection explained the perceptual asymmetry. To complement these results, we also showed that, in reinforcement learning models, the amount of neural activity triggered by a stimulus (e.g. a sound) impacts learning speed and strategy. Interestingly very similar effects were observed in sound discrimination behavior and could be explain by the amount of cortical activity triggered by the discriminated sounds. This altogether establishes that auditory system nonlinearities have an impact on perception and behavior. To more extensively identify the nonlinearities that influence sounds encoding, we then recorded the activity of around 60,000 neurons sampling the entire horizontal extent of auditory cortex. Beyond the fine scale tonotopic organization uncovered with this dataset, we identified and quantified 7 nonlinearities. We found interestingly that different nonlinearities can interact with each other in a non-trivial manner. The knowledge of these interactions carry good promises to refine auditory processing model. Finally, we wondered if the nonlinear processes are also important for multisensory integration. We measured how visual inputs and sounds combine in the visual and auditory cortex using calcium imaging in mice. We found no modulation of supragranular auditory cortex in response to visual stimuli, as observed in previous others studies. We observed that auditory cortex inputs to visual cortex affect visual responses concomitant to a sound. Interestingly, we found that auditory cortex projections to visual cortex preferentially channel activity from neurons encoding a particular non-linear feature: the loud onset of sudden sounds. As a result, visual cortex activity for an image combined with a loud sound is higher than for the image alone or combine with a quiet sound. Moreover, this boosting effect is highly nonlinear. This result suggests that loud sound onsets are behaviorally relevant in the visual system, possibly to indicate the presence of a new perceptual objects in the visual field, which could represent potential threats. As a conclusion, our results show that nonlinearities are ubiquitous in sound processing by the brain and also play a role in the integration of auditory information with visual information. In addition, it is not only crucial to account for these nonlinearities to understand how perceptual representations are formed but also to predict how these representations impact behavior
Лавриненко, Олександр Юрійович, Александр Юрьевич Лавриненко та Oleksandr Lavrynenko. "Методи підвищення ефективності семантичного кодування мовних сигналів". Thesis, Національний авіаційний університет, 2021. https://er.nau.edu.ua/handle/NAU/52212.
Повний текст джерелаThe thesis is devoted to the solution of the actual scientific and practical problem in telecommunication systems, namely increasing the bandwidth of the semantic speech data transmission channel due to their efficient coding, that is the question of increasing the efficiency of semantic coding is formulated, namely – at what minimum speed it is possible to encode semantic features of speech signals with the set probability of their error-free recognition? It is on this question will be answered in this research, which is an urgent scientific and technical task given the growing trend of remote human interaction and robotic technology through speech, where the accurateness of this type of system directly depends on the effectiveness of semantic coding of speech signals. In the thesis the well-known method of increasing the efficiency of semantic coding of speech signals based on mel-frequency cepstral coefficients is investigated, which consists in finding the average values of the coefficients of the discrete cosine transformation of the prologarithmic energy of the spectrum of the discrete Fourier transform treated by a triangular filter in the mel-scale. The problem is that the presented method of semantic coding of speech signals based on mel-frequency cepstral coefficients does not meet the condition of adaptability, therefore the main scientific hypothesis of the study was formulated, which is that to increase the efficiency of semantic coding of speech signals is possible through the use of adaptive empirical wavelet transform followed by the use of Hilbert spectral analysis. Coding efficiency means a decrease in the rate of information transmission with a given probability of error-free recognition of semantic features of speech signals, which will significantly reduce the required passband, thereby increasing the bandwidth of the communication channel. In the process of proving the formulated scientific hypothesis of the study, the following results were obtained: 1) the first time the method of semantic coding of speech signals based on empirical wavelet transform is developed, which differs from existing methods by constructing a sets of adaptive bandpass wavelet-filters Meyer followed by the use of Hilbert spectral analysis for finding instantaneous amplitudes and frequencies of the functions of internal empirical modes, which will determine the semantic features of speech signals and increase the efficiency of their coding; 2) the first time it is proposed to use the method of adaptive empirical wavelet transform in problems of multiscale analysis and semantic coding of speech signals, which will increase the efficiency of spectral analysis due to the decomposition of high-frequency speech oscillations into its low-frequency components, namely internal empirical modes; 3) received further development the method of semantic coding of speech signals based on mel-frequency cepstral coefficients, but using the basic principles of adaptive spectral analysis with the application empirical wavelet transform, which increases the efficiency of this method. Conducted experimental research in the software environment MATLAB R2020b showed, that the developed method of semantic coding of speech signals based on empirical wavelet transform allows you to reduce the encoding speed from 320 to 192 bit/s and the required passband from 40 to 24 Hz with a probability of error-free recognition of about 0.96 (96%) and a signal-to-noise ratio of 48 dB, according to which its efficiency increases 1.6 times in contrast to the existing method. The results obtained in the thesis can be used to build systems for remote interaction of people and robotic equipment using speech technologies, such as speech recognition and synthesis, voice control of technical objects, low-speed encoding of speech information, voice translation from foreign languages, etc.
"Feature topography and sound intensity level encoding in primary auditory cortex." WASHINGTON UNIVERSITY IN ST. LOUIS, 2010. http://pqdtopen.proquest.com/#viewpdf?dispub=3387456.
Повний текст джерелаKAO, HUI-TZU, and 高惠慈. "SURF Feature Encoding for Quick Indexing of Finger Vein Recognition System." Thesis, 2015. http://ndltd.ncl.edu.tw/handle/64072663018729452442.
Повний текст джерела國立臺灣科技大學
資訊工程系
103
With the development of technology, Security of information becomes an important issue. Biometrics recognition is popular in recent years, Biometrics is of considerable research interest in recent years, it use humans face、iris、voice、fingerprint recognition, and the biometrics recognition mainstream become vein recognition. This paper is set a recognition system by using finger vein. The characteristics of finger vein are small and portability. Nowadays, the issue we have to discussion is recognize immediately with big database. Although many recognition algorithms have the high recognition rate, but they always cost too much time. Our proposed encode the new feature by using SURF information. The method can reduce the execution time, decrease the dimensions of vector and keep the feature of the vein. We calculate two distances to matching two images. First, computing the feature of second and third level by Hamming distance, it can filter the false candidates quickly. Second, SURF feature using Euclidean distance to determine each pattern vector matching or not. Using this method can retain the original characteristics, and have efficient recognition in the big databases.
Botly, Leigh Cortland Perry. "Cholinergic influences on the encoding, but not retrieval, of crossmodal sensory feature binding in rats." 2005. http://link.library.utoronto.ca/eir/EIRdetail.cfm?Resources__ID=370475&T=F.
Повний текст джерелаHuynh, D. L., Srimant P. Tripathy, H. E. Bedell, and Haluk Ogmen. "Stream specificity and asymmetries in feature binding and content-addressable access in visual encoding and memory." 2015. http://hdl.handle.net/10454/10479.
Повний текст джерелаHuman memory is content addressable—i.e., contents of the memory can be accessed using partial information about the bound features of a stored item. In this study, we used a cross-feature cuing technique to examine how the human visual system encodes, binds, and retains information about multiple stimulus features within a set of moving objects. We sought to characterize the roles of three different features (position, color, and direction of motion, the latter two of which are processed preferentially within the ventral and dorsal visual streams, respectively) in the construction and maintenance of object representations. We investigated the extent to which these features are bound together across the following processing stages: during stimulus encoding, sensory (iconic) memory, and visual shortterm memory. Whereas all features examined here can serve as cues for addressing content, their effectiveness shows asymmetries and varies according to cue–report pairings and the stage of information processing and storage. Position-based indexing theories predict that position should be more effective as a cue compared to other features. While we found a privileged role for position as a cue at the stimulus-encoding stage, position was not the privileged cue at the sensory and visual short-term memory stages. Instead, the pattern that emerged from our findings is one that mirrors the parallel processing streams in the visual system. This stream-specific binding and cuing effectiveness manifests itself in all three stages of information processing examined here. Finally, we find that the Leaky Flask model proposed in our previous study is applicable to all three features.
Liu, Yu-Nan, and 劉猷楠. "Prediction of protein quaternary structural attributes through hybrid feature encoding method by using machine learning approach." Thesis, 2018. http://ndltd.ncl.edu.tw/handle/6b665e.
Повний текст джерела國立中興大學
生物科技學研究所
106
Predicting their attributes is an essential task in computational biology for the advancement of the proteomics. However, the existing methods did not consider the integration of heterogeneous coding and the accuracy of subunit categories with low data number. To end this, we proposed a predictive tool which can predicting more than 12 subunit protein oligomers, QUATgo. At the same time, three kinds of sequence coding were used, including dipeptide composition which was first time using to predict protein quaternary structural attributes, protein half-life characteristics and we modified the coding method of the Functional Domain Composition which proposed by the predecessors to solve the problem of large feature vectors. QUATgo solves the problem of insufficient data in a single subunit using a two-stage architecture and uses 10 times cross-validation to test the predictive accuracy of the classifier, the first-stage prediction model uses a random forest algorithm to generate sixteen homologous, heterologous oligomers and monomer respectively. The accuracy of the first-stage classifier is 63.4%. However, the number of training data of the hetero-10mer is insufficient so the training data of the hetero-10mer and the hetero-more than 12mer is regarded as the same category X. If the result of the first stage classifier is class X the sequence will sent to second stage classifier which was constructed with support vector machines, and can the prediction result of the hetero-10mer and hetero-more than 12mer with an accuracy of 97.5%, QUATgo will eventually have 61.4% cross-validation accuracy and 63.4% independent test accuracy. In case study, QUATgo can accurately predicts the variable complex structure of the MERS-CoV ectodomains.
JAIN, AKSHAT. "IRIS RECOGNITION SYSTEM." Thesis, 2014. http://dspace.dtu.ac.in:8080/jspui/handle/repository/15609.
Повний текст джерелаQiu, Jun-Wei, and 邱俊瑋. "Encoding of Speech Feature Using Principal Components Analysis and Singular Value Decomposition in Distributed Speech Recognition Systems." Thesis, 2018. http://ndltd.ncl.edu.tw/handle/b9u3t8.
Повний текст джерела國立臺北科技大學
電機工程系
106
In this thesis, we investigate principal components analysis (PCA) on both well-known speech features simultaneously at the front-end of the distributed speech recognition (DSR) systems for discarding insignificant feature dimensions. One speech feature is the Mel-frequency cepstral coefficients (MFCC) after the cepstrum mean and variance normalization (CMVN) preprocessing, the other feature is the line spectral frequencies (LSF). After removal of less important features, we apply the singular value decomposition (SVD) to encode the significant speech features for further reducing the transmission bandwidth. At the back-end we employ the histogram equalization (HEQ) method on the decoded speech features and their first and second delta counterparts. In the experiments we use the Aurora-2 database and a basic front-end of the European Telecommunication Standards Institute (ETSI) distributed speech recognition system for evaluation. The experimental results show that the proposed SVD method (9 frames/group) can promote the word accuracy by 14.8 % in the clean condition and 2.65 % in the multi condition, respectively, as compared to the ETSI baseline system. In terms of transmission bitrate, the proposed SVD method (9 frames/group) can perform an average 12.4 bits/frame reduction (approximately 31.79%) than that of the full frame rate (FFR) ETSI baseline system.
Li, Yi-Hsun, and 李宜勳. "A Fast Mode Decision Based on Adjacent Feature and Rate-Distortion Cost Analysis for the Encoding of HEVC Inter-Prediction." Thesis, 2018. http://ndltd.ncl.edu.tw/handle/tmsty7.
Повний текст джерела國立臺北科技大學
電子工程系
106
High Efficiency Video Coding (HEVC) is a novel video coding standard. It inherits and improves the previous standard H.264/AVC video coding architecture. With the same video quality, it has better compression efficiency. HEVC applies many architectures and technologies, such as recursive architecture constituted by Coding Unit (CU), Prediction Unit (PU), Transformation Unit (TU), Block Merging. It adapts technologies such as Sample Adaptive Offset (SAO) and supports higher resolution video. Although the performance is increased, each unit from both intra-frame and inter-frame predictions must undergo a large number of computations of Rate-Distortion Cost (RDC) to find the best mode of encoding. Therefore, the complexity in the prediction is rather high, and the time cost is also increased when compared with that in H.264/AVC. In order to reduce the amount of computational complexity required for inter-frame prediction, we propose a fast mode decision algorithm for inter-picture prediction. By analyzing the prediction mode of neighboring blocks, the complexity of the current block is predicted, and those less likely prediction modes will be pruned. Experimental results show that a very good trade-off on the image quality and the coding time can be obtained with the proposed algorithm.
Machireddy, Amrutha. "Learning Non-linear Mappings from Data with Applications to Priority-based Clustering, Prediction, and Detection." Thesis, 2021. https://etd.iisc.ac.in/handle/2005/5670.
Повний текст джерелаLu, Li-wei, and 盧立偉. "Genetic-based optimal encoding for image clustering with texture-based features." Thesis, 2013. http://ndltd.ncl.edu.tw/handle/76177398596588526204.
Повний текст джерела國立高雄大學
電機工程學系碩士班
101
For image clustering, homogeneous and meaningful image pixels with specific features are clustered. The homogeneity of various features is usually calculated by the Euclidean distances among features. For features that have continuous variations, such as color, luminance, saturation, gradient of intensity, distance-based clustering can give effective results. When textures are used as features for clustering, an encoding scheme that describes the variations of textures in terms of distances can produce effective clustering results. This study proposes a genetic-based encoding method to deal with the abovementioned problem where the local binary pattern (LBP) is employed as the texture for clustering. The genetic algorithm (GA) is used to implement the optimal encoding scheme of LBP-based textures. In the encoding scheme, similar LBP-textures are required to have shorter distances, and vice versa. The GA process is separated into two stages. The first stage arranges the locations for all LBP patterns so that they can have continuous variations. The second stage assigns each LBP pattern a unique integer in a manner that similar (dissimilar) patterns have short (long) distances in the same Euclidean scale. A fitness function describing these requirements is defined. In this study, fuzzy c-means is used as the clustering method. Various encoding methods are compared with the proposed method. From the experimental results, the genetic-based encoding method finds a feasible set of encodes for LBP-based textures and improves the quality of image clustering. Some images are tested and the results are analyzed.
Ambosta, Althea Hyacinth. "Reorienting in virtual environments: examining the influence of the number of discrete features on the encoding of geometry by humans." 2013. http://hdl.handle.net/1993/22068.
Повний текст джерелаSharma, Neeraj Kumar. "Information-rich Sampling of Time-varying Signals." Thesis, 2018. https://etd.iisc.ac.in/handle/2005/4126.
Повний текст джерела