Dissertations / Theses on the topic 'Reconnaissance de textes manuscrits'
Create a spot-on reference in APA, MLA, Chicago, Harvard, and other styles
Consult the top 50 dissertations / theses for your research on the topic 'Reconnaissance de textes manuscrits.'
Next to every source in the list of references, there is an 'Add to bibliography' button. Press on it, and we will generate automatically the bibliographic reference to the chosen work in the citation style you need: APA, MLA, Harvard, Chicago, Vancouver, etc.
You can also download the full text of the academic publication as pdf and read online its abstract whenever available in the metadata.
Browse dissertations / theses on a wide variety of disciplines and organise your bibliography correctly.
Nosary, Ali. "Reconnaissance automatique de textes manuscrits par adaptation au scripteur." Rouen, 2002. http://www.theses.fr/2002ROUES007.
Full textThis thesis deals with the problem of off-line handwritten text recognition. It describes a system of text recognition which exploits an original principle of adaptation to the handwriting to be recognized. The adaptation principle, inspired by contextual effects observed from a human reader, is based on the automatic learning, during the recognition, of the graphical characteristics of the handwriting (writer invariants). The word recognition proceeds according to an analytical approach based on a segmentation-recognition principle. The on-line adaptation of the recognition system relies on the iteration of two steps : a word recognition step which allows to label the writer's representations (allographes) on the whole text and a revaluation step of character models. The implementation of our adaptation strategy requires an interactive recognition scheme able to make interact treatments at various contextual levels. The interaction model retained is based on the multi-agent paradigm
Vincent, Nicole. "Contribution à la reconnaissance de textes multipolices." lyon, INSA, 1988. http://www.theses.fr/1988ISAL0011.
Full textPaquet, Thierry. "Segmentation et classification de mots en reconnaissance optique de textes manuscrits." Rouen, 1992. http://www.theses.fr/1992ROUES007.
Full textOudot, Loïc. "Fusion d'informations et adaptation pour la reconnaissance de textes manuscrits dynamiques." Paris 6, 2003. http://www.theses.fr/2003PA066469.
Full textBarrère, Killian. "Architectures de Transformer légères pour la reconnaissance de textes manuscrits anciens." Electronic Thesis or Diss., Rennes, INSA, 2023. http://www.theses.fr/2023ISAR0017.
Full textTransformer architectures deliver low error rates but are challenging to train due to limited annotated data in handwritten text recognition. We propose lightweight Transformer architectures to adapt to the limited amounts of annotated handwritten text available. We introduce a fast Transformer architecture with an encoder, processing up to 60 pages per second. We also present architectures using a Transformer decoder to incorporate language modeling into character recognition. To effectively train our architectures, we offer algorithms for generating synthetic data adapted to the visual style of modern and historical documents. Finally, we propose strategies for learning with limited data and reducing prediction errors. Our architectures, combined with synthetic data and these strategies, achieve competitive error rates on lines of text from modern documents. For historical documents, they train effectively with minimal annotated data, surpassing state-ofthe- art approaches. Remarkably, just 500 annotated lines are sufficient for character error rates close to 5%
Quiniou, Solen. "Intégration de connaissances linguistiques pour la reconnaissance de textes manuscrits en-ligne." Phd thesis, Rennes, INSA, 2007. ftp://ftp.irisa.fr/techreports/theses/2007/quiniou.pdf.
Full textThe aim of this thesis is to build a sentence regognition system based on an existing word regognition system. Two research axes are considered: the sentence segmentation int words as well as the integration of linguistic knowledge to take into account the context of the sentences. We studied several types of statistic language models by comparing their respective impact on the recognition system performances. We also tried to find the best strategy to introduce them efficiently into the whole recognition system. One of the originality of this study is the integration of a representation of the different sentence hypotheses in the form of a confusion network; which is then used to detect and correct the remaining regognition errors. Using the aforementioned techniques allowed us to considerably reduce the number of recognition errors among the words of the sentences
Quiniou, Solen. "Intégration de connaissances linguistiques pour la reconnaissance de textes manuscrits en-ligne." Phd thesis, INSA de Rennes, 2007. http://tel.archives-ouvertes.fr/tel-00580623.
Full textLeroux, Manuel. "Reconnaissance de textes manuscrits à vocabulaire limité avec application à la lecture automatique des chèques." Rouen, 1991. http://www.theses.fr/1991ROUES045.
Full textMorillot, Olivier. "Reconnaissance de textes manuscrits par modèles de Markov cachés et réseaux de neurones récurrents : application à l'écriture latine et arabe." Electronic Thesis or Diss., Paris, ENST, 2014. http://www.theses.fr/2014ENST0002.
Full textHandwriting recognition is an essential component of document analysis. One of the popular trends is to go from isolated word to word sequence recognition. Our work aims to propose a text-line recognition system without explicit word segmentation. In order to build an efficient model, we intervene at different levels of the recognition system. First of all, we introduce two new preprocessing techniques : a cleaning and a local baseline correction for text-lines. Then, a language model is built and optimized for handwritten mails. Afterwards, we propose two state-of-the-art recognition systems based on contextual HMMs (Hidden Markov Models) and recurrent neural networks BLSTM (Bi-directional Long Short-Term Memory). We optimize our systems in order to give a comparison of those two approaches. Our systems are evaluated on arabic and latin cursive handwritings and have been submitted to two international handwriting recognition competitions. At last, we introduce a strategy for some out-of-vocabulary character strings recognition, as a prospect of future work
Bilane, P. "Contributions a l'indexation et a la reconnaissance des manuscrits Syriaques." Phd thesis, INSA de Lyon, 2010. http://tel.archives-ouvertes.fr/tel-00499537.
Full textDelaye, Adrien. "Méta-modèles de positionnement spatial pour la reconnaissance de tracés manuscrits." Phd thesis, INSA de Rennes, 2011. http://tel.archives-ouvertes.fr/tel-00651446.
Full textThomas, S. "Extraction d'information dans des documents manuscrits non contraints : application au traitement automatique des courriers entrants manuscrits." Phd thesis, Université de Rouen, 2012. http://tel.archives-ouvertes.fr/tel-00863502.
Full textMontreuil, Florent. "Extraction de structures de documents par champs aléatoires conditionnels : application aux traitements des courriers manuscrits." Phd thesis, Rouen, 2011. http://www.theses.fr/2011ROUES047.
Full textThe automatic processing of written documents is a very active field in the industry. Indeed, due to the mass of written documents to process, the automatic analysis becomes a necessity, but the performance of current systems is highly variable according to the types of documents processed. For example, treatment of unconstrained handwritten documents remains an unsolved issue because two technological obstacles that hinder the development of reliable automatic processing of handwritten documents : - the first is the recognition of handwritten in those documents - the second is related to the existence of widely variability in the document structures. This thesis focuses on solving the second bolt in the case of unconstrained handwritten documents. For this, we have developed reliable and robust methods to analyze document structures based on the use of Conditional Random Fields. The choice of Conditional Random Fields is motivated by the ability of these graphical models to take into account the relationships between the various entities of the document (words, phrases, blocks,. . . ) and integrate contextual knowledge. In addition, the use of probabilistic modeling gifted learning overcomes the inherent variability of the documents to be processed. The originality of the thesis also addresses the proposal of a hierarchical approach for extracting joint physical (segmentation of the document into blocks, lines, ldots) and logical (functional interpretation of the physical structure) structures by combining low-level physical features (position, graphic,. . . ) and high-level logical (keyword spotting). The experiments carried out on handwritten letters show that the proposed model represents an interesting solution because of its discriminatory character and his natural ability to integrate and contextualize the characteristics of different kinds
Montreuil, Florent. "Extraction de structures de documents par champs aléatoires conditionnels : application aux traitements des courriers manuscrits." Phd thesis, Université de Rouen, 2011. http://tel.archives-ouvertes.fr/tel-00652301.
Full textGhanmi, Nabil. "Segmentation d'images de documents manuscrits composites : application aux documents de chimie." Electronic Thesis or Diss., Université de Lorraine, 2016. http://www.theses.fr/2016LORR0109.
Full textThis thesis deals with chemistry document segmentation and structure analysis. This work aims to help chemists by providing the information on the experiments which have already been carried out. The documents are handwritten, heterogeneous and multi-writers. Although their physical structure is relatively simple, since it consists of a succession of three regions representing: the chemical formula of the experiment, a table of the used products and one or more text blocks describing the experimental procedure, several difficulties are encountered. In fact, the lines located at the region boundaries and the imperfections of the table layout make the separation task a real challenge. The proposed methodology takes into account these difficulties by performing segmentation at several levels and treating the region separation as a classification problem. First, the document image is segmented into linear structures using an appropriate horizontal smoothing. The horizontal threshold combined with a vertical overlapping tolerance favor the consolidation of fragmented elements of the formula without too merge the text. These linear structures are classified in text or graphic based on discriminant structural features. Then, the segmentation is continued on text lines to separate the rows of the table from the lines of the raw text locks. We proposed for this classification, a CRF model for determining the optimal labelling of the line sequence. The choice of this kind of model has been motivated by its ability to absorb the variability of lines and to exploit contextual information. For the segmentation of table into cells, we proposed a hybrid method that includes two levels of analysis: structural and syntactic. The first relies on the presence of graphic lines and the alignment of both text and spaces. The second tends to exploit the coherence of the cell content syntax. We proposed, in this context, a Recognition-based approach using contextual knowledge to detect the numeric fields present in the table. The thesis was carried out in the framework of CIFRE, in collaboration with the eNovalys campany.We have implemented and tested all the steps of the proposed system on a consequent dataset of chemistry documents
Ghanmi, Nabil. "Segmentation d'images de documents manuscrits composites : application aux documents de chimie." Thesis, Université de Lorraine, 2016. http://www.theses.fr/2016LORR0109/document.
Full textThis thesis deals with chemistry document segmentation and structure analysis. This work aims to help chemists by providing the information on the experiments which have already been carried out. The documents are handwritten, heterogeneous and multi-writers. Although their physical structure is relatively simple, since it consists of a succession of three regions representing: the chemical formula of the experiment, a table of the used products and one or more text blocks describing the experimental procedure, several difficulties are encountered. In fact, the lines located at the region boundaries and the imperfections of the table layout make the separation task a real challenge. The proposed methodology takes into account these difficulties by performing segmentation at several levels and treating the region separation as a classification problem. First, the document image is segmented into linear structures using an appropriate horizontal smoothing. The horizontal threshold combined with a vertical overlapping tolerance favor the consolidation of fragmented elements of the formula without too merge the text. These linear structures are classified in text or graphic based on discriminant structural features. Then, the segmentation is continued on text lines to separate the rows of the table from the lines of the raw text locks. We proposed for this classification, a CRF model for determining the optimal labelling of the line sequence. The choice of this kind of model has been motivated by its ability to absorb the variability of lines and to exploit contextual information. For the segmentation of table into cells, we proposed a hybrid method that includes two levels of analysis: structural and syntactic. The first relies on the presence of graphic lines and the alignment of both text and spaces. The second tends to exploit the coherence of the cell content syntax. We proposed, in this context, a Recognition-based approach using contextual knowledge to detect the numeric fields present in the table. The thesis was carried out in the framework of CIFRE, in collaboration with the eNovalys campany.We have implemented and tested all the steps of the proposed system on a consequent dataset of chemistry documents
Kesiman, Made Windu Antara. "Document image analysis of Balinese palm leaf manuscripts." Thesis, La Rochelle, 2018. http://www.theses.fr/2018LAROS013/document.
Full textThe collection of palm leaf manuscripts is an important part of Southeast Asian people’s culture and life. Following the increasing of the digitization projects of heritage documents around the world, the collection of palm leaf manuscripts in Southeast Asia finally attracted the attention of researchers in document image analysis (DIA). The research work conducted for this dissertation focused on the heritage documents of the collection of palm leaf manuscripts from Indonesia, especially the palm leaf manuscripts from Bali. This dissertation took part in exploring DIA researches for palm leaf manuscripts collection. This collection offers new challenges for DIA researches because it uses palm leaf as writing media and also with a language and script that have never been analyzed before. Motivated by the contextual situations and real conditions of the palm leaf manuscript collections in Bali, this research tried to bring added value to digitized palm leaf manuscripts by developing tools to analyze, to transliterate and to index the content of palm leaf manuscripts. These systems aim at making palm leaf manuscripts more accessible, readable and understandable to a wider audience and, to scholars and students all over the world. This research developed a DIA system for document images of palm leaf manuscripts, that includes several image processing tasks, beginning with digitization of the document, ground truth construction, binarization, text line and glyph segmentation, ending with glyph and word recognition, transliteration and document indexing and retrieval. In this research, we created the first corpus and dataset of the Balinese palm leaf manuscripts for the DIA research community. We also developed the glyph recognition system and the automatic transliteration system for the Balinese palm leaf manuscripts. This dissertation proposed a complete scheme of spatially categorized glyph recognition for the transliteration of Balinese palm leaf manuscripts. The proposed scheme consists of six tasks: the text line and glyph segmentation, the glyph ordering process, the detection of the spatial position for glyph category, the global and categorized glyph recognition, the option selection for glyph recognition and the transliteration with phonological rules-based machine. An implementation of knowledge representation and phonological rules for the automatic transliteration of Balinese script on palm leaf manuscript is proposed. The adaptation of a segmentation-free LSTM-based transliteration system with the generated synthetic dataset and the training schemes at two different levels (word level and text line level) is also proposed
Constum, Thomas. "Extractiοn d'infοrmatiοn dans des dοcuments histοriques à l'aide de grands mοdèles multimοdaux." Electronic Thesis or Diss., Normandie, 2024. http://www.theses.fr/2024NORMR083.
Full textThis thesis focuses on automatic information extraction from historical handwritten documents, within the framework of the POPP and EXO-POPP projects. The POPP project focuses on handwritten census tables from Paris (1921-1946), while EXO-POPP deals with marriage records from the Seine department (1880-1940). The main objective is to develop an end-to-end architecture for information extraction from complete documents, avoiding explicit segmentation steps.Initially, a sequential processing pipeline was developed for the POPP project, enabling the automatic extraction of information for 9 million individuals across 300,000 pages. Then, an end-to-end architecture for information extraction was implemented for EXO-POPP, based on a convolutional encoder and a Transformer decoder, with the insertion of special symbols encoding the information to be extracted.Subsequently, the integration of large language models based on the Transformer architecture led to the creation of the DANIEL model, which achieved a new state-of-the-art on several public datasets (RIMES 2009 and M-POPP for handwriting recognition, IAM NER for information extraction), while offering faster inference compared to existing approaches. Finally, two public datasets from the POPP and EXO-POPP projects were made available, along with the code and weights of the DANIEL model
Ghorbel, Adam. "Generalized Haar-like filters for document analysis : application to word spotting and text extraction from comics." Thesis, La Rochelle, 2016. http://www.theses.fr/2016LAROS008/document.
Full textThe presented thesis follows two directions. The first one disposes a technique for text and graphic separation in comics. The second one points out a learning free segmentation free word spotting framework based on the query-by-string problem for manuscript documents. The two approaches are based on human perception characteristics. Indeed, they were inspired by several characteristics of human vision such as the Preattentive processing. These characteristics guide us to introduce two multi scale approaches for two different document analysis tasks which are text extraction from comics and word spotting in manuscript document. These two approaches are based on applying generalized Haar-like filters globally on each document image whatever its type. Describing and detailing the use of such features throughout this thesis, we offer the researches of document image analysis field a new line of research that has to be more explored in future. The two approaches are layout segmentation free and the generalized Haar-like filters are applied globally on the image. Moreover, no binarization step of the processed document is done in order to avoid losing data that may influence the accuracy of the two frameworks. Indeed, any learning step is performed. Thus, we avoid the process of extraction features a priori which will be performed automatically, taking into consideration the different characteristics of the documents
LETURCQ, ALEXIS. "La reconnaissance des caracteres manuscrits sans apprentissage." Paris 6, 1990. http://www.theses.fr/1990PA066209.
Full textFeray, Nicolas. "Reconnaissance de formes : application aux chiffres manuscrits." Rouen, 1995. http://www.theses.fr/1995ROUES028.
Full textLoy, Wee Wang Landau I. D. "Reconnaissance en ligne de caractères alphanumériques manuscrits." S. l. : Université Grenoble 1, 2008. http://tel.archives-ouvertes.fr/tel-00297291.
Full textVincent, Nicole. "Contribution à la reconnaissance de textes multipolices." Grenoble 2 : ANRT, 1988. http://catalogue.bnf.fr/ark:/12148/cb37619142z.
Full textLamy, Bertrand. "Reconnaissance de caracteres manuscrits par combinaison de modeles connexionnistes." Paris 6, 1995. http://www.theses.fr/1995PA066365.
Full textMarti, Roland. "Handschrift, Text, Textgruppe, Literatur : Untersuchungen zur inneren Gliederung der frühen Literatur aus dem ostslavischen Sprachbereich in den Handschriften des 11. bis 14. Jahrhunderts /." Wiesbaden : O. Harrassowitz, 1989. http://catalogue.bnf.fr/ark:/12148/cb35541795f.
Full textLemaitre, Melanie. "Approche markovienne bidimensionnelle d'analyse et de reconnaissance de documents manuscrits." Phd thesis, Université René Descartes - Paris V, 2007. http://tel.archives-ouvertes.fr/tel-00273255.
Full textAMBRES a été appliquée avec succès à des tâches aussi diverses que la reconnaissance de caractères et de mots manuscrits isolés, la structuration de documents manuscrits et la reconnaissance de logos et pourrait être étendue à d'autres problématiques du domaine de la vision.
Des protocoles rigoureux ont été utilisés pour l'étude du système et de ses paramètres ainsi que pour l'évaluation des performances. En particulier, AMBRES a pu être validée au sein de la campagne d'évaluation RIMES (Reconnaissance et Indexation de données Manuscrites et de fac similES).
Ronnet, Jean-Christophe. "Etude et réalisations d'un dispositif de reconnaissance de chiffres manuscrits." Paris 6, 1987. http://www.theses.fr/1987PA066204.
Full textBilane, Pétra. "Contributions à l'indexation et à la reconnaissance des manuscrits syriaques." Lyon, INSA, 2010. http://theses.insa-lyon.fr/publication/2010ISAL0029/these.pdf.
Full textThis thesis is dedicated to the computed exploration of Syriac manuscripts; it is the first study of the sort. Syriac is a language that developed in the eastern region of the Mediterranean coast, about twenty centuries ago, and is still in practice, today. The history as well as the development of the language is presented in the first chapter. Syriac is written from right to left with a distinct feature which is a tilt of about 45Ê which renders classical signal and document analysis algorithms which were developed for other languages rather useless. In the second chapter, after describing and extracting the documents structure, we developed a word segmentation method that takes this tilt into consideration, this lead us to about thirty stable shapes which are vertical letters and n-grammes made out of titled letters. In the second part of this thesis, we were interested in the content of the documents for indexation purposes. We developed a word spotting method that allowed us to find all the occurrences of a word in a document using several word query approaches (word spotting, word retrieval). It is based on shape similarity evaluated after a thorough analysis of the orientations of the handwriting. The last chapter consists of a first contribution to assisted transcription of Syriac manuscripts which relies on the above described segmentation. We showed that transcription based on interaction, is in conflict with the traditional approaches of OCR recognition
Silbermann, Martine. "Reconnaissance de schémas manuscrits : contribution à l'étude d'objets en mouvement." Grenoble INPG, 1986. http://www.theses.fr/1986INPG0134.
Full textSilbermann, Martine. "Reconnaissance de schémas manuscrits contribution à l'étude d'objets en mouvement /." Grenoble 2 : ANRT, 1986. http://catalogue.bnf.fr/ark:/12148/cb37601154t.
Full textRonnet, Jean-Christophe. "Etude et réalisation d'un dispositif de reconnaissance de chiffres manuscrits." Grenoble 2 : ANRT, 1987. http://catalogue.bnf.fr/ark:/12148/cb37609460h.
Full textHERVIEU, MARC. "Etude architecturale d'un systeme de reconnaissance en ligne de caracteres manuscrits." Paris 6, 1996. http://www.theses.fr/1996PA066585.
Full textAUGER, JEAN-MARIE. "Methodes neuronales pour la reconnaissance de caracteres manuscrits : comparaison et cooperation." Paris 6, 1993. http://www.theses.fr/1993PA066296.
Full textDelaye, Adrien. "Méta-modèles de positionnement spatial pour la reconnaissance de tracé manuscrits." Rennes, INSA, 2011. https://tel.archives-ouvertes.fr/tel-00651446.
Full textSpatial meta-models are introduced for a generic description of spatial relations between handwritten objetcs. The models are trainable from data and offer an increased precision thanks to the consideration of objetcs shapes. The fuzzy definition allows to deal with imprecision and provides an intuitive modeling. Meta-models have a predictive ability which enables the description of the modeled relation in the image space, relatively to a reference object. It allows the model visualization and provide a tool for context-driven object segmentation. We proporse a resentation of structured handwritten symbols based only on the description of their spatial information with meta-models. Segmentation of objetcs into primitives is driven by the models prediction ability. Results confirme the importance of spatial information and validate the quality of the proposed models
Kinder, Alain Bouvier Gérard. "Procédé de reconnaissance syntaxique des caractères alphanumériques manuscrits réalisation micro-informatique /." S. l. : Université Grenoble 1, 2008. http://tel.archives-ouvertes.fr/tel-00298254.
Full textMoradkhan, Romel. "Détection des points critiques d'une forme : application à la reconnaissance de caractères manuscrits." Paris 9, 1993. https://portail.bu.dauphine.fr/fileviewer/index.php?doc=1993PA090012.
Full textThe représentation of two-dimensional patterns by their contours is of great importance since many patterns, such as hand-written or printed characters, can be recognized by their contours. Because of its complexity the détection of dominant points of digitalized contours continues to be an important area of research. The first part of our work covers dominant point détection methods of digitalized curves (contours). After a survey of existing techniques we propose two new and efficient methods: the first is based on the notion of "co-angularity"; the second on the notion of "axis of symmetry". In the second part we focus on the problem of hand-written character récognition. We have proposed a hierarchical algorithm based on ctural matching which is both flexible and continuous
Houssein, Ali. "Reconnaissance globale de mots manuscrits arabes basée sur une recherche avec un dictionnaire." Rouen, 1998. http://www.theses.fr/1998ROUES049.
Full textThomas, Simon. "Extraction d'information dans des documents manuscrits non contraints : application au traitement automatique des courriers entrants manuscrits." Rouen, 2012. http://www.theses.fr/2012ROUES048.
Full textDespite the avenment of our world into the digital era, a large amount of handwritten documents continue to be exchanged, forcing our companies and administrations to cope with the processing of masses of documents. Automatic processing of these documents requires access to an unknown but relevant part of their content, and implies taking into account three key points : the document segmentation into relevant entities, their recognition and the rejection of irrelevant entities. Contrary to traditional approaches (full documents reading or keyword detection), all processes are parallelized leading to an information extraction approach. The first contribution of the present work is the design of a generic text line model for information extraction purpose and the implementation of a complete system based on Hidden Markov Models (HMM) constrained by this model. In one pass, the recognition module seeks to discriminate relevant information, characterized by a set of alphabetic, numeric or alphanumeric queries, with the irrelevant information, characterized by a filler model. A second contribution concerns the improvement of the local frame discrimination by using a deep neural network. This allows one to infer high-level representation for the frames and thus automate the feature extraction process. These result is a complete, generic and industrially system, responding to emerging needs in the field of handwritten document automatic reading : the extraction of complex information in unconstrained documents
Humbert, Geneviève. "Premières recherches sur le Kitâb de Si͏̈bawayhi." Paris 8, 1992. http://www.theses.fr/1992PA080702.
Full textThis thesis studies the history of the text of the earliest extant work on arabic grammar : al-kotab by sibawayhi (died ca. 180 796). The sources used are the manuscripts of the text : out of 77 known manuscripts, 58 have been examined and 47 described (elven manuscripts, flawed and late, have been left aside for the moment). The analysis of the chains of transmitters cited in the manuscripts, as well as of other evidence (glosses, colophons and other scribal notes), points to two main stages in the text's history : 1) edition of kitab by al-mubarrad (d. In 285 898), who introduced numerous glosses into the text, 2) and wide diffusion of this edition in both the east, notably by abu 'ali l-farisi and al-zamakhshari, and in the west, where the text was introduced in al-andalus by al-rabahi. Al-mubarrad's corpus became the standard edition of kitab, which had all but eliminated the others. One exception should, however, be noted : the manuscript la (5th xith century), which preserves an older state of the text, without the glosses, might be related to the "kufian" rival of al-mubarrad : tha'lab. Many passages of kitab can be restored with the help of the newly discovered manuscript 1a
Arrivault, Denis. "Apport des Graphes dans la Reconnaissance Non-Contrainte de Caractères Manuscrits Anciens." Phd thesis, Université de Poitiers, 2006. http://tel.archives-ouvertes.fr/tel-00267232.
Full textNamane, Abderrahmane. "Degraded printed text and handwritten recognition methods : Application to automatic bank check recognition." Université Louis Pasteur (Strasbourg) (1971-2008), 2007. http://www.theses.fr/2007STR13048.
Full textCharacter recognition is a significant stage in all document recognition systems. Character recognition is considered as an assignment problem and decision of a given character, and is an active research subject in many disciplines. This thesis is mainly related to the recognition of degraded printed and handwritten characters. New solutions were brought to the field of document image analysis (DIA). The first solution concerns the development of two recognition methods for handwritten numeral character, namely, the method based on the use of Fourier-Mellin transform (FMT) and the self-organization map (SOM), and the parallel combination of HMM-based classifiers using as parameter extraction a new projection technique. In the second solution, one finds a new holistic recognition method of handwritten words applied to French legal amount. The third solution presents two recognition methods based on neural networks for the degraded printed character applied to the Algerian postal check. The first work is based on sequential combination and the second used a serial combination based mainly on the introduction of a relative distance for the quality measurement of the degraded character. During the development of this thesis, methods of preprocessing were also developed, in particular, the handwritten numeral slant correction, the handwritten word central zone detection and its slope
Chatelain, Clément. "Extraction de séquences numériques dans des documents manuscrits quelconques." Phd thesis, Rouen, 2006. http://www.theses.fr/2006ROUES056.
Full textWithin the framework of the automatic processing of incoming mail documents, we present in this thesis the conception and development of a numerical field extraction system in weakly constrained handwritten documents. Although the recognition of isolated handwritten entities can be considered as a partially solved problem, the extraction of information in images of complex and free-layout documents is still a challenge. This problem requires the implementation of both handwriting recognition and information extraction methods inspired by approaches developed within the field of information extraction in electronic documents. Our contribution consists in the conception and the implementation of two different strategies: the first extends classical handwriting recognition methods, while the second is inspired from approaches used within the field of information extraction in electronic documents. The results obtained on a real handwritten mail database show that our second approach is significantly better. Finally, a complete, generic and efficient system is produced, answering one of the emergent perspectives in the field of the automatic reading of handwritten documents: the extraction of complex information in images of documents
Chatelain, Clément. "Extraction de séquences numériques dans des documents manuscrits quelconques." Phd thesis, Université de Rouen, 2006. http://tel.archives-ouvertes.fr/tel-00143090.
Full textKoch, Guillaume. "Catégorisation automatique de documents manuscrits : Application aux courriers entrants." Rouen, 2006. http://www.theses.fr/2006ROUES033.
Full textThe work described in this document deals with the design, the development and the evaluation of a system of categorization of unconstrained handwritten documents such as handwritten incoming mails. We demonstrate that a system developed for electronic document categorization can be adapted for handwritten document categorization
Duneau, Laurent. "Etude et réalisation d'un système adaptatif pour la reconnaissance en ligne de mots manuscrits." Compiègne, 1994. http://www.theses.fr/1994COMP7665.
Full textFarouz, Cherki. "Reconnaissance hors-ligne par modélisation markovienne de mots manuscrits dans un vocabulaire ouvert." Nantes, 1999. http://www.theses.fr/1999NANT2061.
Full textNous decrivons dans ce memoire un systeme de reconnaissance de mots manuscrits hors-ligne sans contrainte dans un grand vocabulaire. La realisation d'un tel systeme a necessite de repondre aux deux problemes suivants: le premier probleme porte sur la diminution du taux de reconnaissance, observe quand il s'agit de rechercher le mot correspondant a une image dans un grand lexique. La reponse apportee a ce probleme passe par une meilleure caracterisation de l'image pour mieux renseigner le module de reconnaissance. Nous avons developpe dans le cadre de cette these une methode d'extraction de caracteristiques, qui a partir d'une image de mot segmentee, determine une description pour representer le mot. Cette representation est deduite de l'etiquetage des segments (entites resultant de la phase de segmentation) a partir d'un alphabet appele ensemble de graphemes. Un grapheme est une entite pouvant correspondre a un morceau de lettre, une lettre complete ou des lettres jointes. L'alphabet des graphemes est construit dans une phase precedente a partir d'un echantillon d'apprentissage. L'etiquetage des differents segments de l'image tient compte du contexte local dans lequel les segments s'inscrivent. Cette idee nous a permis de construire une description pertinente de l'image qui est fournie par la suite aux modeles de markov caches pour estimer la probabilite de vraisemblance de l'image avec chaque entree du lexique. L'application d'une telle methode a donne des taux de reconnaissance tres interessants et a montre une certaine robustesse face a l'augmentation de la taille du lexique. Le deuxieme probleme souleve quand il s'agit de realiser une reconnaissance dans un grand vocabulaire porte sur le temps d'execution qui peut devenir redhibitoire surtout s'il y a des contraintes de temps reel. En reponse a ce probleme, nous avons developpe une methode de reduction de lexique pour la reconnaissance de mots manuscrits hors-ligne. Cette methode se base sur l'estimation d'une borne superieure, a partir d'heuristiques, pour la probabilite de vraisemblance de l'image avec une entree du lexique ou plusieurs. La technique de reduction consiste a eliminer des entrees du lexique, supposes eloignees, au fur et a mesure que nous trouvons dans le dictionnaire des mots proches de l'image en entree. Les tests ont montre que le taux de reconnaissance de la methode approche celui de la methode exhaustive mais le taux moyen d'elagage (69%) reste modeste
Joutel, Guillaume. "Analyse multirésolution des images de documents manuscrits : application à l'analyse de l'écriture." Lyon, INSA, 2009. http://theses.insa-lyon.fr/publication/2009ISAL0045/these.pdf.
Full textImages of strokes, especially handwriting documents images, are often composed of heterogeneous contents and require specific methods of analysis to be exploited. In this thesis, we propose the development of a characterization of handwritings based on a non-adaptive geometrical wavelet transform which is the Curvelets transform. We validate this characterization in different application frameworks of document images analysis. The Curvelets were chosen for their property of good localization of anisotropic objects and their directional multi-scale analysis. We can extract from those two essential primitives which are orientation and curvature at varying levels of scales. These primitives are gathered in a matrix of occurrences to form the signature of a handwriting. The latter is used as a features' vector in an content based image retrieval application. We also propose an evaluation of local similarity between the smaller forms, whatever they are, where the only primitive used is orientation. Finally, as part of a specialization of our method to the corpus of images provided by the ANR project Graphem, we propose the definition of a similarity index, generated from trade between partners in this project, which attempts to use common properties of handwritings while trying to address their differences and their characteristics. This consideration, weighted by the user, allows an advanced exploitation of medieval handwritings
Almaksour, Abdullah. "Apprentissage incrémental de systèmes d'inférence floue : application à la reconnaissance de gestes manuscrits." Phd thesis, INSA de Rennes, 2011. http://tel.archives-ouvertes.fr/tel-00741574.
Full textThépaut, André. "Contribution à l'étude des machines hybrides : application à la reconnaissance des chiffres manuscrits." Montpellier 2, 1995. http://www.theses.fr/1995MON20096.
Full textAlmousa, Almaksour Abdullah. "Apprentissage incrémental de systèmes d'inférence floue : Application à la reconnaissance de gestes manuscrits." Rennes, INSA, 2011. http://www.theses.fr/2011ISAR0015.
Full textWe present in a new method for the conception of evolving and customizable classification systems. We propose an incremental approach for the learning of classification models based on first-order Takagi-Sugeno (TS) fuzzy inference systems. This approach includes, on the one hand, the adaptation of linear consequences of the fuzzy rules using the recursive least-squares method, and, on the other hand, an incremental learning of the antecedent of these rules in order to modify the membership functions according to the evolution of data density in the input space. The proposed method resolves the instability problems in the incremental learning of TS models thanks to a global learning paradigm in which antecedent and consequents are learned in synergy, contrary to the existing approaches where they are learned separately. The performance of our system had been demonstrated on different well-known benchmarks, with a special focus on its capacity of learning new classes