Dissertations / Theses on the topic 'Vision stéréo'
Create a spot-on reference in APA, MLA, Chicago, Harvard, and other styles
Consult the top 43 dissertations / theses for your research on the topic 'Vision stéréo.'
Next to every source in the list of references, there is an 'Add to bibliography' button. Press on it, and we will generate automatically the bibliographic reference to the chosen work in the citation style you need: APA, MLA, Harvard, Chicago, Vancouver, etc.
You can also download the full text of the academic publication as pdf and read online its abstract whenever available in the metadata.
Browse dissertations / theses on a wide variety of disciplines and organise your bibliography correctly.
Zhao, Jian. "Extraction d'information tri-dimensionnelle par stéréo vision." Toulouse 3, 1989. http://www.theses.fr/1989TOU30207.
Full textGyory, Georges. "Détection d'obstacles pour conduite assistée par vision stéréo." Paris 11, 2003. http://www.theses.fr/2003PA112107.
Full textFor a multisensor car with data fusion the stereo vision must deliver independent data from the other sensors. We tested two groups of methods to do that, one predicting the dense disparity maps to save CPU time and the other eliminating road surface points and looking for obstacles in the contour points left. To predict the disparity maps we need the position of the expansion center and the expansion coefficient, the latter depending on speed. We tested uniform and depth-dependent expansion, depth being estimated from the disparity values. Parameter estimations were made comparing several estimated disparity maps' values. The prediction methods can extend the considered disparity values' range without loss of image size and can follow correct matches over time. On the other hand, they don't follow the objects' own movement and in lack of everywhere dense contrast points in road situations the estimations cannot be corrected. Road surface elimination needs matching several road surface points. Supposing some are on an arc of circle more or less tangent to the car, we apply a Hough transform to find them. On planar road the matching parameters vary dependently with the car's pitch movement and we exploit that to use just one road point matched. On the contour points left we look for quasi-vertical scending sequences with quasi-constant disparity values close to that of the road surface at the beginning of the sequence. This gives a very fast algorithm yielding positions and confidence values (length of the ascending sequence) of obstacle contours
Laïeb, Noureddine. "Contribution à la vision par ordinateur : une méthode de stéréo vision passive." Le Havre, 1987. http://www.theses.fr/1987LEHA0005.
Full textZhu, Ying. "Etude psychophysique de la vision en relief humaine en télévision stéréo." Lyon, INSA, 1992. http://www.theses.fr/1992ISAL0106.
Full textThis thesis is concerned with the stud of certain hurnan stereo vision cues. Particularly relevant ta depth perception in a dynamic 3d (three-dimensional) stereoscopic TV and with t he understanding of human stereoscopic visual processes using local spectral models. We have studied different types of stereo acuity all the retinal meridian in both central and eccentric eye positions, in physiological and pathological subjects. Accomdation and convergence, the two essentiel depth perception cues in 3D Tvs, have benn investigated in detail, allowing ta get insight into the way the human visual system behaves in arder to resolve the conflict between accommodation and convergence. The quality problem of depth perception in the presence of geometrical distortions was also addressed. Within this framework. Two types of geornetrical distortions have been distinguished, and their affect the visual comfort and binocular disparity sensitivity bas been detailed in various situations. Finally, in arder ta get an insight into the stereo informatian processing and coding processes, we have studied the influence of the monocular images' local spectral content on human stereo vision by employing a 2D (two-dimensional) space/spatial-frequency model based on using the 2D Wigner-Vill distribution, and the 2D discrets cosine transformation
Bak, Adrien. "Coopération stéréo mouvement pour la détection des objets dynamiques." Phd thesis, Université Paris Sud - Paris XI, 2011. http://tel.archives-ouvertes.fr/tel-00673364.
Full textSIMOND, Nicolas. "Localisation robuste d'un véhicule en environnement urbain à partir d'un système de stéréo-vision." Phd thesis, Université de Nice Sophia-Antipolis, 2005. http://tel.archives-ouvertes.fr/tel-00009694.
Full textUne modélisation polyhédrique des rues qui forment l'environnement urbain de cette étude permet de segmenter trois plans principaux : la route et les façades verticales des bâtiments qui délimitent l'espace de navigation. La segmentation du plan de la route est facilitée par la matérialisation des voies de navigation et autres trottoirs ou terre-pleins, parallèles entre eux au premier plan, qui sont à l'origine dans les images de contours qui convergent vers un point de fuite dominant.
Les conditions d'illumination et la présence d'obstacles dynamiques qui obstruent le champ de vision compliquent la tâche de suivi des plans segmentés. L'emploi d'une paire stéréo permet à chaque nouvelle pose de déterminer quelles sont les régions des images qui correspondent aux plans segmentés en calculant l'homographie induite par chaque plan entre les deux images de la paire. Le déplacement du véhicule étant supposé plan, ses mouvements d'inclinaison et de lacet influent peu sur chacune des homographies estimées entre les images de la paire stéréo. Nous possédons ainsi un outil fiable de segmentation des plans principaux qui forment l'environnement.
L'estimation du déplacement entre deux poses de chaque caméra consiste à estimer l'homographie induite par le mouvement relatif du plan considéré. Le calcul est entrepris grâce à la mise en correspondance des primitives (points, droites) détectées dans les régions planaires segmentées. La précision et la fiabilité des résultats dépendent essentiellement de la répartition spatiale des primitives mises en correspondance.
L'introduction de la notion de super-homographie permet de robustifier les estimations des homographies en calculant de manière simultanée toutes les homographies induites par un même plan, observé dans plusieurs images. La redondance des contraintes imposées par le mouvement relatif des primitives coplanaires mises en correspondance améliore l'estimation du mouvement de la projection du plan dans les différentes images. L'apport de la super-homographie est double : elle permet de détecter d'éventuelles erreurs de mise en correspondance et fournit une estimée des coordonnées des primitives coplanaires avec une précision sub-pixellique. Lorsque la calibration de la paire stéréo est connue, la trajectographie du véhicule peut être estimée en décomposant l'homographie relative au mouvement d'une des caméras, extraite de la super-homographie.
Simond, Nicolas. "Localisation robuste d'un véhicule en environnement urbain à l'aide d'un système de stéréo-vision." Nice, 2005. http://www.theses.fr/2005NICE4018.
Full textThe thesis deals with mobile robots in the specific context of urban environment. We want to extract the maximum of data on the motion of an uncalibrated stereo-rig embedded in a vehicle with in a structured environment. The aim of this study is to estimate the motion of a vehicle when the localization is impossible with a satellite position system due to the occlusion of the free space. A polyhedrical model of the streets, which is the considered environment in this study, allows the segmentation of three main plane : the road and the vertical facades which represent the frontier of the navigation area. We assume that the while lanes which split the different ways and the kerbs which delimit the road are all parallel in the scene, at least on the foreground. Therefore the detection of these induced edges in image is easy because they all converge in a dominant vanishing point. The lighting conditions and the dynamical obstacles which occlude the vision field make the tracking task of the segmented planes difficult. The use of a stereo-vision allows the segmentation in image of the coplanar area of the scene assuming that each plane induces a homography between both images of the stereo-rig. The vehicle motion is assumed to be plane, the pan and tilt rotation motions have low contributions on the computed homographies. We hence possess a reliable tool to segment the main planes which structure the environment. The motion of the cameras between two poses is obtained with the homography induced by the relative motion of one plane. The homography computation requires the matching of features (lines, points) detected in the coplanar regions of images. The precision and the reliability of the result highly depends on the spatial distribution of the matched features. The introduction of the motion of super-homography improves the robustness of the homography computation by computing in one shot all the homographies induced by the same plane in several images. The redundancy of the constraints imposed by the relative motion of the matched coplanar features increases the constraints of the plane projection in each image. The super-homography has two advantages : it allows the detection of mismatched features and supplies coordinates with a sub-pixellic precision. When the calibration of the stereo-rig is available, the extraction of the homography induced by the relative motion of a camera from the super-homography supplies the trajectography of the vehicle
Lefaudeux, Benjamin. "Détection, localisation et suivi des obstacles et objets mobiles à partir d'une plate forme de stéréo-vision." Thesis, Paris, ENMP, 2013. http://www.theses.fr/2013ENMP0091/document.
Full textThis PhD work is to be seen within the context of autonomous vehicle perception, in which the detection and localisation of elements of the surroundings in real time is an obvious requirement. Subsequent perception needs are manyfold, from localisation to obstacle detection, and are the subject of a continued research interest. The goal of this work is to build, in real time and from stereovision acquisition, a 3D map of the surroundings ; while detecting and tracking moving objects.Interest point selection and tracking on picture space are a first step, which we initiate by a thorough comparison of detectors from the literature. As regards tracking, we propose a massively parallel implementation of the standard KLT algorithm, using redundant tracking to provide reliable quality estimation. This allows us to track thousands of points in real-time, which compares favourably to the state of the art.Next step is the ego-motion estimation, along with the positioning of tracked points in 3D space. We first propose an iterative variant of the well known “SVD” process followed by UKF filtering, which allows for a very fast and reliable estimation. Then the position of every followed interest point is filtered on the fly over time, in contrast to most dense approaches from the literature.We finally propose a segmentation of moving objects in the augmented position-speed space, which is made possible by our continuous estimation of feature points position. Target tracking and filtering finally use a GM-PHD approach
Petitpas, Benoit. "Extraction de paramètres bio-geo-physiques de surfaces 3D reconstruites par multi-stéréo-restitution d'images prises sans contraintes." Phd thesis, Université Paris-Est, 2011. http://tel.archives-ouvertes.fr/tel-00693326.
Full textAmur, Khua Bux. "Contrôle adaptatif, techniques de régularisation et applications en analyse d'images." Thesis, Metz, 2011. http://www.theses.fr/2011METZ011S/document.
Full textAn adaptive control and regularization techniques are studied for some linear and nonlinear ill-posed problems in image processing and computer vision. These methods are based on variational approach which constitutes the minimization of a suitable energy functional made up of two parts data term obtained from constancy of grey value assumption and regularization term which cope with ill-posed problem by its filling in effect property. We have provided a novel adaptive approach for optic flow and stero vision problems. Which is based on adaptive finite element method using unstructutred grid as the domain of computation, which allow the automatic choice of local optimal regularization parameters. Various linear and nonlinear variational models have been considered in this thesis for scientific computations of optic flow and stereo vision problems and an efficient adaptive control is obtained. This novel regularization strategy is controlled by the regularization parameter [alpha] which depends on space and a posteriori error estimator called residual error indicator. This local adptive behavior has encouraged us to experiment with other problems in image analysis such as denoising, we add a preliminary chapter on hte model of Perona-Malik. This work falls in to the category novel and advanced numerical strategies for scientific computations specifically for image motion problems
Douret, Jérôme. "Conception et réalisation d'un capteur stéréoscopique pour la détection en condition d'éclairement naturel : application à l'analyse automatique du trafic routier." Paris 6, 2004. http://www.theses.fr/2004PA066097.
Full textMassone, Quentin. "Numérisation 3D en environnement aquatique : application aux réseaux karstiques." Electronic Thesis or Diss., Université de Montpellier (2022-....), 2023. http://www.theses.fr/2023UMONS032.
Full textKarst aquifers, mainly limestone geological formations, represent a considerable challenge in the current context where many parts of the world are under water stress due to climate change. Although these aquifers are one of the world’s main freshwater sources, they are still underexploited due to a lack of precise knowledge of their geomorphology. Exploring them to collect data is a complex task because the networks formed in this type of environment are very extensive. In addition, sending divers is not ideal as they are limited in depth and the risks are significant due to the confinement and unpredictable structure. Eventually, the shipment of autonomous underwater vehicles would allow further exploration of the tunnels safely while acquiring data from sensors. However, using such vehicles in such an unpredictable environment without human supervision still presents many technical and scientific challenges.The objective of this thesis is therefore to propose 3D vision solutions adapted to gallery networks present in this type of environment. We thus propose two approaches based on the use of camera and a projector of conical shape.The first, prospective, is an active stereovision method using a system composed of two cameras and the projector, has been validated in simulation via the use of a gallery model.The second, which is the heart of this memory, is a structured light 3D reconstruction method using the combination of a camera and a cone-shaped projector.After extracting the contours, the points of these contours are reconstructed in 3D using the geometric constraint between the cone and the camera. This geometric constraint is expressed via the parameters of the headlamp cone, the estimation of which is made via a specific calibration.Various experiments have been carried out to validate this original approach. The first is carried out in a laboratory where a luminous contour projected on two orthogonal walls is reconstructed in 3D and then verified via tests of coplanarity and angulation between these two control walls. The second is carried out in an abandoned aqueduct (without water) having the advantage of reproducing an environment close to that of a karst aquifer: narrow, without light and especially without water to continue our assessment. The validation of the results is done on tests of coplanarity of the walls of the aqueduct and on the estimation of the known distance between these walls. The third is done in the pool, the first step before the final test in karstic environment. The validation of the results is also carried out by tests of the coplanarity of the points on the ground and the estimation of the cylindrical shape of the pool and its radius
De, Franchis Carlo. "Earth Observation and Stereo Vision." Thesis, Université Paris-Saclay (ComUE), 2015. http://www.theses.fr/2015SACLN002/document.
Full textThis thesis deals with the problem of computing accurate digital elevationmodels of the Earth's surface from optical images taken by pushbroomobservation satellites. It takes advantage of the collaboration of thedefendant with CNES (the French Space Agency) on the development ofstereo vision tools for Pléiades, the first Earth observation satelliteproducing quasi simultaneous stereo pairs or triplets with small baseline.The first chapter describes a simple pushbroom camera model for observationsatellites orbiting around the Earth and addresses the correction of theacquisition geometry by involving extrinsic information. This chapter proposesa new algorithm to refine the orientation parameters from a set of groundcontrol points, applicable to all pushbroom satellites.With the goal of testing for satellite imaging the thriving exploration ofstereo matching by the computer vision community, the second chapter exploresthe adaptation of the theory of epipolar resampling to pushbroom images.Epipolar resampling is traditionally used in stereo to reduce the matchingcomputational cost, and permits to test for satellite imaging the mostcompetitive computer vision algorithms. The third chapter discusses the effectsof geometric calibration inaccuracies and proposes a method to cancel itsimpact on stereo matching.The fourth chapter analyzes and describes a detailed implementation of theSemi-Global Matching (SGM) algorithm, which is currently among the top-rankedstereo vision algorithms. Based on a recently proposed interpretation of SGM asa min-sum Belief Propagation algorithm, a variant is proposed that allows toreduce by a factor five the energy gap of SGM with respect to referencealgorithms for Markov Random Fields with truncated smoothness terms.By wrapping together the algorithmic blocks described in the previous chapters,the fifth chapter describes S2P, a complete stereo pipeline for producingdigital elevation models from satellite images. As an application, a landscapeevolution model is presented in the sixth chapter. The model is used tosimulate numerically the fine structure of the river networks on digitalelevation models obtained from Pléiades Earth observation images.The source code of the S2P stereo pipeline is distributed as open source. Toensure reproducibility, the algorithms implemented in each step of the S2Ppipeline are submitted to the IPOL journal, with detailed descriptions of thealgorithms, documented source codes and online demonstrations for each block ofthe pipeline
Naoulou, Abdelelah. "Architectures pour la stéréovision passive dense temps réel : application à la stéréo-endoscopie." Phd thesis, Université Paul Sabatier - Toulouse III, 2006. http://tel.archives-ouvertes.fr/tel-00110093.
Full textSkordas, Thomas. "Mise en correspondance et reconstruction stéréo utilisant une description structurelle des images." Phd thesis, Grenoble INPG, 1988. http://tel.archives-ouvertes.fr/tel-00329721.
Full textDurand-Texte, Thomas. "Mesure de vibrations par vision 3D." Thesis, Le Mans, 2019. http://www.theses.fr/2019LEMA1003/document.
Full textThe objective of this Ph.D is to study the relevance and limits of 3D vision methods coupled to high-speed cameras and applied to non-contact synchronous vibration measurement, in the vibro-acoustic range of frequencies. A first pseudo stereoscopic set-up, taken from robotics, using a four-mirror adapter in order to generate two virtual viewpoints from a single real camera, has been tested on a plate and a loudspeaker. The results, validated by comparison with those obtained with a laser vibrometre, prove the relevance of the approach, despite some constraints related to the optical elements. In a logic of simplification, three other set-ups have then been proposed and tested, allowing designing two full-field vibration measurement techniques and a method for the Iterative Rectification of Images (IRIs), adapted to the context. The no-mirror method uses a mathematical line to triangulate positions and is basically suited to measure the single-axis vibrations of globally plane objects, displaying non-repeatable displacements along the normal of the surface or along a known axis. The asynchronous cameras technique requires a high-speed and an industrial camera used simultaneously to measure the multi-axis displacements of 3D vibratory phenomena. The results obtained on a car bonnet and a loudspeaker prove its potential to characterise large panels or to carry out end-of-line testing of loud-speakers for example. In conclusion, the three measurement protocols and the associated results are compared in order to assess their respective potentialities and limits in the context of vibration measurement
Bony, Alexandre. "Modélisation de l'interaction lumière/matière pour l'analyse de surfaces rugueuses texturées par stéréo photométrie." Thesis, Poitiers, 2013. http://www.theses.fr/2013POIT2301/document.
Full textTridimensional reconstruction method has become essential for applications suchas the characterization and analysis of surfaces. In this thesis, aims are to increase the qualityof 3d reconstructions by photometric stereo. This method is based on two principles, reversinglight-matter interaction model and configuration of a lighting system. With diffuse surfaces,the photometric stereo use three captured images from a fixed point of view for differentillumination directions. Its main advantage is to extract the color and geometric propertiesfor the textured rough surfaces. However, its application requires to make assumptions thatare not credible in real cases. This problem generates significant errors in the reconstructions.To reduce them, we offer various solutions around the overall consideration of the acquisitionchain. Our contribution focuses on the characterization and modeling of the lighting system,the acquisition sensor and improved image quality. We are also interested to optimize acquisitionprotocol in the case of specular surface or shading due to the surface geometry. Ourresults show that the inclusion of these features in the inversion of a BRDF model allowsan improvement of 3d reconstructions as well as the possibility of reducing the size of theacquisition systems
El, Kabir Taoufik. "Analyse d'objets par vision par ordinateur : géométrie et colorimétrie." Thesis, Poitiers, 2020. http://www.theses.fr/2020POIT2262.
Full textDigital image processing allows to perform non-destructive control in order to determine accurately the state of the analysed surfaces. In the medical field, it can improve patient care and assist clinicians in diagnosis and post-treatment follow-up. The work realised in this thesis focuses on Photometric-Stereo, one of the most important methods for information extraction by digital image processing. The Photometric-Stereo relies on the acquisition of the analysed surface from a fixed point of view but under different lighting positions. It enables the simultaneous extraction of the geometry and colorimetry of the analysed surfaces. It enables a simultaneous extraction of the geometry and colorimetry of the analysed surfaces. However, its application requires the formulation of several hypotheses that are not always valid under real conditions. Thus, we propose a solution to one of the most reductive hypotheses related to the photometric nature of the analysed surfaces. On the other hand, a new mathematical formulation of Photometric-Stereo is proposed to avoid the numerical integration step for the 3D reconstruction estimation. This original approach based on discrete geometry offers an inclusive numerical framework with a spatial character. As an application for this work, we propose a diagnostic aid system for psoriasis severity evaluation from the study of Photometric-Stereo previously carried out. The results obtained through these different contributions extend the use of Photometry-Stereo and demonstrate its potential in a rigorous application framework
Dubreuil, Lorène. "Mesure In-situ par moyens optiques." Thesis, Université Paris-Saclay (ComUE), 2017. http://www.theses.fr/2017SACLN018/document.
Full textIntegrating inspection procedures in the machining process contributes to process optimization. The use of in-situ measurement allows a betterreactivity for corrective actions. However, to be highly efficient, Machining and Inspection Process Planning must reach a high level of integration. It is hereessential to focus on the compromise measurement time vs precision: the time dedicated to inspection must be limited, but not to the detriment ofmeasurement quality. A measurement process for in-situ machining defect detection is proposed based on a stereo-DIC
Mennillo, Laurent. "Reconstruction 3D de l'environnement dynamique d'un véhicule à l'aide d'un système multi-caméras hétérogène en stéréo wide-baseline." Thesis, Université Clermont Auvergne (2017-2020), 2019. http://www.theses.fr/2019CLFAC022/document.
Full textThis Ph.D. thesis, which has been carried out in the automotive industry in association with Renault Group, mainly focuses on the development of advanced driver-assistance systems and autonomous vehicles. The progress made by the scientific community during the last decades in the fields of computer science and robotics has been so important that it now enables the implementation of complex embedded systems in vehicles. These systems, primarily designed to provide assistance in simple driving scenarios and emergencies, now aim to offer fully autonomous transport. Multibody SLAM methods currently used in autonomous vehicles often rely on high-performance and expensive onboard sensors such as LIDAR systems. On the other hand, digital video cameras are much cheaper, which has led to their increased use in newer vehicles to provide driving assistance functions, such as parking assistance or emergency braking. Furthermore, this relatively common implementation now allows to consider their use in order to reconstruct the dynamic environment surrounding a vehicle in three dimensions. From a scientific point of view, existing multibody visual SLAM techniques can be divided into two categories of methods. The first and oldest category concerns stereo methods, which use several cameras with overlapping fields of view in order to reconstruct the observed dynamic scene. Most of these methods use identical stereo pairs in short baseline, which allows for the dense matching of feature points to estimate disparity maps that are then used to compute the motions of the scene. The other category concerns monocular methods, which only use one camera during the reconstruction process, meaning that they have to compensate for the ego-motion of the acquisition system in order to estimate the motion of other objects. These methods are more difficult in that they have to address several additional problems, such as motion segmentation, which consists in clustering the initial data into separate subspaces representing the individual movement of each object, but also the problem of the relative scale estimation of these objects before their aggregation within the static scene. The industrial motive for this work lies in the use of existing multi-camera systems already present in actual vehicles to perform dynamic scene reconstruction. These systems, being mostly composed of a front camera accompanied by several surround fisheye cameras in wide-baseline stereo, has led to the development of a multibody reconstruction method dedicated to such heterogeneous systems. The proposed method is incremental and allows for the reconstruction of sparse mobile points as well as their trajectory using several geometric constraints. Finally, a quantitative and qualitative evaluation conducted on two separate datasets, one of which was developed during this thesis in order to present characteristics similar to existing multi-camera systems, is provided
Tannouri, Anthony. "Using Wireless multimedia sensor networks for 3D scene asquisition and reconstruction." Thesis, Bourgogne Franche-Comté, 2018. http://www.theses.fr/2018UBFCD053/document.
Full textNowadays, the WMSNs are promising for different applications and fields, specially with the development of the IoT and cheap efficient camera sensors. The stereo vision is also very important for multiple purposes like Cinematography, games, Virtual Reality, Augmented Reality, etc. This thesis aim to develop a 3D scene reconstruction system that proves the concept of using multiple view stereo disparity maps in the context of WMSNs. Our work can be divided in three parts. The first one concentrates on studying all WMSNs applications, components, topologies, constraints and limitations. Adding to this stereo vision disparity map calculations methods in order to choose the best method(s) to make a 3d reconstruction on WMSNs with low cost in terms of complexity and power consumption. In the second part, we experiment and simulate different disparity map calculations on a couple of nodes by changing scenarios (indoor and outdoor), coverage distances, angles, number of nodes and algorithms. In the third part, we propose a tree-based network model to compute accurate disparity maps on multi-layer camera sensor nodes that meets the server needs to make a 3d scene reconstruction of the scene or object of interest. The results are acceptable and ensure the proof of the concept to use disparity maps in the context of WMSNs
Iskef, Alaa Eddin. "Technologies informatiques pour l'étude du comportement expérimental et numérique d'un assemblage poutre-poteau en béton armé." Thesis, Université Paris-Saclay (ComUE), 2016. http://www.theses.fr/2016SACLN015/document.
Full textThe behavior of reinforced concrete beam-column joints and their influence on the strength of the structures under cyclic or seismic loadings has been the subject of several investigations in recent years. However, the behavior of that part of the structure remains far from being mastered due to the complexity of the assembly involving several physical phenomena and due to the lack of exhaustive experimental data. This work aims to implement and provide a reliable and dense experimental database whose vocation is to provide access to an experimental benchmark to enable the modeling and validation of the behavior of these assemblies
Delaunoy, Amael. "Modélisation 3D à partir d'images : contributions en reconstruction photométrique à l'aide de maillages déformables." Phd thesis, Université de Grenoble, 2011. http://tel.archives-ouvertes.fr/tel-00716407.
Full textNarasimha, Ramya. "Méthodes dʼestimation de la profondeur par mise en correspondance stéréoscopique à lʼaide de champs aléatoires couplés." Grenoble, 2010. http://www.theses.fr/2010GRENM053.
Full textThe depth of objects in 3-D scene can be recovered from a stereo image-pair by finding correspondences between the two views. This stereo matching task involves identifying the corresponding points in the left and the right images, which are the projections of the same scene point. The difference between the locations of the two corresponding points is the disparity, which is inversely related to the 3-D depth. In this thesis, we focus on Bayesian techniques that constrain the disparity estimates. In particular, these constraints involve explicit smoothness assumptions. However, there are further constraints that should be included, for example, the disparities should not be smoothed across object boundaries, the disparities should be consistent with geometric properties of the surface, and regions with similar colour should have similar disparities. The goal of this thesis is to incorporate such constraints using monocular cues and differential geometric information about the surface. To this end, this thesis considers two important problems associated with stereo matching; the first is localizing disparity discontinuities and second aims at recovering binocular disparities in accordance with the surface properties of the scene under consideration. We present a possible solution for each these problems. In order to deal with disparity discontinuities, we propose to cooperatively estimating disparities and object boundaries. This is motivated by the fact that the disparity discontinuities occur near object boundaries. The second one deals with recovering surface consistent disparities and surface normals by estimating the two simultaneously
Kmiotek, Pawel. "Fusion multi-capteurs pour la représentation et le suivi des objets dynamiques." Phd thesis, Belfort-Montbéliard, 2009. http://tel.archives-ouvertes.fr/tel-00608155.
Full textPelcat, Jimmy. "Reconstruction 3D et production de carte dense de disparité en stéréovision non-alignée pour des applications industrielles de localisation 3D et d'analyse de surface." Thesis, Rouen, INSA, 2012. http://www.theses.fr/2012ISAM0025/document.
Full textIn industrial vision, many applications for measuring and quality control are moving to three-dimensional problems. Stereovision systems are technological solutions that attract industry by their mechanical simplicity. Two static cameras placed at strategic locations may be sufficient to address this problem although the industrial constraints imposed to respect a short processing time and precise measurements. The diversity of applications lead us to consider two approaches to resolve the two types of application. The first technique consists in the 3D reconstruction from pairs of image points which correspond in both images. It is intended to address the problem of 3D measurement. The methods of monocular calibration and 3D triangulation are the basis of 3D reconstruction. We study the accuracy and its evolution according to the capture system pose compared to the observed scene. The second technique is to construct disparity maps to address problems of building profile and default analysis. The alignment constraint of cameras needed to accelerate the process of matching involves the use of methods of stereoscopic calibration and image rectification. We study the impact of alignment on the quality of the rectification. The production of dense disparity map is based on the stereo-correlation techniques. We show the limits of the use of a squared correlation kernel and propose an alternative production of two dense disparity maps from two mono-directional kernels, improving the measurement of disparity around edges and occlusions
Vu, Hiep. "Stéreo multi-vues à grande échelleet de haute qualité." Phd thesis, Ecole Nationale des Ponts et Chaussées, 2011. http://tel.archives-ouvertes.fr/tel-00743289.
Full textDelaunoy, Amaël. "Modélisation 3D à partir d'images : contributions en reconstruction photométrique à l'aide de maillages déformables." Thesis, Grenoble, 2011. http://www.theses.fr/2011GRENM069/document.
Full textUnderstanding, analyzing and modeling the 3D world from 2D pictures and videos is probably one of the most exciting and challenging problem of computer vision. In this thesis, we address several geometric and photometric aspects to 3D surface reconstruction from multi-view calibrated images. We first formulate multi-view shape reconstruction as an inverse rendering problem. Using generative models, we formulate the problem as an energy minimization method that leads to the non-linear surface optimization of a deformable mesh. A particular attention is addressed to the computation of the discrete gradient flow, which leads to coherent vertices displacements. We particularly focus on models and energy functionals that depend on visibility and photometry. The same framework can then be equally used to perform multi-view stereo, multi-view shape from shading or multi-view photometric stereo. Then, we propose to exploit different additional information to constraint the problem in the non-Lambertian case, where the appearance of the scene depends on the view-point direction. Segmentation for instance can be used to segment surface regions sharing similar appearance or reflectance. Helmholtz reciprocity can also be applied to reconstruct 3D shapes of objects of any arbitrary reflectance properties. By taking multiple image-light pairs around an object, multi-view Helmholtz stereo can be performed. Using this constrained acquisition scenario and our deformable mesh framework, it is possible to reconstruct high quality 3D models
JUNG, Il Kyun. "Simultaneous localization and mapping in 3D environments with stereovision." Phd thesis, Institut National Polytechnique de Toulouse - INPT, 2004. http://tel.archives-ouvertes.fr/tel-00010250.
Full textLefaudeux, Benjamin. "Détection, localisation et suivi des obstacles et objets mobiles à partir d'une plate-forme de stéréovision." Phd thesis, Ecole Nationale Supérieure des Mines de Paris, 2013. http://tel.archives-ouvertes.fr/tel-00918508.
Full textNarasimha, Ramya. "Méthodes dʼestimation de la profondeur par mise en correspondance stéréoscopique à lʼaide de champs aléatoires couplés." Phd thesis, Grenoble, 2010. http://www.theses.fr/2010GRENM056.
Full textThe depth of objects in 3-D scene can be recovered from a stereo image-pair by finding correspondences between the two views. This stereo matching task involves identifying the corresponding points in the left and the right images, which are the projections of the same scene point. The difference between the locations of the two corresponding points is the disparity, which is inversely related to the 3-D depth. In this thesis, we focus on Bayesian techniques that constrain the disparity estimates. In particular, these constraints involve explicit smoothness assumptions. However, there are further constraints that should be included, for example, the disparities should not be smoothed across object boundaries, the disparities should be consistent with geometric properties of the surface, and regions with similar colour should have similar disparities. The goal of this thesis is to incorporate such constraints using monocular cues and differential geometric information about the surface. To this end, this thesis considers two important problems associated with stereo matching; the first is localizing disparity discontinuities and second aims at recovering binocular disparities in accordance with the surface properties of the scene under consideration. We present a possible solution for each these problems. In order to deal with disparity discontinuities, we propose to cooperatively estimating disparities and object boundaries. This is motivated by the fact that the disparity discontinuities occur near object boundaries. The second one deals with recovering surface consistent disparities and surface normals by estimating the two simultaneously
Demirdjian, David. "Le mouvement projectif : théorie et applications pour l'autocalibrage et la segmentation du mouvement." Phd thesis, Grenoble INPG, 2000. http://tel.archives-ouvertes.fr/tel-00590318.
Full textProvencher, David. "Imagerie 3D de l'anatomie interne d'une souris par dynamique de fluorescence." Mémoire, Université de Sherbrooke, 2012. http://hdl.handle.net/11143/6205.
Full textSheikh, Faridul Hasan. "Analysis of 3D color matches for the creation and consumption of video content." Thesis, Saint-Etienne, 2014. http://www.theses.fr/2014STET4001.
Full textThe objective of this thesis is to propose a solution to the problem of color consistency between images originate from the same scene irrespective of acquisition conditions. Therefore, we present a new color mapping framework that is able to compensate color differences and achieve color consistency between views of the same scene. Our proposed, new framework works in two phases. In the first phase, we propose a new method that can robustly collect color correspondences from the neighborhood of sparse feature correspondences, despite the low accuracy of feature correspondences. In the second phase, from these color correspondences, we introduce a new, two-step, robust estimation of the color mapping model: first, nonlinear channel-wise estimation; second, linear cross-channel estimation. For experimental assessment, we propose two new image datasets: one with ground truth for quantitative assessment; another, without the ground truth for qualitative assessment. We have demonstrated a series of experiments in order to investigate the robustness of our proposed framework as well as its comparison with the state of the art. We have also provided brief overview, sample results, and future perspectives of various applications of color mapping. In experimental results, we have demonstrated that, unlike many methods of the state of the art, our proposed color mapping is robust to changes of: illumination spectrum, illumination intensity, imaging devices (sensor, optic), imaging device settings (exposure, white balance), viewing conditions (viewing angle, viewing distance)
Alkhalil, Fadi. "Stereo visual servoing from straight lines." Phd thesis, Université de Strasbourg, 2012. http://tel.archives-ouvertes.fr/tel-00750946.
Full textBen, Hamadou Achraf. "Contribution à la cartographie 3D des parois internes de la vessie par cystoscopie à vision active." Phd thesis, Institut National Polytechnique de Lorraine - INPL, 2011. http://tel.archives-ouvertes.fr/tel-00628292.
Full textZhang, Naiyu. "Cellular GPU Models to Euclidean Optimization Problems : Applications from Stereo Matching to Structured Adaptive Meshing and Traveling Salesman Problem." Thesis, Belfort-Montbéliard, 2013. http://www.theses.fr/2013BELF0215/document.
Full textThe work presented in this PhD studies and proposes cellular computation parallel models able to address different types of NP-hard optimization problems defined in the Euclidean space, and their implementation on the Graphics Processing Unit (GPU) platform. The goal is to allow both dealing with large size problems and provide substantial acceleration factors by massive parallelism. The field of applications concerns vehicle embedded systems for stereovision as well as transportation problems in the plane, as vehicle routing problems. The main characteristic of the cellular model is that it decomposes the plane into an appropriate number of cellular units, each responsible of a constant part of the input data, and such that each cell corresponds to a single processing unit. Hence, the number of processing units and required memory are with linear increasing relationship to the optimization problem size, which makes the model able to deal with very large size problems.The effectiveness of the proposed cellular models has been tested on the GPU parallel platform on four applications. The first application is a stereo-matching problem. It concerns color stereovision. The problem input is a stereo image pair, and the output a disparity map that represents depths in the 3D scene. The goal is to implement and compare GPU/CPU winner-takes-all local dense stereo-matching methods dealing with CFA (color filter array) image pairs. The second application focuses on the possible GPU improvements able to reach near real-time stereo-matching computation. The third and fourth applications deal with a cellular GPU implementation of the self-organizing map neural network in the plane. The third application concerns structured mesh generation according to the disparity map to allow 3D surface compressed representation. Then, the fourth application is to address large size Euclidean traveling salesman problems (TSP) with up to 33708 cities.In all applications, GPU implementations allow substantial acceleration factors over CPU versions, as the problem size increases and for similar or higher quality results. The GPU speedup factor over CPU was of 20 times faster for the CFA image pairs, but GPU computation time is about 0.2s for a small image pair from Middlebury database. The near real-time stereovision algorithm takes about 0.017s for a small image pair, which is one of the fastest records in the Middlebury benchmark with moderate quality. The structured mesh generation is evaluated on Middlebury data set to gauge the GPU acceleration factor and quality obtained. The acceleration factor for the GPU parallel self-organizing map over the CPU version, on the largest TSP problem with 33708 cities, is of 30 times faster
Dufournaud, Yves. "Navigation aérienne et guidage terminal à partir de données bidimensionnelles." Grenoble INPG, 2001. http://www.theses.fr/2001INPG0040.
Full textBraux-Zin, Jim. "Contributions aux problèmes de l'étalonnage extrinsèque d'affichages semi-transparents pour la réalité augmentée et de la mise en correspondance dense d'images." Thesis, Clermont-Ferrand 1, 2014. http://www.theses.fr/2014CLF1MM13/document.
Full textAugmented reality is the process of inserting virtual elements into a real scene, observed through a screen. Augmented Reality systems can take different forms to get the desired balance between three criteria: accuracy, latency and robustness. Three main components can be identified: localization, reconstruction and display. The contributions of this thesis are focused on display and reconstruction. Most augmented reality systems use non-transparent screens as they are widely available. However, for critical applications such as surgery or driving assistance, the user cannot be ever isolated from reality. We answer this problem by proposing a new “augmented tablet” system with a semi-transparent screen. Such a system needs a suitable calibration scheme:to correctly align the displayed augmentations and reality, one need to know at every moment the poses of the user and the observed scene with regard to the screen. Two tracking devices (user and scene) are thus necessary, and the system calibration aims to compute the pose of those devices with regard to the screen. The calibration process set up in this thesis is as follows: the user indicates the apparent projections in the screen of reference points from a known 3D object ; then the poses to estimate should minimize the 2D on-screen distance between those projections and the ones computed by the system. This is a non-convex problem difficult to solve without a sane initialization. We develop a direct estimation method by computing the extrinsic parameters of virtual cameras. Those are defined by their optical centers which coincide with user positions, and their common focal plane consisting of the screen plane. The user-entered projections are then the 2D observations of the reference points in those virtual cameras. A symmetrical thinking allows one to define virtual cameras centered on the reference points, and “looking at” the user positions. Those initial estimations can then be refined with a bundle adjustment. Meanwhile, 3D reconstruction is based on the triangulation of matches between images. Those matches can be sparse when computed by detection and description of image features or dense when computed through the minimization of a cost function of the whole image. A dense correspondence field is better because it makes it possible to reconstruct a 3D surface, useful especially for realistic handling of occlusions for augmented reality. However, such a field is usually estimated thanks to variational methods, minimizing a convex cost function using local information. Those methods are accurate but subject to local minima, thus limited to small deformations. In contrast, sparse matches can be made very robust by using adequately discriminative descriptors. We propose to combine the advantages of those two approaches by adding a feature-based term into a dense variational method. It helps prevent the optimization from falling into local minima without degrading the end accuracy. Our feature-based term is suited to feature with non-integer coordinates and can handle point or line segment matches while implicitly filtering false matches. We also introduce comprehensive handling of occlusions so as to support large deformations. In particular, we have adapted and generalized a local method for detecting selfocclusions. Results on 2D optical flow and wide-baseline stereo disparity estimation are competitive with the state of the art, with a simpler and most of the time faster method. This proves that our contributions enables new applications of variational methods without degrading their accuracy. Moreover, the weak coupling between the components allows great flexibility and genericness. This is the reason we were able to also transpose the proposed method to the problem of non-rigid surface registration and outperforms the state of the art methods
Zhou, Shuting. "Navigation of a quad-rotor to access the interior of a building." Thesis, Compiègne, 2015. http://www.theses.fr/2015COMP2237.
Full textThis research work is dedicated to the development of an autonomous navigation strategy which includes generating an optimal trajectory with obstacles avoiding capabilities, detecting specific object of interest (i.e. a window) and then conducting the subsequent maneuver to approach the window and finally access into the building. The vehicle is navigated by a vision system and a combination of inertial and altitude sensors, which achieve a relative localization of the quad-rotor with respect to its surrounding environment. A MPC-based path planning method using the information provided by the GPS and the visual sensor has been developed to generate an optimal real-time trajectory with collision avoidance capabilities, which starts from an initial point given by the user and guides the vehicle to achieve the final point outside the target building. With the aim of detecting and locating the object of interest, two different vision-based object detection strategies are proposed and are applied respectively in the stereo vision system and the vision system using the Kinect. After estimating the target window model, a motion estimation framework is developed to estimate the vehicle’s ego-motion from the images provided by the visual sensor. There have been two versions of the motion estimation frameworks for both vision systems. A quad-rotor experimental platform is developed. For estimating the translational dynamic of the vehicle, a Kalman filter is implemented to combine the imaging, inertial and altitude sensors. A hierarchical sensing and control system is designed to perform the navigation and control of the quad-rotor helicopter, which allows the vehicle to estimate the state without artificial marks or other external positioning systems
Ruf, Andreas. "Couplage mouvements articulées - vision stéréoscopique : une approche projective." Phd thesis, 2001. http://tel.archives-ouvertes.fr/tel-00584109.
Full textNarasimha, Ramya. "Méthodes dʼestimation de la profondeur par mise en correspondance stéréoscopique à lʼaide de champs aléatoires couplés." Phd thesis, 2010. http://tel.archives-ouvertes.fr/tel-00543238.
Full textChapdelaine-Couture, Vincent. "Le cinéma omnistéréo ou l'art d'avoir des yeux tout le tour de la tête." Thèse, 2011. http://hdl.handle.net/1866/8382.
Full textThis thesis deals with aspects of shooting, projection and perception of stereo panoramic cinema, also called omnistereo cinema. It falls largely in the field of computer vision, but it also in the areas of computer graphics and human visual perception. Omnistereo cinema uses immersive screens to project videos that provide depth information of a scene all around the spectators. Many challenges remain in omnistereo cinema, in particular shooting omnistereo videos for dynamic scenes, polarized projection on highly reflective screens making difficult the process to recover their shape by active reconstruction, and perception of depth distortions introduced by omnistereo images. Our thesis addressed these challenges by making three major contributions. First, we developed the first mosaicing method of omnistereo videos for stochastic and localized motions. We developed a psychophysical experiment that shows the effectiveness of the method for scenes without isolated structure, such as water flows. We also propose a shooting method that adds to these videos foreground motions that are not as constrained, like a moving actor. Second, we introduced new light patterns that allow a camera and a projector to recover the shape of objects likely to produce interreflections. These patterns are general enough to not only recover the shape of omnistereo screens, but also very complex objects that have depth discontinuities from the viewpoint of the camera. Third, we showed that omnistereo distortions are negligible for a viewer located at the center of a cylindrical screen, as they are in the periphery of the visual field where the human visual system becomes less accurate.