Monaco, Joseph W. "Generalized motion models for video applications." Diss., Georgia Institute of Technology, 1997. http://hdl.handle.net/1853/14926.
Full textOwen, Michael Information Technology & Electrical Engineering Australian Defence Force Academy UNSW. "Temporal motion models for video mosaicing and synthesis." Awarded by:University of New South Wales - Australian Defence Force Academy, 2008. http://handle.unsw.edu.au/1959.4/39028.
Full textJónsson, Ragner H. "Adaptive subband coding of video using probability distribution models." Diss., Georgia Institute of Technology, 1994. http://hdl.handle.net/1853/14453.
Full textZapata, Iván R. "Detecting humans in video sequences using statistical color and shape models." [Gainesville, Fla.] : University of Florida, 2001. http://etd.fcla.edu/etd/uf/2001/anp1058/ivan%5Fthesis2.pdf.
Full textTitle from first page of PDF file. Document formatted into pages; contains viii, 49 p.; also contains graphics. Vita. Includes bibliographical references (p. 47-48).
FOTIO, TIOTSOP LOHIC. "Optimizing Perceptual Quality Prediction Models for Multimedia Processing Systems." Doctoral thesis, Politecnico di Torino, 2022. http://hdl.handle.net/11583/2970982.
Full textLee, Sangkeun. "Video analysis and abstraction in the compressed domain." Diss., Available online, Georgia Institute of Technology, 2004:, 2003. http://etd.gatech.edu/theses/available/etd-04072004-180041/unrestricted/lee%5fsangkeun%5f200312%5fphd.pdf.
Full textLazcano, Vanel. "Some problems in depth enhanced video processing." Doctoral thesis, Universitat Pompeu Fabra, 2016. http://hdl.handle.net/10803/373917.
Full textEn esta tesis se abordan dos problemas: interpolación de datos en el contexto del cálculo de disparidades tanto para imágenes como para video, y el problema de la estimación del movimiento aparente de objetos en una secuencia de imágenes. El primer problema trata de la completación de datos de profundidad en una región de la imagen o video dónde los datos se han perdido debido a oclusiones, datos no confiables, datos dañados o pérdida de datos durante la adquisición. En esta tesis estos problemas se abordan de dos maneras. Primero, se propone una energía basada en gradientes no-locales, energía que puede (localmente) completar planos. Se considera este modelo como una extensión del filtro bilateral al dominio del gradiente. Se ha evaluado en forma exitosa el modelo para completar datos sintéticos y también mapas de profundidad incompletos de un sensor Kinect. El segundo enfoque, para abordar el problema, es un estudio experimental del biased AMLE (Biased Absolutely Minimizing Lipschitz Extension) para interpolación anisotrópica de datos de profundidad en grandes regiones sin información. El operador AMLE es un interpolador de conos, pero el operador biased AMLE es un interpolador de conos exponenciales lo que lo hace estar más adaptado a mapas de profundidad de escenas reales (las que comunmente presentan superficies convexas, concavas y suaves). Además, el operador biased AMLE puede expandir datos de profundidad a regiones grandes. Considerando al dominio de la imagen dotado de una métrica anisotrópica, el método propuesto puede tomar en cuenta información geométrica subyacente para no interpolar a través de los límites de los objetos a diferentes profundidades. Se ha propuesto un modelo numérico, basado en el operador eikonal, para calcular la solución del biased AMLE. Adicionalmente, se ha extendido el modelo numérico a sequencias de video. El cálculo del flujo óptico es uno de los problemas más desafiantes para la visión por computador. Los modelos tradicionales fallan al estimar el flujo óptico en presencia de oclusiones o iluminación no uniforme. Para abordar este problema se propone un modelo variacional para conjuntamente estimar flujo óptico y oclusiones. Además, el modelo propuesto puede tolerar, una limitación tradicional de los métodos variacionales, desplazamientos rápidos de objetos que son más grandes que el tamaño objeto en la escena. La adición de un término para el balance de gradientes e intensidades aumenta la robustez del modelo propuesto ante cambios de iluminación. La inclusión de correspondencias adicionales (obtenidas usando búsqueda exhaustiva en ubicaciones específicas) ayuda a estimar grandes desplazamientos.
Hautala, I. (Ilkka). "From dataflow models to energy efficient application specific processors." Doctoral thesis, Oulun yliopisto, 2019. http://urn.fi/urn:isbn:9789526223681.
Full textTiivistelmä Langattomien verkkojen kehittyminen on luonut edellytykset useille uusille sovelluksille. Muiden muassa sosiaalisen media, suoratoistopalvelut, virtuaalitodellisuus ja esineiden internet asettavat kannettaville ja puettaville laitteille moninaisia toimintoihin, suorituskykyyn, energiankulutukseen ja fyysiseen muotoon liittyviä vaatimuksia. Yksi isoimmista haasteista on sulautettujen laitteiden energiankulutus. Laitteiden energiatehokkuutta on pyritty parantamaan rinnakkaislaskentaa ja räätälöityjä laskentaresursseja hyödyntämällä. Tämä puolestaan on vaikeuttanut niin laite- kuin sovelluskehitystä, koska laajassa käytössä olevat kehitystyökalut perustuvat matalan tason abstraktioihin ja hyödyntävät alun perin yksi ydinprosessoreille suunniteltuja ohjelmointikieliä. Korkean tason ja automatisoitujen kehitysmenetelmien käyttöönottoa on hidastanut aikaansaatujen järjestelmien puutteellinen suorituskyky ja laiteresurssien tehoton hyödyntäminen. Väitöskirja esittelee datavuopohjaiseen suunnitteluun perustuvan työkaluketjun, joka on tarkoitettu energiatehokkaiden signaalikäsittelyjärjestelmien toteuttamiseen. Työssä esiteltävä suunnitteluvuo pohjautuu laitteistoratkaisuissa räätälöitävään ja ohjelmoitavaan siirtoliipaistavaan prosessoritemplaattiin. Ehdotettu suunnitteluvuo mahdollistaa useiden heterogeenisten prosessoriytimien ja niiden välisten kytkentöjen räätälöimisen sovelluksien tarpeiden vaatimalla tavalla. Suunnitteluvuossa ohjelmistot kuvataan korkean tason datavuomallien avulla. Tämä mahdollistaa erityisesti rinnakkaista laskentaa sisältävän ohjelmiston automaattisen sovittamisen erilaisiin moniprosessorijärjestelmiin ja nopeuttaa erilaisten järjestelmätason ratkaisujen kartoittamista. Suunnitteluvuon käyttökelpoisuus osoitetaan käyttäen esimerkkinä kolmea eri signaalinkäsittelysovellusta. Tulokset osoittavat, että suunnittelumenetelmien abstraktiotasoa on mahdollista nostaa ilman merkittävää suorituskyvyn heikkenemistä. Väitöskirjan keskeinen sovellusalue on videonkoodaus. Työ esittelee videonkoodaukseen suunniteltuja energiatehokkaita ja uudelleenohjelmoitavia prosessoriytimiä. Ratkaisut perustuvat usean prosessoriytimen käyttämiseen hyödyntäen erityisesti videonkäsittelyalgoritmeille ominaista liukuhihnarinnakkaisuutta. Prosessorien virrankulutus, suorituskyky ja pinta-ala on analysoitu käyttämällä simulointimalleja, jotka huomioivat logiikkasolujen sijoittelun ja johdotuksen. Ehdotetut sovelluskohtaiset prosessoriratkaisut tarjoavat uuden energiatehokkaan kompromissiratkaisun tavanomaisten ohjelmoitavien prosessoreiden ja kiinteästi johdotettujen video-kiihdyttimien välille
KABRA, PRATEEK. "IMPLEMENTATION OF REAL TIME OBJECT DETECTION & TRACKING." Thesis, 2012. http://dspace.dtu.ac.in:8080/jspui/handle/repository/13908.
Full textIn this project we present an approach to develop a real-time object tracking system using a static camera to grab the video frames and track an object. The work presents the concepts of histogram matching and absolute frame subtraction to implement a robust automated object tracking system. Once the object is detected it is tracked using discrete Kalman filter technique. The histogram matching algorithm helps to identify when the object enters the viewing range of the camera and the absolute frame subtraction gives better results even with low quality videos. Such a tracking system can be used in surveillance applications and proves to be cost effective. A simulink model is also developed for object tracking for real time video
"Markov random fields based image and video processing." Thesis, 2010. http://library.cuhk.edu.hk/record=b6074890.
Full textMany problems in computer vision involve assigning each pixel a label, which represents some spatially varying quantity such as image intensity in image denoising or object index label in image segmentation. In general, such quantities in image processing tend to be spatially piecewise smooth, since they vary smoothly in the object surface and change dramatically at object boundaries, while in video processing, additional temporal smoothness is satisfied as the corresponding pixels in different frames should have similar labels. Markov random field (MRF) models provide a robust and unified framework for many image and video applications. The framework can be elegantly expressed as an MRF-based energy minimization problem, where two penalty terms are defined with different forms. Many approaches have been proposed to solve the MRF-based energy optimization problem, such as simulated annealing, iterated conditional modes, graph cuts, and belief propagation.
Promising results obtained by the proposed algorithms, with both quantitative and qualitative comparisons to the state-of-the-art methods, demonstrate the effectiveness of our algorithms in these image and video processing applications.
Liu, Ming.
Adviser: Xiaoou Tang.
Source: Dissertation Abstracts International, Volume: 72-04, Section: B, page: .
Thesis (Ph.D.)--Chinese University of Hong Kong, 2010.
Includes bibliographical references (leaves 79-89).
Electronic reproduction. Hong Kong : Chinese University of Hong Kong, [2012] System requirements: Adobe Acrobat Reader. Available via World Wide Web.
Abstract also in Chinese.
"Video based dynamic scene analysis and multi-style abstraction." 2008. http://library.cuhk.edu.hk/record=b5893627.
Full textThesis (M.Phil.)--Chinese University of Hong Kong, 2008.
Includes bibliographical references (leaves 89-97).
Abstracts in English and Chinese.
Abstract --- p.i
Acknowledgements --- p.iii
Chapter 1 --- Introduction --- p.1
Chapter 1.1 --- Window-oriented Retargeting --- p.1
Chapter 1.2 --- Abstraction Rendering --- p.4
Chapter 1.3 --- Thesis Outline --- p.6
Chapter 2 --- Related Work --- p.7
Chapter 2.1 --- Video Migration --- p.8
Chapter 2.2 --- Video Synopsis --- p.9
Chapter 2.3 --- Periodic Motion --- p.14
Chapter 2.4 --- Video Tracking --- p.14
Chapter 2.5 --- Video Stabilization --- p.15
Chapter 2.6 --- Video Completion --- p.20
Chapter 3 --- Active Window Oriented Video Retargeting --- p.21
Chapter 3.1 --- System Model --- p.21
Chapter 3.1.1 --- Foreground Extraction --- p.23
Chapter 3.1.2 --- Optimizing Active Windows --- p.27
Chapter 3.1.3 --- Initialization --- p.29
Chapter 3.2 --- Experiments --- p.32
Chapter 3.3 --- Summary --- p.37
Chapter 4 --- Multi-Style Abstract Image Rendering --- p.39
Chapter 4.1 --- Abstract Images --- p.39
Chapter 4.2 --- Multi-Style Abstract Image Rendering --- p.42
Chapter 4.2.1 --- Multi-style Processing --- p.45
Chapter 4.2.2 --- Layer-based Rendering --- p.46
Chapter 4.2.3 --- Abstraction --- p.47
Chapter 4.3 --- Experimental Results --- p.49
Chapter 4.4 --- Summary --- p.56
Chapter 5 --- Interactive Abstract Videos --- p.58
Chapter 5.1 --- Abstract Videos --- p.58
Chapter 5.2 --- Multi-Style Abstract Video --- p.59
Chapter 5.2.1 --- Abstract Images --- p.60
Chapter 5.2.2 --- Video Morphing --- p.65
Chapter 5.2.3 --- Interactive System --- p.69
Chapter 5.3 --- Interactive Videos --- p.76
Chapter 5.4 --- Summary --- p.77
Chapter 6 --- Conclusions --- p.81
Chapter A --- List of Publications --- p.83
Chapter B --- Optical flow --- p.84
Chapter C --- Belief Propagation --- p.86
Bibliography --- p.89
"Compressive Sensing for 3D Data Processing Tasks: Applications, Models and Algorithms." Thesis, 2012. http://hdl.handle.net/1911/70314.
Full textKelly, Brian J. "Processing spatial information from photographs, video, and scale models: Complex mental representation in children (Homo sapiens) and monkeys (Macaca mulatta)." 2008. https://scholarworks.umass.edu/dissertations/AAI3337018.
Full textAmiri, Delaram. "Bilateral and adaptive loop filter implementations in 3D-high efficiency video coding standard." Thesis, 2015. http://hdl.handle.net/1805/7983.
Full textIn this thesis, we describe a different implementation for in loop filtering method for 3D-HEVC. First we propose the use of adaptive loop filtering (ALF) technique for 3D-HEVC standard in-loop filtering. This filter uses Wiener–based method to minimize the Mean Squared Error between filtered pixel and original pixels. The performance of adaptive loop filter in picture based level is evaluated. Results show up to of 0.2 dB PSNR improvement in Luminance component for the texture and 2.1 dB for the depth. In addition, we obtain up to 0.1 dB improvement in Chrominance component for the texture view after applying this filter in picture based filtering. Moreover, a design of an in-loop filtering with Fast Bilateral Filter for 3D-HEVC standard is proposed. Bilateral filter is a filter that smoothes an image while preserving strong edges and it can remove the artifacts in an image. Performance of the bilateral filter in picture based level for 3D-HEVC is evaluated. Test model HTM- 6.2 is used to demonstrate the results. Results show up to of 20 percent of reduction in processing time of 3D-HEVC with less than affecting PSNR of the encoded 3D video using Fast Bilateral Filter.
Mantiuk, Rafał [Verfasser]. "High fidelity imaging : the computational models of the human visual system in high dynamic range video compression, visible difference prediction and image processing / vorgelegt von Rafał Mantiuk." 2007. http://d-nb.info/985159871/34.
Full text