Dissertations / Theses on the topic 'CUDA FRAMEWORK'
Create a spot-on reference in APA, MLA, Chicago, Harvard, and other styles
Consult the top 16 dissertations / theses for your research on the topic 'CUDA FRAMEWORK.'
Next to every source in the list of references, there is an 'Add to bibliography' button. Press on it, and we will generate automatically the bibliographic reference to the chosen work in the citation style you need: APA, MLA, Harvard, Chicago, Vancouver, etc.
You can also download the full text of the academic publication as pdf and read online its abstract whenever available in the metadata.
Browse dissertations / theses on a wide variety of disciplines and organise your bibliography correctly.
Dworaczyk, Wiltshire Austin Aaron. "CUDA ENHANCED FILTERING IN A PIPELINED VIDEO PROCESSING FRAMEWORK." DigitalCommons@CalPoly, 2013. https://digitalcommons.calpoly.edu/theses/1072.
Full textKarlsson, Per. "A GPU-based framework for efficient image processing." Thesis, Linköpings universitet, Medie- och Informationsteknik, 2014. http://urn.kb.se/resolve?urn=urn:nbn:se:liu:diva-112093.
Full textGiordano, Andrea. "Sviluppo di una simulazione ad agenti di un modello di infezione virale tramite il framework FLAME GPU." Bachelor's thesis, Alma Mater Studiorum - Università di Bologna, 2018. http://amslaurea.unibo.it/15755/.
Full textFabian, Xavier. "Precision measurements in the weak interaction framework: development of realistic simulations for the LPCTrap device installed at GANIL." Caen, 2015. http://hal.in2p3.fr/tel-01288412.
Full textThis work belongs to the effort presently deployed to measure the angular correlation parameter aβν in three nuclear beta decays (6He+, 35Ar+ and 19Ne+). The V-A structure of the weak interaction implies that aβν = +1 for a pure Fermi transition and aβν = -1/3 for a pure Gamow-Teller transition. A thorough measurement of this parameter to check any deviation from these values may lead to the discovery of possible exotic currents. Furthermore, the measurement of aβν in mirror transitions allows the extraction of Vud, the first element of the Cabibbo-Kobayashi-Maskawa (CKM) matrix. The LPCTrap apparatus, installed at GANIL, is designed to ready a continuous ion beam for injection in a dedicated Paul trap. This latter device allows to have a quasi-ponctual source from which the decay products are detected in coincidence. It is from the study of the recoil ion time-of-flight (TOF) distribution that aβν is withdrawn and, since 2010, the associated Shake-Off (SO) probabilities. This study requires the complete simulation of the LPCTrap experiments. The major part of this work is dedicated to such simulations, especially to the modeling of the trapped ion cloud dynamic. The CLOUDA program, which takes advantage of graphics processing unit (GPU), was developed in this context and its full characterization is presented here. Three important aspects are addressed: the electromagnetic trapping field, the realistic collisions between the ions and the buffer gas atoms and the space charge effect. The present work shows the importance of these simulations to increase the control of the systematic errors on aβν
Badalov, Alexey Pavlovich. "Coprocessor integration for real-time event processing in particle physics detectors." Doctoral thesis, Universitat Ramon Llull, 2016. http://hdl.handle.net/10803/396128.
Full textLa física de altas energías dispone actualmente de aceleradores con energías mayores, sensores más precisos y métodos de recopilación de datos más flexibles que nunca. Su rápido progreso necesita aún más potencia de cálculo; el hardware masivamente paralelo, como las unidades de procesamiento gráfico, nos brinda esta potencia a un coste mucho más bajo que las CPUs tradicionales. Sin embargo, para usar eficientemente este hardware necesitamos algoritmos nuevos y nuevos enfoques de organización de datos difíciles de integrarse con el software existente. En este trabajo, se investiga cómo se pueden usar estos algoritmos paralelos en las infraestructuras de software ya existentes y que están orientadas a CPUs. Se propone una solución en forma de un servicio que comunica con los diversos pipelines que procesan los eventos de las correspondientes colisiones de particulas, reúne los datos en lotes y se los entrega a los algoritmos paralelos acelerados por hardware. Este servicio se integra con Gaudí — la infraestructura del entorno de software que usan dos de los cuatro gran experimentos del Gran Colisionador de Hadrones. Se examinan los costes añadidos por el servicio en los algoritmos paralelos. Se estudia un caso de uso del servicio para ejecutar un algoritmo paralelo para el VELO Pixel (el subdetector encargado de la localización de vértices en el upgrade del experimento LHCb) y se estudian las características de rendimiento de los distintos tamaños de lotes de datos. Finalmente, las conclusiones se contextualizan dentro la perspectiva de los requerimientos para el sistema de trigger de LHCb.
High-energy physics experiments today have higher energies, more accurate sensors, and more flexible means of data collection than ever before. Their rapid progress requires ever more computational power; and massively parallel hardware, such as graphics cards, holds the promise to provide this power at a much lower cost than traditional CPUs. Yet, using this hardware requires new algorithms and new approaches to organizing data that can be difficult to integrate with existing software. In this work, I explore the problem of using parallel algorithms within existing CPU-orientated frameworks and propose a compromise between the different trade-offs. The solution is a service that communicates with multiple event-processing pipelines, gathers data into batches, and submits them to hardware-accelerated parallel algorithms. I integrate this service with Gaudi — a framework underlying the software environments of two of the four major experiments at the Large Hadron Collider. I examine the overhead the service adds to parallel algorithms. I perform a case study of using the service to run a parallel track reconstruction algorithm for the LHCb experiment's prospective VELO Pixel subdetector and look at the performance characteristics of using different data batch sizes. Finally, I put the findings into perspective within the context of the LHCb trigger's requirements.
Varadarajan, Aravind Krishnan. "Improving Bio-Inspired Frameworks." Thesis, Virginia Tech, 2018. http://hdl.handle.net/10919/97506.
Full textMS
Odeh, Khuloud, Annita Seckinger, and Carina Forsman-Knecht. "Connected Urban Development (CUD) Initiative as an Approach towards Sustainability in Urban Areas." Thesis, Blekinge Tekniska Högskola, Avdelningen för maskinteknik, 2009. http://urn.kb.se/resolve?urn=urn:nbn:se:bth-3127.
Full textMed det ökande antalet informations-och kommunikationsteknik (IKT)-baserade initiativ som riktar sig mot hållbarhet i städerna, är det viktigt att undersöka de eventuella bidrag dessa initiativ kan ge när de stödjer samhällens hållbara utveckling. Denna uppsats undersöker Connected urban developments (CUD) potential som initiativ och dess stödjande strategi för städers hållbara utveckling. Förslag till justeringar av CUDs strategi studeras och rekommendationer för bättre anpassning av nuvarande strategier till CUDs nya mål ges. Orginalstrategierna skrevs utifrån orginalmålet, reducering av koldioxidutsläpp och behövde justeras för att bättre inriktas mot det nya målet, global hållbarhet. Detta uppnåddes genom att studera Connected Urban Development som organisation och genom att intervjua representanter från CUDs organisation, representanter för Pilotprojektsstäder, olika experter på stadsplanering, IT och hållbar utveckling (både forskare och praktiker). Ett vetenskapligt förhållningssätt till kunskap om hållbarhetsbegrepp utgör grunden för denna utvärdering av CUD-initiativet, dess fördelar och utmaningar, inklusive rollen för bredbandsuppkoppling och tillämpning av informations-och kommunikationsteknik. I detta sammanhang ges rekommendationer för att ytterligare förbättra CUD-initiativets effektivitet gällande städers hållbara utveckling. Gruppens forskning utgick från ett föreställt idealiskt initiativ i förhållande till systemets gränser och komponenter, strategiska riktlinjer, åtgärder och verktyg - "CUD Gold" och åtgärder föreslås för att göra CUD mer strategiska i sin banbrytande ansträngning för att stödja hållbara städer på global nivå.
Carina Forsman-Knecht S. Bellevuevägen 2 371 61 Lyckeby E-mail: cinaknecht@gmail.com US +1 435 503 8460 Sweden +46 (0)733 629951 Skype: cinaknecht Annita Seckinger 10620 Barnwood Lane Potomac Maryland 20854 email: a2ndger@yahoo.com Khuloud Odeh address: 2501 Calvert St. NW Apt. 401, Washington, DC, 20008, USA phones: home +1-202-332-1103, mobile:+1 301-768-1886 email: khuloud.odeh@gmail.com
Awan, Ammar Ahmad. "Co-designing Communication Middleware and Deep Learning Frameworks for High-Performance DNN Training on HPC Systems." The Ohio State University, 2020. http://rave.ohiolink.edu/etdc/view?acc_num=osu1587433770960088.
Full textOliveira, Danilo Senen Cavallieri de. "Fintechs e inclusão financeira: o caso da implementação de uma plataforma digital de pagamentos em favelas do Rio de Janeiro e São Paulo." reponame:Repositório Institucional do FGV, 2018. http://hdl.handle.net/10438/23940.
Full textApproved for entry into archive by Debora Nunes Ferreira (debora.nunes@fgv.br) on 2018-05-16T19:31:18Z (GMT) No. of bitstreams: 1 180313DissertaçãoDSCOvffsent.pdf: 1907936 bytes, checksum: ed2fa17510013b1a651ad18d4b0119df (MD5)
Rejected by Suzane Guimarães (suzane.guimaraes@fgv.br), reason: Prezado Danilo, Rejeitamos a sua submissão pois o nome do autor foi alterado na ficha catalográfica, sendo assim é necessário fazer a correção e submeter o arquivo novamente. Por gentileza utilizar a ficha enviada pela biblioteca inserindo nela apenas o número de folhas do seu trabalho. Quaisquer dúvidas entrar em contato com o telefone 11 3799-7732. Estamos à disposição! on 2018-05-17T14:12:33Z (GMT)
Submitted by Danilo Cavallieri-de-Oliveira (danilosenen@gmail.com) on 2018-05-17T19:34:32Z No. of bitstreams: 1 180313DissertaçãoDSCOvffsent.pdf: 1890653 bytes, checksum: 76d0e733d4ccf4c138d6c56f736fe490 (MD5)
Approved for entry into archive by Debora Nunes Ferreira (debora.nunes@fgv.br) on 2018-05-22T17:03:48Z (GMT) No. of bitstreams: 1 180313DissertaçãoDSCOvffsent.pdf: 1890653 bytes, checksum: 76d0e733d4ccf4c138d6c56f736fe490 (MD5)
Approved for entry into archive by Suzane Guimarães (suzane.guimaraes@fgv.br) on 2018-05-22T17:23:29Z (GMT) No. of bitstreams: 1 180313DissertaçãoDSCOvffsent.pdf: 1890653 bytes, checksum: 76d0e733d4ccf4c138d6c56f736fe490 (MD5)
Made available in DSpace on 2018-05-22T17:23:29Z (GMT). No. of bitstreams: 1 180313DissertaçãoDSCOvffsent.pdf: 1890653 bytes, checksum: 76d0e733d4ccf4c138d6c56f736fe490 (MD5) Previous issue date: 2018-03-13
O presente estudo visa responder à pergunta de pesquisa: 'Como ocorre o processo de implantação de uma plataforma digital de pagamentos, desenvolvida por uma fintech, que visa promover a inclusão financeira?'. Para isso, foi analisado o caso do CUFA Card, plataforma digital de pagamentos implantada na favela Parque União, parte do Complexo da Maré no Rio de Janeiro, e na comunidade de Heliópolis, em São Paulo. Compõem o presente trabalho uma revisão de literatura, que nos permite compreender como as fintechs podem ser uma oportunidade para se promover a inclusão financeira; e um estudo de caso, onde foram realizadas entrevistas com os grupos sociais envolvidos na criação ou implantação dessa plataforma, as quais foram transcritas, codificadas e analisadas utilizando o software Atlas TI©. Como principal contribuição, temos a análise de como ocorre a implantação de uma plataforma digital de pagamentos, que visa promover a inclusão financeira a luz da implantação do CUFA Card, bem como com a descrição de como ocorreu a articulação entre diferentes grupos sociais para viabilização do projeto e explicitação do conteúdo resultante da implantação dessa tecnologia, a qual é derivada da parceria entre a fintech Conta Um e a organização FHolding/CUFA. Para isso utilizou-se do multilevel framework de Pozzebon, Diniz e Jayo (2009), uma teoria nativa do campo de sistemas da informação, que possibilitou um melhor entendimento sobre o caso estudado, ao analisar, concomitantemente, aspectos tecnológicos e sociais da implantação dessa da plataforma em um dado contexto onde essa está sendo inserida. O estudo contribui também com a geração de insights para pesquisas futuras e para a prática, ao estudar a relação entre fintechs e inclusão financeira, um tema ainda incipiente na literatura, e ao analisar o processo de implantação da plataforma de pagamento, destacando questões cruciais desse processo.
The present study aims to answer the research question: 'How does the process of implementing a digital payments platform, developed by a fintech, that aims to pro-mote financial inclusion occurs?'. For that, was analyzed the case of CUFA Card, a digital payment platform implemented in the Parque União favela, part of the Com-plexo da Maré in Rio de Janeiro, and in the community of Heliópolis, in São Paulo. The present work compiles a literature review, which allows us to understand how fintechs can be an opportunity to promote financial inclusion; and a case study where interviews were conducted with the social groups involved in the creation or imple-mentation of this platform, that were transcribed, coded and analyzed using Atlas TI © software. As a main contribution, we have the analysis on how this process occurs in the light of the CUFA Card implementation, as well as describing how the articulation between different social groups occurred to project feasibility and explicit content resulting from the implementation of this technology, which is derived from the partnership between the fintech Conta Um and the FHolding /CUFA organization. The work also brings as contributions the articulation of the multilevel framework, by Pozzebon, Diniz and Jayo (2009) and which is native to the information systems field, that made possible a better understanding of the case studied, while simultaneously analyzing technological and social aspects of the implementation of this platform in a given context where it is being inserted. It also contributes to the generation of in-sights for future research and practice, studying the relationship between fintechs and financial inclusion, a still incipient topic in the literature, and analysing the implementation process of a payment platform, highlighting what is the crucial issues in this process.
Chen, Yu-Wen, and 陳郁文. "Online Derivatives Arbitrage Trading Mechanism Based on CUDA Framework." Thesis, 2016. http://ndltd.ncl.edu.tw/handle/35874995565142123742.
Full text國立交通大學
資訊科學與工程研究所
104
Parallel Computing denotes a technique to simultaneously process a huge amount of data with low dependency by multiple processing units. In other words, we can divide a complex problem or a huge data set into many small independent problems or small data chunks, and reduce the overall computational time through allocating these problems to different process units in the same time. High frequency trading is becoming important in financial markets and the ability to deal a huge amount of financial trading data in real time is thus critical. This thesis apply parallel computing technique to search for arbitrage opportunities and design trading strategies for TAIEX options and futures. Usually, arbitrage opportunity comes from occasionally irrational price quotes. In highly competitive and mature markets, arbitrage opportunities are not only extremely rare but also fleeting. Therefore, the technique which can process a great number of data rapidly such as Parallel Computing is very suitable for finding arbitrage opportunity. This research revises the framework of [5]. I implement the following arbitrage strategies: convexity strategy and put-call-future parity strategy, and we have introduced spread strategy in my framework to seek arbitrage opportunities in TAIEX Exchange of Futures. Besides, the off-line framework that uses virtual exchange to simulate tradings. I add the online real-time trading mode which can receive price quotes from a remote server and send back the encoded strategies through a TCP channel.
SUN, PO-YUAN, and 孫伯元. "A Nonlinear Dynamic Analysis Acceleration Framework Using CUDA And OpenMP." Thesis, 2019. http://ndltd.ncl.edu.tw/handle/d2cvf8.
Full text國立臺北科技大學
土木工程系土木與防災碩士班
107
To verify the aseismatic capability of a structure requires many types of performance tests. One of the testing type named real-time hybrid testing, requires real-time numerical analysis (based on finite element analysis) running concurrently during the test. However, nonlinear dynamic structural analysis leads to huge computation and takes a lot of time. This work is aiming to accelerate real-time numerical analysis. Based on an open-source structural analysis program OpenSees, this framework can use CUDA and OpenMP for parallel computation and is suitable for multicore or SIMD architecture hardware. Expensive runtime routines like forming stiffness matrix and updating numeric model are distributed to multicore CPUs and GPUs to reduce calculation time in each time step.
Stinson, Derek L. "Deep Learning with Go." Thesis, 2020. http://hdl.handle.net/1805/22729.
Full textCurrent research in deep learning is primarily focused on using Python as a support language. Go, an emerging language, that has many benefits including native support for concurrency has seen a rise in adoption over the past few years. However, this language is not widely used to develop learning models due to the lack of supporting libraries and frameworks for model development. In this thesis, the use of Go for the development of neural network models in general and convolution neural networks is explored. The proposed study is based on a Go-CUDA implementation of neural network models called GoCuNets. This implementation is then compared to a Go-CPU deep learning implementation that takes advantage of Go's built in concurrency called ConvNetGo. A comparison of these two implementations shows a significant performance gain when using GoCuNets compared to ConvNetGo.
ARUN. "HUMAN EMOTION RECOGNITION USING DEEO LEARNING TECHNIQUES." Thesis, 2017. http://dspace.dtu.ac.in:8080/jspui/handle/repository/16003.
Full text(8812109), Derek Leigh Stinson. "Deep Learning with Go." Thesis, 2020.
Find full textFraga, António Fernando Crisóstomo. "Parallel Face Detection." Master's thesis, 2020. http://hdl.handle.net/10316/94026.
Full textO reconhecimento de faces em imagens é atualmente feito em grande escala e as imagens utilizadas tende a ser cada vez mais de resolução mais elevadas. Isto pode ser um desafio complicado em arquiteturas sequenciais, pois, com o aumento do número total de pixels das imagens, o desempenho geral desse tipo de implementações tende a diminuir drasticamente. A tese apresentada descreve a implementação de uma framework baseada no artigo Viola-Jones “Rapid Object Detection using a Boosted Cascade of Simple Features” [2]. Desta forma, as arquiteturas paralelas (GPUs e GPUs de baixo consumo), emergem como a solução ideal já que oferecem elevados valores de poder computacional e números de cores que beneficiam o processamento de grandes quantidades de data em paralelo. Utilizando, assim, as vantagens destas arquiteturas para uma paralelização e otimização específica a esta implementação, obtendo, portanto, uma melhoria significativa na performance em comparação a arquiteturas sequenciais em imagens de alta resolução. Por sua vez, também é realizada uma análise dos resultados desta implementação, que acaba por ser bem-sucedida em diversas GPUs, com o objetivo de fazer uma análise conclusiva da influência dos recursos de GPU disponíveis (Power, CUDA cores, etc.) na aceleração geral da GPU. De referir ainda que este detetor de caras baseado em arquiteturas paralelas foi capaz de obter uma aceleração global de até 33 vezes superior em imagens de 8k em comparação com a versão sequencial inicialmente implementada.
Face detection is typically used millions of times per day in many different contexts and the resolution of the images has seen a significant increase. These high-resolution images can be a very defiant challenge in sequentially based architecture since with the rise in the number of pixels the overall performance of this type of implementation decreases drastically.The following paper describes the implementation of a framework of the Viola-Jones “Rapid Object Detection using a Boosted Cascade of Simple Features” [2] in parallel architectures such as GPUs and low-power GPUs. They emerge as natural candidates for the acceleration that we seek, offering a very high computational power and core numbers that enable the process of such large amounts of data in parallelIt also shows the parallelization and optimization of the implementation utilizing the advantages offered by these architectures to achieve an overall performance boost and speedup in high-resolution images when comparing to sequential architectures. An analysis of the results shows the successful implementation and the influence that the GPU resources available (Power, CUDA cores, etc.) have on the overall GPU speedup as well as in its performance. This parallel face detector implementation was able to obtain a global speedup as high as 33 times in 8k images in comparison with the sequential version. An analysis of the results shows the successful implementation and the influence that the GPU resources available (Power, CUDA cores, etc.) have on the overall GPU speedup as well as in its performance. This parallel face detector implementation was able to obtain a global speedup as high as 33 times in 8k images in comparison with the sequential version.
Park, Yubin. "CUDIA : a probabilistic cross-level imputation framework using individual auxiliary information." Thesis, 2011. http://hdl.handle.net/2152/ETD-UT-2011-12-4746.
Full texttext