Dissertations / Theses on the topic 'CNN'
Create a spot-on reference in APA, MLA, Chicago, Harvard, and other styles
Consult the top 50 dissertations / theses for your research on the topic 'CNN.'
Next to every source in the list of references, there is an 'Add to bibliography' button. Press on it, and we will generate automatically the bibliographic reference to the chosen work in the citation style you need: APA, MLA, Harvard, Chicago, Vancouver, etc.
You can also download the full text of the academic publication as pdf and read online its abstract whenever available in the metadata.
Browse dissertations / theses on a wide variety of disciplines and organise your bibliography correctly.
Garbay, Thomas. "Zip-CNN." Electronic Thesis or Diss., Sorbonne université, 2023. https://accesdistant.sorbonne-universite.fr/login?url=https://theses-intra.sorbonne-universite.fr/2023SORUS210.pdf.
Full textDigital systems used for the Internet of Things (IoT) and Embedded Systems have seen an increasing use in recent decades. Embedded systems based on Microcontroller Unit (MCU) solve various problems by collecting a lot of data. Today, about 250 billion MCU are in use. Projections in the coming years point to very strong growth. Artificial intelligence has seen a resurgence of interest in 2012. The use of Convolutional Neural Networks (CNN) has helped to solve many problems in computer vision or natural language processing. The implementation of CNN within embedded systems would greatly improve the exploitation of the collected data. However, the inference cost of a CNN makes their implementation within embedded systems challenging. This thesis focuses on exploring the solution space, in order to assist the implementation of CNN within embedded systems based on microcontrollers. For this purpose, the ZIP-CNN methodology is defined. It takes into account the embedded system and the CNN to be implemented. It provides an embedded designer with information regarding the impact of the CNN inference on the system. A designer can explore the impact of design choices, with the objective of respecting the constraints of the targeted application. A model is defined to quantitatively provide an estimation of the latency, the energy consumption and the memory space required to infer a CNN within an embedded target, whatever the topology of the CNN is. This model takes into account algorithmic reductions such as knowledge distillation, pruning or quantization. The implementation of state-of-the-art CNN within MCU verified the accuracy of the different estimations through an experimental process. This thesis democratize the implementation of CNN within MCU, assisting the designers of embedded systems. Moreover, the results open a way of exploration to apply the developed models to other target hardware, such as multi-core architectures or FPGA. The estimation results are also exploitable in the Neural Architecture Search (NAS)
Carpani, Valerio. "CNN-based video analytics." Master's thesis, Alma Mater Studiorum - Università di Bologna, 2018.
Find full textLara, Teodoro. "Controllability and applications of CNN." Diss., Georgia Institute of Technology, 1997. http://hdl.handle.net/1853/28921.
Full textSamal, Kruttidipta. "FPGA acceleration of CNN training." Thesis, Georgia Institute of Technology, 2015. http://hdl.handle.net/1853/54467.
Full textMohamed, Moussa Elmokhtar. "Conversion d’écriture hors-ligne en écriture en-ligne et réseaux de neurones profonds." Electronic Thesis or Diss., Nantes Université, 2024. http://www.theses.fr/2024NANU4001.
Full textThis thesis focuses on the conversion of static images of offline handwriting into temporal signals of online handwriting. Our goal is to extend neural networks beyond the scale of images of isolated letters and as well to generalize to other complex types of content. The thesis explores two distinct neural network-based approaches, the first approach is a fully convolutional multitask UNet-based network, inspired by the method of [ZYT18]. This approach demonstrated good results for skeletonization but suboptimal stroke extrac- tion. Partly due to the inherent temporal mod- eling limitations of CNN architecture. The second approach builds on the pre- vious skeletonization model to extract sub- strokes and proposes a sub-stroke level modeling with Transformers, consisting of a sub- stroke embedding transformer (SET) and a sub-stroke ordering transformer (SORT) to or- der the different sub-strokes as well as pen up predictions. This approach outperformed the state of the art on text lines and mathematical equations databases and addressed several limitations identified in the literature. These advancements have expanded the scope of offline-to-online conversion to include entire text lines and generalize to bidimensional content, such as mathematical equations
Rossetto, Andrea. "CNN per view synthesis da mappe depth." Master's thesis, Alma Mater Studiorum - Università di Bologna, 2018. http://amslaurea.unibo.it/16570/.
Full textCastelli, Filippo Maria. "3D CNN methods in biomedical image segmentation." Master's thesis, Alma Mater Studiorum - Università di Bologna, 2019. http://amslaurea.unibo.it/18796/.
Full textRingenson, Josefin. "Efficiency of CNN on Heterogeneous Processing Devices." Thesis, Linköpings universitet, Programvara och system, 2019. http://urn.kb.se/resolve?urn=urn:nbn:se:liu:diva-155034.
Full textKristin, Hallberg. "Islam, BBC och CNN : Palestinska inbördeskriget 2006-2007." Thesis, Uppsala universitet, Teologiska institutionen, 2016. http://urn.kb.se/resolve?urn=urn:nbn:se:uu:diva-295888.
Full textEklund, Anton. "Cascade Mask R-CNN and Keypoint Detection used in Floorplan Parsing." Thesis, Uppsala universitet, Institutionen för informationsteknologi, 2020. http://urn.kb.se/resolve?urn=urn:nbn:se:uu:diva-415371.
Full textGu, Dongfeng. "3D Densely Connected Convolutional Network for the Recognition of Human Shopping Actions." Thesis, Université d'Ottawa / University of Ottawa, 2017. http://hdl.handle.net/10393/36739.
Full textChen, Tairui. "Going Deeper with Convolutional Neural Network for Intelligent Transportation." Digital WPI, 2016. https://digitalcommons.wpi.edu/etd-theses/144.
Full textMukhtar, Hind. "Machine Learning Enabled-Localization in 5G and LTE Using Image Classification and Deep Learning." Thesis, Université d'Ottawa / University of Ottawa, 2021. http://hdl.handle.net/10393/42449.
Full textHossain, Md Tahmid. "Towards robust convolutional neural networks in challenging environments." Thesis, Federation University Australia, 2021. http://researchonline.federation.edu.au/vital/access/HandleResolver/1959.17/181882.
Full textDoctor of Philosophy
Bark, Filip. "Embedded Implementation of Lane Keeping Functionality Using CNN." Thesis, KTH, Skolan för elektroteknik och datavetenskap (EECS), 2018. http://urn.kb.se/resolve?urn=urn:nbn:se:kth:diva-230193.
Full textPå senare tid så har intresset angående självkörande bilar ökat. Detta har lett till att många företag och forskare har börjat jobbat på sina egna lösningar till den myriad av problem som upstår när en bil behöver ta komplicerade beslut på egen hand. Detta projekt undersöker möjligheten att lämna så många av dessa beslut som möjligt till en enda sensor och processor. I detta fall så blir det en Raspberry Pi (RPI) och en kamera som sätts på en radiostyrd bil och skall följa en väg. För att implementera detta så används bildbehandling, eller mer specifikt, convolutional neural networks (CNN) från maskininlärning för att styra bilen med en enda kamera. Det utvecklade nätverket är designat och implementerat med ett bibliotek för maskininlärning i Python som kallas för Keras. Nätverkets design är baserat på det berömda Lenet men den har skalats ner för att öka prestandan och minska storleken som nätverket tar men fortfarande uppnå en anständing träffsäkerhet. Nätverket körs på RPIn, vilken i sin tur är fastsatt på en radiostyrd bil tillsammans med kameran. Kablar har kopplats och blivit lödda mellan RPIn och handkontrollen till radiostyrda bilen så att RPIn kan styra bilen. Själva styrningen lämnats åt en simpel "Bang Bang controller". Utvärdering av nätvärket och prototypen utfördes löpande under projektets gång, enhetstester gjordes enligt glasboxmetoden för att testa och verifiera olika delar av koden. Större experiment gjordes för att säkerställa att nätverket presterar som förväntat i olika situationer. Det slutgiltiga experimentet fastställde att nätverket uppfyller en acceptabel träffsäkerhet och kan styra prototypen utan problem när denne följer olika vägar samt att den kan stanna i de fall den behöver. Detta visar att trots den begränsade storleken på nätverket så kunde det styra en bil baserat på datan från endast en sensor. Detta var dessutom möjligt när man körde nätverket på en liten och svag dator som en RPI, detta visar att CNN var kraftfulla nog i det här fallet.
Fernandez, Brillet Lucas. "Réseaux de neurones CNN pour la vision embarquée." Thesis, Université Grenoble Alpes, 2020. http://www.theses.fr/2020GRALM043.
Full textRecently, Convolutional Neural Networks have become the state-of-the-art soluion(SOA) to most computer vision problems. In order to achieve high accuracy rates, CNNs require a high parameter count, as well as a high number of operations. This greatly complicates the deployment of such solutions in embedded systems, which strive to reduce memory size. Indeed, while most embedded systems are typically in the range of a few KBytes of memory, CNN models from the SOA usually account for multiple MBytes, or even GBytes in model size. Throughout this thesis, multiple novel ideas allowing to ease this issue are proposed. This requires to jointly design the solution across three main axes: Application, Algorithm and Hardware.In this manuscript, the main levers allowing to tailor computational complexity of a generic CNN-based object detector are identified and studied. Since object detection requires scanning every possible location and scale across an image through a fixed-input CNN classifier, the number of operations quickly grows for high-resolution images. In order to perform object detection in an efficient way, the detection process is divided into two stages. The first stage involves a region proposal network which allows to trade-off recall for the number of operations required to perform the search, as well as the number of regions passed on to the next stage. Techniques such as bounding box regression also greatly help reduce the dimension of the search space. This in turn simplifies the second stage, since it allows to reduce the task’s complexity to the set of possible proposals. Therefore, parameter counts can greatly be reduced.Furthermore, CNNs also exhibit properties that confirm their over-dimensionment. This over-dimensionement is one of the key success factors of CNNs in practice, since it eases the optimization process by allowing a large set of equivalent solutions. However, this also greatly increases computational complexity, and therefore complicates deploying the inference stage of these algorithms on embedded systems. In order to ease this problem, we propose a CNN compression method which is based on Principal Component Analysis (PCA). PCA allows to find, for each layer of the network independently, a new representation of the set of learned filters by expressing them in a more appropriate PCA basis. This PCA basis is hierarchical, meaning that basis terms are ordered by importance, and by removing the least important basis terms, it is possible to optimally trade-off approximation error for parameter count. Through this method, it is possible to compress, for example, a ResNet-32 network by a factor of ×2 both in the number of parameters and operations with a loss of accuracy <2%. It is also shown that the proposed method is compatible with other SOA methods which exploit other CNN properties in order to reduce computational complexity, mainly pruning, winograd and quantization. Through this method, we have been able to reduce the size of a ResNet-110 from 6.88Mbytes to 370kbytes, i.e. a x19 memory gain with a 3.9 % accuracy loss.All this knowledge, is applied in order to achieve an efficient CNN-based solution for a consumer face detection scenario. The proposed solution consists of just 29.3kBytes model size. This is x65 smaller than other SOA CNN face detectors, while providing equal detection performance and lower number of operations. Our face detector is also compared to a more traditional Viola-Jones face detector, exhibiting approximately an order of magnitude faster computation, as well as the ability to scale to higher detection rates by slightly increasing computational complexity.Both networks are finally implemented in a custom embedded multiprocessor, verifying that theorical and measured gains from PCA are consistent. Furthermore, parallelizing the PCA compressed network over 8 PEs achieves a x11.68 speed-up with respect to the original network running on a single PE
Lind, Johan. "Evaluating CNN-based models for unsupervised image denoising." Thesis, Linköpings universitet, Institutionen för datavetenskap, 2021. http://urn.kb.se/resolve?urn=urn:nbn:se:liu:diva-176092.
Full textSöderström, Douglas. "Comparing pre-trained CNN models on agricultural machines." Thesis, Umeå universitet, Institutionen för fysik, 2021. http://urn.kb.se/resolve?urn=urn:nbn:se:umu:diva-185333.
Full textLi, Xile. "Real-time Multi-face Tracking with Labels based on Convolutional Neural Networks." Thesis, Université d'Ottawa / University of Ottawa, 2017. http://hdl.handle.net/10393/36707.
Full textEl-Shafei, Ahmed. "Time multiplexing of cellular neural networks." Thesis, University of Kent, 2001. http://ethos.bl.uk/OrderDetails.do?uin=uk.bl.ethos.365221.
Full textКириченко, І. О. "Інтелектуальна технологія детектування стану трубопроводів з аугментацією даних в режимі екзамену." Master's thesis, Сумський державний університет, 2021. https://essuir.sumdu.edu.ua/handle/123456789/86859.
Full textGustafsson, Magnus, and Niclas Hagel. "Al-Jazeera och CNN - En jämförande fallstudie i krigsjournalistik." Thesis, Halmstad University, School of Social and Health Sciences (HOS), 2009. http://urn.kb.se/resolve?urn=urn:nbn:se:hh:diva-2234.
Full textFörfattare: Magnus Gustafsson Niclas Hagel
Handledare: Thomas Knoll
Examinator: Martin Danielsson
Titel: Al-Jazeera och CNN - En jämförande fallstudie i krigsjournalistik
Typ av rapport: C - uppsats
Ämne: Medie- och Kommunikationsvetenskap
År: Höstterminen 2008
Sektion: Sektionen för Hälsa och Samhälle
Syfte: Vårt syfte är att studera och jämföra al-Jazeeras och CNN:s
bevakning av en händelse i Afghanistankonflikten för att kunna
redogöra för eventuella skillnader. Vi vill se hur olika faktorer
påverkar journalistiken. En analys ur ett genusperspektiv
kommer också att göras.
Metod: Fallstudie har tillämpats som huvudsaklig metod och vid analys
av material har innehållsanalys och kritisk diskursanalys använts.
Slutsatser: Efter att ha jämfört de två nyhetskanalerna kan vi tydligt se att
det finns stora skillnader i rapporteringen av ett amerikanskt
flyganfall mot en afghansk by. CNN som amerikansk
nyhetskanal visar att deras rapportering påverkas av det
amerikanska medieklimatet där en neutral krigsrapportering kan
ses som stötande och journalister ständigt utsätts för
påtryckningar. Ur ett genusperspektiv ser vi dock tydliga
likheter mellan kanalerna.
Berg, Albin. "Jämförelse av CNN modeller för objektidentifiering och automatisk markering." Thesis, Högskolan i Skövde, Institutionen för informationsteknologi, 2020. http://urn.kb.se/resolve?urn=urn:nbn:se:his:diva-18637.
Full textEl, Ahmar Wassim. "Head and Shoulder Detection using CNN and RGBD Data." Thesis, Université d'Ottawa / University of Ottawa, 2019. http://hdl.handle.net/10393/39448.
Full textGrogan, Andree Marie. "Observations on the news factory a case study of CNN /." restricted, 2005. http://etd.gsu.edu/theses/available/etd-11172005-173426/.
Full textTitle from title screen. Merrill Morris, committee chair; Marian Meyers, Douglas Barthlow, committee members. Electronic text (98 p.) : digital, PDF file. Description based on contents viewed June 21, 2007. Includes bibliographical references (p. 89-96).
Grogan, Andree Marie. "Observations on the News Factory: A Case Study of CNN." Digital Archive @ GSU, 2006. http://digitalarchive.gsu.edu/communication_theses/6.
Full textHiselius, Leo. "Igenkänning av musikalisk genre med CNN-nätverk och transfer learning." Thesis, KTH, Skolan för teknikvetenskap (SCI), 2019. http://urn.kb.se/resolve?urn=urn:nbn:se:kth:diva-254764.
Full textThis project studies the effects of transfer learning on music information retrieval tasks of CNN-based audio data representations. Several neural networks are fed melspectrogram matrices and trained with random initial weights on three different classification tasks including ’genre’, ’region’ and ’year’ and classification performance is measured, after which transfer learning is utilized and classification performance is measured again. F1-score for individual classes within the different tasks is also measured. Comparing the results shows that transfer learning is applicable in this task domain.
Lee, Yi-Jou, and 李依柔. "A Reconfigurable CNN Accelerator Design." Thesis, 2017. http://ndltd.ncl.edu.tw/handle/15122663000772368149.
Full text國立臺灣大學
資訊工程學研究所
105
With the large size of the convolutional neural network (CNN), performance and energy efficiency of CNN accelerator become an important problem. From previous works, we can find that DRAM accesses took a large part in energy consumption. To reduce DRAM accesses, we observe the computation behavior of convolutional layer, and many parameters are shared between computation. Those data may be loaded on-chip repeatedly with the limitation of on-chip buffer size in an accelerator. We would like to capture data reuse via the on-chip buffer to reduce DRAM accesses of CNN computation. There are three kinds of data reuse can be captured, and those data will be kept by on-chip buffer and be evicted when not needed. The first kind of data reuse is input feature map reuse, the next is filter reuse and the other is intermediate feature map reuse. Each layer in a CNN model may favor different data reuse policy based on the size of its input, output, and filters. But existing CNN accelerators only focus on one type of data reuse through CNN processing. To have flexibility using different data reuse policy for each layer in CNN processing, we would like to propose a reconfigurable CNN accelerator design, which can be configured to capture different types of reuse with the objective of minimizing off-chip memory accesses. With separating the CNN processing into several computation primitives which are units of convolution with different inputs and filters, we can reuse different data by arranging the computation ordering of those computation primitives in our accelerator. And our accelerator will execute based on the instructions generated by off-line generator considering the optimal reuse policy and hardware constraints. Our work shows that with our reconfigurable design, DRAM accesses can be reduced, and compare the execution time and the energy when using different data reuse policy. We also analyze the effect of the different configuration in our CNN accelerator design.
Lopez, Paola Denisse Gomez, and 鮑樂. "Face Keypoint Recognition with CNN." Thesis, 2016. http://ndltd.ncl.edu.tw/handle/87259949052618555601.
Full text元智大學
通訊工程學系
104
Purpose This is an attempt to unravel the problem of human face keypoints recognition. In the new area of machine learning research called deep learning. Different approaches to this problem were evaluated and proposed one system to implement using python libraries for computational skills. Methodology Face keypoints detection was achieved by using a template algorithm. Using GPU instances and convolutional networks consisting of multiple levels. The key idea is to pre-train models in completely unsupervised way and finally they can be fine-tuned for the task at hand using supervised learning using Nesterov Gradient Algorithm to create the perceptron units. Manual detection was used to test implemented face keypoints recognition system. Findings Successful results were obtained for automated face keypoints recognition under robust and controlled conditions. The experimental results show that the model provides better results than publicly available benchmarks for the dataset. Originality/Value Discuss different machine learning techniques used for face keypoints detection and provide a description why most algorithms are based in neural networks. Keywords Convolution Neural network, face keypoints recognition.
CHEN, CHUN-LIN, and 陳俊霖. "CNN-based identity recognition system." Thesis, 2019. http://ndltd.ncl.edu.tw/handle/drtnp8.
Full text國立中央大學
資訊管理學系在職專班
107
This paper proposes a set of "CNN-based identity recognition system" for identity recognition using a computer vision library OpenCV and deep learning technology and webcam. It is expected to be applied to access control and regional security. Monitoring, advertising, or other related systems that need to be enhanced by confirming their identity. This thesis is based on Python and TensorFlow's built-in GoogLeNet CNN model. Supervised learning is used to obtain facial image features and classified by identity. This paper uses self-organizing face image data and compares GoogLeNet. The identification rate of the three versions of the model, in the neural network architecture with the highest recognition rate in the experiment, can increase the recognition rate by adding the residual network experiment. Using the neural network model of the above-mentioned best recognition rate, the OpenCV is used to load the movie to instantly recognize the character in the film to verify the practicability of the neural network model of the research training. In the verification part of the results, the paper has self-organized 14 public figures, and each public figure has at least 130 face images as training and test and verification samples, among which the best recognition rate of the neural network is in 1260 images. The recognition rate of the training sample is 100%, and the image recognition rate of the 450 images is 99.11%. The time of the instant image recognition from the face image in the film to the completion identity is about 0.1 second.
Chen, Zih-Jie, and 陳子傑. "CNN-based Gaze Block Estimation." Thesis, 2019. http://ndltd.ncl.edu.tw/handle/3mzyzg.
Full text國立中央大學
資訊工程學系
107
The visual is one of the most important senses that a human receives outside information. The visual helps us explore the world, receive new knowledge, and communicate with computer. As contactless human-computer interaction (HCI) model continues to develop, the technology of communicating with gaze behavior has become a highlight in this field. There have been many applications in the fields of education, advertising, nursing, entertainment or virtual reality. In general, most of the eye tracking devices need calibration in advance or fixing head. There are still many restrictions on usage specification. To solve the above problems, this study uses the ResNet model as the core of classification to construct Gaze Block Estimation Model (GBE Model). It can estimate the gaze block of the user without calibration process. Moreover, only an RGB camera device without depth information is used to capture the image, such as a webcam, a built-in camera on a laptop, or front-facing camera of a smartphone. The deep learning approach is data-driven. It needs a large amount of correctly labeled training data to train a stable and compliant model. However, the existing public dataset of visual behavior has different application scenarios. Resulting in images of the dataset does not apply to all application domains. Therefore, this study collects and builds up to a dataset of eye images of up to 300000 images. According to the experimental results, the GBE Model can estimate gaze block of the user without calibration process and allow the head moving. Even in the real-life testing, it can reach 85.1% accuracy. The experimental results prove the proposed method can let user use gaze block to control the screen, and achieve the goal of HCI application scenario.
Rebelo, José Soares. "CNN-Based Refinement for Image Segmentation." Master's thesis, 2018. https://repositorio-aberto.up.pt/handle/10216/114115.
Full textLIAO, PEN-MIN, and 廖本閔. "Streamflow Forecasting by CNN-GRU Model." Thesis, 2019. http://ndltd.ncl.edu.tw/handle/8rs76r.
Full text逢甲大學
水利工程與資源保育學系
107
During the last two decades, the application of artificial intelligence in the field of flood forecasting has increased noticeably. Since the information of flood forecasting is the most important part of disaster management, also the emergency response and the mechanism of Recurrent Neural Network (RNN) include the behavior of the time series, this study attempt to adopt the Gated Recurrent Unit (GRU) which is a type of RNN used to develop a rainfall-runoff model for the mentioned purpose above. In this research RNN is using Gated Recurrent Unit (GRU). In each field, applicability of GRU is still in researching. Thereby, this paper will discuss the application GRU in the flood forecast. In order to improve the prediction accuracy of the GRU, the data is processed by using the Convolutional Neural Network (CNN) and then input into the GRU for prediction, called CNN-GRU. In the past, most studies used to extract every rainfall from the data before learning artificial neural networks for flood flow prediction. However this study will use a different approach, because GRU cell can remember the status from past. In addition, optimal hyperparameters setting for artificial neural networks will be found by genetic algorithm (GA) to modeling Dali River hourly rainfall-runoff model. Evaluation indicators show that CNN-GRU is better than GRU, the evaluation indicators show that CNN-GRU is better than GRU, because CNN-GRU uses CNN to extract eigenvalues from input data before using GRU for prediction.
Rebelo, José Soares. "CNN-Based Refinement for Image Segmentation." Dissertação, 2018. https://repositorio-aberto.up.pt/handle/10216/114115.
Full textChen, Shih-Che, and 陳釋澈. "Mandarin Tone Classification Using CNN/DNN." Thesis, 2018. http://ndltd.ncl.edu.tw/handle/6ptt3a.
Full text國立臺灣大學
資訊工程學研究所
106
In Mandarin Chinese system, the tone plays an important role. Different tone patterns of the same syllable may result in different meanings. People whose native language aren’t Mandarin can be distinguished by their tone patterns. Therefore, we propose a method for tone classification. First, we convert the audio signal into the spectrogram. We treat the spectrogram as images, apply them as the image inputs for image recognition convolutional neural networks, and create tone classification models. We compare different image recognition models for tone classification. This approach can achieve good accuracy without too many processes on the audio signal. The tone classification architecture can be applied to Chinese teaching methods which will lead to educational success.
Yang, Hsin-Wei, and 楊馨媁. "CNN-based Handwritten Invoice Recognition System." Thesis, 2019. http://ndltd.ncl.edu.tw/handle/c5zem6.
Full text國立臺灣海洋大學
資訊工程學系
107
This paper proposes a method that uses deep learning and convolution neural network (CNN) for handwritten invoice recognition, this method can help enterprises solve that enterprises use only handwritten invoices and reduce labor costs of sorting invoices. Invoice recognition contains invoice number, buyer's government uniform invoice number, seller's government uniform invoice number, digital total amount, and Chinese total amount. Models train by different content, analyze and calculate the best results based on the labels, coordinates and scores of the model detection results. Besides, total amount result use digital total amount and Chinese total amount to correct, which increase 3% accuracy of total amount. The experiment use about 500 labeled invoices to train models, use models to recognize that randomly selected 1000 invoices, according to research results, the overall recognition accuracy over 95%.
Shih, Yi-Hao, and 史鎰豪. "CNN-Based Distorted Barcode Number Recognition." Thesis, 2019. http://ndltd.ncl.edu.tw/handle/bnzv68.
Full text國立臺灣海洋大學
資訊工程學系
107
The rapid development of deep learning in recent years saw breakthroughs after breakthroughs. AlphaGo’s victory against the world’s top-ranked professional GO player took only two years of learning. Then, Alpha Zero took only 21 days of self-learning to beat AlphaGo. We are now fully aware of the fast progress in deep learning, which uses Artificial Neural Network modeled upon the neuron transmission in the human brain to solve problems. This thesis uses a convolutional neural network Yolov3 to capture the feature of distorted barcode number images and made variance-weighted decisions based on the verification results. The value parameters are further changes to achieve the experimental goal of recognizing barcode label number. We use 350 photos of barcode numbers of training in deep learning,And the results show that the accuracy is up to 93%.
Shen, Yu-Ru, and 沈渝茹. "Hands-on Image Recognition with CNN." Thesis, 2019. http://ndltd.ncl.edu.tw/handle/f7b527.
Full text元智大學
資訊管理學系
107
Human beings are visualizers. The amount of information received from the visuals accounts for about 60% of all our senses. In the process of developing artificial intelligence, we train that machines what see the world, understand the world and use images recognition as a source of data for making decision and judgment. Deep learning is the mainstream of artificial intelligence, which a class of machine learning algorithms that use multiple layers to progressively extract higher level features from raw input. Artificial Neural Networks (ANNs) were inspired by information processing and distributed communication nodes in biological systems. Convolutional neural network is a class of deep neural networks, most commonly applied to analyzing visual imagery. The key to affecting the convolutional neural network is the architecture, the depth and the weight of the convolution kernel. The study compares these three factors and compares their impact differences.
Tsao, Po-Ho, and 曹博賀. "Boat License Number Recognition Using CNN." Thesis, 2018. http://ndltd.ncl.edu.tw/handle/gu58wv.
Full text國立臺灣海洋大學
資訊工程學系
106
This paper proposes the use of convolutional neural networks (CNNs) for real-time recognition of numbers on fishing vessels entering and leaving their ports. First, video cameras were mounted at the access of fishing ports to capture images of entries into and exits from these ports. Then, fishing vessels in the images were detected and positions of license plates on the vessels were located. After cutting and trimming, numbers on the fishing vessels were recognized. The recognized fishing vessel numbers then underwent rearrangement of their positions and trust scores of fishing vessel numbers that were recognized in their positions were organized in descending order. Finally, fishing vessel numbers that complied with the fishing vessel numbering rules and had the highest trust scores were regarded as the detected numbers of the fishing vessels. The recognition results were then shown in the video images. After experimentation in multiple networks, the results show that the accuracy is up to 58.3%
Ming-Wei-Huang and 黃銘偉. "CNN-based gender and age classification." Thesis, 2018. http://ndltd.ncl.edu.tw/handle/a7wytt.
Full text中原大學
資訊工程研究所
106
In this paper, we propose a method for classifying gender and age of pedestrian that can be applied to CCTV. With the development of science and technology, identifying the gender and age of pedestrians/face images becomes a popular and important task in social network and surveillance domain. We first perform face detection and extract facial landmarks from each image. Face alignment is then applied to gain aligned face images as training data. We use “GoogLeNet” which is one of the framework of Convolutional Neural Network (CNN) to train the models for gender and age classification. The experimental results show that our method achieves over 60% for all the male and female in our test video set.
GUPTA, RASHI. "IMAGE FORGERY DETECTION USING CNN MODEL." Thesis, 2022. http://dspace.dtu.ac.in:8080/jspui/handle/repository/19175.
Full textSoldátová, Jana. "Hybridizace konceptu TV stanic na příkladu CNN Prima News." Master's thesis, 2021. http://www.nusl.cz/ntk/nusl-448007.
Full textHsiao, Chiao-Wei, and 蕭喬蔚. "A New CMOS Large-Neighborhood Cellular-Neural-Network (CNN) Cell Structure For Large-Neighborhood CNN Universal Machine (CNNUM)." Thesis, 2001. http://ndltd.ncl.edu.tw/handle/17286284714479061976.
Full text國立交通大學
電子工程系
89
In this thesis, a new structure for the VLSI implementation of large- neighborhood cellular neural network (LN-CNN) is proposed and analyzed. In the proposed LN-CNN structure, the parasitic lateral bipolar junction transistor (BJT) in the CMOS process is used to implement both the neuron and synaptic path. Based on the basic device physics of the neuron-BJT (νBJT), a new compact neuron structure is proposed and analyzed. Besides, because of using NPN and PNP BJTs together, a low-power structure of synaptic path is designed and verified. The new low power is composed of dual path for positive or negative current flow in. There is no DC standby current, and it consumes no DC power. Basing on the concept of LN-CNN, the templates with more than two neighborhood layers can be realized without extra complex connections in VLSI implementation. So the chip area for interconnections is reduced and the array size could be increased. The above mentioned low power synaptic path circuit is used, so the LN-CNN is a lower power design. Using the proposed LN-CNN structure, the LN-CNN functions such as Muller-lyer arrowhead illusion and connected component detector, have been successfully realized and verified in HSPICE simulation. Both the negative and the asymmetrical template can be realized. A 16 X 16 LN-CNN with chip area 1200μm X 2580μm is designed and fabricated by 0.25μm TSMC 1P5M CMOS process. The total power consumption is lower than 50mW. Finally, the large neighborhood universal machine is proposed. By using the large neighborhood cellular neural network as the core-computing unit, the analog array processor is realized. Some local memories are added into the single LN-CNN cell. There are also local communication and control unit inside the cell. Many complicated tasks that cannot be compute at one time by LN-CNN can be solve by the universal machine. From the above results, the proposed LN-CNN has great potential in the implementation of the CNN universal machine for various signal-processing applications. Further researches in this field will be conducted in the future.
Wei, Cian-Pin, and 魏千評. "Signal Reconstruction-LMI, GA and CNN Approaches." Thesis, 2006. http://ndltd.ncl.edu.tw/handle/07302740737170892869.
Full text國立高雄應用科技大學
電子與資訊工程研究所碩士班
94
In this thesis, first, the design of FIR and IIR equalizers for the communication channels via the genetic algorithm (GA) and linear matrix inequality (LMI) approaches from an H-inf perspective is presented, which the communication channels are considered as linear time-invariant and nonlinear time-invariant models, respectively. In general, the equalizer plays an important role in modern digital communication systems that can be used to recover the corrupted signal. For the linear time-invariant channel, the problem of IIR equalizer design can be transformed into a nonlinear matrix inequality. In order to eliminate the nonlinear element from the inequality, GA technique is employed. Moreover, in the nonlinear time-invariant channel, the GA technique is utilized to linearize the nonlinear channel model and the approximate errors can be viewed as state uncertainties. Second, the technique of image noise cancellation is presented by employing cellular neural network (CNN) and LMI. The main objective is to train the templates of CNN by a corrupted image corresponding a desired image. A criterion for the uniqueness and global asymptotic stability of the equilibrium point of CNN is obtained based on the Lyapunov stability theorem. Finally, all illustrative examples are presented to demonstrate the effectiveness of the proposed methodologies.
Chou, Hung-Chun, and 周宏春. "Discriminatively-learned CNN Features for Image Retrieval." Thesis, 2015. http://ndltd.ncl.edu.tw/handle/21952307307565825693.
Full text國立交通大學
資訊科學與工程研究所
103
The thesis aims to learn discriminative features for image retrieval tasks based on using deep convolutional neural networks (CNN). Motivated by the great success of CNN in recognition tasks, one may be tempted to simply adopt the output of CNN for retrieval. However, CNN pre-trained model for classification tasks may not optimized for retrieval tasks. To address this issue, the CNN’s weight parameters are specifically adapted by a contrastive loss function to suit retrieval tasks. Extensive experiments conducted on typical retrieval datasets confirm the superiority of the proposed scheme over the state-of-the-art methods.
黃園芳. "Two-Dimensional CNN with L-shaped Template." Thesis, 2007. http://ndltd.ncl.edu.tw/handle/96492000877316767105.
Full text國立交通大學
應用數學系所
95
In this paper, we consider the simplest two-dimensional CNN template, L-shaped template. This work had investigated on [Lin&Yang, 2001] before. They use the building block to discuss the spatial entropy. In this paper, we reappraise the spatial entropy by pattern generation method which could refer to [Ban&Lin, 2005]. When we could not evaluate the spatial entropy, we use connecting operator referred to [Ban, Lin&Lin, 2006] to evaluate the lower bounded of spatial entropy. Finally, we compare the result with [Lin&Yang].
Chen, Wei-Cheng, and 陳威成. "A Hrbrid Method for CNN Template Design." Thesis, 2002. http://ndltd.ncl.edu.tw/handle/82335919178615802468.
Full text國立中正大學
電機工程研究所
90
In this study, a hybrid method for CNN (Cellar Neural Networks) template design is proposed. The objective is to efficiently find robust template for CNN with non-zero boundary consideration. In the proposed method, we analyzed the dynamic transient of the CNN and found the influence of non-zero boundary on the analytic method of CNN. This discovery can provide a limitation in the searching of robust template using GA (Genetic Algorithm). Incorporating the limitation in the procedure of GA can decrease the searching space and thus decrease the useless search. This study depicted that the number of generation in the GA procedure of the proposed method is smaller than several existing methods. Moreover, the robust templates found using this method are superior to other methods.
Hua, Chen Bo, and 陳柏樺. "Handwritten Character Recognition using GA-based CNN." Thesis, 2011. http://ndltd.ncl.edu.tw/handle/69041942101896744315.
Full text國立高雄應用科技大學
電機工程系
99
In this paper, the static simulation method is used for handwritten characters recognition. Since a lot of noise and some non-character traces would occur while scanning image of writting characters, it evoked an inevitable noise-elimination problems in character recognition. This paper simulates the image preprocessing by adding several types of noise, and then filter it out by using conventional and gene-based CNN methods. The results demonstrate the superiority of CNN optimization method. In dealing with salt-pepper noise and Gaussian noise, CNN algorithm results with a better image clearness. Even for the shaking blurred image, its restoration effect is better than that of least square filter. After preprocessing and normalized to the same size, the features of handwritten characters are extracted and put into back-propagate neural network for parameter training. In this stage, an innovative horizontal feature extraction method is adoped besides the traditional ones. It is easier to distinct those mixed-strokes. Meanwhile, the amount of data is much less than that in literatures of other researchers. The main contributions of this thesis are: the simplicity of system algorithm, the effectiveness of noise elimination, and a high recognition rate up to 98%.
Lu, Pei-Hsuan, and 呂姵萱. "L1-Norm Based Adversarial Example against CNN." Thesis, 2018. http://ndltd.ncl.edu.tw/handle/ua49z8.
Full text國立中興大學
資訊科學與工程學系
106
In recent years, defending adversarial perturbations to natural examples in order to build robust machine learning models trained by deep neural networks (DNNs) has become an emerging research field in the conjunction of deep learning and security. In particular, MagNet consisting of an adversary detector and a data reformer is by far one of the strongest defenses in the black-box setting, where the attacker aims to craft transferable adversarial examples from an undefended DNN model to bypass a defense module without knowing its existence. MagNet can successfully defend a variety of attacks in DNNs, including the Carlini and Wagner''s transfer attack based on the L2 distortion metric. However, in this thesis, under the black-box transfer attack setting we show that adversarial examples crafted based on the L1 distortion metric can easily bypass MagNet and fool the target DNN image classifiers on MNIST and CIFAR-10. We also provide theoretical justification on why the considered approach can yield adversarial examples with superior attack transferability and conduct extensive experiments on variants of MagNet to verify its lack of robustness to L1 distortion based transfer attacks. Notably, our results substantially weaken the existing transfer attack assumption of knowing the deployed defense technique when attacking defended DNNs (i.e., the gray-box setting).
CHANG, CHANG, and 張競. "Fast Gender Detection System based on CNN." Thesis, 2019. http://ndltd.ncl.edu.tw/handle/jwvbz6.
Full text輔仁大學
資訊工程學系碩士班
107
Artificial Intelligence(AI), Machine Learning, Deep Learning, have always been very popular topic. As the time moved forward, there are more and more open source tools appeared e.g. OpenAI, TensorFlow, char-RNN that people can get the hang of Artificial Intelligence and Machine Learning more easily and more quickly. As the technology arising, Artificial Intelligence can apply to many fields such as simple AI can apply to refrigerator, sweeper, air conditioner and so on. They can detect external signal e.g. humidity, temperature, brightness, image, horizontal, vibration, distance then to achieve machine’s automatic control. And further more AI can apply to medical related and industrial related application. In medical related application can use Expert System, branch of Artificial Intelligence, which are solving expert levels ability problems in some specific fields. Through expert’s rich experience and expertise, simulating expert’s mode of thinking to solve that only can solve by experts. In industrial related fields, then utilize and train a large amount of information cause by the production and estimate the problems in production line then improve the product’s yield rate, production efficiency and increase the gross output value. The essay is about the female passenger safety as propose then create the related application. In recent years, Taiwan’s public transportation had been thrive and flourish and the problems are still endless because the passenger couldn’t be filtered then the female passenger had to be more careful of the other passenger and guarded and more attention should be paid at night. This essay is using the surveillance camera on the platform and inside the carriage and instant identification the gender of people and provide the identification result through the cloud to passenger’s mobile device so that can get some information about male and female on the platform and carriage.