Dissertations / Theses on the topic 'DEEP LEARNING MODEL'
Create a spot-on reference in APA, MLA, Chicago, Harvard, and other styles
Consult the top 50 dissertations / theses for your research on the topic 'DEEP LEARNING MODEL.'
Next to every source in the list of references, there is an 'Add to bibliography' button. Press on it, and we will generate automatically the bibliographic reference to the chosen work in the citation style you need: APA, MLA, Harvard, Chicago, Vancouver, etc.
You can also download the full text of the academic publication as pdf and read online its abstract whenever available in the metadata.
Browse dissertations / theses on a wide variety of disciplines and organise your bibliography correctly.
Meng, Zhaoxin. "A deep learning model for scene recognition." Thesis, Mittuniversitetet, Institutionen för informationssystem och –teknologi, 2019. http://urn.kb.se/resolve?urn=urn:nbn:se:miun:diva-36491.
Full textZeledon, Lostalo Emilia Maria. "FMRI IMAGE REGISTRATION USING DEEP LEARNING." OpenSIUC, 2019. https://opensiuc.lib.siu.edu/theses/2641.
Full textGiovanelli, Francesco. "Model Agnostic solution of CSPs with Deep Learning." Master's thesis, Alma Mater Studiorum - Università di Bologna, 2019. http://amslaurea.unibo.it/18633/.
Full textMatsoukas, Christos. "Model Distillation for Deep-Learning-Based Gaze Estimation." Thesis, KTH, Skolan för elektroteknik och datavetenskap (EECS), 2019. http://urn.kb.se/resolve?urn=urn:nbn:se:kth:diva-261412.
Full textDen senaste utvecklingen inom djupinlärning har hjälp till att förbättra precisionen hos gaze estimation-modeller till nivåer som inte tidigare varit möjliga. Dock kräver djupinlärningsmetoder oftast både stora mängder beräkningar och minne som därmed begränsar dess användning i inbyggda system med små minnes- och beräkningsresurser. Det här arbetet syftar till att kringgå detta problem genom att öka prediktiv kraft i små nätverk som kan användas i inbyggda system, med hjälp av en modellkomprimeringsmetod som kallas distillation". Under begreppet destillation introducerar vi ytterligare en term till den komprimerade modellens totala optimeringsfunktion som är en avgränsande term mellan en komprimerad modell och en kraftfull modell. Vi visar att destillationsmetoden inför mer än bara brus i den komprimerade modellen. Det vill säga lärarens induktiva bias som hjälper studenten att nå ett bättre optimum tack vare adaptive error deduction. Utöver detta visar vi att MobileNet-familjen uppvisar instabila träningsfaser och vi rapporterar att den destillerade MobileNet25 överträffade sin lärare MobileNet50 något. Dessutom undersöker vi nyligen föreslagna träningsmetoder för att förbättra prediktionen hos små och tunna nätverk och vi konstaterar att extremt tunna arkitekturer är svåra att träna. Slutligen föreslår vi en ny träningsmetod baserad på hint-learning och visar att denna teknik hjälper de tunna MobileNets att stabiliseras under träning och ökar dess prediktiva effektivitet.
Lim, Steven. "Recommending TEE-based Functions Using a Deep Learning Model." Thesis, Virginia Tech, 2021. http://hdl.handle.net/10919/104999.
Full textMaster of Science
Improving the security of software systems has become critically important. A trusted execution environment (TEE) is an emerging technology that can help secure software that uses or stores confidential information. To make use of this technology, developers need to identify which pieces of code handle confidential information and should thus be placed in a TEE. However, this process is costly and laborious because it requires the developers to understand the code well enough to make the appropriate changes in order to incorporate a TEE. This process can become challenging for large software that contains millions of lines of code. To help reduce the cost incurred in the process of identifying which pieces of code should be placed within a TEE, this thesis presents ML-TEE, a recommendation system that uses a deep learning model to help reduce the number of lines of code a developer needs to inspect. Our results show that the recommendation system achieves high accuracy as well as a good balance between precision and recall. In addition, we conducted a pilot study and found that participants from the intervention group who used the output from the recommendation system managed to achieve a higher average accuracy and perform the assigned task faster than the participants in the control group.
Del, Vecchio Matteo. "Improving Deep Question Answering: The ALBERT Model." Master's thesis, Alma Mater Studiorum - Università di Bologna, 2020. http://amslaurea.unibo.it/20414/.
Full textWu, Xinheng. "A Deep Unsupervised Anomaly Detection Model for Automated Tumor Segmentation." Thesis, The University of Sydney, 2020. https://hdl.handle.net/2123/22502.
Full textKayesh, Humayun. "Deep Learning for Causal Discovery in Texts." Thesis, Griffith University, 2022. http://hdl.handle.net/10072/415822.
Full textThesis (PhD Doctorate)
Doctor of Philosophy (PhD)
School of Info & Comm Tech
Science, Environment, Engineering and Technology
Full Text
Зайяд, Абдаллах Мухаммед. "Ecrypted Network Classification With Deep Learning." Master's thesis, КПІ ім. Ігоря Сікорського, 2020. https://ela.kpi.ua/handle/123456789/34069.
Full textThis dissertation consists of 84 pages, 59 Figures and 29 sources in the reference list. Problem: As the world becomes more security conscious, more encryption protocols have been employed in ensuring suecure data transmission between communicating parties. Network classification has become more of a hassle with the use of some techniques as inspecting encrypted traffic can pose to be illegal in some countries. This has hindered network engineers to be able to classify traffic to differentiate encrypted from unencrypted traffic. Purpose of work: This paper aims at the problem caused by previous techniques used in encrypted network classification. Some of which are limited to data size and computational power. This paper employs the use of deep learning algorithm to solve this problem. The main tasks of the research: 1. Compare previous traditional techniques and compare their advantages and disadvantages 2. Study previous related works in the current field of research. 3. Propose a more modern and efficient method and algorithm for encrypted network traffic classification The object of research: Simple artificial neural network algorithm for accurate and reliable network traffic classification that is independent of data size and computational power. The subject of research: Based on data collected from private traffic flow in our own network simulation tool. We use our proposed method to identify the differences in network traffic payloads and classify network traffic. It helped to separate or classify encrypted from unencrypted traffic. 6 Research methods: Experimental method. We have carried out our experiment with network simulation and gathering traffic of different unencrypted protocols and encrypted protocols. Using python programming language and the Keras library we developed a convolutional neural network that was able to take in the payload of the traffic gathered, train the model and classify the traffic in our test set with high accuracy without the requirement of high computational power.
Zhao, Yajing. "Chaotic Model Prediction with Machine Learning." BYU ScholarsArchive, 2020. https://scholarsarchive.byu.edu/etd/8419.
Full textSaitas-Zarkias, Konstantinos. "Insights into Model-Agnostic Meta-Learning on Reinforcement Learning Tasks." Thesis, KTH, Skolan för elektroteknik och datavetenskap (EECS), 2021. http://urn.kb.se/resolve?urn=urn:nbn:se:kth:diva-290903.
Full textMeta-Learning har fått dragkraft inom Deep Learning fältet som ett tillvägagångssätt för att bygga modeller som effektivt kan anpassa sig till nya uppgifter efter distribution. I motsats till konventionella maskininlärnings metoder som är tränade för en specifik uppgift (t.ex. bild klassificering på en uppsättning klasser), så metatränas meta-learning metoder över flera uppgifter (t.ex. bild klassificering över flera uppsättningar av klasser). Deras slutmål är att lära sig att lösa osedda uppgifter med bara några få prover. En av de mest kända metoderna inom området är Model-Agnostic Meta-Learning (MAML). Syftet med denna avhandling är att komplettera den senaste relevanta forskningen med nya observationer avseende MAML: s kapacitet, begränsningar och nätverksdynamik. För detta ändamål utfördes experiment på metaförstärkningslärande riktmärke Meta-World. Dessutom gjordes en jämförelse med en ny variant av MAML, kallad Almost No Inner Loop (ANIL), som gav insikter om förändringarna i nätverkets representation under anpassning (metatestning). Resultaten av denna studie indikerar att MAML kan överträffa baslinjerna för det utmanande Meta-Worldriktmärket men visar små tecken på faktisk ”snabb inlärning” under metatestning, vilket stödjer hypotesen att den återanvänder funktioner som den lärt sig under metaträning.
Lindespång, Victor. "Bildklassificering av bilar med hjälp av deep learning." Thesis, Örebro universitet, Institutionen för naturvetenskap och teknik, 2017. http://urn.kb.se/resolve?urn=urn:nbn:se:oru:diva-58361.
Full textThis report describes how an image classifier was created with the ability to identify car makeand model from a given picture of a car. The classifier was developed using pictures that the company CAB had saved from insurance errands that was managed through their current products. First of all the report begins with a brief theoretical introduction to machine learning and deep learning to guide the reader in to the subject of the report, and then continues with problemspecific methods that were of good use for the project. The report brings up methods for how the data was processed before training took place, how the training process went with the chosen tools for this project and also discussion about the result and what effected it – with comments about what can be done in the future to improve the end product.
Hellström, Terese. "Deep-learning based prediction model for dose distributions in lung cancer patients." Thesis, Stockholms universitet, Fysikum, 2021. http://urn.kb.se/resolve?urn=urn:nbn:se:su:diva-196891.
Full textViebke, André. "Accelerated Deep Learning using Intel Xeon Phi." Thesis, Linnéuniversitetet, Institutionen för datavetenskap (DV), 2015. http://urn.kb.se/resolve?urn=urn:nbn:se:lnu:diva-45491.
Full textIannello, Michele. "Deep Learning and Constrained Optimization for Epidemic Control." Master's thesis, Alma Mater Studiorum - Università di Bologna, 2022. http://amslaurea.unibo.it/25815/.
Full textWang, Junpeng. "Interpreting and Diagnosing Deep Learning Models: A Visual Analytics Approach." The Ohio State University, 2019. http://rave.ohiolink.edu/etdc/view?acc_num=osu1555499299957829.
Full textWang, Wei. "Image Segmentation Using Deep Learning Regulated by Shape Context." Thesis, KTH, Skolan för elektroteknik och datavetenskap (EECS), 2018. http://urn.kb.se/resolve?urn=urn:nbn:se:kth:diva-227261.
Full textUnder de senaste åren har bildsegmentering med hjälp av djupa neurala nätverk gjort stora framsteg. Att nå ett bra resultat med träning med en liten mängd data kvarstår emellertid som en utmaning. För att hitta ett bra sätt att förbättra noggrannheten i segmenteringen med begränsade datamängder så implementerade vi en ny segmentering för automatiska röntgenbilder av bröstkorgsdiagram baserat på tidigare forskning av Chunliang. Detta tillvägagångssätt använder djupt lärande neurala nätverk kombinerat med "shape context" information. I detta experiment skapade vi en ny nätverkstruktur genom omkonfiguration av U-nätverket till en 2-inputstruktur och förfinade pipeline processeringssteget där bilden och "shape contexten" var tränade tillsammans genom den nya nätverksmodellen genom iteration.Den föreslagna metoden utvärderades på dataset med 247 bröströntgenfotografier, och n-faldig korsvalidering användes för utvärdering. Resultatet visar att den föreslagna pipelinen jämfört med ursprungs U-nätverket når högre noggrannhet när de tränas med begränsade datamängder. De "begränsade" dataseten här hänvisar till 1-20 bilder inom det medicinska fältet. Ett bättre resultat med högre noggrannhet kan nås om den andra strukturen förfinas ytterligare och "shape context-generatorns" parameter finjusteras.
Di, Giacomo Emanuele. "A Deep Learning approach for predicting COSMO-Model's execution time." Master's thesis, Alma Mater Studiorum - Università di Bologna, 2021.
Find full textBeaudoin, Jean-Michel. "Growing deep roots : learning from the Essipit's culturally adapted model of Aboriginal forestry." Thesis, University of British Columbia, 2014. http://hdl.handle.net/2429/46590.
Full textLi, Mengtong. "An intelligent flood evacuation model based on deep learning of various flood scenarios." Doctoral thesis, Kyoto University, 2021. http://hdl.handle.net/2433/263634.
Full textZarrinkoub, Sahand. "Transfer Learning in Deep Structured Semantic Models for Information Retrieval." Thesis, KTH, Skolan för elektroteknik och datavetenskap (EECS), 2020. http://urn.kb.se/resolve?urn=urn:nbn:se:kth:diva-286310.
Full textNya modeller inom informationssökning inkluderar neurala nät som genererar vektorrepresentationer för sökfrågor och dokument. Dessa vektorrepresentationer används tillsammans med ett likhetsmått för att avgöra relevansen för ett givet dokument med avseende på en sökfråga. Semantiska särdrag i sökfrågor och dokument kan kodas in i vektorrepresentationerna. Detta möjliggör informationssökning baserat på semantiska enheter, vilket ej är möjligt genom de klassiska metoderna inom informationssökning, som istället förlitar sig på den ömsesidiga förekomsten av nyckelord i sökfrågor och dokument. För att träna neurala sökmodeller krävs stora datamängder. De flesta av dagens söktjänster används i för liten utsträckning för att möjliggöra framställande av datamängder som är stora nog att träna en neural sökmodell. Därför är det önskvärt att hitta metoder som möjliggör användadet av neurala sökmodeller i domäner med små tillgängliga datamängder. I detta examensarbete har en neural sökmodell implementerats och använts i en metod avsedd att förbättra dess prestanda på en måldatamängd genom att förträna den på externa datamängder. Måldatamängden som används är WikiQA, och de externa datamängderna är Quoras Question Pairs, Reuters RCV1 samt SquAD. I experimenten erhålls de bästa enskilda modellerna genom att föträna på Question Pairs och finjustera på WikiQA. Den genomsnittliga prestandan över ett flertal tränade modeller påverkas negativt av vår metod. Detta äller både när samtliga externa datamänder används tillsammans, samt när de används enskilt, med varierande prestanda beroende på vilken datamängd som används. Att förträna på RCV1 och Question Pairs ger den största respektive minsta negativa påverkan på den genomsnittliga prestandan. Prestandan hos en slumpmässigt genererad, otränad modell är förvånansvärt hög, i genomsnitt högre än samtliga förtränade modeller, och i nivå med BM25. Den bästa genomsnittliga prestandan erhålls genom att träna på måldatamängden WikiQA utan tidigare förträning.
Miao, Yishu. "Deep generative models for natural language processing." Thesis, University of Oxford, 2017. http://ora.ox.ac.uk/objects/uuid:e4e1f1f9-e507-4754-a0ab-0246f1e1e258.
Full textMiserocchi, Andrea. "The Fokker-Planck equation as model for the stochastic gradient descent in deep learning." Bachelor's thesis, Alma Mater Studiorum - Università di Bologna, 2019. http://amslaurea.unibo.it/18290/.
Full textSievert, Rolf. "Instance Segmentation of Multiclass Litter and Imbalanced Dataset Handling : A Deep Learning Model Comparison." Thesis, Linköpings universitet, Datorseende, 2021. http://urn.kb.se/resolve?urn=urn:nbn:se:liu:diva-175173.
Full textChen, Kuang-Yu, and 陳廣瑜. "Deep Learning Model Compression with Information Guide." Thesis, 2017. http://ndltd.ncl.edu.tw/handle/w26d3z.
Full textWU, GUAN-WEI, and 吳冠瑋. "Applying Deep Learning Model to Medicine Discrimination." Thesis, 2019. http://ndltd.ncl.edu.tw/handle/sz7t8v.
Full text逢甲大學
資訊工程學系
107
Medicines dispensing verification means that whether the medicines in the patient’s medicine bag are the same as the prescription prescribed by the doctor. The correct of medicines dispensing can be said to be the most basic safe drug condition for the patient. However, there are various kinds medicines but they may have a similar appearance and packaging. Therefore, the possibility of human error is greatly increased. Although many medical units have set up an online medicines discriminator system and provide people with using text or photos to get information about medicines, they are not often used. In order to implement the patients medication safety, this study proposes a new medicines discrimination system based on computer vision. When the pharmacist is dispensing medicines, we use the image recognition model with a high recognition ability to monitor the dispensing table to assist the pharmacist in dispensing the medicine, thereby ensuring the contents of the patients medicine bag is correct. This system is an object detection model based on YOLOv3. It has three major features: First, the recognition speed is fast, it can react to the situation of the dispensing table in real time. Second, it is easy to use and easy to deploy to the current dispensing units. Third, this system is proactive and requires no frequent operation, does not impose an additional burden on the pharmacist. In this study, a deep learning-based image recognition technology was used, and 380 medicines images were used as training data to propose a medicine discrimination system based on deep learning.
Hsu, Chun-Wei, and 徐莙惟. "Deep Learning Enabled Process Independent Lithographic Model." Thesis, 2019. http://ndltd.ncl.edu.tw/handle/utrw8z.
Full textLiu, Zheng-Wei, and 劉政威. "Waterfall Model for Deep Reinforcement Learning Based Scheduling." Thesis, 2019. http://ndltd.ncl.edu.tw/handle/a3yn5q.
Full text國立中央大學
通訊工程學系在職專班
107
The fourth generation of communication systems has been able to meet the multimedia application needs of mobile devices. Through the scheduling service provided by the base station, the user equipment can obtain the data packets required by the downlink of the communication system to meet and obtain better application services, so the channel resources are allocated and the calculation of the user group scheduling service is provided. The law is quite critical. This paper implements a mobile communication scheduling learning platform, and proposes a Deep Deterministic Policy Gradient model. The waterfall model concept is used to analyze the scheduling algorithm flow into three stages: sorting selection, resource evaluation and channel allocation. A waterfall scheduling method that enables more data throughput per unit time and meets more user needs in the current communication environment. The mobile communication scheduling learning platform is composed of six modular components: base station and channel resources, enhanced learning neural network, user equipment attributes, application service types, environmental information and reward functions, and phase micro-algorithms and dependency injection. . Using inversion control and dependency injection to reduce platform software coupling, it is quite easy to maintain the stage micro-algorithm and the six module components.
Yen, Yi-Tung, and 顏逸東. "Enhanced Car-Following Model with Deep Reinforcement Learning." Thesis, 2019. http://ndltd.ncl.edu.tw/handle/499erb.
Full text國立臺灣大學
資訊工程學研究所
107
With the rapid evolution of artificial intelligence and technology, autonomous vehicle is regarded as the future of transportation. One of the important functions that autonomous vehicle should be equipped is a well-designed car-following model. With a well-designed car-following model, autonomous vehicle can drive in a safe, comfortable and efficient manner. This will increase driving safety, passenger comfort and improve road efficiency. This thesis designs and implements an enhanced car-following model. According to the laws, regulations and standards, we modeled the safety, comfort and efficiency into quantified reward functions. Using reinforcement learning, the network agent learns the best policy to achieve the maximum reward by repeated the learning process. The evaluation results show that our model not only reduces the number of inefficient and dangerous headways but also eliminates the jerk to achieve more efficient and comfortable driving than human drivers. The model outperformed 79% human drivers in public dataset. The achieved efficiency is 98% of the optimal bound. Furthermore, compared to the SUMO’s ACC model, given the same number of departed vehicles, our model enables more arrived vehicles and higher average speed to improve the overall road capacity.
Shukla, Aditya. "Model Extraction and Active Learning." Thesis, 2020. https://etd.iisc.ac.in/handle/2005/4420.
Full textCHEN, CHIA-HSI, and 陳家羲. "Develop Forecasting Model for Financial Crisis with Deep Learning." Thesis, 2018. http://ndltd.ncl.edu.tw/handle/3kp4yb.
Full text東吳大學
會計學系
106
Financial crisis forecasting of a company is extremely important for both investigators and company manager. For investigators, they are able to take actions before the crisis happens and therefore prevent assets loss. For company managers, they can adjust the management policy or direction according to the forecasting results so that the crisis would not happen.In this paper, we develop a deep neural network and train it using TEJ Financial database to obtain a forecasting model for financial crisis. Our model outperforms traditional shallow neural network by 10% in terms of both test and prediction accuracy. The prediction accuracy of our model is up to 70%. On the other hand, we also find that the data and model can be further improved to train more complex model and deal with time-series data, which may result in more accurate model.
Liu, Wen-Cheng, and 劉文誠. "A Web Service for Automatic Deep Learning Model Generation." Thesis, 2019. http://ndltd.ncl.edu.tw/handle/utec82.
Full text國立中央大學
資訊工程學系
107
As technology advances, deep learning has changed the way many industries produce, such as detecting defects, identifying objects, and so on. The core network model is the core of the algorithm and the essence of the training after big data. However, for most operators, how to train a usable model from scratch is a major difficulty in introducing artificial intelligence on the production line. How to quickly and easily complete a usable deep learning model becomes an issue that most non-employed workers want to know. Usually, training a highly accurate deep learning model requires a complex network architecture in addition to a large amount of data, and can be completed after numerous fine-tuning. The acquisition of data is relatively easy for the production line operators, and the network architecture needs to take time to understand the details. It is not completed in a moment and a half, and the threshold for entry is relatively improved, which is not conducive to industrial upgrading of various factories. This study combines the deep learning model suite Keras with the web language of the client and server side to provide a web-based artificial intelligence system that can quickly train deep learning models. The system allows users to set parameters and upload training data through the graphical interface, so that users in non-employed fields can quickly train the required models without spending too much time on the details of deep learning.
Pradhan, Sipun Kumar, and 施庫瑪. "A Rapid Deep Learning Model for Goal-Oriented Dialog." Thesis, 2016. http://ndltd.ncl.edu.tw/handle/6y6zpd.
Full text國立中央大學
資訊工程學系
104
Open-domain Question Answering (QA) systems aim at providing the exact answer(s) to questions formulated in natural language, without restriction of domain. My research goal in this thesis is to develop learning models that can automatically induce new facts without having to be re-trained, in particular its structure and meaning in order to solve multiple Open-domain QA tasks. The main advantage of this framework is that it requires little feature engineering and domain specificity whilst matching or surpassing state-of-the-art results. Furthermore, it can easily be trained to be used with any kind of Open-domain QA. I investigate a new class of learning models called memory neural networks. Memory neural networks reason with inference components combined with a long-term memory component; they learn how to use these jointly. The long-term memory can be read and written to, with the goal of using it for prediction. I investigate these models in the context of question answering (QA) where the long-term memory effectively acts as a (dynamic) knowledge base, and the output is a textual response. Finally, I show that an end-to-end dialog system based on memory neural networks can reach promising and learn to perform non-trivial operations. I confirm those results by comparing my system to various well-crafted baseline Datasets and future work is discussed.
Tsao, Yeh-Wen, and 曹爗文. "A Fast Deep Learning Model for Time Series Prediction." Thesis, 2019. http://ndltd.ncl.edu.tw/handle/94h345.
Full text國立臺灣大學
資訊工程學研究所
107
Time series forecasting is an important research area across many domains, such as predictions of financial market, weather, electricity consumption, and traffic jam situation. However, most of recent works are usually time-consuming and complex. In this paper, we propose a deep learning model to tackle this issue, and deliver efficient performance. Our model uses purely Convolutional Neural Network (CNN) structure to capture both long-term and short-term features. Thorough empirical studies based upon the total seven different dataset demonstrate that the our model can outperform state-of-the-art methods over training time with comparable performance.
Chang, Chao-Mei, and 張昭美. "Taiwanese speech commands recognition model based on deep learning." Thesis, 2019. http://ndltd.ncl.edu.tw/handle/knb7ws.
Full text國立交通大學
資訊學院資訊學程
107
Most of the recent machine learning papers are aimed at images and videos, such as face recognition, large image database identification, unmanned autopilot, object recognition, AlphaGo, object movement trajectory prediction, changing image style and creating virtual portrait style-based GAN. However, due to the development trend of voice assistants, it is necessary to closely cooperate with local language materials and culture habits. Therefore, the focus is on local language audio processing and machine learning . Benefiting from the prosperous deep learning progress, diverse languages are no longer an obstacle to communication, but rather a manifestation of diverse cultures. It is time to pay attention to regional languages such as Taiwanese. The paper uses a variety of audio pre-processing and CNN, LSTM, GRU and other attempts to use the depth model. 1. Taiwanese speech command recognition. 2. Specific Taiwanese speech keyword trigger. 3. Identify the audio segment between Mandarin and Taiwanese. 4. Use AI to write Taiwanese local drama. Finally, it is applied to the Android mobile app, so that the user can use the voice "góabeh" (I want to) "khuànn-siòng-phìnn" (See photo), "thian-im-ga̍k" (Listen to music), "khà-tiān-uē" (Make a phone call), "hip-siàng" (Photograph) evokes the corresponding application.
Lin, Yi-Hsiu, and 林怡秀. "Question Generation from Knowledge Base Using Deep Learning Model." Thesis, 2018. http://ndltd.ncl.edu.tw/handle/mkj9vx.
Full text國立交通大學
資訊科學與工程研究所
106
With the advancement of data-driven approach, the lack of corpora has become the main obstacle of the natural language processing research. Compared with English corpora, publicly available Mandarin corpora is even more lacking. Our paper purposes to solve this problem by using existing question answering dataset and knowledge base to create a new Mandarin question answering dataset. In this study, we first collect the data from CN-DBpedia and question answering dataset from WebQA and web crawler, and propose a method to combine them in the form of pairs as our training data, and then using sequence-to-sequence model to generate questions from knowledge base. The generated questions then incorporate with entities in knowledge base as the answers to create a new Mandarin question answering dataset. In our experiment, we develop a template-based question generation baseline in order to evaluate our model by human evaluation. Our model achieves an acceptable performance compare to the template-based baseline.
BAINS, INDERPREET SINGH. "WEB SECURITY IN IoT NETWORKS USING DEEP LEARNING MODEL." Thesis, 2020. http://dspace.dtu.ac.in:8080/jspui/handle/repository/18061.
Full text"Efficient and Online Deep Learning through Model Plasticity and Stability." Doctoral diss., 2020. http://hdl.handle.net/2286/R.I.62959.
Full textDissertation/Thesis
Doctoral Dissertation Electrical Engineering 2020
Frazão, Xavier Marques. "Deep learning model combination and regularization using convolutional neural networks." Master's thesis, 2014. http://hdl.handle.net/10400.6/5605.
Full textHUANG, PO-YU, and 黃柏毓. "Predicting Social Insurance Payment Behavior Based on Deep Learning Model." Thesis, 2017. http://ndltd.ncl.edu.tw/handle/04210692742079168717.
Full text逢甲大學
資訊工程學系
105
The social insurance is an important part of the social security system. In Taiwan, the social insurance system is classified by occupational groups and managed by different government agencies. According to the Executive Yuan of Taiwan, this pension system includes five separate social insurance programs covering public servants and teachers, laborers, military personnel, farmers, and a national pension insurance program for those not covered by the above four employment-based categories. Ministry of Health and Welfare in Taiwan is responsible for many types of social insurances like National Pension Insurance, National Health Insurance and Long-term Care Services Program. In addition, Ministry of Health and Welfare provides subsidized health insurance coverage for the underprivileged and ensures that senior citizens with no employment-based retirement benefits will still have the basic economic necessities in their elderly life. Unfortunately, most social insurances are impacted by various problems and have been facing the crisis of pension bankruptcy. Althougth the traditional actuarial methods use many hypotheses to analyze cash flow, they mostly focus on trend analysis with a macro view of the participants. Due to a large number of the insured, it is very hard to predict the payment behavior of each individual. To make better prediction, we propose to build payment behavior models based on machine learning technology to predict personal payment behavior accurately. Using the number of the participants for each personal payment behavior and corresponding insurance premiums, we can make better cash flow prediction in order to help the social insurance operations become sustainable. This research uses the seven year's data from Taiwan's National Pension Insurance as the source of experimental data. With the implementation of deep learning model, we could analyze and predict the future payment behaviors of the insured.
Lee, Yi-Nan, and 李奕男. "Deep Visual Semantic Transform Model Learning from Multi-Label Images." Thesis, 2017. http://ndltd.ncl.edu.tw/handle/48kv54.
Full text國立臺灣師範大學
資訊工程學系
105
Learning the relation between images and text semantics has been an important problem in the field of machine learning and computer vision. This paper addresses this problem. We observe that there is a semantic relation between texts, for example, “sky” and “cloud” have a close semantic relation, and “sky” and “car” have a weak semantic relation. We suppose the semantic relation between texts can be different depending on images. For example, an image contains both sky and car. The word “sky” and “car” are initially semantically irrelevant, but may have a connection because of the image containing these concepts. Therefore, we propose a Convolutional Neural Network based model to link the semantic relation between an image and its text labels. The main difference between our work and existing visual semantic embedding models is that the output of our model is a linear transformation function. In other words, each input image is treated as a function to determine the relation between each word and the image, and to predict the possible labels for the image. Finally, this model is validated on the NUS-WIDE dataset and the experimental results show that the model has a great performance on predicting labels for images.
LEE, JHONG-TING, and 李仲庭. "Apply TensorFlow deep learning model for time series forecasting problem." Thesis, 2018. http://ndltd.ncl.edu.tw/handle/y6cses.
Full text開南大學
資訊學院碩士在職專班
106
This study takes the TensorFlow as a backend engine for deep learning. The Multi-Layer Perceptron (MLP) is built to solve the time series forecasting problems. The case study are daily stock closing prices in Taiwan, i.e. the Taiwan Semiconductor Manufacturing Company Limited (TSMC), Uni-President Enterprises Corporation (Uni-President), and Largan Precision Company Limited (LARGAN Precision). We collect 120 daily records from 2017/01/03 to 2017/07/04. Around 20 input features we used are: the Trade Volume, the Trade Value, the Opening Price, the Highest Price, the Lowest Price, the Closing Price, the Delta Price, the Transaction amount, and other Technical Analysis Indicators. Then, the Stepwise Regression Analysis is adopted as a filter for screening out some input features really correlative to the Label (the predict closing price). The numerical results are summarized as follows: the Mean Absolute Percentage Error (MAPE) and Standard Deviation (SD) in training and predicting stages for the TSMC are (0.17%, 0.06) and (0.33%, 0.05); (0.15%, 0.06) and (0.20%, 0.04) for the Uni-President; (0.35%, 0.10) and (0.37%, 0.05) for the LARGAN Precision. Keywords: Deep learning, Time series, TensorFlow, Stock price prediction
LAI, HUNG-JU, and 賴泓儒. "STT-MRAM Co-design Deep Learning Model for IoT Applications." Thesis, 2019. http://ndltd.ncl.edu.tw/handle/c2a8z5.
Full text逢甲大學
通訊工程學系
107
At present, there are fewer STT-MRAM applications. The mian memories on the current market are SRAM, DRAM, and Flash Memory. However, these memories consume more power than STT-MRAM, which are not suitable for resource-limited IoT devices. The memories equipped with IoT devices must be low energy consumption, rapidly operatrion , access endurace, and samll area. STT-MARM just meets these requirements. In particular, STT-MARM is non-volatile. After the power turns off, the data are still reserved. Therefore, it is an emergent memory for IoT applications. In this thesis, we propose an application architecture of STT-MRAM as a deep learning model, which is implemeted on FPGA, and then use Qsys (SOPC) architecture to implement a CNN neural network. Finally, we use the MNIST data set to evaluate the performance and accuracy of STT-MRAM co-design deep learning model for IoT applications, compared with other memories SRAM, SDRAM, and MRAM.
Chung, Hao-Ting, and 鐘皓廷. "Building Student Course Performance Prediction Model Based on Deep Learning." Thesis, 2018. http://ndltd.ncl.edu.tw/handle/m2z8n3.
Full text國立臺北科技大學
資訊工程系
106
The deferral of graduation rate in Taiwan’s universities is estimated 16%, which will affect the scheduling of school resources. Therefore, if we can expect to take notice of students’ academic performance and provide guidance to students who cannot pass the threshold as expected, we can effectively reduce the waste of school resources. In this research, we use recent years’ student data attributes and course results as training data to construct student performance prediction model. The K-Means algorithm was used to classify all courses from the freshman to the senior. The related courses will be grouped in the same cluster, which will more likely to find similar features and improve the accuracy of the prediction. Then, this research constructs independent neural networks for each course according to the different academic year. Each model will be pre-trained by using De-noising Autoencoder. After pre-training, the corresponding structure and weights are taken as the initial value of the neural network model. Each neural network is treated as a base predictor. All predictors will be integrated into an Ensemble predictor according to different years’ weights to predict the current student’s course performance. As the students finish the course at the end of each semester, the prediction model will continue track and update to enhance model accuracy through online learning.
Teng, Yu-Han, and 鄧鈺翰. "Using a multimodal architecture Research on Deep Learning Model Analysis." Thesis, 2019. http://ndltd.ncl.edu.tw/handle/7het95.
Full text國立中央大學
資訊管理學系
107
With the popularity of social networks and e-commerce sites, users have switched from passively receiving messages to actively disseminating messages. The value of comments and online messages is also becoming more and more important. Analysis and research over the past few years. Trying to analyze trends about specific product products, topics, reviews, and tweets. Play an important role in all aspects. This study uses different vectorization processes to verify the multimodal analysis model and confirm that the model can effectively improve the accuracy. This study proposes a combination of two models. This feature is combined with deep learning neural network construction to build a multimodal analysis model. Model 1 is a deep learning model based on Glove vector, attention mechanism and GRU neural network architecture. Model 2 is a deep learning model based on Word2Vec vector, attention mechanism and CNN neural network architecture. Multimodal analysis model is validated by K-fold cross validation and F1 measurement method. The experimental results prove that the multimodal analysis model proposed in this study has higher accuracy than related research. Using the high-level multi-modal combination method, the features of multiple models are extracted and combined to form a combined feature, and this feature is trained in neural network. The feature set can be mutually assisted, and the accuracy can be 91.56% through the two vectors and the optimal neural network architecture combined with the multi-modal method. And the model verification shows 93% verification value, which proves that the multimodal analysis model proposed in this study is used in the field of comment texts, which can effectively improve the accuracy of model prediction and improve its accuracy.
Huang, Chu-Chih, and 黃炬智. "Classification of Chinese Articulation Disorder based on Deep Learning Model." Thesis, 2019. http://ndltd.ncl.edu.tw/handle/3v7km6.
Full text國立臺灣科技大學
電子工程系
107
Articulation disorder means having difficulties during pronunciations, leading to incorrect articulations and unclear sentences. Articulation disorder has been a common child language issue. Currently, there is no any unified sayings for articulation disorder's classification in the Taiwan's medical field. Thus, a speech therapist is required for analysis and treatment in hospitals. After a series of pronunciations, a speech therapist will make an analysis based on children's pronunciations. Children will return to the hospitals for months continuously to improve their conditions. Nevertheless, the treatment can only benefit children with articulation disorder by receiving treatments in hospitals, slowing down the treatment cycle. The purpose of this work is to automate the diagnosis for articulation disorder by combining the latest AI's convolutional neural network (CNN). Results show that LeNet-5 which achieved 94.56 Top-1 accuracy and 0.995 avg F1-score with the smallest model size is more suitable to apply articulations disorder application on mobile devices.
LIN, HAN-LONG, and 林翰隆. "Building Graduate Salary Grading Prediction Model Based on Deep Learning." Thesis, 2019. http://ndltd.ncl.edu.tw/handle/z4hkqx.
Full text國立臺北科技大學
資訊工程系
107
This paper used deep learning to build a salary grading prediction model. Due to the order relationship between each grading of salary grading, this paper regards this kind of problem as an ordinal regression problem. This paper used multiple output deep neural network to solve the ordinal regression problem so that the network learns the correlation between these salary grading during training. This model is pre-trained using Stacked De-noising Autoencoder. After pre-training, the corresponding weights are taken as the initial weights of neural network. During training, this paper used the Dropout and Bootstrap Aggregating to improve model performance. This model used the graduates’ personal information, grades, and family data as input feature, and predict salary grading of graduating or graduated students. This result will be provided to the school researchers to grasp the salary trend.
Lin, Yung-Chien, and 林詠謙. "Predicting the Computation Time of Deep Learning Model on Accelerators." Thesis, 2019. http://ndltd.ncl.edu.tw/handle/w2wncd.
Full text國立臺灣大學
資訊工程學研究所
107
With the rapid development of deep learning, in order to improve the efficiency of implementation, the hardware for deep learning is becoming more and more important, but the platform with higher performance is often accompanied by high prices. Therefore, the goal of this research is to let users can quickly calculate the performance of a system, and even can easily analyze its performance before getting the target hardware. There are a lot of related researches at present, but most of them use formulas to make performance predictions, and this method often uses linear methods to do calculations, so many details are ignored. The method used in this study is to collect enough relevant data and use deep learning to learn the computation time under different configurations. This study also predicts different neural networks, even for unavailable hardware.
Gutta, Sreedevi. "Improving photoacoustic imaging with model compensating and deep learning methods." Thesis, 2018. https://etd.iisc.ac.in/handle/2005/4390.
Full textPereira, Carlos Manuel Silva. "Deep learning techniques for grapevine variety classification using natural images." Doctoral thesis, 2020. http://hdl.handle.net/10348/9969.
Full textThis thesis proposes a computer vision system for automatic classification of six endogenous grapevine varieties of the Portuguese Douro Demarcated Region from natural images. For a full understanding about the applied methodologies and the developed experiments in this research work, we structured this document into six sections. The first ones are reserved for the revision of the literature about image processing in agriculture, such as, image processing techniques (enhancement and color model conversion) and image segmentation methods that inspired us to develop the proposed leaf segmentation algorithm. The theoretical background about the machine learning process, namely, deep learning and convolutional neural networks were presented for an easier understanding of the methodologies applied on our research proposal. The remaining ones are reserved for the presentation of the materials and methods, the major conclusions and possible future work developments. Our proposed system is hard to develop because it presents many constraints. First, the presence in natural vineyard images of savage foliage, weed, multiple leaves with overlapping, occlusion, and obstruction by objects due to the shadows, dust, insects and other adverse climatic conditions that occur in natural environment at the moment of image capturing; second, high similarity of the images among different grapevine varieties; third, leaf senescence and significant changes on the grapevine leaf and bunches images in the harvest seasons, namely, due to adverse climatic conditions, diseases and presence of pesticides; fourth, the low volume of images available. In addition, the vineyards of the Douro Region are also characterized for having more than one grapevine variety per parcel and even for row. Knowing the susceptibility of a particular variety to a specific disease, its identification using this automatic system, will help, for example, in a more specific and targeted treatment. Besides that, many wine producers are entitled to this large number of grapevine varieties to produce their most expensive wines. As the title of this thesis highlights, the deep learning techniques were used to solve the presented constratints. With this advanced neural technologies, the performance of transfer learning schemes based on AlexNet architecture was evaluated for classification of grapevine varieties using diverse pre-processed datasets. Thus, two natural vineyard image datasets were constructed from which different pre-processed datasets are generated with the application of some image processing methods, including a proposed four-corners-in-one image warping algorithm for deep training purposes. After detailing some network schemes, we present and discuss some of the experimental results obtained by the proposed approach, which we judge promising and encouraging to help Douro wine growers in the automatic grapevine varieties classification for future implementation of a robotic grape harvest.
Esta tese propõe um sistema de visão computacional para classificação automática de seis variedades de videira endógenas da Região Demarcada do Douro a partir de imagens naturais. Para um completo entendimento sobre as metodologias aplicadas e as experièncias desenvolvidas neste trabalho de investigação, estruturamos este documento em seis capítulos. Os primeiros são reservados à revisão da literatura sobre processamento de imagens na agricultura, como técnicas de processamento de imagens (realce da imagem e conversão de modelos de cores) e métodos de segmentação de imagens que nos inspiraram a desenvolver um algoritmo de segmentação de folhas. Os fundamentos teóricos sobre o processo de aprendizagem de máquina, a saber, aprendizagem profunda e redes neuronais convolucionais, são apresentados para facilitar a compreensão das metodologias aplicadas na nossa proposta de trabalho. Os demais capítulos ficam reservados para a apresentação dos materiais e métodos, as principais conclusões e possíveis desenvolvimentos futuros do trabalho. Torna-se difícil desenvolver o sistema que se propõe porque apresenta muitos constrangimentos. Primeiro, a presença em imagens naturais de vinhas de folhagem selvagem, erva daninha, várias folhas com sobreposição, oclusão e obstrução por objetos devido às sombras, poeira, insetos e outras condições climáticas adversas que ocorrem no ambiente natural no momento da captação de imagem; segundo, a alta similaridade das imagens entre diferentes variedades de videira; terceiro, senescência foliar e mudanças significativas nas imagens de folhas e cachos de videira nas safras, devido a condições climáticas adversas, doenças e presença de pesticidas; quarto, o baixo volume de imagens disponíveis. Além disso, as vinhas da região do Douro também se caracterizam por possuir mais de uma variedade de videira por parcela e até por bardo. Conhecendo a suscetibilidade de uma variedade específica a uma doença específica, usando este sistema automático, a sua identificação ajudará, por exemplo, num tratamento mais específico e direcionado. Além disso, muitos produtores de vinho têm utilizado um grande número de variedades de videira para produzir os seus vinhos de referência e, portanto, mais caros. Como o título desta tese destaca, as técnicas de aprendizagem profunda foram usadas para resolver os constrangimentos apresentados. Com estas tecnologias neuronais avançadas, o desempenho dos esquemas de aprendizagem de transferência baseados na arquitetura AlexNet foi avaliado na classificação de variedades de videira usando diversos conjuntos de dados pré-processados. Assim, foram construídos dois conjuntos de dados de imagem de vinhas naturais a partir dos quais foram gerados diferentes conjuntos de dados pré-processados com a aplicação de alguns métodos de processamento de imagem, incluindo um algoritmo de distorção de imagem chamado four-corners-in-one para fins de treino. Depois de detalharmos alguns esquemas de rede, apresentamos e discutimos alguns dos resultados experimentais obtidos pela abordagem proposta, que julgamos promissores e encorajadores para ajudar os viticultores do Douro na classificação automática das variedades de videira para futura implementação de um robot para colheita de uvas.