Journal articles on the topic 'Machine Learning, Artificial Intelligence, Regularization Methods'

Create a spot-on reference in APA, MLA, Chicago, Harvard, and other styles

Select a source type:

Consult the top 50 journal articles for your research on the topic 'Machine Learning, Artificial Intelligence, Regularization Methods.'

Next to every source in the list of references, there is an 'Add to bibliography' button. Press on it, and we will generate automatically the bibliographic reference to the chosen work in the citation style you need: APA, MLA, Harvard, Chicago, Vancouver, etc.

You can also download the full text of the academic publication as pdf and read online its abstract whenever available in the metadata.

Browse journal articles on a wide variety of disciplines and organise your bibliography correctly.

1

Abidine, M’hamed Bilal, and Belkacem Fergani. "Activity recognition from smartphone data using weighted learning methods." Intelligenza Artificiale 15, no. 1 (July 28, 2021): 1–15. http://dx.doi.org/10.3233/ia-200059.

Full text
Abstract:
Mobile phone based activity recognition uses data obtained from embedded sensors to infer user’s physical activities. The traditional approach for activity recognition employs machine learning algorithms to learn from collected labeled data and induce a model. To enhance the accuracy and hence to improve the overall efficiency of the system, the good classifiers can be combined together. Fusion can be done at the feature level and also at the decision level. In this work, we propose a new hybrid classification model Weighted SVM-KNN to perform automatic recognition of activities that combines a Weighted Support Vector Machines (WSVM) to learn a model with a Weighted K-Nearest Neighbors (WKNN), to classify and identify the ongoing activity. The sensory inputs to the classifier are reduced with the Linear Discriminant Analysis (LDA). We demonstrate how to train the hybrid approach in this setting, introduce an adaptive regularization parameter for WSVM approach, and illustrate how our method outperforms the state-of-the-art on a large benchmark datasets.
APA, Harvard, Vancouver, ISO, and other styles
2

Fokkema, Marjolein, Dragos Iliescu, Samuel Greiff, and Matthias Ziegler. "Machine Learning and Prediction in Psychological Assessment." European Journal of Psychological Assessment 38, no. 3 (May 2022): 165–75. http://dx.doi.org/10.1027/1015-5759/a000714.

Full text
Abstract:
Abstract. Modern prediction methods from machine learning (ML) and artificial intelligence (AI) are becoming increasingly popular, also in the field of psychological assessment. These methods provide unprecedented flexibility for modeling large numbers of predictor variables and non-linear associations between predictors and responses. In this paper, we aim to look at what these methods may contribute to the assessment of criterion validity and their possible drawbacks. We apply a range of modern statistical prediction methods to a dataset for predicting the university major completed, based on the subscales and items of a scale for vocational preferences. The results indicate that logistic regression combined with regularization performs strikingly well already in terms of predictive accuracy. More sophisticated techniques for incorporating non-linearities can further contribute to predictive accuracy and validity, but often marginally.
APA, Harvard, Vancouver, ISO, and other styles
3

Кабанихин, С. И. "Inverse Problems and Artificial Intelligence." Успехи кибернетики / Russian Journal of Cybernetics, no. 3 (October 11, 2021): 33–43. http://dx.doi.org/10.51790/2712-9942-2021-2-3-5.

Full text
Abstract:
В данной работе приведен анализ взаимосвязей теории обратных и некорректных задач и математических аспектов искусственного интеллекта. Показано, что при анализе вычислительных алгоритмов, которые условно можно отнести к вычислительному искусственному интеллекту (машинное обучение, природоподобные алгоритмы, методы анализа и обработки данных), возможно, а подчас и необходимо, использовать результаты и подходы, развитые в теории и численных методах решения обратных и некорректных задач, такие как регуляризация, условная устойчивость и сходимость, использование априорной информации, идентифицируемость, чувствительность, усвоение данных. This paper analyzes the relationship between the theory of inverse and incorrect problems and the mathematical aspects of artificial intelligence. It is shown that computational algorithms that can be categorized as computational artificial intelligence (machine learning, nature-like algorithms, data analysis and processing) can or should be analyzed with the approaches developed for the theory and numerical methods for solving inverse and incorrect problems. They are regularization, conditional stability and convergence, the use of a priori information, identifiability, sensitivity, data assimilation.
APA, Harvard, Vancouver, ISO, and other styles
4

Mohammad-Djafari, Ali. "Interaction between Model Based Signal and Image Processing, Machine Learning and Artificial Intelligence." Proceedings 33, no. 1 (November 28, 2019): 16. http://dx.doi.org/10.3390/proceedings2019033016.

Full text
Abstract:
Signale and image processing has always been the main tools in many area and in particular in Medical and Biomedical applications. Nowadays, there are great number of toolboxes, general purpose and very specialized, in which classical techniques are implemented and can be used: all the transformation based methods (Fourier, Wavelets, ...) as well as model based and iterative regularization methods. Statistical methods have also shown their success in some area when parametric models are available. Bayesian inference based methods had great success, in particular, when the data are noisy, uncertain, incomplete (missing values) or with outliers and where there is a need to quantify uncertainties. In some applications, nowadays, we have more and more data. To use these “Big Data” to extract more knowledge, the Machine Learning and Artificial Intelligence tools have shown success and became mandatory. However, even if in many domains of Machine Learning such as classification and clustering these methods have shown success, their use in real scientific problems are limited. The main reasons are twofold: First, the users of these tools cannot explain the reasons when the are successful and when they are not. The second is that, in general, these tools can not quantify the remaining uncertainties. Model based and Bayesian inference approach have been very successful in linear inverse problems. However, adjusting the hyper parameters is complex and the cost of the computation is high. The Convolutional Neural Networks (CNN) and Deep Learning (DL) tools can be useful for pushing farther these limits. At the other side, the Model based methods can be helpful for the selection of the structure of CNN and DL which are crucial in ML success. In this work, I first provide an overview and then a survey of the aforementioned methods and explore the possible interactions between them.
APA, Harvard, Vancouver, ISO, and other styles
5

Dif, Nassima, and Zakaria Elberrichi. "Efficient Regularization Framework for Histopathological Image Classification Using Convolutional Neural Networks." International Journal of Cognitive Informatics and Natural Intelligence 14, no. 4 (October 2020): 62–81. http://dx.doi.org/10.4018/ijcini.2020100104.

Full text
Abstract:
Deep learning methods are characterized by their capacity to learn data representation compared to the traditional machine learning algorithms. However, these methods are prone to overfitting on small volumes of data. The objective of this research is to overcome this limitation by improving the generalization in the proposed deep learning framework based on various techniques: data augmentation, small models, optimizer selection, and ensemble learning. For ensembling, the authors used selected models from different checkpoints and both voting and unweighted average methods for combination. The experimental study on the lymphomas histopathological dataset highlights the efficiency of the MobileNet2 network combined with the stochastic gradient descent (SGD) optimizer in terms of generalization. The best results have been achieved by the combination of the best three checkpoint models (98.67% of accuracy). These findings provide important insights into the efficiency of the checkpoint ensemble learning method for histopathological image classification.
APA, Harvard, Vancouver, ISO, and other styles
6

Luo, Yong, Liancheng Yin, Wenchao Bai, and Keming Mao. "An Appraisal of Incremental Learning Methods." Entropy 22, no. 11 (October 22, 2020): 1190. http://dx.doi.org/10.3390/e22111190.

Full text
Abstract:
As a special case of machine learning, incremental learning can acquire useful knowledge from incoming data continuously while it does not need to access the original data. It is expected to have the ability of memorization and it is regarded as one of the ultimate goals of artificial intelligence technology. However, incremental learning remains a long term challenge. Modern deep neural network models achieve outstanding performance on stationary data distributions with batch training. This restriction leads to catastrophic forgetting for incremental learning scenarios since the distribution of incoming data is unknown and has a highly different probability from the old data. Therefore, a model must be both plastic to acquire new knowledge and stable to consolidate existing knowledge. This review aims to draw a systematic review of the state of the art of incremental learning methods. Published reports are selected from Web of Science, IEEEXplore, and DBLP databases up to May 2020. Each paper is reviewed according to the types: architectural strategy, regularization strategy and rehearsal and pseudo-rehearsal strategy. We compare and discuss different methods. Moreover, the development trend and research focus are given. It is concluded that incremental learning is still a hot research area and will be for a long period. More attention should be paid to the exploration of both biological systems and computational models.
APA, Harvard, Vancouver, ISO, and other styles
7

Alcin, Omer F., Abdulkadir Sengur, Jiang Qian, and Melih C. Ince. "OMP-ELM: Orthogonal Matching Pursuit-Based Extreme Learning Machine for Regression." Journal of Intelligent Systems 24, no. 1 (March 1, 2015): 135–43. http://dx.doi.org/10.1515/jisys-2014-0095.

Full text
Abstract:
AbstractExtreme learning machine (ELM) is a recent scheme for single hidden layer feed forward networks (SLFNs). It has attracted much interest in the machine intelligence and pattern recognition fields with numerous real-world applications. The ELM structure has several advantages, such as its adaptability to various problems with a rapid learning rate and low computational cost. However, it has shortcomings in the following aspects. First, it suffers from the irrelevant variables in the input data set. Second, choosing the optimal number of neurons in the hidden layer is not well defined. In case the hidden nodes are greater than the training data, the ELM may encounter the singularity problem, and its solution may become unstable. To overcome these limitations, several methods have been proposed within the regularization framework. In this article, we considered a greedy method for sparse approximation of the output weight vector of the ELM network. More specifically, the orthogonal matching pursuit (OMP) algorithm is embedded to the ELM. This new technique is named OMP-ELM. OMP-ELM has several advantages over regularized ELM methods, such as lower complexity and immunity to the singularity problem. Experimental works on nine commonly used regression problems indicate that the investigated OMP-ELM method confirms these advantages. Moreover, OMP-ELM is compared with the ELM method, the regularized ELM scheme, and artificial neural networks.
APA, Harvard, Vancouver, ISO, and other styles
8

Homayouni, Haleh, and Eghbal G. Mansoori. "Manifold regularization ensemble clustering with many objectives using unsupervised extreme learning machines." Intelligent Data Analysis 25, no. 4 (July 9, 2021): 847–62. http://dx.doi.org/10.3233/ida-205362.

Full text
Abstract:
Spectral clustering has been an effective clustering method, in last decades, because it can get an optimal solution without any assumptions on data’s structure. The basic key in spectral clustering is its similarity matrix. Despite many empirical successes in similarity matrix construction, almost all previous methods suffer from handling just one objective. To address the multi-objective ensemble clustering, we introduce a new ensemble manifold regularization (MR) method based on stacking framework. In our Manifold Regularization Ensemble Clustering (MREC) method, several objective functions are considered simultaneously, as a robust method for constructing the similarity matrix. Using it, the unsupervised extreme learning machine (UELM) is employed to find the generalized eigenvectors to embed the data in low-dimensional space. These eigenvectors are then used as the base point in spectral clustering to find the best partitioning of the data. The aims of this paper are to find robust partitioning that satisfy multiple objectives, handling noisy data, keeping diversity-based goals, and dimension reduction. Experiments on some real-world datasets besides to three benchmark protein datasets demonstrate the superiority of MREC over some state-of-the-art single and ensemble methods.
APA, Harvard, Vancouver, ISO, and other styles
9

Nayef, Bahera Hani, Siti Norul Huda Sheikh Abdullah, Rossilawati Sulaiman, and Zaid Abdi Al Kareem Alyasseri. "VARIANTS OF NEURAL NETWORKS: A REVIEW." Malaysian Journal of Computer Science 35, no. 2 (April 29, 2022): 158–78. http://dx.doi.org/10.22452/mjcs.vol35no2.5.

Full text
Abstract:
Machine learning (ML) techniques are part of artificial intelligence. ML involves imitating human behavior in solving different problems, such as object detection, text handwriting recognition, and image classification. Several techniques can be used in machine learning, such as Neural Networks (NN). The expansion in information technology enables researchers to collect large amounts of various data types. The challenging issue is to uncover neural network parameters suitable for object detection problems. Therefore, this paper presents a literature review of the latest proposed and developed components in neural network techniques to cope with different sizes and data types. A brief discussion is also introduced to demonstrate the different types of neural network parameters, such as activation functions, loss functions, and regularization methods. Moreover, this paper also uncovers parameter optimization methods and hyperparameters of the model, such as weight, the learning rate, and the number of iterations. From the literature, it is notable that choosing the activation function, loss function, number of neural network layers, and data size is the major factor affecting NN performance. Additionally, utilizing deep learning NN resulted in a significant improvement in model performance for a variety of issues, which became the researcher's attention.
APA, Harvard, Vancouver, ISO, and other styles
10

Cai, Yingfeng, Youguo He, Hai Wang, Xiaoqiang Sun, Long Chen, and Haobin Jiang. "Pedestrian detection algorithm in traffic scene based on weakly supervised hierarchical deep model." International Journal of Advanced Robotic Systems 14, no. 1 (February 14, 2016): 172988141769231. http://dx.doi.org/10.1177/1729881417692311.

Full text
Abstract:
The emergence and development of deep learning theory in machine learning field provide new method for visual-based pedestrian recognition technology. To achieve better performance in this application, an improved weakly supervised hierarchical deep learning pedestrian recognition algorithm with two-dimensional deep belief networks is proposed. The improvements are made by taking into consideration the weaknesses of structure and training methods of existing classifiers. First, traditional one-dimensional deep belief network is expanded to two-dimensional that allows image matrix to be loaded directly to preserve more information of a sample space. Then, a determination regularization term with small weight is added to the traditional unsupervised training objective function. By this modification, original unsupervised training is transformed to weakly supervised training. Subsequently, that gives the extracted features discrimination ability. Multiple sets of comparative experiments show that the performance of the proposed algorithm is better than other deep learning algorithms in recognition rate and outperforms most of the existing state-of-the-art methods in non-occlusion pedestrian data set while performs fair in weakly and heavily occlusion data set.
APA, Harvard, Vancouver, ISO, and other styles
11

Almalki, Yassir Edrees, Abdul Qayyum, Muhammad Irfan, Noman Haider, Adam Glowacz, Fahad Mohammed Alshehri, Sharifa K. Alduraibi, et al. "A Novel Method for COVID-19 Diagnosis Using Artificial Intelligence in Chest X-ray Images." Healthcare 9, no. 5 (April 29, 2021): 522. http://dx.doi.org/10.3390/healthcare9050522.

Full text
Abstract:
The Coronavirus disease 2019 (COVID-19) is an infectious disease spreading rapidly and uncontrollably throughout the world. The critical challenge is the rapid detection of Coronavirus infected people. The available techniques being utilized are body-temperature measurement, along with anterior nasal swab analysis. However, taking nasal swabs and lab testing are complex, intrusive, and require many resources. Furthermore, the lack of test kits to meet the exceeding cases is also a major limitation. The current challenge is to develop some technology to non-intrusively detect the suspected Coronavirus patients through Artificial Intelligence (AI) techniques such as deep learning (DL). Another challenge to conduct the research on this area is the difficulty of obtaining the dataset due to a limited number of patients giving their consent to participate in the research study. Looking at the efficacy of AI in healthcare systems, it is a great challenge for the researchers to develop an AI algorithm that can help health professionals and government officials automatically identify and isolate people with Coronavirus symptoms. Hence, this paper proposes a novel method CoVIRNet (COVID Inception-ResNet model), which utilizes the chest X-rays to diagnose the COVID-19 patients automatically. The proposed algorithm has different inception residual blocks that cater to information by using different depths feature maps at different scales, with the various layers. The features are concatenated at each proposed classification block, using the average-pooling layer, and concatenated features are passed to the fully connected layer. The efficient proposed deep-learning blocks used different regularization techniques to minimize the overfitting due to the small COVID-19 dataset. The multiscale features are extracted at different levels of the proposed deep-learning model and then embedded into various machine-learning models to validate the combination of deep-learning and machine-learning models. The proposed CoVIR-Net model achieved 95.7% accuracy, and the CoVIR-Net feature extractor with random-forest classifier produced 97.29% accuracy, which is the highest, as compared to existing state-of-the-art deep-learning methods. The proposed model would be an automatic solution for the assessment and classification of COVID-19. We predict that the proposed method will demonstrate an outstanding performance as compared to the state-of-the-art techniques being used currently.
APA, Harvard, Vancouver, ISO, and other styles
12

Sun, Zhenzhen, and Yuanlong Yu. "Robust multi-class feature selection via l2,0-norm regularization minimization." Intelligent Data Analysis 26, no. 1 (January 14, 2022): 57–73. http://dx.doi.org/10.3233/ida-205724.

Full text
Abstract:
Feature selection is an important data preprocessing in data mining and machine learning, that can reduce the number of features without deteriorating model’s performance. Recently, sparse regression has received considerable attention in feature selection task due to its good performance. However, because the l2,0-norm regularization term is non-convex, this problem is hard to solve, and most of the existing methods relaxed it by l2,1-norm. Unlike the existing methods, this paper proposes a novel method to solve the l2,0-norm regularized least squares problem directly based on iterative hard thresholding, which can produce exact row-sparsity solution for weights matrix, and features can be selected more precisely. Furthermore, two homotopy strategies are derived to reduce the computational time of the optimization method, which are more practical for real-world applications. The proposed method is verified on eight biological datasets, experimental results show that our method can achieve higher classification accuracy with fewer number of selected features than the approximate convex counterparts and other state-of-the-art feature selection methods.
APA, Harvard, Vancouver, ISO, and other styles
13

V. Graça, João, Kuzman Ganchev, and Ben Taskar. "Learning Tractable Word Alignment Models with Complex Constraints." Computational Linguistics 36, no. 3 (September 2010): 481–504. http://dx.doi.org/10.1162/coli_a_00007.

Full text
Abstract:
Word-level alignment of bilingual text is a critical resource for a growing variety of tasks. Probabilistic models for word alignment present a fundamental trade-off between richness of captured constraints and correlations versus efficiency and tractability of inference. In this article, we use the Posterior Regularization framework (Graça, Ganchev, and Taskar 2007) to incorporate complex constraints into probabilistic models during learning without changing the efficiency of the underlying model. We focus on the simple and tractable hidden Markov model, and present an efficient learning algorithm for incorporating approximate bijectivity and symmetry constraints. Models estimated with these constraints produce a significant boost in performance as measured by both precision and recall of manually annotated alignments for six language pairs. We also report experiments on two different tasks where word alignments are required: phrase-based machine translation and syntax transfer, and show promising improvements over standard methods.
APA, Harvard, Vancouver, ISO, and other styles
14

Dou, Wenbang, Weihong Chin, and Naoyuki Kubota. "Multi-Scopic Cognitive Memory System for Continuous Gesture Learning." Biomimetics 8, no. 1 (February 21, 2023): 88. http://dx.doi.org/10.3390/biomimetics8010088.

Full text
Abstract:
With the advancement of artificial intelligence technologies in recent years, research on intelligent robots has progressed. Robots are required to understand human intentions and communicate more smoothly with humans. Since gestures can have a variety of meanings, gesture recognition is one of the essential issues in communication between robots and humans. In addition, robots need to learn new gestures as humans grow. Moreover, individual gestures vary. Because catastrophic forgetting occurs in training new data in traditional gesture recognition approaches, it is necessary to preserve the prepared data and combine it with further data to train the model from scratch. We propose a Multi-scopic Cognitive Memory System (MCMS) that mimics the lifelong learning process of humans and can continuously learn new gestures without forgetting previously learned gestures. The proposed system comprises a two-layer structure consisting of an episode memory layer and a semantic memory layer, with a topological map as its backbone. The system is designed with reference to conventional continuous learning systems in three ways: (i) using a dynamic architecture without setting the network size, (ii) adding regularization terms to constrain learning, and (iii) generating data from the network itself and performing relearning. The episode memory layer clusters the data and learns their spatiotemporal representation. The semantic memory layer generates a topological map based on task-related inputs and stores them as longer-term episode representations in the robot’s memory. In addition, to alleviate catastrophic forgetting, the memory replay function can reinforce memories autonomously. The proposed system could mitigate catastrophic forgetting and perform continuous learning by using both machine learning benchmark datasets and real-world data compared to conventional methods.
APA, Harvard, Vancouver, ISO, and other styles
15

Yanishevskaya, N. A., and I. P. Bolodurina. "APPLICATION OF COMPUTER VISION TECHNOLOGIES FOR THE DEVELOPMENT OF A MODEL FOR THE RECOGNITION OF LESIONS OF CULTIVATED PLANTS." Bulletin of the South Ural State University. Ser. Computer Technologies, Automatic Control & Radioelectronics 21, no. 3 (August 2021): 5–13. http://dx.doi.org/10.14529/ctcr210301.

Full text
Abstract:
In the Russian Federation, the agro-industrial complex is one of the leading sectors of the eco-nomy with a volume of domestic product of 4.5%. Russia owns 10 % of all arable land in the world. According to the data on the sown areas by crops in 2020, most of the agricultural area of Russia is occupied by wheat. The Russian Federation ranks third in the ranking of leading countries in the production of this type of grain crops, as well as leading positions in its export. Brown (leaf) and linear (stem) rust is the most harmful disease of grain crops. It is the reason for the sparseness of wheat crops and leads to a sharp decrease in yield. Therefore, one of the main tasks of farmers is to preserve the crop from diseases. The application of such areas of artificial intelligence as computer vision, machine learning and deep learning is able to cope with this task. These artificial intelligence technologies allow us to successfully solve applied problems of the agro-industrial complex using automated analysis of photographic materials. Aim. To consider the application of computer vision methods for the problem of classification of lesions of cultivated plants on the example of wheat. Materials and methods. The CGIAR Computer Vision for Crop Disease dataset for the crop disease recognition task is taken from the open source Kaggle. It is proposed to use an approach to the re-cognition of lesions of cultivated plants using the well-known neural network models ResNet50, DenseNet169, VGG16 and EfficientNet-B0. Neural network models receive images of wheat as in-put. The output of neural networks is the class of plant damage. To overcome the effect of overfit-ting neural networks, various regularization techniques are investigated. Results. The results of the classification quality, estimated by the software using the F1-score metric, which is the average harmonic between the Precision and Recall measures, are presented. Conclusion. As a result of the conducted research, it was found that the DenseNet model showed the best recognition accuracy us-ing a combination of transfer learning technology and DropOut and L2 regulation technologies to overcome the effect of retraining. The use of this approach allowed us to achieve a recognition ac-curacy of 91%.
APA, Harvard, Vancouver, ISO, and other styles
16

Mohiuddin, Tasnim, and Shafiq Joty. "Unsupervised Word Translation with Adversarial Autoencoder." Computational Linguistics 46, no. 2 (June 2020): 257–88. http://dx.doi.org/10.1162/coli_a_00374.

Full text
Abstract:
Crosslingual word embeddings learned from monolingual embeddings have a crucial role in many downstream tasks, ranging from machine translation to transfer learning. Adversarial training has shown impressive success in learning crosslingual embeddings and the associated word translation task without any parallel data by mapping monolingual embeddings to a shared space. However, recent work has shown superior performance for non-adversarial methods in more challenging language pairs. In this article, we investigate adversarial autoencoder for unsupervised word translation and propose two novel extensions to it that yield more stable training and improved results. Our method includes regularization terms to enforce cycle consistency and input reconstruction, and puts the target encoders as an adversary against the corresponding discriminator. We use two types of refinement procedures sequentially after obtaining the trained encoders and mappings from the adversarial training, namely, refinement with Procrustes solution and refinement with symmetric re-weighting. Extensive experimentations with high- and low-resource languages from two different data sets show that our method achieves better performance than existing adversarial and non-adversarial approaches and is also competitive with the supervised system. Along with performing comprehensive ablation studies to understand the contribution of different components of our adversarial model, we also conduct a thorough analysis of the refinement procedures to understand their effects.
APA, Harvard, Vancouver, ISO, and other styles
17

Ichihashi, Hidetomo, and Katsuhiro Honda. "Application of Kernel Trick to Fuzzy c-Means with Regularization by K-L Information." Journal of Advanced Computational Intelligence and Intelligent Informatics 8, no. 6 (November 20, 2004): 566–72. http://dx.doi.org/10.20965/jaciii.2004.p0566.

Full text
Abstract:
Support vector machines (SVM), kernel principal component analysis (KPCA), and kernel Fisher discriminant analysis (KFD), are examples of successful kernel-based learning methods. By the addition of a regularizer and the kernel trick to a fuzzy counterpart of Gaussian mixture models (GMM), this paper proposes a clustering algorithm in an extended high dimensional feature space. Unlike the global nonlinear approaches, GMM or its fuzzy counterpart is to model nonlinear structure with a collection, or mixture, of local linear sub-models of PCA. When the number of feature vectors and clusters are n and c respectively, this kernel approach can find up to c × n nonzero eigenvalues. A way to control the number of parameters in the mixture of probabilistic principal component analysis (PPCA) is adopted to reduce the number of parameters. The algorithm provides a partitioning with flexible shape of clusters in the original input data space.
APA, Harvard, Vancouver, ISO, and other styles
18

Vasicek, Daniel. "Artificial intelligence and machine learning: Practical aspects of overfitting and regularization." Information Services & Use 39, no. 4 (February 6, 2020): 281–89. http://dx.doi.org/10.3233/isu-190059.

Full text
APA, Harvard, Vancouver, ISO, and other styles
19

Guo, Lihua. "Extreme Learning Machine with Elastic Net Regularization." Intelligent Automation & Soft Computing 26, no. 3 (2020): 421–27. http://dx.doi.org/10.32604/iasc.2020.013918.

Full text
APA, Harvard, Vancouver, ISO, and other styles
20

Zhang, Boyang, Zhao Ma, Yingyi Liu, Haiwen Yuan, and Lingjie Sun. "Ensemble based reactivated regularization extreme learning machine for classification." Neurocomputing 275 (January 2018): 255–66. http://dx.doi.org/10.1016/j.neucom.2017.07.018.

Full text
APA, Harvard, Vancouver, ISO, and other styles
21

Serey, Joel, Luis Quezada, Miguel Alfaro, Guillermo Fuertes, Manuel Vargas, Rodrigo Ternero, Jorge Sabattin, Claudia Duran, and Sebastian Gutierrez. "Artificial Intelligence Methodologies for Data Management." Symmetry 13, no. 11 (October 29, 2021): 2040. http://dx.doi.org/10.3390/sym13112040.

Full text
Abstract:
This study analyses the main challenges, trends, technological approaches, and artificial intelligence methods developed by new researchers and professionals in the field of machine learning, with an emphasis on the most outstanding and relevant works to date. This literature review evaluates the main methodological contributions of artificial intelligence through machine learning. The methodology used to study the documents was content analysis; the basic terminology of the study corresponds to machine learning, artificial intelligence, and big data between the years 2017 and 2021. For this study, we selected 181 references, of which 120 are part of the literature review. The conceptual framework includes 12 categories, four groups, and eight subgroups. The study of data management using AI methodologies presents symmetry in the four machine learning groups: supervised learning, unsupervised learning, semi-supervised learning, and reinforced learning. Furthermore, the artificial intelligence methods with more symmetry in all groups are artificial neural networks, Support Vector Machines, K-means, and Bayesian Methods. Finally, five research avenues are presented to improve the prediction of machine learning.
APA, Harvard, Vancouver, ISO, and other styles
22

Et. al., D. Saravanan ,. "Predict and Measure Air Quality Monitoring System Using Machine Learning." Turkish Journal of Computer and Mathematics Education (TURCOMAT) 12, no. 2 (April 10, 2021): 2562–71. http://dx.doi.org/10.17762/turcomat.v12i2.2217.

Full text
Abstract:
This article looks at how artificial intelligence can help expect the hourly consolidation of air toxinSulphur ozone, element matter (PM2.5), and Sulphur dioxide. As one of the most excellently procedures, AI can efficiently prepare a model on a large amount of data by using large-scale streamlining computations. Even thoughseveral works use AI to predict air quality, most of the earlier studies are limited to long-term data and easilyinstruct regular relapse designs (direct or nonlinear) to expect the hourly air pollution focus. This paper suggestsadvanced analysis to simulate the hourly environmental change focus based on previous days' weather-related data by calculating the expectation for more than 24 hours as an execute multiple tasks learning (MTL) issue. This allows us to choose a suitable model with a variety of regularization strategies. We suggest a useful regularization that maintains the assumption patterns of concurrent hours to be nearby to each other, and we evaluate it to a few common MTL expect completion such as normal Frobenius standard regularization, normal atomicregularization, and '2,1-standard regularization. Our tests revealed that the suggested boundary declining concepts and constant hour-related regularizations outperform open product relapse models and regularizations in terms of execution.
APA, Harvard, Vancouver, ISO, and other styles
23

Shui, Changjian, Boyu Wang, and Christian Gagné. "On the benefits of representation regularization in invariance based domain generalization." Machine Learning 111, no. 3 (January 1, 2022): 895–915. http://dx.doi.org/10.1007/s10994-021-06080-w.

Full text
Abstract:
AbstractA crucial aspect of reliable machine learning is to design a deployable system for generalizing new related but unobserved environments. Domain generalization aims to alleviate such a prediction gap between the observed and unseen environments. Previous approaches commonly incorporated learning the invariant representation for achieving good empirical performance. In this paper, we reveal that merely learning the invariant representation is vulnerable to the related unseen environment. To this end, we derive a novel theoretical analysis to control the unseen test environment error in the representation learning, which highlights the importance of controlling the smoothness of representation. In practice, our analysis further inspires an efficient regularization method to improve the robustness in domain generalization. The proposed regularization is orthogonal to and can be straightforwardly adopted in existing domain generalization algorithms that ensure invariant representation learning. Empirical results show that our algorithm outperforms the base versions in various datasets and invariance criteria.
APA, Harvard, Vancouver, ISO, and other styles
24

Weiss, S. M., and N. Indurkhya. "Rule-based Machine Learning Methods for Functional Prediction." Journal of Artificial Intelligence Research 3 (December 1, 1995): 383–403. http://dx.doi.org/10.1613/jair.199.

Full text
Abstract:
We describe a machine learning method for predicting the value of a real-valued function, given the values of multiple input variables. The method induces solutions from samples in the form of ordered disjunctive normal form (DNF) decision rules. A central objective of the method and representation is the induction of compact, easily interpretable solutions. This rule-based decision model can be extended to search efficiently for similar cases prior to approximating function values. Experimental results on real-world data demonstrate that the new techniques are competitive with existing machine learning and statistical methods and can sometimes yield superior regression performance.
APA, Harvard, Vancouver, ISO, and other styles
25

Hein, Helle, and Ljubov Jaanuska. "Comparison of machine learning methods for crack localization." Acta et Commentationes Universitatis Tartuensis de Mathematica 23, no. 1 (August 9, 2019): 125–42. http://dx.doi.org/10.12697/acutm.2019.23.13.

Full text
Abstract:
In this paper, the Haar wavelet discrete transform, the artificial neural networks (ANNs), and the random forests (RFs) are applied to predict the location and severity of a crack in an Euler–Bernoulli cantilever subjected to the transverse free vibration. An extensive investigation into two data collection sets and machine learning methods showed that the depth of a crack is more difficult to predict than its location. The data set of eight natural frequency parameters produces more accurate predictions on the crack depth; meanwhile, the data set of eight Haar wavelet coefficients produces more precise predictions on the crack location. Furthermore, the analysis of the results showed that the ensemble of 50 ANN trained by Bayesian regularization and Levenberg–Marquardt algorithms slightly outperforms RF.
APA, Harvard, Vancouver, ISO, and other styles
26

Alspector, Joshua, and Thomas Dietterich. "DARPA’s Role in Machine Learning." AI Magazine 41, no. 2 (June 23, 2020): 36–48. http://dx.doi.org/10.1609/aimag.v41i2.5298.

Full text
Abstract:
Machine learning methods provide a way for artificial intelligence systems to learn from experience. This article describes four threads of machine learning research supported and guided by the Defense Advanced Research Projects Agency — probabilistic modeling for speech recognition, probabilistic relational models, the integration of multiple machine learning approaches into a task-specific system, and neural network technology. These threads illustrate the Defense Advanced Research Projects Agency way of creating timely advances in a field.
APA, Harvard, Vancouver, ISO, and other styles
27

Patil, Shruti, Vijayakumar Varadarajan, Siddiqui Mohd Mazhar, Abdulwodood Sahibzada, Nihal Ahmed, Onkar Sinha, Satish Kumar, Kailash Shaw, and Ketan Kotecha. "Explainable Artificial Intelligence for Intrusion Detection System." Electronics 11, no. 19 (September 27, 2022): 3079. http://dx.doi.org/10.3390/electronics11193079.

Full text
Abstract:
Intrusion detection systems are widely utilized in the cyber security field, to prevent and mitigate threats. Intrusion detection systems (IDS) help to keep threats and vulnerabilities out of computer networks. To develop effective intrusion detection systems, a range of machine learning methods are available. Machine learning ensemble methods have a well-proven track record when it comes to learning. Using ensemble methods of machine learning, this paper proposes an innovative intrusion detection system. To improve classification accuracy and eliminate false positives, features from the CICIDS-2017 dataset were chosen. This paper proposes an intrusion detection system using machine learning algorithms such as decision trees, random forests, and SVM (IDS). After training these models, an ensemble technique voting classifier was added and achieved an accuracy of 96.25%. Furthermore, the proposed model also incorporates the XAI algorithm LIME for better explainability and understanding of the black-box approach to reliable intrusion detection. Our experimental results confirmed that XAI LIME is more explanation-friendly and more responsive.
APA, Harvard, Vancouver, ISO, and other styles
28

Pettit, Rowland W., Robert Fullem, Chao Cheng, and Christopher I. Amos. "Artificial intelligence, machine learning, and deep learning for clinical outcome prediction." Emerging Topics in Life Sciences 5, no. 6 (December 20, 2021): 729–45. http://dx.doi.org/10.1042/etls20210246.

Full text
Abstract:
AI is a broad concept, grouping initiatives that use a computer to perform tasks that would usually require a human to complete. AI methods are well suited to predict clinical outcomes. In practice, AI methods can be thought of as functions that learn the outcomes accompanying standardized input data to produce accurate outcome predictions when trialed with new data. Current methods for cleaning, creating, accessing, extracting, augmenting, and representing data for training AI clinical prediction models are well defined. The use of AI to predict clinical outcomes is a dynamic and rapidly evolving arena, with new methods and applications emerging. Extraction or accession of electronic health care records and combining these with patient genetic data is an area of present attention, with tremendous potential for future growth. Machine learning approaches, including decision tree methods of Random Forest and XGBoost, and deep learning techniques including deep multi-layer and recurrent neural networks, afford unique capabilities to accurately create predictions from high dimensional, multimodal data. Furthermore, AI methods are increasing our ability to accurately predict clinical outcomes that previously were difficult to model, including time-dependent and multi-class outcomes. Barriers to robust AI-based clinical outcome model deployment include changing AI product development interfaces, the specificity of regulation requirements, and limitations in ensuring model interpretability, generalizability, and adaptability over time.
APA, Harvard, Vancouver, ISO, and other styles
29

Zhou, Yong, Beizuo Liu, Shixiong Xia, and Bing Liu. "Semi-supervised extreme learning machine with manifold and pairwise constraints regularization." Neurocomputing 149 (February 2015): 180–86. http://dx.doi.org/10.1016/j.neucom.2014.01.073.

Full text
APA, Harvard, Vancouver, ISO, and other styles
30

Campos Souza, Paulo Vitor De, Augusto Junio Guimaraes, Vanessa Souza Ararújo, Thiago Silva Rezende, and Vinicius Jonathan Silva Araújo. "Fuzzy Neural Networks based on Fuzzy Logic Neurons Regularized by Resampling Techniques and Regularization Theory for Regression Problems." Inteligencia Artificial 21, no. 62 (November 9, 2018): 114. http://dx.doi.org/10.4114/intartif.vol22iss63pp114-133.

Full text
Abstract:
This paper presents a novel learning algorithm for fuzzy logic neuron based on neural networks and fuzzy systems able to generate accurate and transparent models. The learning algorithm is based on ideas from Extreme Learning Machine [36], to achieve a low time complexity, and regularization theory, resulting in sparse and accurate models. A compact set of incomplete fuzzy rules can be extracted from the resulting network topology. Experiments considering regression problems are detailed. Results suggest the proposed approach as a promising alternative for pattern recognition with a good accuracy and some level of interpretability.
APA, Harvard, Vancouver, ISO, and other styles
31

Jurewicz, Mateusz, and Leon Derczynski. "Set-to-Sequence Methods in Machine Learning: A Review." Journal of Artificial Intelligence Research 71 (August 12, 2021): 885–924. http://dx.doi.org/10.1613/jair.1.12839.

Full text
Abstract:
Machine learning on sets towards sequential output is an important and ubiquitous task, with applications ranging from language modelling and meta-learning to multi-agent strategy games and power grid optimization. Combining elements of representation learning and structured prediction, its two primary challenges include obtaining a meaningful, permutation invariant set representation and subsequently utilizing this representation to output a complex target permutation. This paper provides a comprehensive introduction to the _eld as well as an overview of important machine learning methods tackling both of these key challenges, with a detailed qualitative comparison of selected model architectures.
APA, Harvard, Vancouver, ISO, and other styles
32

Zharikova, E. P., J. Yu Grigoriev, and A. L. Grigorieva. "Artificial Intelligence Methods for Detecting Water Pollution." IOP Conference Series: Earth and Environmental Science 988, no. 2 (February 1, 2022): 022082. http://dx.doi.org/10.1088/1755-1315/988/2/022082.

Full text
Abstract:
Abstract In the modern world, industrial development often negatively affects the environment, including the state of water bodies. Pollution of various types, from thermal to chemical (oil spills, industrial waste dumping and thermometric disturbances), have a detrimental effect on flora and fauna. Continuous monitoring of water areas allows timely detection of pollution. One of the tasks of analyzing the state of water resources is monitoring the water surface and monitoring the coastal zone. The aim of the study is to compare classical approaches based on the application of spectral characteristics and machine learning methods to the analysis of the state of water bodies. The studies show the disadvantages of classical methods of remote sensing in solving problems of autonomous monitoring, consisting in poor resistance to noise and the need for constant expert assessment. The paper presents solutions to the problem of detecting pollution of water bodies using machine learning methods.
APA, Harvard, Vancouver, ISO, and other styles
33

Bargagli Stoffi, Falco J., Gustavo Cevolani, and Giorgio Gnecco. "Simple Models in Complex Worlds: Occam’s Razor and Statistical Learning Theory." Minds and Machines 32, no. 1 (March 2022): 13–42. http://dx.doi.org/10.1007/s11023-022-09592-z.

Full text
Abstract:
AbstractThe idea that “simplicity is a sign of truth”, and the related “Occam’s razor” principle, stating that, all other things being equal, simpler models should be preferred to more complex ones, have been long discussed in philosophy and science. We explore these ideas in the context of supervised machine learning, namely the branch of artificial intelligence that studies algorithms which balance simplicity and accuracy in order to effectively learn about the features of the underlying domain. Focusing on statistical learning theory, we show that situations exist for which a preference for simpler models (as modeled through the addition of a regularization term in the learning problem) provably slows down, instead of favoring, the supervised learning process. Our results shed new light on the relations between simplicity and truth approximation, which are briefly discussed in the context of both machine learning and the philosophy of science.
APA, Harvard, Vancouver, ISO, and other styles
34

Hashimoto, Daniel A., Elan Witkowski, Lei Gao, Ozanan Meireles, and Guy Rosman. "Artificial Intelligence in Anesthesiology." Anesthesiology 132, no. 2 (February 1, 2020): 379–94. http://dx.doi.org/10.1097/aln.0000000000002960.

Full text
Abstract:
Abstract Artificial intelligence has been advancing in fields including anesthesiology. This scoping review of the intersection of artificial intelligence and anesthesia research identified and summarized six themes of applications of artificial intelligence in anesthesiology: (1) depth of anesthesia monitoring, (2) control of anesthesia, (3) event and risk prediction, (4) ultrasound guidance, (5) pain management, and (6) operating room logistics. Based on papers identified in the review, several topics within artificial intelligence were described and summarized: (1) machine learning (including supervised, unsupervised, and reinforcement learning), (2) techniques in artificial intelligence (e.g., classical machine learning, neural networks and deep learning, Bayesian methods), and (3) major applied fields in artificial intelligence. The implications of artificial intelligence for the practicing anesthesiologist are discussed as are its limitations and the role of clinicians in further developing artificial intelligence for use in clinical care. Artificial intelligence has the potential to impact the practice of anesthesiology in aspects ranging from perioperative support to critical care delivery to outpatient pain management.
APA, Harvard, Vancouver, ISO, and other styles
35

Bektaş, Mustafa, Babs M. Zonderhuis, Henk A. Marquering, Jaime Costa Pereira, George L. Burchell, and Donald L. van der Peet. "Artificial intelligence in hepatFIGopancreaticobiliary surgery: a systematic review." Artificial Intelligence Surgery 2, no. 3 (2022): 132–43. http://dx.doi.org/10.20517/ais.2022.20.

Full text
Abstract:
Aim: The aim of this systematic review was to provide an overview of Machine Learning applications within hepatopancreaticobiliary surgery. The secondary aim was to evaluate the predictive performances of applied Machine Learning models. Methods: A systematic search was conducted in PubMed, EMBASE, Cochrane, and Web of Science. Studies were only eligible for inclusion when they described Machine Learning in hepatopancreaticobiliary surgery. The Cochrane and PROBAST risk of bias tools were used to evaluate the quality of studies and included Machine Learning models. Results: Out of 1821 articles, 52 studies have met the inclusion criteria. The majority of Machine Learning models were developed to predict the course of disease, and postoperative complications. The course of disease has been predicted with accuracies up to 99%, and postoperative complications with accuracies up to 89%. Most studies had a retrospective study design, in which external validation was absent for Machine Learning models. Conclusion: Machine learning models have shown promising accuracies in the prediction of short-term and long-term surgical outcomes after hepatopancreaticobiliary surgery. External validation of Machine Learning models is required to facilitate the clinical introduction of Machine Learning.
APA, Harvard, Vancouver, ISO, and other styles
36

Shi, Tianyi, Wei Huang, Man Zhang, and Jingyue Wu. "Application of Artificial Intelligence in Material Testing." Highlights in Science, Engineering and Technology 1 (June 14, 2022): 171–74. http://dx.doi.org/10.54097/hset.v1i.445.

Full text
Abstract:
Under the development of artificial intelligence technology, the technical level of machine learning and machine vision has been significantly improved. For testing, machine vision can input the characteristics of the inspected object into the computer, while machine learning ability enables the computer to better analyze the characteristics of the inspected object and make the testing conclusion. Compared with traditional testing methods, this process has the characteristics of high accuracy and high speed, and its excellent performance can be used in all aspects of material testing.
APA, Harvard, Vancouver, ISO, and other styles
37

Wu, Chengyuan, and Carol Anne Hargreaves. "Topological Machine Learning for Mixed Numeric and Categorical Data." International Journal on Artificial Intelligence Tools 30, no. 05 (August 2021): 2150025. http://dx.doi.org/10.1142/s0218213021500251.

Full text
Abstract:
Topological data analysis is a relatively new branch of machine learning that excels in studying high-dimensional data, and is theoretically known to be robust against noise. Meanwhile, data objects with mixed numeric and categorical attributes are ubiquitous in real-world applications. However, topological methods are usually applied to point cloud data, and to the best of our knowledge there is no available framework for the classification of mixed data using topological methods. In this paper, we propose a novel topological machine learning method for mixed data classification. In the proposed method, we use theory from topological data analysis such as persistent homology, persistence diagrams and Wasserstein distance to study mixed data. The performance of the proposed method is demonstrated by experiments on a real-world heart disease dataset. Experimental results show that our topological method outperforms several state-of-the-art algorithms in the prediction of heart disease.
APA, Harvard, Vancouver, ISO, and other styles
38

Tiwari, Tanya, Tanuj Tiwari, and Sanjay Tiwari. "How Artificial Intelligence, Machine Learning and Deep Learning are Radically Different?" International Journal of Advanced Research in Computer Science and Software Engineering 8, no. 2 (March 6, 2018): 1. http://dx.doi.org/10.23956/ijarcsse.v8i2.569.

Full text
Abstract:
There is a lot of confusion these days about Artificial Intelligence (AI), Machine Learning (ML) and Deep Learning (DL). A computer system able to perform tasks that normally require human intelligence, such as visual perception, speech recognition, decision-making, and translation between languages. Artificial Intelligence has made it possible. Deep learning is a subset of machine learning, and machine learning is a subset of AI, which is an umbrella term for any computer program that does something smart. In other words, all machine learning is AI, but not all AI is machine learning, and so forth. Machine Learning represents a key evolution in the fields of computer science, data analysis, software engineering, and artificial intelligence. Machine learning (ML)is a vibrant field of research, with a range of exciting areas for further development across different methods and applications. These areas include algorithmic interpretability, robustness, privacy, fairness, inference of causality, human-machine interaction, and security. The goal of ML is never to make “perfect” guesses, because ML deals in domains where there is no such thing. The goal is to make guesses that are good enough to be useful. Deep learning is a particular kind of machine learning that achieves great power and flexibility by learning to represent the world as nested hierarchy of concepts, with each concept defined in relation to simpler concepts, and more abstract representations computed in terms of less abstract ones. This paper gives an overview of artificial intelligence, machine learning & deep learning techniques and compare these techniques.
APA, Harvard, Vancouver, ISO, and other styles
39

Rusova, O., V. Bredikhin, and V. Verbytska. "FOCUS ON ARTIFICIAL INTELLIGENCE FOR PREDICTING THE OUTFLOW OF CLIENTS FROM ON-LINE EDUCATION SITES." Municipal economy of cities 4, no. 171 (October 17, 2022): 2–6. http://dx.doi.org/10.33042/2522-1809-2022-4-171-2-6.

Full text
Abstract:
The article examines the task of assessing the cost of housing in the cities of Ukraine. The purpose of this work is to simplify the determination of the value of apartments on the real estate market using machine learning technologies. To solve this problem, it is proposed to use a program module in Python using the Sequential direct distribution model of the keras library. A program was created that estimates the value of apartments according to their parameters using a neural network. The importance of forecasting in the field of real estate is shown, because the housing market is a systemic part of the regional economy. The results of the software application, which consists of two parts, are presented. The first program collects the necessary data for training a neural network about apartments from the OLX site ads, their structuring and recording in a csv file. The second program provides tools for preliminary analysis of the collected data, after which they are cleaned, divided into training and test samples and trained on their basis by a multilayer neural network of direct propagation using a machine learning algorithm. The layers API of the keras library was used to design the neural network, which allows the user to create arbitrary layers. For regularization, the keras.regularizers tool, which is also in the layers API, is used. To configure model metrics, the compile method was used. Three hidden layers were defined, for each of which 512 neurons were introduced and the Relu activation function was chosen. Calculations of the correlation of prediction indicators and error curves of machine learning are given. As a result of testing the trained neural network on a test set of 652 examples, an average absolute error of 3570.88 was obtained, and the accuracy of the model was approximately 85%. Thus, the neural network has reached an acceptable level of accuracy for estimating the cost of apartments in the city of Kharkiv. Ways to reduce test errors and learning errors using cross-validation are proposed. Concepts of learning hyper-parameters and their regularization are considered Keywords: neural networks, deep learning, machine learning, regression, prediction, estimation, data analysis.
APA, Harvard, Vancouver, ISO, and other styles
40

El-Nabarawy, Islam, and Ashraf M. Abdelbar. "Advanced learning methods and exponent regularization applied to a high order neural network." Neural Computing and Applications 25, no. 3-4 (April 3, 2014): 897–910. http://dx.doi.org/10.1007/s00521-014-1563-7.

Full text
APA, Harvard, Vancouver, ISO, and other styles
41

Tripathi, Diwakar, Damodar Reddy Edla, Annushree Bablani, Alok Kumar Shukla, and B. Ramachandra Reddy. "Experimental analysis of machine learning methods for credit score classification." Progress in Artificial Intelligence 10, no. 3 (March 15, 2021): 217–43. http://dx.doi.org/10.1007/s13748-021-00238-2.

Full text
APA, Harvard, Vancouver, ISO, and other styles
42

M S, Dharmapriya. "Thesis on Machine Learning Methods and Its Applications." International Journal for Research in Applied Science and Engineering Technology 9, no. 10 (October 31, 2021): 746–57. http://dx.doi.org/10.22214/ijraset.2021.38506.

Full text
Abstract:
Abstract: In the 1950s, the concept of machine learning was discovered and developed as a subfield of artificial intelligence. However, there were no significant developments or research on it until this decade. Typically, this field of study has developed and expanded since the 1990s. It is a field that will continue to develop in the future due to the difficulty of analysing and processing data as the number of records and documents increases. Due to the increasing data, machine learning focuses on finding the best model for the new data that takes into account all the previous data. Therefore, machine learning research will continue in correlation with this increasing data. This research focuses on the history of machine learning, the methods of machine learning, its applications, and the research that has been conducted on this topic. Our study aims to give researchers a deeper understanding of machine learning, an area of research that is becoming much more popular today, and its applications. Keywords: Machine Learning, Machine Learning Algorithms, Artificial Intelligence, Big Data.
APA, Harvard, Vancouver, ISO, and other styles
43

Abdelmonem, Ahmed, and Nehal N. Mostafa. "Interpretable Machine Learning Fusion and Data Analytics Models for Anomaly Detection." Fusion: Practice and Applications 3, no. 1 (2021): 54–69. http://dx.doi.org/10.54216/fpa.030104.

Full text
Abstract:
Explainable artificial intelligence received great research attention in the past few years during the widespread of Black-Box techniques in sensitive fields such as medical care, self-driving cars, etc. Artificial intelligence needs explainable methods to discover model biases. Explainable artificial intelligence will lead to obtaining fairness and Transparency in the model. Making artificial intelligence models explainable and interpretable is challenging when implementing black-box models. Because of the inherent limitations of collecting data in its raw form, data fusion has become a popular method for dealing with such data and acquiring more trustworthy, helpful, and precise insights. Compared to other, more traditional-based data fusion methods, machine learning's capacity to automatically learn from experience with nonexplicit programming significantly improves fusion's computational and predictive power. This paper comprehensively studies the most explainable artificial intelligent methods based on anomaly detection. We proposed the required criteria of the transparency model to measure the data fusion analytics techniques. Also, define the different used evaluation metrics in explainable artificial intelligence. We provide some applications for explainable artificial intelligence. We provide a case study of anomaly detection with the fusion of machine learning. Finally, we discuss the key challenges and future directions in explainable artificial intelligence.
APA, Harvard, Vancouver, ISO, and other styles
44

Chen, Zihao, Long Hu, Bao-Ting Zhang, Aiping Lu, Yaofeng Wang, Yuanyuan Yu, and Ge Zhang. "Artificial Intelligence in Aptamer–Target Binding Prediction." International Journal of Molecular Sciences 22, no. 7 (March 30, 2021): 3605. http://dx.doi.org/10.3390/ijms22073605.

Full text
Abstract:
Aptamers are short single-stranded DNA, RNA, or synthetic Xeno nucleic acids (XNA) molecules that can interact with corresponding targets with high affinity. Owing to their unique features, including low cost of production, easy chemical modification, high thermal stability, reproducibility, as well as low levels of immunogenicity and toxicity, aptamers can be used as an alternative to antibodies in diagnostics and therapeutics. Systematic evolution of ligands by exponential enrichment (SELEX), an experimental approach for aptamer screening, allows the selection and identification of in vitro aptamers with high affinity and specificity. However, the SELEX process is time consuming and characterization of the representative aptamer candidates from SELEX is rather laborious. Artificial intelligence (AI) could help to rapidly identify the potential aptamer candidates from a vast number of sequences. This review discusses the advancements of AI pipelines/methods, including structure-based and machine/deep learning-based methods, for predicting the binding ability of aptamers to targets. Structure-based methods are the most used in computer-aided drug design. For this part, we review the secondary and tertiary structure prediction methods for aptamers, molecular docking, as well as molecular dynamic simulation methods for aptamer–target binding. We also performed analysis to compare the accuracy of different secondary and tertiary structure prediction methods for aptamers. On the other hand, advanced machine-/deep-learning models have witnessed successes in predicting the binding abilities between targets and ligands in drug discovery and thus potentially offer a robust and accurate approach to predict the binding between aptamers and targets. The research utilizing machine-/deep-learning techniques for prediction of aptamer–target binding is limited currently. Therefore, perspectives for models, algorithms, and implementation strategies of machine/deep learning-based methods are discussed. This review could facilitate the development and application of high-throughput and less laborious in silico methods in aptamer selection and characterization.
APA, Harvard, Vancouver, ISO, and other styles
45

Litman, D. J. "Cue Phrase Classification Using Machine Learning." Journal of Artificial Intelligence Research 5 (September 1, 1996): 53–94. http://dx.doi.org/10.1613/jair.327.

Full text
Abstract:
Cue phrases may be used in a discourse sense to explicitly signal discourse structure, but also in a sentential sense to convey semantic rather than structural information. Correctly classifying cue phrases as discourse or sentential is critical in natural language processing systems that exploit discourse structure, e.g., for performing tasks such as anaphora resolution and plan recognition. This paper explores the use of machine learning for classifying cue phrases as discourse or sentential. Two machine learning programs (Cgrendel and C4.5) are used to induce classification models from sets of pre-classified cue phrases and their features in text and speech. Machine learning is shown to be an effective technique for not only automating the generation of classification models, but also for improving upon previous results. When compared to manually derived classification models already in the literature, the learned models often perform with higher accuracy and contain new linguistic insights into the data. In addition, the ability to automatically construct classification models makes it easier to comparatively analyze the utility of alternative feature representations of the data. Finally, the ease of retraining makes the learning approach more scalable and flexible than manual methods.
APA, Harvard, Vancouver, ISO, and other styles
46

Rashidi, Hooman H., Nam K. Tran, Elham Vali Betts, Lydia P. Howell, and Ralph Green. "Artificial Intelligence and Machine Learning in Pathology: The Present Landscape of Supervised Methods." Academic Pathology 6 (January 1, 2019): 237428951987308. http://dx.doi.org/10.1177/2374289519873088.

Full text
Abstract:
Increased interest in the opportunities provided by artificial intelligence and machine learning has spawned a new field of health-care research. The new tools under development are targeting many aspects of medical practice, including changes to the practice of pathology and laboratory medicine. Optimal design in these powerful tools requires cross-disciplinary literacy, including basic knowledge and understanding of critical concepts that have traditionally been unfamiliar to pathologists and laboratorians. This review provides definitions and basic knowledge of machine learning categories (supervised, unsupervised, and reinforcement learning), introduces the underlying concept of the bias-variance trade-off as an important foundation in supervised machine learning, and discusses approaches to the supervised machine learning study design along with an overview and description of common supervised machine learning algorithms (linear regression, logistic regression, Naive Bayes, k-nearest neighbor, support vector machine, random forest, convolutional neural networks).
APA, Harvard, Vancouver, ISO, and other styles
47

Mosteiro, Pablo, Jesse Kuiper, Judith Masthoff, Floortje Scheepers, and Marco Spruit. "Bias Discovery in Machine Learning Models for Mental Health." Information 13, no. 5 (May 5, 2022): 237. http://dx.doi.org/10.3390/info13050237.

Full text
Abstract:
Fairness and bias are crucial concepts in artificial intelligence, yet they are relatively ignored in machine learning applications in clinical psychiatry. We computed fairness metrics and present bias mitigation strategies using a model trained on clinical mental health data. We collected structured data related to the admission, diagnosis, and treatment of patients in the psychiatry department of the University Medical Center Utrecht. We trained a machine learning model to predict future administrations of benzodiazepines on the basis of past data. We found that gender plays an unexpected role in the predictions—this constitutes bias. Using the AI Fairness 360 package, we implemented reweighing and discrimination-aware regularization as bias mitigation strategies, and we explored their implications for model performance. This is the first application of bias exploration and mitigation in a machine learning model trained on real clinical psychiatry data.
APA, Harvard, Vancouver, ISO, and other styles
48

Santos, Allan Erlikhman Medeiros, Milene Sabino Lana, and Tiago Martins Pereira. "Evaluation of machine learning methods for rock mass classification." Neural Computing and Applications 34, no. 6 (October 26, 2021): 4633–42. http://dx.doi.org/10.1007/s00521-021-06618-y.

Full text
APA, Harvard, Vancouver, ISO, and other styles
49

Geary, Andrew. "Seismic Soundoff." Leading Edge 39, no. 9 (September 2020): 688. http://dx.doi.org/10.1190/tle39090688.1.

Full text
Abstract:
Machine learning and artificial intelligence are two trendy but important topics in geophysics. In this episode, host Andrew Geary speaks with Distinguished Lecturer Aria Abubakar about his 2020 tour titled, “Potential and challenges of applying artificial intelligence and machine learning methods for geoscience.” This conversation explores machine learning and artificial intelligence from multiple angles. Listen to the full episode at https://seg.org/podcast/post/9101 .
APA, Harvard, Vancouver, ISO, and other styles
50

Zöller, Marc-André, and Marco F. Huber. "Benchmark and Survey of Automated Machine Learning Frameworks." Journal of Artificial Intelligence Research 70 (January 27, 2021): 409–72. http://dx.doi.org/10.1613/jair.1.11854.

Full text
Abstract:
Machine learning (ML) has become a vital part in many aspects of our daily life. However, building well performing machine learning applications requires highly specialized data scientists and domain experts. Automated machine learning (AutoML) aims to reduce the demand for data scientists by enabling domain experts to build machine learning applications automatically without extensive knowledge of statistics and machine learning. This paper is a combination of a survey on current AutoML methods and a benchmark of popular AutoML frameworks on real data sets. Driven by the selected frameworks for evaluation, we summarize and review important AutoML techniques and methods concerning every step in building an ML pipeline. The selected AutoML frameworks are evaluated on 137 data sets from established AutoML benchmark suites.
APA, Harvard, Vancouver, ISO, and other styles
We offer discounts on all premium plans for authors whose works are included in thematic literature selections. Contact us to get a unique promo code!

To the bibliography