Artigos de revistas sobre o tema "Multiagent decision"

Siga este link para ver outros tipos de publicações sobre o tema: Multiagent decision.

Crie uma referência precisa em APA, MLA, Chicago, Harvard, e outros estilos

Selecione um tipo de fonte:

Veja os 50 melhores artigos de revistas para estudos sobre o assunto "Multiagent decision".

Ao lado de cada fonte na lista de referências, há um botão "Adicionar à bibliografia". Clique e geraremos automaticamente a citação bibliográfica do trabalho escolhido no estilo de citação de que você precisa: APA, MLA, Harvard, Chicago, Vancouver, etc.

Você também pode baixar o texto completo da publicação científica em formato .pdf e ler o resumo do trabalho online se estiver presente nos metadados.

Veja os artigos de revistas das mais diversas áreas científicas e compile uma bibliografia correta.

1

Kumar, Akshat, Shlomo Zilberstein e Marc Toussaint. "Probabilistic Inference Techniques for Scalable Multiagent Decision Making". Journal of Artificial Intelligence Research 53 (29 de junho de 2015): 223–70. http://dx.doi.org/10.1613/jair.4649.

Texto completo da fonte
Resumo:
Decentralized POMDPs provide an expressive framework for multiagent sequential decision making. However, the complexity of these models---NEXP-Complete even for two agents---has limited their scalability. We present a promising new class of approximation algorithms by developing novel connections between multiagent planning and machine learning. We show how the multiagent planning problem can be reformulated as inference in a mixture of dynamic Bayesian networks (DBNs). This planning-as-inference approach paves the way for the application of efficient inference techniques in DBNs to multiagent decision making. To further improve scalability, we identify certain conditions that are sufficient to extend the approach to multiagent systems with dozens of agents. Specifically, we show that the necessary inference within the expectation-maximization framework can be decomposed into processes that often involve a small subset of agents, thereby facilitating scalability. We further show that a number of existing multiagent planning models satisfy these conditions. Experiments on large planning benchmarks confirm the benefits of our approach in terms of runtime and scalability with respect to existing techniques.
Estilos ABNT, Harvard, Vancouver, APA, etc.
2

Han, Xiaoyu. "Application of Reinforcement Learning in Multiagent Intelligent Decision-Making". Computational Intelligence and Neuroscience 2022 (16 de setembro de 2022): 1–6. http://dx.doi.org/10.1155/2022/8683616.

Texto completo da fonte
Resumo:
The combination of deep neural networks and reinforcement learning had received more and more attention in recent years, and the attention of reinforcement learning of single agent was slowly getting transferred to multiagent. Regret minimization was a new concept in the theory of gaming. In some game issues that Nash equilibrium was not the optimal solution, the regret minimization had better performance. Herein, we introduce the regret minimization into multiagent reinforcement learning and propose a multiagent regret minimum algorithm. This chapter first introduces the Nash Q-learning algorithm and uses the overall framework of Nash Q-learning to minimize regrets into the multiagent reinforcement learning and then verify the effectiveness of the algorithm in the experiment.
Estilos ABNT, Harvard, Vancouver, APA, etc.
3

Narayanan, Lakshmi Kanthan, Suresh Sankaranarayanan, Joel J. P. C. Rodrigues e Pascal Lorenz. "Multi-Agent-Based Modeling for Underground Pipe Health and Water Quality Monitoring for Supplying Quality Water". International Journal of Intelligent Information Technologies 16, n.º 3 (julho de 2020): 52–79. http://dx.doi.org/10.4018/ijiit.2020070103.

Texto completo da fonte
Resumo:
This article discusses distributed monitoring through the deployment of various multiagents in the IoT-Fog-based water distribution network (WDN). This will ensure the right amount of water supplied with respect to demand forecasted to residents. In addition, underground pipe health is also monitored by means of a multiagent based on hydraulic parameters supplying water forecasted with minimal losses which would minimize the operational and material cost involved in recovery or repair. Lastly, there are agents deployed towards leakage monitoring and anti-theft detection of water. The multiagents act upon various parameters of hydrology and analysis is based on the data acquired by the various sensors deployed in the water distribution network which perform partial automation of the disconnection of the supply during extreme critical conditions.
Estilos ABNT, Harvard, Vancouver, APA, etc.
4

Xiang, Yang, e Frank Hanshar. "Multiagent Decision Making in Collaborative Decision Networks by Utility Cluster Based Partial Evaluation". International Journal of Uncertainty, Fuzziness and Knowledge-Based Systems 23, n.º 02 (abril de 2015): 149–91. http://dx.doi.org/10.1142/s0218488515500075.

Texto completo da fonte
Resumo:
We consider optimal multiagent cooperative decision making in stochastic environments. The focus is on simultaneous decision making, during which agents cooperate by limited communication. We model the multiagent system as a collaborative decision network (CDN). Several techniques are developed to improve efficiency for decision making with CDNs. We present an equivalent transformation of CDN subnets to facilitate model manipulation. We propose partial evaluation to allow action profiles evaluated with reduced computation. We decompose a CDN subnet, based on clustering of utility variables. A general simultaneous decision making algorithm suite is developed that embeds these techniques. We show that the new algorithm suite improves efficiency by a combination of a linear factor and an exponential factor.
Estilos ABNT, Harvard, Vancouver, APA, etc.
5

XIANG, YANG, e FRANK HANSHAR. "MULTIAGENT EXPEDITION WITH GRAPHICAL MODELS". International Journal of Uncertainty, Fuzziness and Knowledge-Based Systems 19, n.º 06 (dezembro de 2011): 939–76. http://dx.doi.org/10.1142/s0218488511007416.

Texto completo da fonte
Resumo:
We investigate a class of multiagent planning problems termed multiagent expedition, where agents move around an open, unknown, partially observable, stochastic, and physical environment, in pursuit of multiple and alternative goals of different utility. Optimal planning in multiagent expedition is highly intractable. We introduce the notion of conditional optimality, decompose the task into a set of semi-independent optimization subtasks, and apply a decision-theoretic multiagent graphical model to solve each subtask optimally. A set of techniques are proposed to enhance modeling so that the resultant graphical model can be practically evaluated. Effectiveness of the framework and its scalability are demonstrated through experiments. Multiagent expedition can be characterized as decentralized partially observable Markov decision processes (Dec-POMDPs). Hence, this work contributes towards practical planning in Dec-POMDPs.
Estilos ABNT, Harvard, Vancouver, APA, etc.
6

Nunes, Ernesto, Julio Godoy e Maria Gini. "Multiagent Decision Making on Transportation Networks". Journal of Information Processing 22, n.º 2 (2014): 307–18. http://dx.doi.org/10.2197/ipsjjip.22.307.

Texto completo da fonte
Estilos ABNT, Harvard, Vancouver, APA, etc.
7

Maturo, Antonio, e Aldo G. S. Ventre. "Reaching consensus in multiagent decision making". International Journal of Intelligent Systems 25, n.º 3 (março de 2010): 266–73. http://dx.doi.org/10.1002/int.20401.

Texto completo da fonte
Estilos ABNT, Harvard, Vancouver, APA, etc.
8

He, Liu, Haoning Xi, Tangyi Guo e Kun Tang. "A Generalized Dynamic Potential Energy Model for Multiagent Path Planning". Journal of Advanced Transportation 2020 (24 de julho de 2020): 1–14. http://dx.doi.org/10.1155/2020/1360491.

Texto completo da fonte
Resumo:
Path planning for the multiagent, which is generally based on the artificial potential energy field, reflects the decision-making process of pedestrian walking and has great importance on the field multiagent system. In this paper, after setting the spatial-temporal simulation environment with large cells and small time segments based on the disaggregation decision theory of the multiagent, we establish a generalized dynamic potential energy model (DPEM) for the multiagent through four steps: (1) construct the space energy field with the improved Dijkstra algorithm, and obtain the fitting functions to reflect the relationship between speed decline rate and space occupancy of the agent through empirical cross experiments. (2) Construct the delay potential energy field based on the judgement and psychological changes of the multiagent in the situations where the other pedestrians have occupied the bottleneck cell. (3) Construct the waiting potential energy field based on the characteristics of the multiagent, such as dissipation and enhancement. (4) Obtain the generalized dynamic potential energy field by superposing the space potential energy field, delay potential energy field, and waiting potential energy field all together. Moreover, a case study is conducted to verify the feasibility and effectiveness of the dynamic potential energy model. The results also indicate that each agent’s path planning decision such as forward, waiting, and detour in the multiagent system is related to their individual characters and environmental factors. Overall, this study could help improve the efficiency of pedestrian traffic, optimize the walking space, and improve the performance of pedestrians in the multiagent system.
Estilos ABNT, Harvard, Vancouver, APA, etc.
9

Xu, Yang, Xiang Li e Ming Liu. "Modeling and Simulation of Complex Network Attributes on Coordinating Large Multiagent System". Scientific World Journal 2014 (2014): 1–15. http://dx.doi.org/10.1155/2014/412479.

Texto completo da fonte
Resumo:
With the expansion of distributed multiagent systems, traditional coordination strategy becomes a severe bottleneck when the system scales up to hundreds of agents. The key challenge is that in typical large multiagent systems, sparsely distributed agents can only communicate directly with very few others and the network is typically modeled as an adaptive complex network. In this paper, we present simulation testbedCoordSimbuilt to model the coordination of network centric multiagent systems. Based on the token-based strategy, the coordination can be built as a communication decision problem that agents make decisions to target communications and pass them over to the capable agents who will potentially benefit the team most. We have theoretically analyzed that the characters of complex network make a significant difference with both random and intelligent coordination strategies, which may contribute to future multiagent algorithm design.
Estilos ABNT, Harvard, Vancouver, APA, etc.
10

Szymak, Piotr. "Comparison of Centralized, Dispersed and Hybrid Multiagent Control Systems of Underwater Vehicles Team". Solid State Phenomena 180 (novembro de 2011): 114–21. http://dx.doi.org/10.4028/www.scientific.net/ssp.180.114.

Texto completo da fonte
Resumo:
Multiagent systems controlling robots can have different structures, depending on a way of generating decision in these systems. Decisions can be work out in centralized, decentralized or even hybrid way (hybrid system is a connection of both centralized and decentralized systems). In the case of controlling a team of underwater vehicles, it is significant to examine different structures of multiagent systems for choosing the best one for defined underwater task. In the paper, results of operation of three different structures (centralized, dispersed - decentralized and hybrid) of multiagent control systems of underwater vehicles team were presented. The systems were tested in predator-prey problem. In this problem, a team of three underwater vehicles had to catch another underwater robot escaping with larger velocity.
Estilos ABNT, Harvard, Vancouver, APA, etc.
11

Pelta, David A., e Ronald R. Yager. "Analyzing the Robustness of Decision Strategies in Multiagent Decision Making". Group Decision and Negotiation 23, n.º 6 (20 de outubro de 2013): 1403–16. http://dx.doi.org/10.1007/s10726-013-9376-0.

Texto completo da fonte
Estilos ABNT, Harvard, Vancouver, APA, etc.
12

Huang, Zishan. "UAV Intelligent Control Based on Machine Vision and Multiagent Decision-Making". Advances in Multimedia 2022 (27 de maio de 2022): 1–11. http://dx.doi.org/10.1155/2022/8908122.

Texto completo da fonte
Resumo:
In order to improve the effect of UAV intelligent control, this paper will improve machine vision technology. Moreover, this paper adds scale information on the basis of the LSD algorithm, uses the multiline segment standard to merge these candidate line segments for intelligent recognition, and uses the LSD detection algorithm to improve the operating efficiency of the UAV control system and reduce the computational complexity. In addition, this paper combines machine vision technology and multiagent decision-making technology for UAV intelligent control and builds an intelligent control system, which uses intelligent machine vision technology for recognition and multiagent decision-making technology for motion control. The research results show that the UAV intelligent control system based on machine vision and multiagent decision-making proposed in this paper can achieve reliable control of UAVs and improve the work efficiency of UAVs.
Estilos ABNT, Harvard, Vancouver, APA, etc.
13

Singh, Arambam James, Duc Thien Nguyen, Akshat Kumar e Hoong Chuin Lau. "Multiagent Decision Making For Maritime Traffic Management". Proceedings of the AAAI Conference on Artificial Intelligence 33 (17 de julho de 2019): 6171–78. http://dx.doi.org/10.1609/aaai.v33i01.33016171.

Texto completo da fonte
Resumo:
We address the problem of maritime traffic management in busy waterways to increase the safety of navigation by reducing congestion. We model maritime traffic as a large multiagent systems with individual vessels as agents, and VTS authority as the regulatory agent. We develop a maritime traffic simulator based on historical traffic data that incorporates realistic domain constraints such as uncertain and asynchronous movement of vessels. We also develop a traffic coordination approach that provides speed recommendation to vessels in different zones. We exploit the nature of collective interactions among agents to develop a scalable policy gradient approach that can scale up to real world problems. Empirical results on synthetic and real world problems show that our approach can significantly reduce congestion while keeping the traffic throughput high.
Estilos ABNT, Harvard, Vancouver, APA, etc.
14

Sokolowski, John A. "Enhanced Decision Modeling Using Multiagent System Simulation". SIMULATION 79, n.º 4 (abril de 2003): 232–42. http://dx.doi.org/10.1177/0037549703038886.

Texto completo da fonte
Estilos ABNT, Harvard, Vancouver, APA, etc.
15

Gray, Rebecca, Alessio Franci, Vaibhav Srivastava e Naomi Ehrich Leonard. "Multiagent Decision-Making Dynamics Inspired by Honeybees". IEEE Transactions on Control of Network Systems 5, n.º 2 (junho de 2018): 793–806. http://dx.doi.org/10.1109/tcns.2018.2796301.

Texto completo da fonte
Estilos ABNT, Harvard, Vancouver, APA, etc.
16

Petit, Claude, e François-Xavier Magaud. "Multiagent meta-model for strategic decision support". Knowledge-Based Systems 19, n.º 3 (julho de 2006): 202–11. http://dx.doi.org/10.1016/j.knosys.2005.11.009.

Texto completo da fonte
Estilos ABNT, Harvard, Vancouver, APA, etc.
17

Pechoucek, M., J. Vokrinek e P. Becvar. "ExPlanTech: Multiagent Support for Manufacturing Decision Making". IEEE Intelligent Systems 20, n.º 1 (janeiro de 2005): 67–74. http://dx.doi.org/10.1109/mis.2005.6.

Texto completo da fonte
Estilos ABNT, Harvard, Vancouver, APA, etc.
18

Rizk, Yara, Mariette Awad e Edward W. Tunstel. "Decision Making in Multiagent Systems: A Survey". IEEE Transactions on Cognitive and Developmental Systems 10, n.º 3 (setembro de 2018): 514–29. http://dx.doi.org/10.1109/tcds.2018.2840971.

Texto completo da fonte
Estilos ABNT, Harvard, Vancouver, APA, etc.
19

Zeng, Yifeng, e Kim-leng Poh. "Symbolic verification of multiagent graphical decision models". International Journal of Intelligent Systems 23, n.º 11 (novembro de 2008): 1177–95. http://dx.doi.org/10.1002/int.20313.

Texto completo da fonte
Estilos ABNT, Harvard, Vancouver, APA, etc.
20

González-Briones, Alfonso, José A. Castellanos-Garzón, Yeray Mezquita Martín, Javier Prieto e Juan M. Corchado. "A Framework for Knowledge Discovery from Wireless Sensor Networks in Rural Environments: A Crop Irrigation Systems Case Study". Wireless Communications and Mobile Computing 2018 (17 de julho de 2018): 1–14. http://dx.doi.org/10.1155/2018/6089280.

Texto completo da fonte
Resumo:
This paper presents the design and development of an innovative multiagent system based on virtual organizations. The multiagent system manages information from wireless sensor networks for knowledge discovery and decision making in rural environments. The multiagent system has been built over the cloud computing paradigm to provide better flexibility and higher scalability for handling both small- and large-scale projects. The development of wireless sensor network technology has allowed for its extension and application to the rural environment, where the lives of the people interacting with the environment can be improved. The use of “smart” technologies can also improve the efficiency and effectiveness of rural systems. The proposed multiagent system allows us to analyse data collected by sensors for decision making in activities carried out in a rural setting, thus, guaranteeing the best performance in the ecosystem. Since water is a scarce natural resource that should not be wasted, a case study was conducted in an agricultural environment to test the proposed system’s performance in optimizing the irrigation system in corn crops. The architecture collects information about the terrain and the climatic conditions through a wireless sensor network deployed in the crops. This way, the architecture can learn about the needs of the crop and make efficient irrigation decisions. The obtained results are very promising when compared to a traditional automatic irrigation system.
Estilos ABNT, Harvard, Vancouver, APA, etc.
21

Xiao, Liang, e Des Greer. "Linked Argumentation Graphs for Multidisciplinary Decision Support". Healthcare 11, n.º 4 (15 de fevereiro de 2023): 585. http://dx.doi.org/10.3390/healthcare11040585.

Texto completo da fonte
Resumo:
Multidisciplinary clinical decision-making has become increasingly important for complex diseases, such as cancers, as medicine has become very specialized. Multiagent systems (MASs) provide a suitable framework to support multidisciplinary decisions. In the past years, a number of agent-oriented approaches have been developed on the basis of argumentation models. However, very limited work has focused, thus far, on systematic support for argumentation in communication among multiple agents spanning various decision sites and holding varying beliefs. There is a need for an appropriate argumentation scheme and identification of recurring styles or patterns of multiagent argument linking to enable versatile multidisciplinary decision applications. We propose, in this paper, a method of linked argumentation graphs and three types of patterns corresponding to scenarios of agents changing the minds of others (argumentation) and their own (belief revision): the collaboration pattern, the negotiation pattern, and the persuasion pattern. This approach is demonstrated using a case study of breast cancer and lifelong recommendations, as the survival rates of diagnosed cancer patients are rising and comorbidity is the norm.
Estilos ABNT, Harvard, Vancouver, APA, etc.
22

Li, Zhuo, Xu Zhou, Filip De Turck, Taixin Li, Yongmao Ren e Yifang Qin. "Feudal Multiagent Reinforcement Learning for Interdomain Collaborative Routing Optimization". Wireless Communications and Mobile Computing 2022 (27 de março de 2022): 1–11. http://dx.doi.org/10.1155/2022/1231979.

Texto completo da fonte
Resumo:
In view of the inability of traditional interdomain routing schemes to meet the sudden network changes and adapt the routing policy accordingly, many optimization schemes such as modifying Border Gateway Protocol (BGP) parameters and using software-defined network (SDN) to optimize interdomain routing decisions have been proposed. However, with the change and increase of the demand for network data transmission, the high latency and flexibility of these mechanisms have become increasingly prominent. Recent researches have addressed these challenges through multiagent reinforcement learning (MARL), which can be capable of dynamically meeting interdomain requirements, and the multiagent Markov Decision Process (MDP) is introduced to construct this routing optimization problem. Thus, in this paper, an interdomain collaborative routing scheme is proposed in interdomain collaborative architecture. The proposed Feudal Multiagent Actor-Critic (FMAAC) algorithm is designed based on multiagent actor-critic and feudal reinforcement learning to solve this competition-cooperative problem. Our multiagent learns about the optimal interdomain routing decisions, focused on different optimization objectives such as end-to-end delay, throughput, and average delivery rate. Experiments were carried out in the interdomain testbed to verify the convergence and effectiveness of the FMAAC algorithm. Experimental results show that our approach can significantly improve various Quality of Service (QoS) indicators, containing reduced end-to-end delay, increased throughput, and guaranteed over 90% average delivery rate.
Estilos ABNT, Harvard, Vancouver, APA, etc.
23

Chen, Hongjing, Chunhua Hu e Zhi Huang. "Optimal Control of Multiagent Decision-Making Based on Competence Evolution". Discrete Dynamics in Nature and Society 2023 (29 de maio de 2023): 1–22. http://dx.doi.org/10.1155/2023/2179376.

Texto completo da fonte
Resumo:
We employ the theory of rarefied gas dynamics and optimal control to investigate the kinetic model of decision-making. The novelty of this paper is that we develop a kinetic model that takes into account both the influence of agents’ competence and managers’ control on decision-making. After each interaction, in addition to the changes in decision directly caused by communication with other agents, the agents’ competence evolves and indirectly influences the degree of decision adjustment through the compromise function. By adding a control term to the model, the behavior of the managers who require the group to establish consensus is also described, and the concrete expression of the control term that minimizes the cost function is obtained by model predictive control. The Boltzmann equation is constructed to characterize the evolution of the density distribution of agents, and the main properties are discussed. The corresponding Fokker–Planck equation is derived by utilizing the asymptotic technique. Lastly, the direct simulation of the Monte Carlo method is used to simulate the evolution of decisions. The results indicate that the agents’ competence and managers’ control facilitate the consistency of collective decisions.
Estilos ABNT, Harvard, Vancouver, APA, etc.
24

Pan, Yinghui, Jing Tang, Biyang Ma, Yifeng Zeng e Zhong Ming. "Toward data-driven solutions to interactive dynamic influence diagrams". Knowledge and Information Systems 63, n.º 9 (8 de agosto de 2021): 2431–53. http://dx.doi.org/10.1007/s10115-021-01600-5.

Texto completo da fonte
Resumo:
AbstractWith the availability of significant amount of data, data-driven decision making becomes an alternative way for solving complex multiagent decision problems. Instead of using domain knowledge to explicitly build decision models, the data-driven approach learns decisions (probably optimal ones) from available data. This removes the knowledge bottleneck in the traditional knowledge-driven decision making, which requires a strong support from domain experts. In this paper, we study data-driven decision making in the context of interactive dynamic influence diagrams (I-DIDs)—a general framework for multiagent sequential decision making under uncertainty. We propose a data-driven framework to solve the I-DIDs model and focus on learning the behavior of other agents in problem domains. The challenge is on learning a complete policy tree that will be embedded in the I-DIDs models due to limited data. We propose two new methods to develop complete policy trees for the other agents in the I-DIDs. The first method uses a simple clustering process, while the second one employs sophisticated statistical checks. We analyze the proposed algorithms in a theoretical way and experiment them over two problem domains.
Estilos ABNT, Harvard, Vancouver, APA, etc.
25

Wei, Xiaojuan, Meng Jia e Mengke Geng. "A Multiagent Cooperative Decision-Making Method for Adaptive Intersection Complexity Based on Hierarchical RL". Wireless Communications and Mobile Computing 2022 (19 de outubro de 2022): 1–10. http://dx.doi.org/10.1155/2022/9329186.

Texto completo da fonte
Resumo:
In this paper, we propose a multiagent collaboration decision-making method for adaptive intersection complexity based on hierarchical reinforcement learning—H-CommNet, which uses a two-level structure for collaboration: the upper-level policy network fuses information from all agents and learns how to set a subtask for each agent, and the lower-level policy network relies on the local observation of the agent to control the action targets of the agents from each subtask in the upper layer. H-CommNet allows multiagents to complete collaboration on different time scales, and the scale is controllable. It also uses the computational intelligence of invehicle intelligence and edge nodes to achieve joint optimization of computing resources and communication resources. Through the simulation experiments in the intersection environment without traffic lights, the experimental results show that H-CommNet can achieve better results than baseline in different complexity scenarios when using as few resources as possible, and the scalability, flexibility, and control effects have been improved.
Estilos ABNT, Harvard, Vancouver, APA, etc.
26

Yang, Yanhua, e Ligang Yao. "Optimization Method of Power Equipment Maintenance Plan Decision-Making Based on Deep Reinforcement Learning". Mathematical Problems in Engineering 2021 (15 de março de 2021): 1–8. http://dx.doi.org/10.1155/2021/9372803.

Texto completo da fonte
Resumo:
The safe and reliable operation of power grid equipment is the basis for ensuring the safe operation of the power system. At present, the traditional periodical maintenance has exposed the abuses such as deficient maintenance and excess maintenance. Based on a multiagent deep reinforcement learning decision-making optimization algorithm, a method for decision-making and optimization of power grid equipment maintenance plans is proposed. In this paper, an optimization model of power grid equipment maintenance plan that takes into account the reliability and economics of power grid operation is constructed with maintenance constraints and power grid safety constraints as its constraints. The deep distributed recurrent Q-networks multiagent deep reinforcement learning is adopted to solve the optimization model. The deep distributed recurrent Q-networks multiagent deep reinforcement learning uses the high-dimensional feature extraction capabilities of deep learning and decision-making capabilities of reinforcement learning to solve the multiobjective decision-making problem of power grid maintenance planning. Through case analysis, the comparative results show that the proposed algorithm has better optimization and decision-making ability, as well as lower maintenance cost. Accordingly, the algorithm can realize the optimal decision of power grid equipment maintenance plan. The expected value of power shortage and maintenance cost obtained by the proposed method is $71.75$ $MW·H$ and $496000$ $yuan$.
Estilos ABNT, Harvard, Vancouver, APA, etc.
27

Pynadath, D. V., e M. Tambe. "The Communicative Multiagent Team Decision Problem: Analyzing Teamwork Theories and Models". Journal of Artificial Intelligence Research 16 (1 de junho de 2002): 389–423. http://dx.doi.org/10.1613/jair.1024.

Texto completo da fonte
Resumo:
Despite the significant progress in multiagent teamwork, existing research does not address the optimality of its prescriptions nor the complexity of the teamwork problem. Without a characterization of the optimality-complexity tradeoffs, it is impossible to determine whether the assumptions and approximations made by a particular theory gain enough efficiency to justify the losses in overall performance. To provide a tool for use by multiagent researchers in evaluating this tradeoff, we present a unified framework, the COMmunicative Multiagent Team Decision Problem (COM-MTDP). The COM-MTDP model combines and extends existing multiagent theories, such as decentralized partially observable Markov decision processes and economic team theory. In addition to their generality of representation, COM-MTDPs also support the analysis of both the optimality of team performance and the computational complexity of the agents' decision problem. In analyzing complexity, we present a breakdown of the computational complexity of constructing optimal teams under various classes of problem domains, along the dimensions of observability and communication cost. In analyzing optimality, we exploit the COM-MTDP's ability to encode existing teamwork theories and models to encode two instantiations of joint intentions theory taken from the literature. Furthermore, the COM-MTDP model provides a basis for the development of novel team coordination algorithms. We derive a domain-independent criterion for optimal communication and provide a comparative analysis of the two joint intentions instantiations with respect to this optimal policy. We have implemented a reusable, domain-independent software package based on COM-MTDPs to analyze teamwork coordination strategies, and we demonstrate its use by encoding and evaluating the two joint intentions strategies within an example domain.
Estilos ABNT, Harvard, Vancouver, APA, etc.
28

Brito, Robison Cris, Cesar Augusto Tacla e Lúcia Valéria Ramos de Arruda. "A multiagent simulator for supporting logistic decisions of unloading petroleum ships in habors". Pesquisa Operacional 30, n.º 3 (dezembro de 2010): 729–50. http://dx.doi.org/10.1590/s0101-74382010000300012.

Texto completo da fonte
Resumo:
This work presents and evaluates the performance of a simulation model based on multiagent system technology in order to support logistic decisions in a harbor from oil supply chain. The main decisions are concerned to pier allocation, oil discharge, storage tanks management and refinery supply by a pipeline. The real elements as ships, piers, pipelines, and refineries are modeled as agents, and they negotiate by auctions to move oil in this system. The simulation results are compared with results obtained with an optimization mathematical model based on mixed integer linear programming (MILP). Both models are able to find optimal solutions or close to the optimal solution depending on the problem size. In problems with several elements, the multiagent model can find solutions in seconds, while the MILP model presents very high computational time to find the optimal solution. In some situations, the MILP model results in out of memory error. Test scenarios demonstrate the usefulness of the multiagent based simulator in supporting decision taken concerning the logistic in harbors.
Estilos ABNT, Harvard, Vancouver, APA, etc.
29

Rao, Ning, Hua Xu, Yue Zhang, Dan Wang, Lei Jiang e Xiang Peng. "Joint Optimization of Jamming Link and Power Control in Communication Countermeasures: A Multiagent Deep Reinforcement Learning Approach". Wireless Communications and Mobile Computing 2022 (29 de dezembro de 2022): 1–18. http://dx.doi.org/10.1155/2022/7962686.

Texto completo da fonte
Resumo:
Due to the nonconvexity feature of optimal controlling such as jamming link selection and jamming power allocation issues, obtaining the optimal resource allocation strategy in communication countermeasures scenarios is challenging. Thus, we propose a novel decentralized jamming resource allocation algorithm based on multiagent deep reinforcement learning (MADRL) to improve the efficiency of jamming resource allocation in battlefield communication countermeasures. We first model the communication jamming resource allocation problem as a fully cooperative multiagent task, considering the cooperative interrelationship of jamming equipment (JE). Then, to alleviate the nonstationarity feature and high decision dimensions in the multiagent system, we introduce a centralized training with decentralized execution framework (CTDE), which means all JEs are trained with global information and rely on their local observations only while making decisions. Each JE obtains a decentralized policy after the training process. Subsequently, we develop the multiagent soft actor-critic (MASAC) algorithm to enhance the exploration capability of agents and accelerate the learning of cooperative policies among agents by leveraging the maximum policy entropy criterion. Finally, the simulation results are presented to demonstrate that the proposed MASAC algorithm outperforms the existing centralized allocation benchmark algorithms.
Estilos ABNT, Harvard, Vancouver, APA, etc.
30

TSUI, KWOK CHING, e JIMING LIU. "AN EVOLUTIONARY MULTIAGENT DIFFUSION APPROACH TO OPTIMIZATION". International Journal of Pattern Recognition and Artificial Intelligence 16, n.º 06 (setembro de 2002): 715–33. http://dx.doi.org/10.1142/s0218001402001940.

Texto completo da fonte
Resumo:
This article proposes a novel multiagent approach to optimization inspired by diffusion in nature called Evolutionary Multiagent Diffusion (EMD). Each agent in EMD makes the decision to diffuse based on the information shared between its parent and its siblings. The behavior of EMD is analyzed and its relation to similar search algorithms is discussed.
Estilos ABNT, Harvard, Vancouver, APA, etc.
31

Ponnambalam, S. G., Mukund Nilakantan Janardhanan e G. Rishwaraj. "Trust-based decision-making framework for multiagent system". Soft Computing 25, n.º 11 (20 de março de 2021): 7559–75. http://dx.doi.org/10.1007/s00500-021-05715-3.

Texto completo da fonte
Estilos ABNT, Harvard, Vancouver, APA, etc.
32

Vemuri, Ratna kumari, Chinni Bala Vijaya Durga, Syed Abuthahir Syed Ibrahim, Nagaraju Arumalla, Senthilvadivu Subramanian e Lakshmi Bhukya. "Intelligent-of-things multiagent system for smart home energy monitoring". Indonesian Journal of Electrical Engineering and Computer Science 34, n.º 3 (1 de junho de 2024): 1858. http://dx.doi.org/10.11591/ijeecs.v34.i3.pp1858-1867.

Texto completo da fonte
Resumo:
The proliferation of IoT devices has ushered in a new era of smart homes, where efficient energy management is a paramount concern. Multiagent artificial intelligence-of-things (MAIoT) has emerged as a promising approach to address the complex challenges of smart home energy management. This research study examines MAIoT's components, functioning, benefits, and drawbacks. MAIoT systems improve energy efficiency and user comfort by combining multiagent systems and IoT devices. However, privacy, security, interoperability, scalability, and user acceptability must be addressed. As technology advances, MAIoT in smart home energy management will offer more sophisticated and adaptable solutions to cut energy consumption and promote sustainability. This article describes how energy status and internal pricing signals affect group intelligent decision making and the interaction dynamics between consumers or decision makers. In a multiagent configuration based on the new concept of artificial intelligence-of-things, this intelligent home energy management challenge is simulated and illustrated using software and hardware. Based on sufficient experimental simulations, this paper suggested that residential clients can significantly improve their economic benefit and decision-making efficiency.
Estilos ABNT, Harvard, Vancouver, APA, etc.
33

Ling, Jiajing, Kushagra Chandak e Akshat Kumar. "Integrating Knowledge Compilation with Reinforcement Learning for Routes". Proceedings of the International Conference on Automated Planning and Scheduling 31 (17 de maio de 2021): 542–50. http://dx.doi.org/10.1609/icaps.v31i1.16002.

Texto completo da fonte
Resumo:
Sequential multiagent decision-making under partial observability and uncertainty poses several challenges. Although multiagent reinforcement learning (MARL) approaches have increased the scalability, addressing combinatorial domains is still challenging as random exploration by agents is unlikely to generate useful reward signals. We address cooperative multiagent pathfinding under uncertainty and partial observability where agents move from their respective sources to destinations while also satisfying constraints (e.g., visiting landmarks). Our main contributions include: (1) compiling domain knowledge such as underlying graph connectivity and domain constraints into propositional logic based decision diagrams, (2) developing modular techniques to integrate such knowledge with deep MARL algorithms, and (3) developing fast algorithms to query the compiled knowledge for accelerated episode simulation in RL. Empirically, our approach can tractably represent various types of domain constraints, and outperforms previous MARL approaches significantly both in terms of sample complexity and solution quality on a number of instances.
Estilos ABNT, Harvard, Vancouver, APA, etc.
34

Oliehoek, Frans, Stefan Witwicki e Leslie Kaelbling. "A Sufficient Statistic for Influence in Structured Multiagent Environments". Journal of Artificial Intelligence Research 70 (24 de fevereiro de 2021): 789–870. http://dx.doi.org/10.1613/jair.1.12136.

Texto completo da fonte
Resumo:
Making decisions in complex environments is a key challenge in artificial intelligence (AI). Situations involving multiple decision makers are particularly complex, leading to computational intractability of principled solution methods. A body of work in AI has tried to mitigate this problem by trying to distill interaction to its essence: how does the policy of one agent influence another agent? If we can find more compact representations of such influence, this can help us deal with the complexity, for instance by searching the space of influences rather than the space of policies. However, so far these notions of influence have been restricted in their applicability to special cases of interaction. In this paper we formalize influence-based abstraction (IBA), which facilitates the elimination of latent state factors without any loss in value, for a very general class of problems described as factored partially observable stochastic games (fPOSGs). On the one hand, this generalizes existing descriptions of influence, and thus can serve as the foundation for improvements in scalability and other insights in decision making in complex multiagent settings. On the other hand, since the presence of other agents can be seen as a generalization of single agent settings, our formulation of IBA also provides a sufficient statistic for decision making under abstraction for a single agent. We also give a detailed discussion of the relations to such previous works, identifying new insights and interpretations of these approaches. In these ways, this paper deepens our understanding of abstraction in a wide range of sequential decision making settings, providing the basis for new approaches and algorithms for a large class of problems.
Estilos ABNT, Harvard, Vancouver, APA, etc.
35

Jiao, Peng, Kai Xu, Shiguang Yue, Xiangyu Wei e Lin Sun. "A Decentralized Partially Observable Markov Decision Model with Action Duration for Goal Recognition in Real Time Strategy Games". Discrete Dynamics in Nature and Society 2017 (2017): 1–15. http://dx.doi.org/10.1155/2017/4580206.

Texto completo da fonte
Resumo:
Multiagent goal recognition is a tough yet important problem in many real time strategy games or simulation systems. Traditional modeling methods either are in great demand of detailed agents’ domain knowledge and training dataset for policy estimation or lack clear definition of action duration. To solve the above problems, we propose a novel Dec-POMDM-T model, combining the classic Dec-POMDP, an observation model for recognizer, joint goal with its termination indicator, and time duration variables for actions with action termination variables. In this paper, a model-free algorithm named cooperative colearning based on Sarsa is used. Considering that Dec-POMDM-T usually encounters multiagent goal recognition problems with different sorts of noises, partially missing data, and unknown action durations, the paper exploits the SIS PF with resampling for inference under the dynamic Bayesian network structure of Dec-POMDM-T. In experiments, a modified predator-prey scenario is adopted to study multiagent joint goal recognition problem, which is the recognition of the joint target shared among cooperative predators. Experiment results show that (a) Dec-POMDM-T works effectively in multiagent goal recognition and adapts well to dynamic changing goals within agent group; (b) Dec-POMDM-T outperforms traditional Dec-MDP-based methods in terms of precision, recall, andF-measure.
Estilos ABNT, Harvard, Vancouver, APA, etc.
36

Kovařík, Vojtěch, Martin Schmid, Neil Burch, Michael Bowling e Viliam Lisý. "Rethinking formal models of partially observable multiagent decision making". Artificial Intelligence 303 (fevereiro de 2022): 103645. http://dx.doi.org/10.1016/j.artint.2021.103645.

Texto completo da fonte
Estilos ABNT, Harvard, Vancouver, APA, etc.
37

Person, Patrick, Hadhoum Boukachour, Michel Coletta, Thierry Galinho e Frédéric Serin. "Data representation layer in a MultiAgent decision support system". Multiagent and Grid Systems 2, n.º 3 (14 de setembro de 2006): 223–35. http://dx.doi.org/10.3233/mgs-2006-2302.

Texto completo da fonte
Estilos ABNT, Harvard, Vancouver, APA, etc.
38

Dubernet, Thibaut, e Kay W. Axhausen. "Including joint decision mechanisms in a multiagent transport simulation". Transportation Letters 5, n.º 4 (outubro de 2013): 175–83. http://dx.doi.org/10.1179/1942787513y.0000000002.

Texto completo da fonte
Estilos ABNT, Harvard, Vancouver, APA, etc.
39

Qinghe, Hu, Arun Kumar e Zhang Shuang. "A bidding decision model in multiagent supply chain planning". International Journal of Production Research 39, n.º 15 (janeiro de 2001): 3291–301. http://dx.doi.org/10.1080/00207540110060860.

Texto completo da fonte
Estilos ABNT, Harvard, Vancouver, APA, etc.
40

Pelta, David A., e Ronald R. Yager. "Decision Strategies in Mediated Multiagent Negotiations: An Optimization Approach". IEEE Transactions on Systems, Man, and Cybernetics - Part A: Systems and Humans 40, n.º 3 (maio de 2010): 635–40. http://dx.doi.org/10.1109/tsmca.2009.2036932.

Texto completo da fonte
Estilos ABNT, Harvard, Vancouver, APA, etc.
41

Yichuan Jiang, Jing Hu e Donghui Lin. "Decision Making of Networked Multiagent Systems for Interaction Structures". IEEE Transactions on Systems, Man, and Cybernetics - Part A: Systems and Humans 41, n.º 6 (novembro de 2011): 1107–21. http://dx.doi.org/10.1109/tsmca.2011.2114343.

Texto completo da fonte
Estilos ABNT, Harvard, Vancouver, APA, etc.
42

Becvar, Petr, Lubo Smidl e Josef Psutka. "An Intelligent Telephony Interface of Multiagent Decision Support Systems". IEEE Transactions on Systems, Man, and Cybernetics, Part C (Applications and Reviews) 37, n.º 4 (julho de 2007): 553–60. http://dx.doi.org/10.1109/tsmcc.2007.897335.

Texto completo da fonte
Estilos ABNT, Harvard, Vancouver, APA, etc.
43

A. Vidhate, Deepak, e Parag Kulkarni. "Implementation of Multiagent Learning Algorithms for Improved Decision Making". International Journal of Computer Trends and Technology 35, n.º 2 (25 de maio de 2016): 60–66. http://dx.doi.org/10.14445/22312803/ijctt-v35p111.

Texto completo da fonte
Estilos ABNT, Harvard, Vancouver, APA, etc.
44

Dahong, Tang, e Chen Ting. "Multi-criteria Decision Making Problems with Bi-level Multiagent". IFAC Proceedings Volumes 22, n.º 10 (agosto de 1989): 275–79. http://dx.doi.org/10.1016/s1474-6670(17)53185-7.

Texto completo da fonte
Estilos ABNT, Harvard, Vancouver, APA, etc.
45

Sujit, P. B., e Debasish Ghose. "Self Assessment-Based Decision Making for Multiagent Cooperative Search". IEEE Transactions on Automation Science and Engineering 8, n.º 4 (outubro de 2011): 705–19. http://dx.doi.org/10.1109/tase.2011.2155058.

Texto completo da fonte
Estilos ABNT, Harvard, Vancouver, APA, etc.
46

Kuroda, Tatsuaki. "A power index for multistage and multiagent decision systems". Behavioral Science 38, n.º 4 (1993): 255–72. http://dx.doi.org/10.1002/bs.3830380403.

Texto completo da fonte
Estilos ABNT, Harvard, Vancouver, APA, etc.
47

Shi, Jiawei, e Yan Zhou. "Group Decision Making for Product Innovation Based on PZB Model in Fuzzy Environment: A Case from New-Energy Storage Innovation Design". Mathematics 10, n.º 19 (4 de outubro de 2022): 3634. http://dx.doi.org/10.3390/math10193634.

Texto completo da fonte
Resumo:
According to the World Economic Forum, countries and regions should steer their energy systems toward cheaper, safer, and more sustainable energy sources, and move away from their reliance on traditional energy sources. With this trend, it is significant that new-energy battery enterprises should not only maintain their current installed product, but also attract more consumers. Due to the differences in customers, there are different requirements for the products. Thus, this paper chooses new-energy storage product innovation design as the object, and proposes a novel multiagent group decision-making method based on QFD and PZB models in a fuzzy environment. Firstly, extensively collected multiagent (consumer and designer) requirements are transformed into specific functions through an extended multiagent QFD with HFLTS, and the relationship coefficients are derived. Afterward, different design schemes for functional components are evaluated according to the concept of the PZB model. Then, the satisfaction degree interval is calculated for each partial design. On the basis of these indicators, a multiagent multi-objective optimization model is established. Afterward, solving he model through NSGA-II quickly generates the most suitable product innovation design scheme. Lastly, the feasibility and superiority of proposed method are illustrated through innovation design for a new-energy storage battery.
Estilos ABNT, Harvard, Vancouver, APA, etc.
48

Radu, Valentin, Catalin Dumitrescu, Emilia Vasile, Alina Iuliana Tăbîrcă, Maria Cristina Stefan, Liliana Manea e Florin Radu. "Modeling and Prediction of Sustainable Urban Mobility Using Game Theory Multiagent and the Golden Template Algorithm". Electronics 12, n.º 6 (8 de março de 2023): 1288. http://dx.doi.org/10.3390/electronics12061288.

Texto completo da fonte
Resumo:
The current development of multimodal transport networks focuses on the realization of intelligent transport systems (ITS) to manage the prediction of traffic congestion and urban mobility of vehicles and passengers so that alternative routes can be recommended for transport, especially the use of public passenger transport, to achieve sustainable transport. In the article, we propose an algorithm and a methodology for solving multidimensional traffic congestion objectives, especially for intersections, based on combining machine learning with the templates method—the golden template algorithm with the multiagent game theory. Intersections are modeled as independent players who had to reach an agreement using Nash negotiation. The obtained results showed that the Nash negotiation with multiagents and the golden template modeling have superior results to the model predictive control (MPC) algorithm, improving travel time, the length of traffic queues, the efficiency of travel flows in an unknown and dynamic environment, and the coordination of the agents’ actions and decision making. The proposed algorithm can be used in planning public passenger transport on alternative routes and in ITS management decision making.
Estilos ABNT, Harvard, Vancouver, APA, etc.
49

Wang, Baolai, Shengang Li, Xianzhong Gao e Tao Xie. "UAV Swarm Confrontation Using Hierarchical Multiagent Reinforcement Learning". International Journal of Aerospace Engineering 2021 (21 de dezembro de 2021): 1–12. http://dx.doi.org/10.1155/2021/3360116.

Texto completo da fonte
Resumo:
With the development of unmanned aerial vehicle (UAV) technology, UAV swarm confrontation has attracted many researchers’ attention. However, the situation faced by the UAV swarm has substantial uncertainty and dynamic variability. The state space and action space increase exponentially with the number of UAVs, so that autonomous decision-making becomes a difficult problem in the confrontation environment. In this paper, a multiagent reinforcement learning method with macro action and human expertise is proposed for autonomous decision-making of UAVs. In the proposed approach, UAV swarm is modeled as a large multiagent system (MAS) with an individual UAV as an agent, and the sequential decision-making problem in swarm confrontation is modeled as a Markov decision process. Agents in the proposed method are trained based on the macro actions, where sparse and delayed rewards, large state space, and action space are effectively overcome. The key to the success of this method is the generation of the macro actions that allow the high-level policy to find a near-optimal solution. In this paper, we further leverage human expertise to design a set of good macro actions. Extensive empirical experiments in our constructed swarm confrontation environment show that our method performs better than the other algorithms.
Estilos ABNT, Harvard, Vancouver, APA, etc.
50

Panella, Alessandro. "Multiagent Stochastic Planning With Bayesian Policy Recognition". Proceedings of the AAAI Conference on Artificial Intelligence 27, n.º 1 (29 de junho de 2013): 1672–73. http://dx.doi.org/10.1609/aaai.v27i1.8506.

Texto completo da fonte
Resumo:
When operating in stochastic, partially observable, multiagent settings, it is crucial to accurately predict the actions of other agents. In my thesis work, I propose methodologies for learning the policy of external agents from their observed behavior, in the form of finite state controllers. To perform this task, I adopt Bayesian learning algorithms based on nonparametric prior distributions, that provide the flexibility required to infer models of unknown complexity. These methods are to be embedded in decision making frameworks for autonomous planning in partially observable multiagent systems.
Estilos ABNT, Harvard, Vancouver, APA, etc.
Oferecemos descontos em todos os planos premium para autores cujas obras estão incluídas em seleções literárias temáticas. Contate-nos para obter um código promocional único!

Vá para a bibliografia