Artículos de revistas sobre el tema "Actor-critic algorithm"
Crea una cita precisa en los estilos APA, MLA, Chicago, Harvard y otros
Consulte los 50 mejores artículos de revistas para su investigación sobre el tema "Actor-critic algorithm".
Junto a cada fuente en la lista de referencias hay un botón "Agregar a la bibliografía". Pulsa este botón, y generaremos automáticamente la referencia bibliográfica para la obra elegida en el estilo de cita que necesites: APA, MLA, Harvard, Vancouver, Chicago, etc.
También puede descargar el texto completo de la publicación académica en formato pdf y leer en línea su resumen siempre que esté disponible en los metadatos.
Explore artículos de revistas sobre una amplia variedad de disciplinas y organice su bibliografía correctamente.
Wang, Jing y Ioannis Ch Paschalidis. "An Actor-Critic Algorithm With Second-Order Actor and Critic". IEEE Transactions on Automatic Control 62, n.º 6 (junio de 2017): 2689–703. http://dx.doi.org/10.1109/tac.2016.2616384.
Texto completoZheng, Liyuan, Tanner Fiez, Zane Alumbaugh, Benjamin Chasnov y Lillian J. Ratliff. "Stackelberg Actor-Critic: Game-Theoretic Reinforcement Learning Algorithms". Proceedings of the AAAI Conference on Artificial Intelligence 36, n.º 8 (28 de junio de 2022): 9217–24. http://dx.doi.org/10.1609/aaai.v36i8.20908.
Texto completoIwaki, Ryo y Minoru Asada. "Implicit incremental natural actor critic algorithm". Neural Networks 109 (enero de 2019): 103–12. http://dx.doi.org/10.1016/j.neunet.2018.10.007.
Texto completoKim, Gi-Soo, Jane P. Kim y Hyun-Joon Yang. "Robust Tests in Online Decision-Making". Proceedings of the AAAI Conference on Artificial Intelligence 36, n.º 9 (28 de junio de 2022): 10016–24. http://dx.doi.org/10.1609/aaai.v36i9.21240.
Texto completoSergey, Denisov y Jee-Hyong Lee. "Actor-Critic Algorithm with Transition Cost Estimation". International Journal of Fuzzy Logic and Intelligent Systems 16, n.º 4 (25 de diciembre de 2016): 270–75. http://dx.doi.org/10.5391/ijfis.2016.16.4.270.
Texto completoAhmed, Ayman Elshabrawy M. "Controller parameter tuning using actor-critic algorithm". IOP Conference Series: Materials Science and Engineering 610 (11 de octubre de 2019): 012054. http://dx.doi.org/10.1088/1757-899x/610/1/012054.
Texto completoDing, Siyuan, Shengxiang Li, Guangyi Liu, Ou Li, Ke Ke, Yijie Bai y Weiye Chen. "Decentralized Multiagent Actor-Critic Algorithm Based on Message Diffusion". Journal of Sensors 2021 (8 de diciembre de 2021): 1–14. http://dx.doi.org/10.1155/2021/8739206.
Texto completoHafez, Muhammad Burhan, Cornelius Weber, Matthias Kerzel y Stefan Wermter. "Deep intrinsically motivated continuous actor-critic for efficient robotic visuomotor skill learning". Paladyn, Journal of Behavioral Robotics 10, n.º 1 (1 de enero de 2019): 14–29. http://dx.doi.org/10.1515/pjbr-2019-0005.
Texto completoZhang, Haifei, Jian Xu, Jian Zhang y Quan Liu. "Network Architecture for Optimizing Deep Deterministic Policy Gradient Algorithms". Computational Intelligence and Neuroscience 2022 (18 de noviembre de 2022): 1–10. http://dx.doi.org/10.1155/2022/1117781.
Texto completoJain, Arushi, Gandharv Patil, Ayush Jain, Khimya Khetarpal y Doina Precup. "Variance Penalized On-Policy and Off-Policy Actor-Critic". Proceedings of the AAAI Conference on Artificial Intelligence 35, n.º 9 (18 de mayo de 2021): 7899–907. http://dx.doi.org/10.1609/aaai.v35i9.16964.
Texto completoHendzel, Zenon y Marcin Szuster. "Discrete Action Dependant Heuristic Dynamic Programming in Control of a Wheeled Mobile Robot". Solid State Phenomena 164 (junio de 2010): 419–24. http://dx.doi.org/10.4028/www.scientific.net/ssp.164.419.
Texto completoSu, Jianyu, Stephen Adams y Peter Beling. "Value-Decomposition Multi-Agent Actor-Critics". Proceedings of the AAAI Conference on Artificial Intelligence 35, n.º 13 (18 de mayo de 2021): 11352–60. http://dx.doi.org/10.1609/aaai.v35i13.17353.
Texto completoQiu, Shuang, Zhuoran Yang, Jieping Ye y Zhaoran Wang. "On Finite-Time Convergence of Actor-Critic Algorithm". IEEE Journal on Selected Areas in Information Theory 2, n.º 2 (junio de 2021): 652–64. http://dx.doi.org/10.1109/jsait.2021.3078754.
Texto completoDing, Feng, Guanfeng Ma, Zhikui Chen, Jing Gao y Peng Li. "Averaged Soft Actor-Critic for Deep Reinforcement Learning". Complexity 2021 (1 de abril de 2021): 1–16. http://dx.doi.org/10.1155/2021/6658724.
Texto completoHatakeyama, Hiroyuki, Shingo Mabu, Kotaro Hirasawa y Jinglu Hu. "Genetic Network Programming with Actor-Critic". Journal of Advanced Computational Intelligence and Intelligent Informatics 11, n.º 1 (20 de enero de 2007): 79–86. http://dx.doi.org/10.20965/jaciii.2007.p0079.
Texto completoHyeon, Soo-Jong, Tae-Young Kang y Chang-Kyung Ryoo. "A Path Planning for Unmanned Aerial Vehicles Using SAC (Soft Actor Critic) Algorithm". Journal of Institute of Control, Robotics and Systems 28, n.º 2 (28 de febrero de 2022): 138–45. http://dx.doi.org/10.5302/j.icros.2022.21.0220.
Texto completoLi, Xinzhou, Guifen Chen, Guowei Wu, Zhiyao Sun y Guangjiao Chen. "Research on Multi-Agent D2D Communication Resource Allocation Algorithm Based on A2C". Electronics 12, n.º 2 (10 de enero de 2023): 360. http://dx.doi.org/10.3390/electronics12020360.
Texto completoArvindhan, M. y D. Rajesh Kumar. "Adaptive Resource Allocation in Cloud Data Centers using Actor-Critical Deep Reinforcement Learning for Optimized Load Balancing". International Journal on Recent and Innovation Trends in Computing and Communication 11, n.º 5s (18 de mayo de 2023): 310–18. http://dx.doi.org/10.17762/ijritcc.v11i5s.6671.
Texto completoSeo, Kanghyeon y Jihoon Yang. "Differentially Private Actor and Its Eligibility Trace". Electronics 9, n.º 9 (10 de septiembre de 2020): 1486. http://dx.doi.org/10.3390/electronics9091486.
Texto completoLiao, Junrong, Shiyue Liu, Qinghe Wu, Jiabin Chen y Fuhua Wei. "PID Control of Permanent Magnet Synchronous Motor Based on Improved Actor-Critic Framework". Journal of Physics: Conference Series 2213, n.º 1 (1 de marzo de 2022): 012005. http://dx.doi.org/10.1088/1742-6596/2213/1/012005.
Texto completoZhong, Shan, Quan Liu y QiMing Fu. "Efficient Actor-Critic Algorithm with Hierarchical Model Learning and Planning". Computational Intelligence and Neuroscience 2016 (2016): 1–15. http://dx.doi.org/10.1155/2016/4824072.
Texto completoHIROYASU, Tomoyuki, Akiyuki NAKAMURA, Mitsunori MIKI, Masato YOSHIMI y Hisatake YOKOUCHI. "The Sensuous Lighting Control System using Actor-Critic Algorithm". Journal of Japan Society for Fuzzy Theory and Intelligent Informatics 23, n.º 4 (2011): 501–12. http://dx.doi.org/10.3156/jsoft.23.501.
Texto completoBorkar, V. S. "An actor-critic algorithm for constrained Markov decision processes". Systems & Control Letters 54, n.º 3 (marzo de 2005): 207–13. http://dx.doi.org/10.1016/j.sysconle.2004.08.007.
Texto completoLi, Shuang, Yanghui Yan, Ju Ren, Yuezhi Zhou y Yaoxue Zhang. "A Sample-Efficient Actor-Critic Algorithm for Recommendation Diversification". Chinese Journal of Electronics 29, n.º 1 (1 de enero de 2020): 89–96. http://dx.doi.org/10.1049/cje.2019.10.004.
Texto completoShi, Wei, Long Chen y Xia Zhu. "Task Offloading Decision-Making Algorithm for Vehicular Edge Computing: A Deep-Reinforcement-Learning-Based Approach". Sensors 23, n.º 17 (1 de septiembre de 2023): 7595. http://dx.doi.org/10.3390/s23177595.
Texto completoZhou, Chengmin, Bingding Huang y Pasi Fränti. "A review of motion planning algorithms for intelligent robots". Journal of Intelligent Manufacturing 33, n.º 2 (25 de noviembre de 2021): 387–424. http://dx.doi.org/10.1007/s10845-021-01867-z.
Texto completoYue, Han, Jiapeng Liu, Dongmei Tian y Qin Zhang. "A Novel Anti-Risk Method for Portfolio Trading Using Deep Reinforcement Learning". Electronics 11, n.º 9 (7 de mayo de 2022): 1506. http://dx.doi.org/10.3390/electronics11091506.
Texto completoNakamura, Yutaka, Takeshi Mori, Yoichi Tokita, Tomohiro Shibata y Shin Ishii. "Off-Policy Natural Policy Gradient Method for a Biped Walking Using a CPG Controller". Journal of Robotics and Mechatronics 17, n.º 6 (20 de diciembre de 2005): 636–44. http://dx.doi.org/10.20965/jrm.2005.p0636.
Texto completoHwang, Ha Jun, Jaeyeon Jang, Jongkwan Choi, Jung Ho Bae, Sung Ho Kim y Chang Ouk Kim. "Stepwise Soft Actor–Critic for UAV Autonomous Flight Control". Drones 7, n.º 9 (24 de agosto de 2023): 549. http://dx.doi.org/10.3390/drones7090549.
Texto completoTakano, Toshiaki, Haruhiko Takase, Hiroharu Kawanaka y Shinji Tsuruoka. "Merging with Extraction Method for Transfer Learning in Actor-Critic". Journal of Advanced Computational Intelligence and Intelligent Informatics 15, n.º 7 (20 de septiembre de 2011): 814–21. http://dx.doi.org/10.20965/jaciii.2011.p0814.
Texto completoZhang, Haifeng, Weizhe Chen, Zeren Huang, Minne Li, Yaodong Yang, Weinan Zhang y Jun Wang. "Bi-Level Actor-Critic for Multi-Agent Coordination". Proceedings of the AAAI Conference on Artificial Intelligence 34, n.º 05 (3 de abril de 2020): 7325–32. http://dx.doi.org/10.1609/aaai.v34i05.6226.
Texto completoWang, Hui, Peng Zhang y Quan Liu. "An Actor-critic Algorithm Using Cross Evaluation of Value Functions". IAES International Journal of Robotics and Automation (IJRA) 7, n.º 1 (1 de marzo de 2018): 39. http://dx.doi.org/10.11591/ijra.v7i1.pp39-47.
Texto completoLan, Xuejing, Zhifeng Tan, Tao Zou y Wenbiao Xu. "CACLA-Based Trajectory Tracking Guidance for RLV in Terminal Area Energy Management Phase". Sensors 21, n.º 15 (26 de julio de 2021): 5062. http://dx.doi.org/10.3390/s21155062.
Texto completoZhang, Shangtong y Hengshuai Yao. "ACE: An Actor Ensemble Algorithm for Continuous Control with Tree Search". Proceedings of the AAAI Conference on Artificial Intelligence 33 (17 de julio de 2019): 5789–96. http://dx.doi.org/10.1609/aaai.v33i01.33015789.
Texto completoKIMURA, Hajime y Shigenobu KOBAYASHI. "An Actor-Critic Algorithm Using a Binary Tree Action Selector". Transactions of the Society of Instrument and Control Engineers 37, n.º 12 (2001): 1147–55. http://dx.doi.org/10.9746/sicetr1965.37.1147.
Texto completoZhong, Shan, Jack Tan, Husheng Dong, Xuemei Chen, Shengrong Gong y Zhenjiang Qian. "Modeling-Learning-Based Actor-Critic Algorithm with Gaussian Process Approximator". Journal of Grid Computing 18, n.º 2 (18 de abril de 2020): 181–95. http://dx.doi.org/10.1007/s10723-020-09512-4.
Texto completoBorkar, Vivek S. y Vijaymohan R. Konda. "The actor-critic algorithm as multi-time-scale stochastic approximation". Sadhana 22, n.º 4 (agosto de 1997): 525–43. http://dx.doi.org/10.1007/bf02745577.
Texto completoItoh, Hideaki y Kazuyuki Aihara. "Combination of an actor/critic algorithm with goal-directed reasoning". Artificial Life and Robotics 5, n.º 4 (diciembre de 2001): 233–41. http://dx.doi.org/10.1007/bf02481507.
Texto completoSu, Jie-Ying, Jia-Lin Kang y Shi-Shang Jang. "An Actor-Critic Algorithm for the Stochastic Cutting Stock Problem". Processes 11, n.º 4 (13 de abril de 2023): 1203. http://dx.doi.org/10.3390/pr11041203.
Texto completoWu, Zhenning, Yiming Deng y Lixing Wang. "A Pinning Actor-Critic Structure-Based Algorithm for Sizing Complex-Shaped Depth Profiles in MFL Inspection with High Degree of Freedom". Complexity 2021 (23 de abril de 2021): 1–12. http://dx.doi.org/10.1155/2021/9995033.
Texto completoTAHAMI, EHSAN, AMIR HOMAYOUN JAFARI y ALI FALLAH. "APPLICATION OF AN EVOLUTIONARY ACTOR–CRITIC REINFORCEMENT LEARNING METHOD FOR THE CONTROL OF A THREE-LINK MUSCULOSKELETAL ARM DURING A REACHING MOVEMENT". Journal of Mechanics in Medicine and Biology 13, n.º 02 (abril de 2013): 1350040. http://dx.doi.org/10.1142/s0219519413500401.
Texto completoDoya, Kenji. "Reinforcement Learning in Continuous Time and Space". Neural Computation 12, n.º 1 (1 de enero de 2000): 219–45. http://dx.doi.org/10.1162/089976600300015961.
Texto completoChen, Haibo, Zhongwei Huang, Xiaorong Zhao, Xiao Liu, Youjun Jiang, Pinyong Geng, Guang Yang, Yewen Cao y Deqiang Wang. "Policy Optimization of the Power Allocation Algorithm Based on the Actor–Critic Framework in Small Cell Networks". Mathematics 11, n.º 7 (2 de abril de 2023): 1702. http://dx.doi.org/10.3390/math11071702.
Texto completoPradhan, Arabinda, Sukant Kishoro Bisoy y Mangal Sain. "Action-Based Load Balancing Technique in Cloud Network Using Actor-Critic-Swarm Optimization". Wireless Communications and Mobile Computing 2022 (30 de junio de 2022): 1–17. http://dx.doi.org/10.1155/2022/6456242.
Texto completoYang, Qisong, Thiago D. Simão, Simon H. Tindemans y Matthijs T. J. Spaan. "WCSAC: Worst-Case Soft Actor Critic for Safety-Constrained Reinforcement Learning". Proceedings of the AAAI Conference on Artificial Intelligence 35, n.º 12 (18 de mayo de 2021): 10639–46. http://dx.doi.org/10.1609/aaai.v35i12.17272.
Texto completoAvkhimenia, Vadim, Matheus Gemignani, Tim Weis y Petr Musilek. "Deep Reinforcement Learning-Based Operation of Transmission Battery Storage with Dynamic Thermal Line Rating". Energies 15, n.º 23 (29 de noviembre de 2022): 9032. http://dx.doi.org/10.3390/en15239032.
Texto completoSola, Yoann, Gilles Le Chenadec y Benoit Clement. "Simultaneous Control and Guidance of an AUV Based on Soft Actor–Critic". Sensors 22, n.º 16 (14 de agosto de 2022): 6072. http://dx.doi.org/10.3390/s22166072.
Texto completoWang, Xiao, Zhe Ma, Lei Mao, Kewu Sun, Xuhui Huang, Changchao Fan y Jiake Li. "Accelerating Fuzzy Actor–Critic Learning via Suboptimal Knowledge for a Multi-Agent Tracking Problem". Electronics 12, n.º 8 (13 de abril de 2023): 1852. http://dx.doi.org/10.3390/electronics12081852.
Texto completoAli, Hamid, Hammad Majeed, Imran Usman y Khaled A. Almejalli. "Reducing Entropy Overestimation in Soft Actor Critic Using Dual Policy Network". Wireless Communications and Mobile Computing 2021 (10 de junio de 2021): 1–13. http://dx.doi.org/10.1155/2021/9920591.
Texto completoXi, Bao, Rui Wang, Ying-Hao Cai, Tao Lu y Shuo Wang. "A Novel Heterogeneous Actor-critic Algorithm with Recent Emphasizing Replay Memory". International Journal of Automation and Computing 18, n.º 4 (23 de abril de 2021): 619–31. http://dx.doi.org/10.1007/s11633-021-1296-x.
Texto completo