Journal articles on the topic 'Actor-critic algorithm'
Create a spot-on reference in APA, MLA, Chicago, Harvard, and other styles
Consult the top 50 journal articles for your research on the topic 'Actor-critic algorithm.'
Next to every source in the list of references, there is an 'Add to bibliography' button. Press on it, and we will generate automatically the bibliographic reference to the chosen work in the citation style you need: APA, MLA, Harvard, Chicago, Vancouver, etc.
You can also download the full text of the academic publication as pdf and read online its abstract whenever available in the metadata.
Browse journal articles on a wide variety of disciplines and organise your bibliography correctly.
Wang, Jing, and Ioannis Ch Paschalidis. "An Actor-Critic Algorithm With Second-Order Actor and Critic." IEEE Transactions on Automatic Control 62, no. 6 (June 2017): 2689–703. http://dx.doi.org/10.1109/tac.2016.2616384.
Full textZheng, Liyuan, Tanner Fiez, Zane Alumbaugh, Benjamin Chasnov, and Lillian J. Ratliff. "Stackelberg Actor-Critic: Game-Theoretic Reinforcement Learning Algorithms." Proceedings of the AAAI Conference on Artificial Intelligence 36, no. 8 (June 28, 2022): 9217–24. http://dx.doi.org/10.1609/aaai.v36i8.20908.
Full textIwaki, Ryo, and Minoru Asada. "Implicit incremental natural actor critic algorithm." Neural Networks 109 (January 2019): 103–12. http://dx.doi.org/10.1016/j.neunet.2018.10.007.
Full textKim, Gi-Soo, Jane P. Kim, and Hyun-Joon Yang. "Robust Tests in Online Decision-Making." Proceedings of the AAAI Conference on Artificial Intelligence 36, no. 9 (June 28, 2022): 10016–24. http://dx.doi.org/10.1609/aaai.v36i9.21240.
Full textSergey, Denisov, and Jee-Hyong Lee. "Actor-Critic Algorithm with Transition Cost Estimation." International Journal of Fuzzy Logic and Intelligent Systems 16, no. 4 (December 25, 2016): 270–75. http://dx.doi.org/10.5391/ijfis.2016.16.4.270.
Full textAhmed, Ayman Elshabrawy M. "Controller parameter tuning using actor-critic algorithm." IOP Conference Series: Materials Science and Engineering 610 (October 11, 2019): 012054. http://dx.doi.org/10.1088/1757-899x/610/1/012054.
Full textDing, Siyuan, Shengxiang Li, Guangyi Liu, Ou Li, Ke Ke, Yijie Bai, and Weiye Chen. "Decentralized Multiagent Actor-Critic Algorithm Based on Message Diffusion." Journal of Sensors 2021 (December 8, 2021): 1–14. http://dx.doi.org/10.1155/2021/8739206.
Full textHafez, Muhammad Burhan, Cornelius Weber, Matthias Kerzel, and Stefan Wermter. "Deep intrinsically motivated continuous actor-critic for efficient robotic visuomotor skill learning." Paladyn, Journal of Behavioral Robotics 10, no. 1 (January 1, 2019): 14–29. http://dx.doi.org/10.1515/pjbr-2019-0005.
Full textZhang, Haifei, Jian Xu, Jian Zhang, and Quan Liu. "Network Architecture for Optimizing Deep Deterministic Policy Gradient Algorithms." Computational Intelligence and Neuroscience 2022 (November 18, 2022): 1–10. http://dx.doi.org/10.1155/2022/1117781.
Full textJain, Arushi, Gandharv Patil, Ayush Jain, Khimya Khetarpal, and Doina Precup. "Variance Penalized On-Policy and Off-Policy Actor-Critic." Proceedings of the AAAI Conference on Artificial Intelligence 35, no. 9 (May 18, 2021): 7899–907. http://dx.doi.org/10.1609/aaai.v35i9.16964.
Full textHendzel, Zenon, and Marcin Szuster. "Discrete Action Dependant Heuristic Dynamic Programming in Control of a Wheeled Mobile Robot." Solid State Phenomena 164 (June 2010): 419–24. http://dx.doi.org/10.4028/www.scientific.net/ssp.164.419.
Full textSu, Jianyu, Stephen Adams, and Peter Beling. "Value-Decomposition Multi-Agent Actor-Critics." Proceedings of the AAAI Conference on Artificial Intelligence 35, no. 13 (May 18, 2021): 11352–60. http://dx.doi.org/10.1609/aaai.v35i13.17353.
Full textQiu, Shuang, Zhuoran Yang, Jieping Ye, and Zhaoran Wang. "On Finite-Time Convergence of Actor-Critic Algorithm." IEEE Journal on Selected Areas in Information Theory 2, no. 2 (June 2021): 652–64. http://dx.doi.org/10.1109/jsait.2021.3078754.
Full textDing, Feng, Guanfeng Ma, Zhikui Chen, Jing Gao, and Peng Li. "Averaged Soft Actor-Critic for Deep Reinforcement Learning." Complexity 2021 (April 1, 2021): 1–16. http://dx.doi.org/10.1155/2021/6658724.
Full textHatakeyama, Hiroyuki, Shingo Mabu, Kotaro Hirasawa, and Jinglu Hu. "Genetic Network Programming with Actor-Critic." Journal of Advanced Computational Intelligence and Intelligent Informatics 11, no. 1 (January 20, 2007): 79–86. http://dx.doi.org/10.20965/jaciii.2007.p0079.
Full textHyeon, Soo-Jong, Tae-Young Kang, and Chang-Kyung Ryoo. "A Path Planning for Unmanned Aerial Vehicles Using SAC (Soft Actor Critic) Algorithm." Journal of Institute of Control, Robotics and Systems 28, no. 2 (February 28, 2022): 138–45. http://dx.doi.org/10.5302/j.icros.2022.21.0220.
Full textLi, Xinzhou, Guifen Chen, Guowei Wu, Zhiyao Sun, and Guangjiao Chen. "Research on Multi-Agent D2D Communication Resource Allocation Algorithm Based on A2C." Electronics 12, no. 2 (January 10, 2023): 360. http://dx.doi.org/10.3390/electronics12020360.
Full textArvindhan, M., and D. Rajesh Kumar. "Adaptive Resource Allocation in Cloud Data Centers using Actor-Critical Deep Reinforcement Learning for Optimized Load Balancing." International Journal on Recent and Innovation Trends in Computing and Communication 11, no. 5s (May 18, 2023): 310–18. http://dx.doi.org/10.17762/ijritcc.v11i5s.6671.
Full textSeo, Kanghyeon, and Jihoon Yang. "Differentially Private Actor and Its Eligibility Trace." Electronics 9, no. 9 (September 10, 2020): 1486. http://dx.doi.org/10.3390/electronics9091486.
Full textLiao, Junrong, Shiyue Liu, Qinghe Wu, Jiabin Chen, and Fuhua Wei. "PID Control of Permanent Magnet Synchronous Motor Based on Improved Actor-Critic Framework." Journal of Physics: Conference Series 2213, no. 1 (March 1, 2022): 012005. http://dx.doi.org/10.1088/1742-6596/2213/1/012005.
Full textZhong, Shan, Quan Liu, and QiMing Fu. "Efficient Actor-Critic Algorithm with Hierarchical Model Learning and Planning." Computational Intelligence and Neuroscience 2016 (2016): 1–15. http://dx.doi.org/10.1155/2016/4824072.
Full textHIROYASU, Tomoyuki, Akiyuki NAKAMURA, Mitsunori MIKI, Masato YOSHIMI, and Hisatake YOKOUCHI. "The Sensuous Lighting Control System using Actor-Critic Algorithm." Journal of Japan Society for Fuzzy Theory and Intelligent Informatics 23, no. 4 (2011): 501–12. http://dx.doi.org/10.3156/jsoft.23.501.
Full textBorkar, V. S. "An actor-critic algorithm for constrained Markov decision processes." Systems & Control Letters 54, no. 3 (March 2005): 207–13. http://dx.doi.org/10.1016/j.sysconle.2004.08.007.
Full textLi, Shuang, Yanghui Yan, Ju Ren, Yuezhi Zhou, and Yaoxue Zhang. "A Sample-Efficient Actor-Critic Algorithm for Recommendation Diversification." Chinese Journal of Electronics 29, no. 1 (January 1, 2020): 89–96. http://dx.doi.org/10.1049/cje.2019.10.004.
Full textShi, Wei, Long Chen, and Xia Zhu. "Task Offloading Decision-Making Algorithm for Vehicular Edge Computing: A Deep-Reinforcement-Learning-Based Approach." Sensors 23, no. 17 (September 1, 2023): 7595. http://dx.doi.org/10.3390/s23177595.
Full textZhou, Chengmin, Bingding Huang, and Pasi Fränti. "A review of motion planning algorithms for intelligent robots." Journal of Intelligent Manufacturing 33, no. 2 (November 25, 2021): 387–424. http://dx.doi.org/10.1007/s10845-021-01867-z.
Full textYue, Han, Jiapeng Liu, Dongmei Tian, and Qin Zhang. "A Novel Anti-Risk Method for Portfolio Trading Using Deep Reinforcement Learning." Electronics 11, no. 9 (May 7, 2022): 1506. http://dx.doi.org/10.3390/electronics11091506.
Full textNakamura, Yutaka, Takeshi Mori, Yoichi Tokita, Tomohiro Shibata, and Shin Ishii. "Off-Policy Natural Policy Gradient Method for a Biped Walking Using a CPG Controller." Journal of Robotics and Mechatronics 17, no. 6 (December 20, 2005): 636–44. http://dx.doi.org/10.20965/jrm.2005.p0636.
Full textHwang, Ha Jun, Jaeyeon Jang, Jongkwan Choi, Jung Ho Bae, Sung Ho Kim, and Chang Ouk Kim. "Stepwise Soft Actor–Critic for UAV Autonomous Flight Control." Drones 7, no. 9 (August 24, 2023): 549. http://dx.doi.org/10.3390/drones7090549.
Full textTakano, Toshiaki, Haruhiko Takase, Hiroharu Kawanaka, and Shinji Tsuruoka. "Merging with Extraction Method for Transfer Learning in Actor-Critic." Journal of Advanced Computational Intelligence and Intelligent Informatics 15, no. 7 (September 20, 2011): 814–21. http://dx.doi.org/10.20965/jaciii.2011.p0814.
Full textZhang, Haifeng, Weizhe Chen, Zeren Huang, Minne Li, Yaodong Yang, Weinan Zhang, and Jun Wang. "Bi-Level Actor-Critic for Multi-Agent Coordination." Proceedings of the AAAI Conference on Artificial Intelligence 34, no. 05 (April 3, 2020): 7325–32. http://dx.doi.org/10.1609/aaai.v34i05.6226.
Full textWang, Hui, Peng Zhang, and Quan Liu. "An Actor-critic Algorithm Using Cross Evaluation of Value Functions." IAES International Journal of Robotics and Automation (IJRA) 7, no. 1 (March 1, 2018): 39. http://dx.doi.org/10.11591/ijra.v7i1.pp39-47.
Full textLan, Xuejing, Zhifeng Tan, Tao Zou, and Wenbiao Xu. "CACLA-Based Trajectory Tracking Guidance for RLV in Terminal Area Energy Management Phase." Sensors 21, no. 15 (July 26, 2021): 5062. http://dx.doi.org/10.3390/s21155062.
Full textZhang, Shangtong, and Hengshuai Yao. "ACE: An Actor Ensemble Algorithm for Continuous Control with Tree Search." Proceedings of the AAAI Conference on Artificial Intelligence 33 (July 17, 2019): 5789–96. http://dx.doi.org/10.1609/aaai.v33i01.33015789.
Full textKIMURA, Hajime, and Shigenobu KOBAYASHI. "An Actor-Critic Algorithm Using a Binary Tree Action Selector." Transactions of the Society of Instrument and Control Engineers 37, no. 12 (2001): 1147–55. http://dx.doi.org/10.9746/sicetr1965.37.1147.
Full textZhong, Shan, Jack Tan, Husheng Dong, Xuemei Chen, Shengrong Gong, and Zhenjiang Qian. "Modeling-Learning-Based Actor-Critic Algorithm with Gaussian Process Approximator." Journal of Grid Computing 18, no. 2 (April 18, 2020): 181–95. http://dx.doi.org/10.1007/s10723-020-09512-4.
Full textBorkar, Vivek S., and Vijaymohan R. Konda. "The actor-critic algorithm as multi-time-scale stochastic approximation." Sadhana 22, no. 4 (August 1997): 525–43. http://dx.doi.org/10.1007/bf02745577.
Full textItoh, Hideaki, and Kazuyuki Aihara. "Combination of an actor/critic algorithm with goal-directed reasoning." Artificial Life and Robotics 5, no. 4 (December 2001): 233–41. http://dx.doi.org/10.1007/bf02481507.
Full textSu, Jie-Ying, Jia-Lin Kang, and Shi-Shang Jang. "An Actor-Critic Algorithm for the Stochastic Cutting Stock Problem." Processes 11, no. 4 (April 13, 2023): 1203. http://dx.doi.org/10.3390/pr11041203.
Full textWu, Zhenning, Yiming Deng, and Lixing Wang. "A Pinning Actor-Critic Structure-Based Algorithm for Sizing Complex-Shaped Depth Profiles in MFL Inspection with High Degree of Freedom." Complexity 2021 (April 23, 2021): 1–12. http://dx.doi.org/10.1155/2021/9995033.
Full textTAHAMI, EHSAN, AMIR HOMAYOUN JAFARI, and ALI FALLAH. "APPLICATION OF AN EVOLUTIONARY ACTOR–CRITIC REINFORCEMENT LEARNING METHOD FOR THE CONTROL OF A THREE-LINK MUSCULOSKELETAL ARM DURING A REACHING MOVEMENT." Journal of Mechanics in Medicine and Biology 13, no. 02 (April 2013): 1350040. http://dx.doi.org/10.1142/s0219519413500401.
Full textDoya, Kenji. "Reinforcement Learning in Continuous Time and Space." Neural Computation 12, no. 1 (January 1, 2000): 219–45. http://dx.doi.org/10.1162/089976600300015961.
Full textChen, Haibo, Zhongwei Huang, Xiaorong Zhao, Xiao Liu, Youjun Jiang, Pinyong Geng, Guang Yang, Yewen Cao, and Deqiang Wang. "Policy Optimization of the Power Allocation Algorithm Based on the Actor–Critic Framework in Small Cell Networks." Mathematics 11, no. 7 (April 2, 2023): 1702. http://dx.doi.org/10.3390/math11071702.
Full textPradhan, Arabinda, Sukant Kishoro Bisoy, and Mangal Sain. "Action-Based Load Balancing Technique in Cloud Network Using Actor-Critic-Swarm Optimization." Wireless Communications and Mobile Computing 2022 (June 30, 2022): 1–17. http://dx.doi.org/10.1155/2022/6456242.
Full textYang, Qisong, Thiago D. Simão, Simon H. Tindemans, and Matthijs T. J. Spaan. "WCSAC: Worst-Case Soft Actor Critic for Safety-Constrained Reinforcement Learning." Proceedings of the AAAI Conference on Artificial Intelligence 35, no. 12 (May 18, 2021): 10639–46. http://dx.doi.org/10.1609/aaai.v35i12.17272.
Full textAvkhimenia, Vadim, Matheus Gemignani, Tim Weis, and Petr Musilek. "Deep Reinforcement Learning-Based Operation of Transmission Battery Storage with Dynamic Thermal Line Rating." Energies 15, no. 23 (November 29, 2022): 9032. http://dx.doi.org/10.3390/en15239032.
Full textSola, Yoann, Gilles Le Chenadec, and Benoit Clement. "Simultaneous Control and Guidance of an AUV Based on Soft Actor–Critic." Sensors 22, no. 16 (August 14, 2022): 6072. http://dx.doi.org/10.3390/s22166072.
Full textWang, Xiao, Zhe Ma, Lei Mao, Kewu Sun, Xuhui Huang, Changchao Fan, and Jiake Li. "Accelerating Fuzzy Actor–Critic Learning via Suboptimal Knowledge for a Multi-Agent Tracking Problem." Electronics 12, no. 8 (April 13, 2023): 1852. http://dx.doi.org/10.3390/electronics12081852.
Full textAli, Hamid, Hammad Majeed, Imran Usman, and Khaled A. Almejalli. "Reducing Entropy Overestimation in Soft Actor Critic Using Dual Policy Network." Wireless Communications and Mobile Computing 2021 (June 10, 2021): 1–13. http://dx.doi.org/10.1155/2021/9920591.
Full textXi, Bao, Rui Wang, Ying-Hao Cai, Tao Lu, and Shuo Wang. "A Novel Heterogeneous Actor-critic Algorithm with Recent Emphasizing Replay Memory." International Journal of Automation and Computing 18, no. 4 (April 23, 2021): 619–31. http://dx.doi.org/10.1007/s11633-021-1296-x.
Full text