Artículos de revistas sobre el tema "Safe RL"
Crea una cita precisa en los estilos APA, MLA, Chicago, Harvard y otros
Consulte los 50 mejores artículos de revistas para su investigación sobre el tema "Safe RL".
Junto a cada fuente en la lista de referencias hay un botón "Agregar a la bibliografía". Pulsa este botón, y generaremos automáticamente la referencia bibliográfica para la obra elegida en el estilo de cita que necesites: APA, MLA, Harvard, Vancouver, Chicago, etc.
También puede descargar el texto completo de la publicación académica en formato pdf y leer en línea su resumen siempre que esté disponible en los metadatos.
Explore artículos de revistas sobre una amplia variedad de disciplinas y organice su bibliografía correctamente.
Carr, Steven, Nils Jansen, Sebastian Junges y Ufuk Topcu. "Safe Reinforcement Learning via Shielding under Partial Observability". Proceedings of the AAAI Conference on Artificial Intelligence 37, n.º 12 (26 de junio de 2023): 14748–56. http://dx.doi.org/10.1609/aaai.v37i12.26723.
Texto completoMa, Yecheng Jason, Andrew Shen, Osbert Bastani y Jayaraman Dinesh. "Conservative and Adaptive Penalty for Model-Based Safe Reinforcement Learning". Proceedings of the AAAI Conference on Artificial Intelligence 36, n.º 5 (28 de junio de 2022): 5404–12. http://dx.doi.org/10.1609/aaai.v36i5.20478.
Texto completoXu, Haoran, Xianyuan Zhan y Xiangyu Zhu. "Constraints Penalized Q-learning for Safe Offline Reinforcement Learning". Proceedings of the AAAI Conference on Artificial Intelligence 36, n.º 8 (28 de junio de 2022): 8753–60. http://dx.doi.org/10.1609/aaai.v36i8.20855.
Texto completoThananjeyan, Brijen, Ashwin Balakrishna, Suraj Nair, Michael Luo, Krishnan Srinivasan, Minho Hwang, Joseph E. Gonzalez, Julian Ibarz, Chelsea Finn y Ken Goldberg. "Recovery RL: Safe Reinforcement Learning With Learned Recovery Zones". IEEE Robotics and Automation Letters 6, n.º 3 (julio de 2021): 4915–22. http://dx.doi.org/10.1109/lra.2021.3070252.
Texto completoSerrano-Cuevas, Jonathan, Eduardo F. Morales y Pablo Hernández-Leal. "Safe reinforcement learning using risk mapping by similarity". Adaptive Behavior 28, n.º 4 (18 de julio de 2019): 213–24. http://dx.doi.org/10.1177/1059712319859650.
Texto completoCheng, Richard, Gábor Orosz, Richard M. Murray y Joel W. Burdick. "End-to-End Safe Reinforcement Learning through Barrier Functions for Safety-Critical Continuous Control Tasks". Proceedings of the AAAI Conference on Artificial Intelligence 33 (17 de julio de 2019): 3387–95. http://dx.doi.org/10.1609/aaai.v33i01.33013387.
Texto completoJurj, Sorin Liviu, Dominik Grundt, Tino Werner, Philipp Borchers, Karina Rothemann y Eike Möhlmann. "Increasing the Safety of Adaptive Cruise Control Using Physics-Guided Reinforcement Learning". Energies 14, n.º 22 (12 de noviembre de 2021): 7572. http://dx.doi.org/10.3390/en14227572.
Texto completoSakrihei, Helen. "Using automatic storage for ILL – experiences from the National Repository Library in Norway". Interlending & Document Supply 44, n.º 1 (15 de febrero de 2016): 14–16. http://dx.doi.org/10.1108/ilds-11-2015-0035.
Texto completoDing, Yuhao y Javad Lavaei. "Provably Efficient Primal-Dual Reinforcement Learning for CMDPs with Non-stationary Objectives and Constraints". Proceedings of the AAAI Conference on Artificial Intelligence 37, n.º 6 (26 de junio de 2023): 7396–404. http://dx.doi.org/10.1609/aaai.v37i6.25900.
Texto completoTubeuf, Carlotta, Felix Birkelbach, Anton Maly y René Hofmann. "Increasing the Flexibility of Hydropower with Reinforcement Learning on a Digital Twin Platform". Energies 16, n.º 4 (11 de febrero de 2023): 1796. http://dx.doi.org/10.3390/en16041796.
Texto completoYOON, JAE UNG y JUHONG LEE. "Uncertainty Sequence Modeling Approach for Safe and Effective Autonomous Driving". Korean Institute of Smart Media 11, n.º 9 (31 de octubre de 2022): 9–20. http://dx.doi.org/10.30693/smj.2022.11.9.9.
Texto completoLin, Xingbin, Deyu Yuan y Xifei Li. "Reinforcement Learning with Dual Safety Policies for Energy Savings in Building Energy Systems". Buildings 13, n.º 3 (21 de febrero de 2023): 580. http://dx.doi.org/10.3390/buildings13030580.
Texto completoMarchesini, Enrico, Davide Corsi y Alessandro Farinelli. "Exploring Safer Behaviors for Deep Reinforcement Learning". Proceedings of the AAAI Conference on Artificial Intelligence 36, n.º 7 (28 de junio de 2022): 7701–9. http://dx.doi.org/10.1609/aaai.v36i7.20737.
Texto completoEgleston, David, Patricia Ann Castelli y Thomas George Marx. "Developing, validating, and testing a model of reflective leadership". Leadership & Organization Development Journal 38, n.º 7 (4 de septiembre de 2017): 886–96. http://dx.doi.org/10.1108/lodj-09-2016-0230.
Texto completoHuh, Gene y Wonjae Cha. "Development and Clinical Application of Real-Time Light-Guided Vocal Fold Injection". Journal of The Korean Society of Laryngology, Phoniatrics and Logopedics 33, n.º 1 (30 de abril de 2022): 1–6. http://dx.doi.org/10.22469/jkslp.2022.33.1.1.
Texto completoRamakrishnan, Ramya, Ece Kamar, Debadeepta Dey, Eric Horvitz y Julie Shah. "Blind Spot Detection for Safe Sim-to-Real Transfer". Journal of Artificial Intelligence Research 67 (4 de febrero de 2020): 191–234. http://dx.doi.org/10.1613/jair.1.11436.
Texto completoHao, Hao, Yichen Sun, Xueyun Mei y Yanjun Zhou. "Reverse Logistics Network Design of Electric Vehicle Batteries considering Recall Risk". Mathematical Problems in Engineering 2021 (18 de agosto de 2021): 1–16. http://dx.doi.org/10.1155/2021/5518049.
Texto completoRay, Kaustabha y Ansuman Banerjee. "Horizontal Auto-Scaling for Multi-Access Edge Computing Using Safe Reinforcement Learning". ACM Transactions on Embedded Computing Systems 20, n.º 6 (30 de noviembre de 2021): 1–33. http://dx.doi.org/10.1145/3475991.
Texto completoDelgado, Tomás, Marco Sánchez Sorondo, Víctor Braberman y Sebastián Uchitel. "Exploration Policies for On-the-Fly Controller Synthesis: A Reinforcement Learning Approach". Proceedings of the International Conference on Automated Planning and Scheduling 33, n.º 1 (1 de julio de 2023): 569–77. http://dx.doi.org/10.1609/icaps.v33i1.27238.
Texto completoBolster, Lauren, Mark Bosch, Brian Brownbridge y Anurag Saxena. "RAP Trial: Ringer's Lactate and Packed Red Blood Cell Transfusion, An in Vitro Study and Chart Review." Blood 114, n.º 22 (20 de noviembre de 2009): 2105. http://dx.doi.org/10.1182/blood.v114.22.2105.2105.
Texto completoRomey, Aurore, Hussaini G. Ularamu, Abdulnaci Bulut, Syed M. Jamal, Salman Khan, Muhammad Ishaq, Michael Eschbaumer et al. "Field Evaluation of a Safe, Easy, and Low-Cost Protocol for Shipment of Samples from Suspected Cases of Foot-and-Mouth Disease to Diagnostic Laboratories". Transboundary and Emerging Diseases 2023 (5 de agosto de 2023): 1–15. http://dx.doi.org/10.1155/2023/9555213.
Texto completoDai, Juntao, Jiaming Ji, Long Yang, Qian Zheng y Gang Pan. "Augmented Proximal Policy Optimization for Safe Reinforcement Learning". Proceedings of the AAAI Conference on Artificial Intelligence 37, n.º 6 (26 de junio de 2023): 7288–95. http://dx.doi.org/10.1609/aaai.v37i6.25888.
Texto completoKrstić, Mladen, Giulio Paolo Agnusdei, Pier Paolo Miglietta, Snežana Tadić y Violeta Roso. "Applicability of Industry 4.0 Technologies in the Reverse Logistics: A Circular Economy Approach Based on COmprehensive Distance Based RAnking (COBRA) Method". Sustainability 14, n.º 9 (7 de mayo de 2022): 5632. http://dx.doi.org/10.3390/su14095632.
Texto completoPrasetyo, Risky Vitria, Abdul Latief Azis y Soegeng Soegijanto. "Comparison of the efficacy and safety of hydroxyethyl starch 130/0.4 and Ringer's lactate in children with grade III dengue hemorrhagic fever". Paediatrica Indonesiana 49, n.º 2 (30 de abril de 2009): 97. http://dx.doi.org/10.14238/pi49.2.2009.97-103.
Texto completoBöck, Markus, Julien Malle, Daniel Pasterk, Hrvoje Kukina, Ramin Hasani y Clemens Heitzinger. "Superhuman performance on sepsis MIMIC-III data by distributional reinforcement learning". PLOS ONE 17, n.º 11 (3 de noviembre de 2022): e0275358. http://dx.doi.org/10.1371/journal.pone.0275358.
Texto completoLi, Yue, Xiao Yong Bai, Shi Jie Wang, Luo Yi Qin, Yi Chao Tian y Guang Jie Luo. "Evaluating of the spatial heterogeneity of soil loss tolerance and its effects on erosion risk in the carbonate areas of southern China". Solid Earth 8, n.º 3 (29 de mayo de 2017): 661–69. http://dx.doi.org/10.5194/se-8-661-2017.
Texto completoKondrup, Flemming, Thomas Jiralerspong, Elaine Lau, Nathan De Lara, Jacob Shkrob, My Duc Tran, Doina Precup y Sumana Basu. "Towards Safe Mechanical Ventilation Treatment Using Deep Offline Reinforcement Learning". Proceedings of the AAAI Conference on Artificial Intelligence 37, n.º 13 (26 de junio de 2023): 15696–702. http://dx.doi.org/10.1609/aaai.v37i13.26862.
Texto completoMiyajima, Hirofumi, Noritaka Shigei, Syunki Makino, Hiromi Miyajima, Yohtaro Miyanishi, Shinji Kitagami y Norio Shiratori. "A proposal of privacy preserving reinforcement learning for secure multiparty computation". Artificial Intelligence Research 6, n.º 2 (23 de mayo de 2017): 57. http://dx.doi.org/10.5430/air.v6n2p57.
Texto completoThananjeyan, Brijen, Ashwin Balakrishna, Ugo Rosolia, Felix Li, Rowan McAllister, Joseph E. Gonzalez, Sergey Levine, Francesco Borrelli y Ken Goldberg. "Safety Augmented Value Estimation From Demonstrations (SAVED): Safe Deep Model-Based RL for Sparse Cost Robotic Tasks". IEEE Robotics and Automation Letters 5, n.º 2 (abril de 2020): 3612–19. http://dx.doi.org/10.1109/lra.2020.2976272.
Texto completoRen, Tianzhu, Yuanchang Xie y Liming Jiang. "Cooperative Highway Work Zone Merge Control Based on Reinforcement Learning in a Connected and Automated Environment". Transportation Research Record: Journal of the Transportation Research Board 2674, n.º 10 (17 de julio de 2020): 363–74. http://dx.doi.org/10.1177/0361198120935873.
Texto completoReda, Ahmad y József Vásárhelyi. "Design and Implementation of Reinforcement Learning for Automated Driving Compared to Classical MPC Control". Designs 7, n.º 1 (29 de enero de 2023): 18. http://dx.doi.org/10.3390/designs7010018.
Texto completoGardille, Arnaud y Ola Ahmad. "Towards Safe Reinforcement Learning via OOD Dynamics Detection in Autonomous Driving System (Student Abstract)". Proceedings of the AAAI Conference on Artificial Intelligence 37, n.º 13 (26 de junio de 2023): 16216–17. http://dx.doi.org/10.1609/aaai.v37i13.26968.
Texto completoFree, David. "In the News". College & Research Libraries News 80, n.º 10 (5 de noviembre de 2019): 541. http://dx.doi.org/10.5860/crln.80.10.541.
Texto completoXu, Xibao, Yushen Chen y Chengchao Bai. "Deep Reinforcement Learning-Based Accurate Control of Planetary Soft Landing". Sensors 21, n.º 23 (6 de diciembre de 2021): 8161. http://dx.doi.org/10.3390/s21238161.
Texto completoSimão, Thiago D., Marnix Suilen y Nils Jansen. "Safe Policy Improvement for POMDPs via Finite-State Controllers". Proceedings of the AAAI Conference on Artificial Intelligence 37, n.º 12 (26 de junio de 2023): 15109–17. http://dx.doi.org/10.1609/aaai.v37i12.26763.
Texto completoZhang, Linrui, Qin Zhang, Li Shen, Bo Yuan, Xueqian Wang y Dacheng Tao. "Evaluating Model-Free Reinforcement Learning toward Safety-Critical Tasks". Proceedings of the AAAI Conference on Artificial Intelligence 37, n.º 12 (26 de junio de 2023): 15313–21. http://dx.doi.org/10.1609/aaai.v37i12.26786.
Texto completoAngele, Martin K., Nadia Smail, Markus W. Knöferl, Alfred Ayala, William G. Cioffi y Irshad H. Chaudry. "l-Arginine restores splenocyte functions after trauma and hemorrhage potentially by improving splenic blood flow". American Journal of Physiology-Cell Physiology 276, n.º 1 (1 de enero de 1999): C145—C151. http://dx.doi.org/10.1152/ajpcell.1999.276.1.c145.
Texto completoStaessens, Tom, Tom Lefebvre y Guillaume Crevecoeur. "Optimizing Cascaded Control of Mechatronic Systems through Constrained Residual Reinforcement Learning". Machines 11, n.º 3 (20 de marzo de 2023): 402. http://dx.doi.org/10.3390/machines11030402.
Texto completoLv, Kexuan, Xiaofei Pei, Ci Chen y Jie Xu. "A Safe and Efficient Lane Change Decision-Making Strategy of Autonomous Driving Based on Deep Reinforcement Learning". Mathematics 10, n.º 9 (5 de mayo de 2022): 1551. http://dx.doi.org/10.3390/math10091551.
Texto completoJurj, Sorin Liviu, Tino Werner, Dominik Grundt, Willem Hagemann y Eike Möhlmann. "Towards Safe and Sustainable Autonomous Vehicles Using Environmentally-Friendly Criticality Metrics". Sustainability 14, n.º 12 (7 de junio de 2022): 6988. http://dx.doi.org/10.3390/su14126988.
Texto completoMaw, Aye Aye, Maxim Tyan, Tuan Anh Nguyen y Jae-Woo Lee. "iADA*-RL: Anytime Graph-Based Path Planning with Deep Reinforcement Learning for an Autonomous UAV". Applied Sciences 11, n.º 9 (27 de abril de 2021): 3948. http://dx.doi.org/10.3390/app11093948.
Texto completoCivetta, Joseph M. y Charles L. Fox. "Advantages of Resuscitation with Balanced Hypertonic Sodium Solution in Disasters". Prehospital and Disaster Medicine 1, S1 (1985): 179–80. http://dx.doi.org/10.1017/s1049023x0004437x.
Texto completoWysocka, B. A., Z. Kassam, G. Lockwood, J. Brierley, L. Dawson y J. Ringash. "Assessment of intra and interfractional organ motion during adjuvant radiochemotherapy in gastric cancer". Journal of Clinical Oncology 25, n.º 18_suppl (20 de junio de 2007): 15132. http://dx.doi.org/10.1200/jco.2007.25.18_suppl.15132.
Texto completoNiu, Tong y Mohit Bansal. "AvgOut: A Simple Output-Probability Measure to Eliminate Dull Responses". Proceedings of the AAAI Conference on Artificial Intelligence 34, n.º 05 (3 de abril de 2020): 8560–67. http://dx.doi.org/10.1609/aaai.v34i05.6378.
Texto completoVivek, Kumar, Shah Amiti, Saha Shivshankar y Choudhary Lalit. "Electrolyte and Haemogram changes post large volume liposuction comparing two different tumescent solutions". Indian Journal of Plastic Surgery 47, n.º 03 (septiembre de 2014): 386–93. http://dx.doi.org/10.4103/0970-0358.146604.
Texto completoChebbi, Alif, Massimiliano Tazzari, Cristiana Rizzi, Franco Hernan Gomez Tovar, Sara Villa, Silvia Sbaffoni, Mentore Vaccari y Andrea Franzetti. "Burkholderia thailandensis E264 as a promising safe rhamnolipids’ producer towards a sustainable valorization of grape marcs and olive mill pomace". Applied Microbiology and Biotechnology 105, n.º 9 (20 de abril de 2021): 3825–42. http://dx.doi.org/10.1007/s00253-021-11292-0.
Texto completoBrown, Jennifer R., Matthew S. Davids, Jordi Rodon, Pau Abrisqueta, Coumaran Egile, Rodrigo Ruiz-Soto y Farrukh Awan. "Update On The Safety and Efficacy Of The Pan Class I PI3K Inhibitor SAR245408 (XL147) In Chronic Lymphocytic Leukemia and Non-Hodgkin’s Lymphoma Patients". Blood 122, n.º 21 (15 de noviembre de 2013): 4170. http://dx.doi.org/10.1182/blood.v122.21.4170.4170.
Texto completoTripathi, Malati, Ayushma Adhikari y Bibhushan Neupane. "Misoprostol Versus Oxytocin for Induction of Labour at Term and Post Term Pregnancy of Primigravida". Journal of Universal College of Medical Sciences 6, n.º 2 (3 de diciembre de 2018): 56–59. http://dx.doi.org/10.3126/jucms.v6i2.22497.
Texto completoOlupot-Olupot, Peter, Florence Aloroker, Ayub Mpoya, Hellen Mnjalla, George Passi, Margaret Nakuya, Kirsty Houston et al. "Gastroenteritis Rehydration Of children with Severe Acute Malnutrition (GASTROSAM): A Phase II Randomised Controlled trial: Trial Protocol". Wellcome Open Research 6 (23 de junio de 2021): 160. http://dx.doi.org/10.12688/wellcomeopenres.16885.1.
Texto completoJiang, Jianhua, Yangang Ren, Yang Guan, Shengbo Eben Li, Yuming Yin, Dongjie Yu y Xiaoping Jin. "Integrated decision and control at multi-lane intersections with mixed traffic flow". Journal of Physics: Conference Series 2234, n.º 1 (1 de abril de 2022): 012015. http://dx.doi.org/10.1088/1742-6596/2234/1/012015.
Texto completo