Gotowa bibliografia na temat „Safe RL”
Utwórz poprawne odniesienie w stylach APA, MLA, Chicago, Harvard i wielu innych
Zobacz listy aktualnych artykułów, książek, rozpraw, streszczeń i innych źródeł naukowych na temat „Safe RL”.
Przycisk „Dodaj do bibliografii” jest dostępny obok każdej pracy w bibliografii. Użyj go – a my automatycznie utworzymy odniesienie bibliograficzne do wybranej pracy w stylu cytowania, którego potrzebujesz: APA, MLA, Harvard, Chicago, Vancouver itp.
Możesz również pobrać pełny tekst publikacji naukowej w formacie „.pdf” i przeczytać adnotację do pracy online, jeśli odpowiednie parametry są dostępne w metadanych.
Artykuły w czasopismach na temat "Safe RL"
Carr, Steven, Nils Jansen, Sebastian Junges i Ufuk Topcu. "Safe Reinforcement Learning via Shielding under Partial Observability". Proceedings of the AAAI Conference on Artificial Intelligence 37, nr 12 (26.06.2023): 14748–56. http://dx.doi.org/10.1609/aaai.v37i12.26723.
Pełny tekst źródłaMa, Yecheng Jason, Andrew Shen, Osbert Bastani i Jayaraman Dinesh. "Conservative and Adaptive Penalty for Model-Based Safe Reinforcement Learning". Proceedings of the AAAI Conference on Artificial Intelligence 36, nr 5 (28.06.2022): 5404–12. http://dx.doi.org/10.1609/aaai.v36i5.20478.
Pełny tekst źródłaXu, Haoran, Xianyuan Zhan i Xiangyu Zhu. "Constraints Penalized Q-learning for Safe Offline Reinforcement Learning". Proceedings of the AAAI Conference on Artificial Intelligence 36, nr 8 (28.06.2022): 8753–60. http://dx.doi.org/10.1609/aaai.v36i8.20855.
Pełny tekst źródłaThananjeyan, Brijen, Ashwin Balakrishna, Suraj Nair, Michael Luo, Krishnan Srinivasan, Minho Hwang, Joseph E. Gonzalez, Julian Ibarz, Chelsea Finn i Ken Goldberg. "Recovery RL: Safe Reinforcement Learning With Learned Recovery Zones". IEEE Robotics and Automation Letters 6, nr 3 (lipiec 2021): 4915–22. http://dx.doi.org/10.1109/lra.2021.3070252.
Pełny tekst źródłaSerrano-Cuevas, Jonathan, Eduardo F. Morales i Pablo Hernández-Leal. "Safe reinforcement learning using risk mapping by similarity". Adaptive Behavior 28, nr 4 (18.07.2019): 213–24. http://dx.doi.org/10.1177/1059712319859650.
Pełny tekst źródłaCheng, Richard, Gábor Orosz, Richard M. Murray i Joel W. Burdick. "End-to-End Safe Reinforcement Learning through Barrier Functions for Safety-Critical Continuous Control Tasks". Proceedings of the AAAI Conference on Artificial Intelligence 33 (17.07.2019): 3387–95. http://dx.doi.org/10.1609/aaai.v33i01.33013387.
Pełny tekst źródłaJurj, Sorin Liviu, Dominik Grundt, Tino Werner, Philipp Borchers, Karina Rothemann i Eike Möhlmann. "Increasing the Safety of Adaptive Cruise Control Using Physics-Guided Reinforcement Learning". Energies 14, nr 22 (12.11.2021): 7572. http://dx.doi.org/10.3390/en14227572.
Pełny tekst źródłaSakrihei, Helen. "Using automatic storage for ILL – experiences from the National Repository Library in Norway". Interlending & Document Supply 44, nr 1 (15.02.2016): 14–16. http://dx.doi.org/10.1108/ilds-11-2015-0035.
Pełny tekst źródłaDing, Yuhao, i Javad Lavaei. "Provably Efficient Primal-Dual Reinforcement Learning for CMDPs with Non-stationary Objectives and Constraints". Proceedings of the AAAI Conference on Artificial Intelligence 37, nr 6 (26.06.2023): 7396–404. http://dx.doi.org/10.1609/aaai.v37i6.25900.
Pełny tekst źródłaTubeuf, Carlotta, Felix Birkelbach, Anton Maly i René Hofmann. "Increasing the Flexibility of Hydropower with Reinforcement Learning on a Digital Twin Platform". Energies 16, nr 4 (11.02.2023): 1796. http://dx.doi.org/10.3390/en16041796.
Pełny tekst źródłaRozprawy doktorskie na temat "Safe RL"
Gowda, Malali, R. C. Venu, Mohan Raghupathy, Kan Nobuta, Huameng Li, Rod Wing, Eric Stahlberg i in. "Deep and comparative analysis of the mycelium and appressorium transcriptomes of Magnaporthe grisea using MPSS, RL-SAGE, and oligoarray methods". BioMed Central, 2006. http://hdl.handle.net/10150/610397.
Pełny tekst źródłaCzęści książek na temat "Safe RL"
Lenka, Lalu Prasad, i Mélanie Bouroche. "Safe Lane-Changing in CAVs Using External Safety Supervisors: A Review". W Communications in Computer and Information Science, 527–38. Cham: Springer Nature Switzerland, 2023. http://dx.doi.org/10.1007/978-3-031-26438-2_41.
Pełny tekst źródłaGowda, Malali, i Guo-Liang Wang. "Robust-LongSAGE (RL-SAGE)". W Methods in Molecular Biology, 25–38. Totowa, NJ: Humana Press, 2008. http://dx.doi.org/10.1007/978-1-59745-454-4_2.
Pełny tekst źródłaPiepenbrock, Jelle, Tom Heskes, Mikoláš Janota i Josef Urban. "Guiding an Automated Theorem Prover with Neural Rewriting". W Automated Reasoning, 597–617. Cham: Springer International Publishing, 2022. http://dx.doi.org/10.1007/978-3-031-10769-6_35.
Pełny tekst źródłaSwan, Jerry, Eric Nivel, Neel Kant, Jules Hedges, Timothy Atkinson i Bas Steunebrink. "Where is My Mind?" W The Road to General Intelligence, 17–22. Cham: Springer International Publishing, 2022. http://dx.doi.org/10.1007/978-3-031-08020-3_3.
Pełny tekst źródłaJamal, Mansoor, Zaib Ullah i Musarat Abbas. "Self-Adapted Resource Allocation in V2X Communication". W Workshop Proceedings of the 19th International Conference on Intelligent Environments (IE2023). IOS Press, 2023. http://dx.doi.org/10.3233/aise230018.
Pełny tekst źródłaJara, Claudia, Débora Buendía, Alvaro Ardiles, Pablo Muñoz i Cheril Tapia-Rojas. "Transcranial Red LED Therapy: A Promising Non-Invasive Treatment to Prevent Age-Related Hippocampal Memory Impairment". W Hippocampus - Cytoarchitecture and Diseases. IntechOpen, 2022. http://dx.doi.org/10.5772/intechopen.100620.
Pełny tekst źródłaJiang, Haoge, Xudong Jiang, Kong-Wah Wan i Han Wang. "Deep Reinforcement Learning Based Crowd Navigation via Feature Aggregation from Graph Convolutional Networks". W Advances in Transdisciplinary Engineering. IOS Press, 2023. http://dx.doi.org/10.3233/atde230066.
Pełny tekst źródłaMelo-Pfeifer, Sílvia. "Translanguaging in Multilingual Chat Interaction". W Advances in Educational Technologies and Instructional Design, 188–207. IGI Global, 2016. http://dx.doi.org/10.4018/978-1-5225-0177-0.ch009.
Pełny tekst źródłaHai-Jew, Shalin. "Modeling the Relationship between a Human and a Malicious Artificial Intelligence, Natural-Language ’Bot in an Immersive Virtual World". W Digital Democracy and the Impact of Technology on Governance and Politics, 287–306. IGI Global, 2013. http://dx.doi.org/10.4018/978-1-4666-3637-8.ch016.
Pełny tekst źródłaIachello, F., i R. D. Levine. "Four-Body Algebraic Theory". W Algebraic Theory of Molecules. Oxford University Press, 1995. http://dx.doi.org/10.1093/oso/9780195080919.003.0008.
Pełny tekst źródłaStreszczenia konferencji na temat "Safe RL"
Yang, Wen-Chi, Giuseppe Marra, Gavin Rens i Luc De Raedt. "Safe Reinforcement Learning via Probabilistic Logic Shields". W Thirty-Second International Joint Conference on Artificial Intelligence {IJCAI-23}. California: International Joint Conferences on Artificial Intelligence Organization, 2023. http://dx.doi.org/10.24963/ijcai.2023/637.
Pełny tekst źródłaRahman, Md Asifur, Tongtong Liu i Sarra Alqahtani. "Adversarial Behavior Exclusion for Safe Reinforcement Learning". W Thirty-Second International Joint Conference on Artificial Intelligence {IJCAI-23}. California: International Joint Conferences on Artificial Intelligence Organization, 2023. http://dx.doi.org/10.24963/ijcai.2023/54.
Pełny tekst źródłaSimão, Thiago D. "Safe and Sample-Efficient Reinforcement Learning Algorithms for Factored Environments". W Twenty-Eighth International Joint Conference on Artificial Intelligence {IJCAI-19}. California: International Joint Conferences on Artificial Intelligence Organization, 2019. http://dx.doi.org/10.24963/ijcai.2019/919.
Pełny tekst źródłaZhao, Weiye, Tairan He, Rui Chen, Tianhao Wei i Changliu Liu. "State-wise Safe Reinforcement Learning: A Survey". W Thirty-Second International Joint Conference on Artificial Intelligence {IJCAI-23}. California: International Joint Conferences on Artificial Intelligence Organization, 2023. http://dx.doi.org/10.24963/ijcai.2023/763.
Pełny tekst źródłaBektas, Kemal, i H. Isil Bozma. "APF-RL: Safe Mapless Navigation in Unknown Environments". W 2022 IEEE International Conference on Robotics and Automation (ICRA). IEEE, 2022. http://dx.doi.org/10.1109/icra46639.2022.9811537.
Pełny tekst źródłaPerepu, Satheesh K., i M. Saravanan. "Optimize Next State Prediction in Safe RL for 5G Ecosystem". W 2022 14th International Conference on COMmunication Systems & NETworkS (COMSNETS). IEEE, 2022. http://dx.doi.org/10.1109/comsnets53615.2022.9668344.
Pełny tekst źródłaHouston, Vern L., Carl P. Mason, Luigi Arena, Gangming Luo, Aaron C. Beattie, MaryAnne Garbarini i Chaiya Thongpop. "Experimental Assessment and FEA Prediction of the Effects of Prosthetic Socket Geometry on Transtibial Amputee Residual Limb Circulation". W ASME 2001 International Mechanical Engineering Congress and Exposition. American Society of Mechanical Engineers, 2001. http://dx.doi.org/10.1115/imece2001/bed-23093.
Pełny tekst źródłaHopkins, Greg. "Design Qualification and Manufacturing of RTP-1 Tanks and Vessels". W ASME 2002 Pressure Vessels and Piping Conference. ASMEDC, 2002. http://dx.doi.org/10.1115/pvp2002-1250.
Pełny tekst źródłaAltmann, Philipp, Fabian Ritz, Leonard Feuchtinger, Jonas Nüßlein, Claudia Linnhoff-Popien i Thomy Phan. "CROP: Towards Distributional-Shift Robust Reinforcement Learning Using Compact Reshaped Observation Processing". W Thirty-Second International Joint Conference on Artificial Intelligence {IJCAI-23}. California: International Joint Conferences on Artificial Intelligence Organization, 2023. http://dx.doi.org/10.24963/ijcai.2023/380.
Pełny tekst źródłaHutter, Marcus, Samuel Yang-Zhao i Sultan Javed Majeed. "Conditions on Features for Temporal Difference-Like Methods to Converge". W Twenty-Eighth International Joint Conference on Artificial Intelligence {IJCAI-19}. California: International Joint Conferences on Artificial Intelligence Organization, 2019. http://dx.doi.org/10.24963/ijcai.2019/357.
Pełny tekst źródłaRaporty organizacyjne na temat "Safe RL"
Kiefner i Vieth. L51688B A Modified Criterion for Evaluating the Remaining Strength of Corroded Pipe. Chantilly, Virginia: Pipeline Research Council International, Inc. (PRCI), grudzień 1989. http://dx.doi.org/10.55274/r0011347.
Pełny tekst źródła