Journal articles on the topic 'Policy gradients'
Create a spot-on reference in APA, MLA, Chicago, Harvard, and other styles
Consult the top 50 journal articles for your research on the topic 'Policy gradients.'
Next to every source in the list of references, there is an 'Add to bibliography' button. Press on it, and we will generate automatically the bibliographic reference to the chosen work in the citation style you need: APA, MLA, Harvard, Chicago, Vancouver, etc.
You can also download the full text of the academic publication as pdf and read online its abstract whenever available in the metadata.
Browse journal articles on a wide variety of disciplines and organise your bibliography correctly.
Cai, Qingpeng, Ling Pan, and Pingzhong Tang. "Deterministic Value-Policy Gradients." Proceedings of the AAAI Conference on Artificial Intelligence 34, no. 04 (April 3, 2020): 3316–23. http://dx.doi.org/10.1609/aaai.v34i04.5732.
Full textWierstra, D., A. Forster, J. Peters, and J. Schmidhuber. "Recurrent policy gradients." Logic Journal of IGPL 18, no. 5 (September 9, 2009): 620–34. http://dx.doi.org/10.1093/jigpal/jzp049.
Full textSehnke, Frank, Christian Osendorfer, Thomas Rückstieß, Alex Graves, Jan Peters, and Jürgen Schmidhuber. "Parameter-exploring policy gradients." Neural Networks 23, no. 4 (May 2010): 551–59. http://dx.doi.org/10.1016/j.neunet.2009.12.004.
Full textZhao, Tingting, Hirotaka Hachiya, Voot Tangkaratt, Jun Morimoto, and Masashi Sugiyama. "Efficient Sample Reuse in Policy Gradients with Parameter-Based Exploration." Neural Computation 25, no. 6 (June 2013): 1512–47. http://dx.doi.org/10.1162/neco_a_00452.
Full textSeno, Takuma, and Michita Imai. "Policy Gradients with Memory-Augmented Critic." Transactions of the Japanese Society for Artificial Intelligence 36, no. 1 (January 1, 2021): B—K71_1–8. http://dx.doi.org/10.1527/tjsai.36-1_b-k71.
Full textMillidge, Beren. "Deep active inference as variational policy gradients." Journal of Mathematical Psychology 96 (June 2020): 102348. http://dx.doi.org/10.1016/j.jmp.2020.102348.
Full textCatling, PC, and RJ Burt. "Studies of the Ground-Dwelling Mammals of Eucalypt Forests in South-Eastern New South Wales: the Effect of Environmental Variables on Distribution and Abundance." Wildlife Research 22, no. 6 (1995): 669. http://dx.doi.org/10.1071/wr9950669.
Full textBaxter, J., P. L. Bartlett, and L. Weaver. "Experiments with Infinite-Horizon, Policy-Gradient Estimation." Journal of Artificial Intelligence Research 15 (November 1, 2001): 351–81. http://dx.doi.org/10.1613/jair.807.
Full textChen, Qiulin, Karen Eggleston, Wei Zhang, Jiaying Zhao, and Sen Zhou. "The Educational Gradient in Health in China." China Quarterly 230 (May 15, 2017): 289–322. http://dx.doi.org/10.1017/s0305741017000613.
Full textPeters, Jan, and Stefan Schaal. "Reinforcement learning of motor skills with policy gradients." Neural Networks 21, no. 4 (May 2008): 682–97. http://dx.doi.org/10.1016/j.neunet.2008.02.003.
Full textZhang, Chuheng, Yuanqi Li, and Jian Li. "Policy Search by Target Distribution Learning for Continuous Control." Proceedings of the AAAI Conference on Artificial Intelligence 34, no. 04 (April 3, 2020): 6770–77. http://dx.doi.org/10.1609/aaai.v34i04.6156.
Full textWang, Lin, Xingang Xu, Xuhui Zhao, Baozhu Li, Ruijuan Zheng, and Qingtao Wu. "A randomized block policy gradient algorithm with differential privacy in Content Centric Networks." International Journal of Distributed Sensor Networks 17, no. 12 (December 2021): 155014772110599. http://dx.doi.org/10.1177/15501477211059934.
Full textPersson, Bertil R. R., and Freddy Ståhlberg. "Safety Aspects of Magnetic Resonance Examinations." International Journal of Technology Assessment in Health Care 1, no. 3 (July 1985): 647–65. http://dx.doi.org/10.1017/s0266462300001549.
Full textMontalvo, Javier, Enrique Ruiz-Labrador, Pablo Montoya-Bernabéu, and Belén Acosta-Gallo. "Rural–Urban Gradients and Human Population Dynamics." Sustainability 11, no. 11 (June 1, 2019): 3107. http://dx.doi.org/10.3390/su11113107.
Full textFinch, Brian Karl. "Socioeconomic Gradients and Low Birth-Weight: Empirical and Policy Considerations." Health Services Research 38, no. 6p2 (December 18, 2003): 1819–42. http://dx.doi.org/10.1111/j.1475-6773.2003.00204.x.
Full textTirado, Daniel A., Jordi Pons, Elisenda Paluzie, and Julio Martínez-Galarraga. "Trade policy and wage gradients: evidence from a protectionist turn." Cliometrica 7, no. 3 (January 10, 2013): 295–318. http://dx.doi.org/10.1007/s11698-012-0090-y.
Full textRauber, Paulo, Avinash Ummadisingu, Filipe Mutz, and Jürgen Schmidhuber. "Reinforcement Learning in Sparse-Reward Environments With Hindsight Policy Gradients." Neural Computation 33, no. 6 (May 13, 2021): 1498–553. http://dx.doi.org/10.1162/neco_a_01387.
Full textCrowley, Mark. "Using Equilibrium Policy Gradients for Spatiotemporal Planning in Forest Ecosystem Management." IEEE Transactions on Computers 63, no. 1 (January 2014): 142–54. http://dx.doi.org/10.1109/tc.2013.113.
Full textLiu, Chujun, Andrew Lonsberry, Mark Nandor, Musa Audu, Alexander Lonsberry, and Roger Quinn. "Implementation of Deep Deterministic Policy Gradients for Controlling Dynamic Bipedal Walking." Biomimetics 4, no. 1 (March 22, 2019): 28. http://dx.doi.org/10.3390/biomimetics4010028.
Full textZhang, Yifan, Qinghe Zhao, Zihao Cao, and Shengyan Ding. "Inhibiting Effects of Vegetation on the Characteristics of Runoff and Sediment Yield on Riparian Slope along the Lower Yellow River." Sustainability 11, no. 13 (July 4, 2019): 3685. http://dx.doi.org/10.3390/su11133685.
Full textMARRIOTT, M. J. "Self-Cleansing Sewer Gradients." Water and Environment Journal 8, no. 4 (August 1994): 360–61. http://dx.doi.org/10.1111/j.1747-6593.1994.tb01118.x.
Full textLi, Kai, Yousef Emami, Wei Ni, Eduardo Tovar, and Zhu Han. "Onboard Deep Deterministic Policy Gradients for Online Flight Resource Allocation of UAVs." IEEE Networking Letters 2, no. 3 (September 2020): 106–10. http://dx.doi.org/10.1109/lnet.2020.3002341.
Full textGrondman, Ivo, Lucian Busoniu, Gabriel A. D. Lopes, and Robert Babuska. "A Survey of Actor-Critic Reinforcement Learning: Standard and Natural Policy Gradients." IEEE Transactions on Systems, Man, and Cybernetics, Part C (Applications and Reviews) 42, no. 6 (November 2012): 1291–307. http://dx.doi.org/10.1109/tsmcc.2012.2218595.
Full textChu, Baeksuk, Daehie Hong, and Jooyoung Park. "Tunnel ventilation control via an actor-critic algorithm employing nonparametric policy gradients." Journal of Mechanical Science and Technology 23, no. 2 (February 2009): 311–23. http://dx.doi.org/10.1007/s12206-008-0924-5.
Full textScott-Marshall, Heather. "Occupational Gradients in Work-Related Insecurity and Health: Interrogating the Links." International Journal of Health Services 49, no. 2 (March 6, 2019): 212–36. http://dx.doi.org/10.1177/0020731419832243.
Full textSeo, Paul Hongsuck, Piyush Sharma, Tomer Levinboim, Bohyung Han, and Radu Soricut. "Reinforcing an Image Caption Generator Using Off-Line Human Feedback." Proceedings of the AAAI Conference on Artificial Intelligence 34, no. 03 (April 3, 2020): 2693–700. http://dx.doi.org/10.1609/aaai.v34i03.5655.
Full textCao, Yongcan, and Huixin Zhan. "Efficient Multi-objective Reinforcement Learning via Multiple-gradient Descent with Iteratively Discovered Weight-Vector Sets." Journal of Artificial Intelligence Research 70 (January 20, 2021): 319–49. http://dx.doi.org/10.1613/jair.1.12270.
Full textOhmann, Janet L., and Matthew J. Gregory. "Predictive mapping of forest composition and structure with direct gradient analysis and nearest- neighbor imputation in coastal Oregon, U.S.A." Canadian Journal of Forest Research 32, no. 4 (April 1, 2002): 725–41. http://dx.doi.org/10.1139/x02-011.
Full textChen, Yizhu, Nuanyin Xu, Qianru Yu, and Luo Guo. "Ecosystem Service Response to Human Disturbance in the Yangtze River Economic Belt: A Case of Western Hunan, China." Sustainability 12, no. 2 (January 8, 2020): 465. http://dx.doi.org/10.3390/su12020465.
Full textJiang, Zhong An, and Lan Jiang. "Gradient Analysis on Occupational Safety in Mine and Economic and Social Development." Advanced Materials Research 524-527 (May 2012): 3107–11. http://dx.doi.org/10.4028/www.scientific.net/amr.524-527.3107.
Full textGRAHAM, HILARY. "Tackling Inequalities in Health in England: Remedying Health Disadvantages, Narrowing Health Gaps or Reducing Health Gradients?" Journal of Social Policy 33, no. 1 (January 2004): 115–31. http://dx.doi.org/10.1017/s0047279403007220.
Full textTangkaratt, Voot, Syogo Mori, Tingting Zhao, Jun Morimoto, and Masashi Sugiyama. "Model-based policy gradients with parameter-based exploration by least-squares conditional density estimation." Neural Networks 57 (September 2014): 128–40. http://dx.doi.org/10.1016/j.neunet.2014.06.006.
Full textSchellberg, J., and L. da S. Pontes. "Plant functional traits and nutrient gradients on grassland." Grass and Forage Science 67, no. 3 (April 5, 2012): 305–19. http://dx.doi.org/10.1111/j.1365-2494.2012.00867.x.
Full textYasir, Achmad Ichwan, and Gede Putra Kusuma. "Deep Deterministic Policy Gradients for Optimizing Simulated PoA Blockchain Networks Based on Healthcare Data Characteristics." Advances in Science, Technology and Engineering Systems Journal 6, no. 1 (February 2021): 757–64. http://dx.doi.org/10.25046/aj060183.
Full textSchulze, Sören, Johannes Leuschner, and Emily J. King. "Blind Source Separation in Polyphonic Music Recordings Using Deep Neural Networks Trained via Policy Gradients." Signals 2, no. 4 (October 7, 2021): 637–61. http://dx.doi.org/10.3390/signals2040039.
Full textKoerselman, Willem, Domien Claessens, Paul ten Den, and Erik van Winden. "Dynamic hydrochemical and vegetation gradients in fens." Wetlands Ecology and Management 1, no. 2 (March 1990): 73–84. http://dx.doi.org/10.1007/bf00177282.
Full textGyuris, E. "Factors that control the emergence of green turtle hatchlings from the nest." Wildlife Research 20, no. 3 (1993): 345. http://dx.doi.org/10.1071/wr9930345.
Full textCai, Han, Kun Ma, and Yunjian Luo. "Geographical Modeling of Spatial Interaction between Built-Up Land Sprawl and Cultivated Landscape Eco-Security under Urbanization Gradient." Sustainability 11, no. 19 (October 5, 2019): 5513. http://dx.doi.org/10.3390/su11195513.
Full textCascone, Valeria, Ilaria Barone, and Jacopo Boaga. "Velocity gradients choice affecting seismic site response in deep alluvial basins: Application to the Venetian Plain (Northern Italy)." Journal of Geophysics and Engineering 19, no. 1 (January 25, 2022): 1–13. http://dx.doi.org/10.1093/jge/gxab067.
Full textBose, Arnab, Aditya Ramji, Jarnail Singh, and Dhairya Dholakia. "A case study for sustainable development action using financial gradients." Energy Policy 47 (June 2012): 79–86. http://dx.doi.org/10.1016/j.enpol.2012.03.038.
Full textSudhakar, G., B. Jyothi, and V. Venkateswarlu. "Role of diatoms as indicators of pollution gradients." Environmental Monitoring and Assessment 33, no. 2 (November 1994): 85–99. http://dx.doi.org/10.1007/bf00548591.
Full textPark, Jeiyoon, Chanhee Lee, Chanjun Park, Kuekyeng Kim, and Heuiseok Lim. "Variational Reward Estimator Bottleneck: Towards Robust Reward Estimator for Multidomain Task-Oriented Dialogue." Applied Sciences 11, no. 14 (July 19, 2021): 6624. http://dx.doi.org/10.3390/app11146624.
Full textCantinotti, Massimiliano, Pietro Marchese, Marco Scalese, Eliana Franchi, Nadia Assanta, Martin Koestenberger, Jef Van den Eynde, Shelby Kutty, and Raffaele Giordano. "Normal Values and Patterns of Normality and Physiological Variability of Mitral and Tricuspid Inflow Pulsed Doppler in Healthy Children." Healthcare 10, no. 2 (February 11, 2022): 355. http://dx.doi.org/10.3390/healthcare10020355.
Full textSiddiqi, Arjumand, Ichiro Kawachi, Lisa Berkman, S. V. Subramanian, and Clyde Hertzman. "Variation of Socioeconomic Gradients in Children's Developmental Health across Advanced Capitalist Societies: Analysis of 22 Oecd Nations." International Journal of Health Services 37, no. 1 (January 2007): 63–87. http://dx.doi.org/10.2190/ju86-457p-7656-w4w7.
Full textJi, Chaonan, Uta Heiden, Tobia Lakes, and Hannes Feilhauer. "Are urban material gradients transferable between areas?" International Journal of Applied Earth Observation and Geoinformation 100 (August 2021): 102332. http://dx.doi.org/10.1016/j.jag.2021.102332.
Full textSorace, Alberto, and Marco Gustin. "Distribution of generalist and specialist predators along urban gradients." Landscape and Urban Planning 90, no. 3-4 (April 2009): 111–18. http://dx.doi.org/10.1016/j.landurbplan.2008.10.019.
Full textMa, Lin, Yueyao Wang, Ze Liang, Jiaqi Ding, Jiashu Shen, Feili Wei, and Shuangcheng Li. "Changing Effect of Urban Form on the Seasonal and Diurnal Variations of Surface Urban Heat Island Intensities (SUHIIs) in More Than 3000 Cities in China." Sustainability 13, no. 5 (March 7, 2021): 2877. http://dx.doi.org/10.3390/su13052877.
Full textNahry and Noor Syiffa Fadillah. "The Empirical Study on the Impact of Road Gradient and Truck Composition on the Toll Road Traffic Performance." E3S Web of Conferences 65 (2018): 09003. http://dx.doi.org/10.1051/e3sconf/20186509003.
Full textLiu, Guoqing, Li Zhao, Feidiao Yang, Jiang Bian, Tao Qin, Nenghai Yu, and Tie-Yan Liu. "Trust Region Evolution Strategies." Proceedings of the AAAI Conference on Artificial Intelligence 33 (July 17, 2019): 4352–59. http://dx.doi.org/10.1609/aaai.v33i01.33014352.
Full textHamin, Elisabeth, Yaser Abunnasr, Max Roman Dilthey, Pamela Judge, Melissa Kenney, Paul Kirshen, Thomas Sheahan, et al. "Pathways to Coastal Resiliency: The Adaptive Gradients Framework." Sustainability 10, no. 8 (July 26, 2018): 2629. http://dx.doi.org/10.3390/su10082629.
Full text