Статті в журналах з теми "Policy gradients"
Оформте джерело за APA, MLA, Chicago, Harvard та іншими стилями
Ознайомтеся з топ-50 статей у журналах для дослідження на тему "Policy gradients".
Біля кожної праці в переліку літератури доступна кнопка «Додати до бібліографії». Скористайтеся нею – і ми автоматично оформимо бібліографічне посилання на обрану працю в потрібному вам стилі цитування: APA, MLA, «Гарвард», «Чикаго», «Ванкувер» тощо.
Також ви можете завантажити повний текст наукової публікації у форматі «.pdf» та прочитати онлайн анотацію до роботи, якщо відповідні параметри наявні в метаданих.
Переглядайте статті в журналах для різних дисциплін та оформлюйте правильно вашу бібліографію.
Cai, Qingpeng, Ling Pan, and Pingzhong Tang. "Deterministic Value-Policy Gradients." Proceedings of the AAAI Conference on Artificial Intelligence 34, no. 04 (April 3, 2020): 3316–23. http://dx.doi.org/10.1609/aaai.v34i04.5732.
Повний текст джерелаWierstra, D., A. Forster, J. Peters, and J. Schmidhuber. "Recurrent policy gradients." Logic Journal of IGPL 18, no. 5 (September 9, 2009): 620–34. http://dx.doi.org/10.1093/jigpal/jzp049.
Повний текст джерелаSehnke, Frank, Christian Osendorfer, Thomas Rückstieß, Alex Graves, Jan Peters, and Jürgen Schmidhuber. "Parameter-exploring policy gradients." Neural Networks 23, no. 4 (May 2010): 551–59. http://dx.doi.org/10.1016/j.neunet.2009.12.004.
Повний текст джерелаZhao, Tingting, Hirotaka Hachiya, Voot Tangkaratt, Jun Morimoto, and Masashi Sugiyama. "Efficient Sample Reuse in Policy Gradients with Parameter-Based Exploration." Neural Computation 25, no. 6 (June 2013): 1512–47. http://dx.doi.org/10.1162/neco_a_00452.
Повний текст джерелаSeno, Takuma, and Michita Imai. "Policy Gradients with Memory-Augmented Critic." Transactions of the Japanese Society for Artificial Intelligence 36, no. 1 (January 1, 2021): B—K71_1–8. http://dx.doi.org/10.1527/tjsai.36-1_b-k71.
Повний текст джерелаMillidge, Beren. "Deep active inference as variational policy gradients." Journal of Mathematical Psychology 96 (June 2020): 102348. http://dx.doi.org/10.1016/j.jmp.2020.102348.
Повний текст джерелаCatling, PC, and RJ Burt. "Studies of the Ground-Dwelling Mammals of Eucalypt Forests in South-Eastern New South Wales: the Effect of Environmental Variables on Distribution and Abundance." Wildlife Research 22, no. 6 (1995): 669. http://dx.doi.org/10.1071/wr9950669.
Повний текст джерелаBaxter, J., P. L. Bartlett, and L. Weaver. "Experiments with Infinite-Horizon, Policy-Gradient Estimation." Journal of Artificial Intelligence Research 15 (November 1, 2001): 351–81. http://dx.doi.org/10.1613/jair.807.
Повний текст джерелаChen, Qiulin, Karen Eggleston, Wei Zhang, Jiaying Zhao, and Sen Zhou. "The Educational Gradient in Health in China." China Quarterly 230 (May 15, 2017): 289–322. http://dx.doi.org/10.1017/s0305741017000613.
Повний текст джерелаPeters, Jan, and Stefan Schaal. "Reinforcement learning of motor skills with policy gradients." Neural Networks 21, no. 4 (May 2008): 682–97. http://dx.doi.org/10.1016/j.neunet.2008.02.003.
Повний текст джерелаZhang, Chuheng, Yuanqi Li, and Jian Li. "Policy Search by Target Distribution Learning for Continuous Control." Proceedings of the AAAI Conference on Artificial Intelligence 34, no. 04 (April 3, 2020): 6770–77. http://dx.doi.org/10.1609/aaai.v34i04.6156.
Повний текст джерелаWang, Lin, Xingang Xu, Xuhui Zhao, Baozhu Li, Ruijuan Zheng, and Qingtao Wu. "A randomized block policy gradient algorithm with differential privacy in Content Centric Networks." International Journal of Distributed Sensor Networks 17, no. 12 (December 2021): 155014772110599. http://dx.doi.org/10.1177/15501477211059934.
Повний текст джерелаPersson, Bertil R. R., and Freddy Ståhlberg. "Safety Aspects of Magnetic Resonance Examinations." International Journal of Technology Assessment in Health Care 1, no. 3 (July 1985): 647–65. http://dx.doi.org/10.1017/s0266462300001549.
Повний текст джерелаMontalvo, Javier, Enrique Ruiz-Labrador, Pablo Montoya-Bernabéu, and Belén Acosta-Gallo. "Rural–Urban Gradients and Human Population Dynamics." Sustainability 11, no. 11 (June 1, 2019): 3107. http://dx.doi.org/10.3390/su11113107.
Повний текст джерелаFinch, Brian Karl. "Socioeconomic Gradients and Low Birth-Weight: Empirical and Policy Considerations." Health Services Research 38, no. 6p2 (December 18, 2003): 1819–42. http://dx.doi.org/10.1111/j.1475-6773.2003.00204.x.
Повний текст джерелаTirado, Daniel A., Jordi Pons, Elisenda Paluzie, and Julio Martínez-Galarraga. "Trade policy and wage gradients: evidence from a protectionist turn." Cliometrica 7, no. 3 (January 10, 2013): 295–318. http://dx.doi.org/10.1007/s11698-012-0090-y.
Повний текст джерелаRauber, Paulo, Avinash Ummadisingu, Filipe Mutz, and Jürgen Schmidhuber. "Reinforcement Learning in Sparse-Reward Environments With Hindsight Policy Gradients." Neural Computation 33, no. 6 (May 13, 2021): 1498–553. http://dx.doi.org/10.1162/neco_a_01387.
Повний текст джерелаCrowley, Mark. "Using Equilibrium Policy Gradients for Spatiotemporal Planning in Forest Ecosystem Management." IEEE Transactions on Computers 63, no. 1 (January 2014): 142–54. http://dx.doi.org/10.1109/tc.2013.113.
Повний текст джерелаLiu, Chujun, Andrew Lonsberry, Mark Nandor, Musa Audu, Alexander Lonsberry, and Roger Quinn. "Implementation of Deep Deterministic Policy Gradients for Controlling Dynamic Bipedal Walking." Biomimetics 4, no. 1 (March 22, 2019): 28. http://dx.doi.org/10.3390/biomimetics4010028.
Повний текст джерелаZhang, Yifan, Qinghe Zhao, Zihao Cao, and Shengyan Ding. "Inhibiting Effects of Vegetation on the Characteristics of Runoff and Sediment Yield on Riparian Slope along the Lower Yellow River." Sustainability 11, no. 13 (July 4, 2019): 3685. http://dx.doi.org/10.3390/su11133685.
Повний текст джерелаMARRIOTT, M. J. "Self-Cleansing Sewer Gradients." Water and Environment Journal 8, no. 4 (August 1994): 360–61. http://dx.doi.org/10.1111/j.1747-6593.1994.tb01118.x.
Повний текст джерелаLi, Kai, Yousef Emami, Wei Ni, Eduardo Tovar, and Zhu Han. "Onboard Deep Deterministic Policy Gradients for Online Flight Resource Allocation of UAVs." IEEE Networking Letters 2, no. 3 (September 2020): 106–10. http://dx.doi.org/10.1109/lnet.2020.3002341.
Повний текст джерелаGrondman, Ivo, Lucian Busoniu, Gabriel A. D. Lopes, and Robert Babuska. "A Survey of Actor-Critic Reinforcement Learning: Standard and Natural Policy Gradients." IEEE Transactions on Systems, Man, and Cybernetics, Part C (Applications and Reviews) 42, no. 6 (November 2012): 1291–307. http://dx.doi.org/10.1109/tsmcc.2012.2218595.
Повний текст джерелаChu, Baeksuk, Daehie Hong, and Jooyoung Park. "Tunnel ventilation control via an actor-critic algorithm employing nonparametric policy gradients." Journal of Mechanical Science and Technology 23, no. 2 (February 2009): 311–23. http://dx.doi.org/10.1007/s12206-008-0924-5.
Повний текст джерелаScott-Marshall, Heather. "Occupational Gradients in Work-Related Insecurity and Health: Interrogating the Links." International Journal of Health Services 49, no. 2 (March 6, 2019): 212–36. http://dx.doi.org/10.1177/0020731419832243.
Повний текст джерелаSeo, Paul Hongsuck, Piyush Sharma, Tomer Levinboim, Bohyung Han, and Radu Soricut. "Reinforcing an Image Caption Generator Using Off-Line Human Feedback." Proceedings of the AAAI Conference on Artificial Intelligence 34, no. 03 (April 3, 2020): 2693–700. http://dx.doi.org/10.1609/aaai.v34i03.5655.
Повний текст джерелаCao, Yongcan, and Huixin Zhan. "Efficient Multi-objective Reinforcement Learning via Multiple-gradient Descent with Iteratively Discovered Weight-Vector Sets." Journal of Artificial Intelligence Research 70 (January 20, 2021): 319–49. http://dx.doi.org/10.1613/jair.1.12270.
Повний текст джерелаOhmann, Janet L., and Matthew J. Gregory. "Predictive mapping of forest composition and structure with direct gradient analysis and nearest- neighbor imputation in coastal Oregon, U.S.A." Canadian Journal of Forest Research 32, no. 4 (April 1, 2002): 725–41. http://dx.doi.org/10.1139/x02-011.
Повний текст джерелаChen, Yizhu, Nuanyin Xu, Qianru Yu, and Luo Guo. "Ecosystem Service Response to Human Disturbance in the Yangtze River Economic Belt: A Case of Western Hunan, China." Sustainability 12, no. 2 (January 8, 2020): 465. http://dx.doi.org/10.3390/su12020465.
Повний текст джерелаJiang, Zhong An, and Lan Jiang. "Gradient Analysis on Occupational Safety in Mine and Economic and Social Development." Advanced Materials Research 524-527 (May 2012): 3107–11. http://dx.doi.org/10.4028/www.scientific.net/amr.524-527.3107.
Повний текст джерелаGRAHAM, HILARY. "Tackling Inequalities in Health in England: Remedying Health Disadvantages, Narrowing Health Gaps or Reducing Health Gradients?" Journal of Social Policy 33, no. 1 (January 2004): 115–31. http://dx.doi.org/10.1017/s0047279403007220.
Повний текст джерелаTangkaratt, Voot, Syogo Mori, Tingting Zhao, Jun Morimoto, and Masashi Sugiyama. "Model-based policy gradients with parameter-based exploration by least-squares conditional density estimation." Neural Networks 57 (September 2014): 128–40. http://dx.doi.org/10.1016/j.neunet.2014.06.006.
Повний текст джерелаSchellberg, J., and L. da S. Pontes. "Plant functional traits and nutrient gradients on grassland." Grass and Forage Science 67, no. 3 (April 5, 2012): 305–19. http://dx.doi.org/10.1111/j.1365-2494.2012.00867.x.
Повний текст джерелаYasir, Achmad Ichwan, and Gede Putra Kusuma. "Deep Deterministic Policy Gradients for Optimizing Simulated PoA Blockchain Networks Based on Healthcare Data Characteristics." Advances in Science, Technology and Engineering Systems Journal 6, no. 1 (February 2021): 757–64. http://dx.doi.org/10.25046/aj060183.
Повний текст джерелаSchulze, Sören, Johannes Leuschner, and Emily J. King. "Blind Source Separation in Polyphonic Music Recordings Using Deep Neural Networks Trained via Policy Gradients." Signals 2, no. 4 (October 7, 2021): 637–61. http://dx.doi.org/10.3390/signals2040039.
Повний текст джерелаKoerselman, Willem, Domien Claessens, Paul ten Den, and Erik van Winden. "Dynamic hydrochemical and vegetation gradients in fens." Wetlands Ecology and Management 1, no. 2 (March 1990): 73–84. http://dx.doi.org/10.1007/bf00177282.
Повний текст джерелаGyuris, E. "Factors that control the emergence of green turtle hatchlings from the nest." Wildlife Research 20, no. 3 (1993): 345. http://dx.doi.org/10.1071/wr9930345.
Повний текст джерелаCai, Han, Kun Ma, and Yunjian Luo. "Geographical Modeling of Spatial Interaction between Built-Up Land Sprawl and Cultivated Landscape Eco-Security under Urbanization Gradient." Sustainability 11, no. 19 (October 5, 2019): 5513. http://dx.doi.org/10.3390/su11195513.
Повний текст джерелаCascone, Valeria, Ilaria Barone, and Jacopo Boaga. "Velocity gradients choice affecting seismic site response in deep alluvial basins: Application to the Venetian Plain (Northern Italy)." Journal of Geophysics and Engineering 19, no. 1 (January 25, 2022): 1–13. http://dx.doi.org/10.1093/jge/gxab067.
Повний текст джерелаBose, Arnab, Aditya Ramji, Jarnail Singh, and Dhairya Dholakia. "A case study for sustainable development action using financial gradients." Energy Policy 47 (June 2012): 79–86. http://dx.doi.org/10.1016/j.enpol.2012.03.038.
Повний текст джерелаSudhakar, G., B. Jyothi, and V. Venkateswarlu. "Role of diatoms as indicators of pollution gradients." Environmental Monitoring and Assessment 33, no. 2 (November 1994): 85–99. http://dx.doi.org/10.1007/bf00548591.
Повний текст джерелаPark, Jeiyoon, Chanhee Lee, Chanjun Park, Kuekyeng Kim, and Heuiseok Lim. "Variational Reward Estimator Bottleneck: Towards Robust Reward Estimator for Multidomain Task-Oriented Dialogue." Applied Sciences 11, no. 14 (July 19, 2021): 6624. http://dx.doi.org/10.3390/app11146624.
Повний текст джерелаCantinotti, Massimiliano, Pietro Marchese, Marco Scalese, Eliana Franchi, Nadia Assanta, Martin Koestenberger, Jef Van den Eynde, Shelby Kutty, and Raffaele Giordano. "Normal Values and Patterns of Normality and Physiological Variability of Mitral and Tricuspid Inflow Pulsed Doppler in Healthy Children." Healthcare 10, no. 2 (February 11, 2022): 355. http://dx.doi.org/10.3390/healthcare10020355.
Повний текст джерелаSiddiqi, Arjumand, Ichiro Kawachi, Lisa Berkman, S. V. Subramanian, and Clyde Hertzman. "Variation of Socioeconomic Gradients in Children's Developmental Health across Advanced Capitalist Societies: Analysis of 22 Oecd Nations." International Journal of Health Services 37, no. 1 (January 2007): 63–87. http://dx.doi.org/10.2190/ju86-457p-7656-w4w7.
Повний текст джерелаJi, Chaonan, Uta Heiden, Tobia Lakes, and Hannes Feilhauer. "Are urban material gradients transferable between areas?" International Journal of Applied Earth Observation and Geoinformation 100 (August 2021): 102332. http://dx.doi.org/10.1016/j.jag.2021.102332.
Повний текст джерелаSorace, Alberto, and Marco Gustin. "Distribution of generalist and specialist predators along urban gradients." Landscape and Urban Planning 90, no. 3-4 (April 2009): 111–18. http://dx.doi.org/10.1016/j.landurbplan.2008.10.019.
Повний текст джерелаMa, Lin, Yueyao Wang, Ze Liang, Jiaqi Ding, Jiashu Shen, Feili Wei, and Shuangcheng Li. "Changing Effect of Urban Form on the Seasonal and Diurnal Variations of Surface Urban Heat Island Intensities (SUHIIs) in More Than 3000 Cities in China." Sustainability 13, no. 5 (March 7, 2021): 2877. http://dx.doi.org/10.3390/su13052877.
Повний текст джерелаNahry and Noor Syiffa Fadillah. "The Empirical Study on the Impact of Road Gradient and Truck Composition on the Toll Road Traffic Performance." E3S Web of Conferences 65 (2018): 09003. http://dx.doi.org/10.1051/e3sconf/20186509003.
Повний текст джерелаLiu, Guoqing, Li Zhao, Feidiao Yang, Jiang Bian, Tao Qin, Nenghai Yu, and Tie-Yan Liu. "Trust Region Evolution Strategies." Proceedings of the AAAI Conference on Artificial Intelligence 33 (July 17, 2019): 4352–59. http://dx.doi.org/10.1609/aaai.v33i01.33014352.
Повний текст джерелаHamin, Elisabeth, Yaser Abunnasr, Max Roman Dilthey, Pamela Judge, Melissa Kenney, Paul Kirshen, Thomas Sheahan, et al. "Pathways to Coastal Resiliency: The Adaptive Gradients Framework." Sustainability 10, no. 8 (July 26, 2018): 2629. http://dx.doi.org/10.3390/su10082629.
Повний текст джерела