Journal articles on the topic 'Neural audio synthesis'
Create a spot-on reference in APA, MLA, Chicago, Harvard, and other styles
Consult the top 50 journal articles for your research on the topic 'Neural audio synthesis.'
Next to every source in the list of references, there is an 'Add to bibliography' button. Press on it, and we will generate automatically the bibliographic reference to the chosen work in the citation style you need: APA, MLA, Harvard, Chicago, Vancouver, etc.
You can also download the full text of the academic publication as pdf and read online its abstract whenever available in the metadata.
Browse journal articles on a wide variety of disciplines and organise your bibliography correctly.
Li, Dongze, Kang Zhao, Wei Wang, Bo Peng, Yingya Zhang, Jing Dong, and Tieniu Tan. "AE-NeRF: Audio Enhanced Neural Radiance Field for Few Shot Talking Head Synthesis." Proceedings of the AAAI Conference on Artificial Intelligence 38, no. 4 (March 24, 2024): 3037–45. http://dx.doi.org/10.1609/aaai.v38i4.28086.
Vyawahare, Prof D. G. "Image to Audio Conversion for Blind People Using Neural Network." International Journal for Research in Applied Science and Engineering Technology 11, no. 12 (December 31, 2023): 1949–57. http://dx.doi.org/10.22214/ijraset.2023.57712.
Kiefer, Chris. "Sample-level sound synthesis with recurrent neural networks and conceptors." PeerJ Computer Science 5 (July 8, 2019): e205. http://dx.doi.org/10.7717/peerj-cs.205.
Liu, Yunyi, and Craig Jin. "Impact on quality and diversity from integrating a reconstruction loss into neural audio synthesis." Journal of the Acoustical Society of America 154, no. 4_supplement (October 1, 2023): A99. http://dx.doi.org/10.1121/10.0022922.
Khandelwal, Karan, Krishiv Pandita, Kshitij Priyankar, Kumar Parakram, and Tejaswini K. "Svara Rachana - Audio Driven Facial Expression Synthesis." International Journal for Research in Applied Science and Engineering Technology 12, no. 5 (May 31, 2024): 2024–29. http://dx.doi.org/10.22214/ijraset.2024.62019.
VOITKO, Viktoriia, Svitlana BEVZ, Sergii BURBELO, and Pavlo STAVYTSKYI. "AUDIO GENERATION TECHNOLOGY OF A SYSTEM OF SYNTHESIS AND ANALYSIS OF MUSIC COMPOSITIONS." Herald of Khmelnytskyi National University 305, no. 1 (February 23, 2022): 64–67. http://dx.doi.org/10.31891/2307-5732-2022-305-1-64-67.
Li, Naihan, Yanqing Liu, Yu Wu, Shujie Liu, Sheng Zhao, and Ming Liu. "RobuTrans: A Robust Transformer-Based Text-to-Speech Model." Proceedings of the AAAI Conference on Artificial Intelligence 34, no. 05 (April 3, 2020): 8228–35. http://dx.doi.org/10.1609/aaai.v34i05.6337.
Hryhorenko, N., N. Larionov, and V. Bredikhin. "RESEARCH OF THE PROCESS OF VISUAL ART TRANSMISSION IN MUSIC AND THE CREATION OF COLLECTIONS FOR PEOPLE WITH VISUAL IMPAIRMENTS." Municipal economy of cities 6, no. 180 (December 4, 2023): 2–6. http://dx.doi.org/10.33042/2522-1809-2023-6-180-2-6.
Andreu, Sergi, and Monica Villanueva Aylagas. "Neural Synthesis of Sound Effects Using Flow-Based Deep Generative Models." Proceedings of the AAAI Conference on Artificial Intelligence and Interactive Digital Entertainment 18, no. 1 (October 11, 2022): 2–9. http://dx.doi.org/10.1609/aiide.v18i1.21941.
Li, Naihan, Shujie Liu, Yanqing Liu, Sheng Zhao, and Ming Liu. "Neural Speech Synthesis with Transformer Network." Proceedings of the AAAI Conference on Artificial Intelligence 33 (July 17, 2019): 6706–13. http://dx.doi.org/10.1609/aaai.v33i01.33016706.
Li, Yusen, Ying Shen, and Dongqing Wang. "DIFFBAS: An Advanced Binaural Audio Synthesis Model Focusing on Binaural Differences Recovery." Applied Sciences 14, no. 8 (April 17, 2024): 3385. http://dx.doi.org/10.3390/app14083385.
Roebel, Axel, and Frederik Bous. "Neural Vocoding for Singing and Speaking Voices with the Multi-Band Excited WaveNet." Information 13, no. 3 (February 23, 2022): 103. http://dx.doi.org/10.3390/info13030103.
García, Víctor, Inma Hernáez, and Eva Navas. "Evaluation of Tacotron Based Synthesizers for Spanish and Basque." Applied Sciences 12, no. 3 (February 7, 2022): 1686. http://dx.doi.org/10.3390/app12031686.
Prihasto, Bima, and Nur Fajri Azhar. "Evaluation of Recurrent Neural Network Based on Indonesian Speech Synthesis for Small Datasets." Advances in Science and Technology 104 (February 2021): 17–25. http://dx.doi.org/10.4028/www.scientific.net/ast.104.17.
Venkatesh, Satvik, David Moffat, and Eduardo Reck Miranda. "Investigating the Effects of Training Set Synthesis for Audio Segmentation of Radio Broadcast." Electronics 10, no. 7 (March 31, 2021): 827. http://dx.doi.org/10.3390/electronics10070827.
Tao Chen. "Music Tone Synthesis based Anti-Interference Dynamic Integral Neural Network optimized with Artificial Hummingbird Optimization Algorithm." Journal of Electrical Systems 20, no. 3s (April 4, 2024): 2665–76. http://dx.doi.org/10.52783/jes.3162.
Serebryanaya, L. V., and I. E. Lasy. "Automatic recognition and representation of text in the form of audio stream." Doklady BGUIR 19, no. 6 (October 1, 2021): 51–58. http://dx.doi.org/10.35596/1729-7648-2021-19-6-51-58.
Patnaik, W. Shivani. "Background Noise Suppression in Audio File using LSTM Network." International Journal for Research in Applied Science and Engineering Technology 10, no. 6 (June 30, 2022): 1310–16. http://dx.doi.org/10.22214/ijraset.2022.44109.
Mu, Jin. "Pose Estimation-Assisted Dance Tracking System Based on Convolutional Neural Network." Computational Intelligence and Neuroscience 2022 (June 3, 2022): 1–10. http://dx.doi.org/10.1155/2022/2301395.
Shejole, Prof Sakshi, Piyush Jaiswal, Neha Karmal, Vivek Patil, and Samnan Shaikh. "Autotuned Voice Cloning Enabling Multilingualism." International Journal for Research in Applied Science and Engineering Technology 11, no. 5 (May 31, 2023): 5945–49. http://dx.doi.org/10.22214/ijraset.2023.52906.
Rodríguez Fernández-Peña, Alfonso Carlos. "AI is great, isn’t it? Tone direction and illocutionary force delivery of tag ques-tions in Amazon’s AI NTTS Polly." Journal of Experimental Phonetics 32 (November 28, 2023): 227–42. http://dx.doi.org/10.1344/efe-2023-32-227-242.
Modi, Rohan. "Transcript Anatomization with Multi-Linguistic and Speech Synthesis Features." International Journal for Research in Applied Science and Engineering Technology 9, no. VI (June 20, 2021): 1755–58. http://dx.doi.org/10.22214/ijraset.2021.35371.
Kazakova, M. A., and A. P. Sultanova. "Analysis of natural language processing technology: modern problems and approaches." Advanced Engineering Research 22, no. 2 (July 11, 2022): 169–76. http://dx.doi.org/10.23947/2687-1653-2022-22-2-169-176.
Mandeel, Ali Raheem, Mohammed Salah Al-Radhi, and Tamás Gábor Csapó. "Speaker Adaptation Experiments with Limited Data for End-to-End Text-To-Speech Synthesis using Tacotron2." Infocommunications journal 14, no. 3 (2022): 55–62. http://dx.doi.org/10.36244/icj.2022.3.7.
Thoidis, Iordanis, Lazaros Vrysis, Dimitrios Markou, and George Papanikolaou. "Temporal Auditory Coding Features for Causal Speech Enhancement." Electronics 9, no. 10 (October 16, 2020): 1698. http://dx.doi.org/10.3390/electronics9101698.
Vishwakama, Ramesh, Ramashish Yadav, Harsheet Sharma, and Dr Saurabh Suman. "Automated Leaf Disease Detection System with Machine Learning." International Journal for Research in Applied Science and Engineering Technology 12, no. 2 (February 29, 2024): 814–19. http://dx.doi.org/10.22214/ijraset.2024.58449.
Kane, Joseph, Michael N. Johnstone, and Patryk Szewczyk. "Voice Synthesis Improvement by Machine Learning of Natural Prosody." Sensors 24, no. 5 (March 1, 2024): 1624. http://dx.doi.org/10.3390/s24051624.
Ravikiran K, Neerav Nishant, M Sreedhar, N.Kavitha, Mathur N. Kathiravan, and Geetha A. "Deep learning methods and integrated digital image processing techniques for detecting and evaluating wheat stripe rust disease." Scientific Temper 14, no. 03 (September 30, 2023): 864–69. http://dx.doi.org/10.58414/scientifictemper.2023.14.3.47.
Gromov, N. V., and T. A. Levanova. "WaveNet vocoder for prediction of time series with extreme events." Genes & Cells 18, no. 4 (December 15, 2023): 847–49. http://dx.doi.org/10.17816/gc623433.
Hakim, Heba, and Ali Marhoon. "Indoor Low Cost Assistive Device using 2D SLAM Based on LiDAR for Visually Impaired People." Iraqi Journal for Electrical and Electronic Engineering 15, no. 2 (December 1, 2019): 115–21. http://dx.doi.org/10.37917/ijeee.15.2.12.
Bai, Jinqiang, Zhaoxiang Liu, Yimin Lin, Ye Li, Shiguo Lian, and Dijun Liu. "Wearable Travel Aid for Environment Perception and Navigation of Visually Impaired People." Electronics 8, no. 6 (June 20, 2019): 697. http://dx.doi.org/10.3390/electronics8060697.
Nicol, Rozenn, and Jean-Yves Monfort. "Acoustic research for telecoms: bridging the heritage to the future." Acta Acustica 7 (2023): 64. http://dx.doi.org/10.1051/aacus/2023056.
Yu, Junxiao, Zhengyuan Xu, Xu He, Jian Wang, Bin Liu, Rui Feng, Songsheng Zhu, Wei Wang, and Jianqing Li. "DIA-TTS: Deep-Inherited Attention-Based Text-to-Speech Synthesizer." Entropy 25, no. 1 (December 26, 2022): 41. http://dx.doi.org/10.3390/e25010041.
Wang, Tianmeng. "Research and Application Analysis of Correlative Optimization Algorithms for GAN." Highlights in Science, Engineering and Technology 57 (July 11, 2023): 141–47. http://dx.doi.org/10.54097/hset.v57i.9992.
Dorofeeva, S. V. "Neuroplasicity and the developmental dyslexia intervention." Genes & Cells 18, no. 4 (December 15, 2023): 706–9. http://dx.doi.org/10.17816/gc623418.
Hood, Graeme, Kieran Hand, Emma Cramp, Philip Howard, Susan Hopkins, and Diane Ashiru-Oredope. "Measuring Appropriate Antibiotic Prescribing in Acute Hospitals: Development of a National Audit Tool Through a Delphi Consensus." Antibiotics 8, no. 2 (April 29, 2019): 49. http://dx.doi.org/10.3390/antibiotics8020049.
Li, Wanting, Yiting Chen, and Buzhou Tang. "Improving Generative Adversarial Network based Vocoding Through Multi-Scale Convolution." ACM Transactions on Asian and Low-Resource Language Information Processing, August 16, 2023. http://dx.doi.org/10.1145/3610532.
Lluís, Francesc, Vasileios Chatziioannou, and Alex Hofmann. "Points2Sound: from mono to binaural audio using 3D point cloud scenes." EURASIP Journal on Audio, Speech, and Music Processing 2022, no. 1 (December 29, 2022). http://dx.doi.org/10.1186/s13636-022-00265-4.
Khanjani, Zahra, Gabrielle Watson, and Vandana P. Janeja. "Audio deepfakes: A survey." Frontiers in Big Data 5 (January 9, 2023). http://dx.doi.org/10.3389/fdata.2022.1001063.
Dyer, Mark. "Neural Synthesis as a Methodology for Art-Anthropology in Contemporary Music." Organised Sound, September 16, 2022, 1–8. http://dx.doi.org/10.1017/s1355771822000371.
Comanducci, Luca, Fabio Antonacci, and Augusto Sarti. "Synthesis of soundfields through irregular loudspeaker arrays based on convolutional neural networks." EURASIP Journal on Audio, Speech, and Music Processing 2024, no. 1 (March 28, 2024). http://dx.doi.org/10.1186/s13636-024-00337-7.
Patole, Prof Mrunalinee, Akhilesh Pandey, Kaustubh Bhagwat, Mukesh Vaishnav, and Salikram Chadar. "A Survey on “Text-to-Speech Systems for Real-Time Audio Synthesis”." International Journal of Advanced Research in Science, Communication and Technology, June 10, 2021, 375–79. http://dx.doi.org/10.48175/ijarsct-1400.
Angrick, Miguel, Maarten C. Ottenhoff, Lorenz Diener, Darius Ivucic, Gabriel Ivucic, Sophocles Goulis, Jeremy Saal, et al. "Real-time synthesis of imagined speech processes from minimally invasive recordings of neural activity." Communications Biology 4, no. 1 (September 23, 2021). http://dx.doi.org/10.1038/s42003-021-02578-0.
Zhang, Ni. "Informatization Integration Strategy of Modern Vocal Music Teaching and Traditional Music Culture in Colleges and Universities in the Era of Artificial Intelligence." Applied Mathematics and Nonlinear Sciences, December 2, 2023. http://dx.doi.org/10.2478/amns.2023.2.01333.
Hayes, Ben, Jordie Shier, György Fazekas, Andrew McPherson, and Charalampos Saitis. "A review of differentiable digital signal processing for music and speech synthesis." Frontiers in Signal Processing 3 (January 11, 2024). http://dx.doi.org/10.3389/frsip.2023.1284100.
Kohler, Jonas, Maarten C. Ottenhoff, Sophocles Goulis, Miguel Angrick, Albert J. Colon, Louis Wagner, Simon Tousseyn, Pieter L. Kubben, and Christian Herff. "Synthesizing Speech from Intracranial Depth Electrodes using an Encoder-Decoder Framework." Neurons, Behavior, Data analysis, and Theory, December 9, 2022. http://dx.doi.org/10.51628/001c.57524.
Simionato, Riccardo, Stefano Fasciani, and Sverre Holm. "Physics-informed differentiable method for piano modeling." Frontiers in Signal Processing 3 (February 13, 2024). http://dx.doi.org/10.3389/frsip.2023.1276748.
Кожирбаев, Ж. М. "ҚАЗАҚ ТІЛІ ҮШІН ИНТЕГРАЛДЫҚ (END-TO-END) СӨЙЛЕУ СИНТЕЗІ." BULLETIN Series Physical and Mathematical Sciences 79, no. 3(2022) (September 25, 2023). http://dx.doi.org/10.51889/9340.2022.21.68.023.
Alsaadawı, Hussein Farooq Tayeb, and Resul Daş. "Multimodal Emotion Recognition Using Bi-LG-GCN for MELD Dataset." Balkan Journal of Electrical and Computer Engineering, October 16, 2023. http://dx.doi.org/10.17694/bajece.1372107.
Mithoowani, Siraj, Andrew Mulloy, Augustin Toma, and Ameen Patel. "To err is human: A case-based review of cognitive bias and its role in clinical decision making." Canadian Journal of General Internal Medicine 12, no. 2 (August 30, 2017). http://dx.doi.org/10.22374/cjgim.v12i2.166.