Academic literature on the topic 'NVLink'

Create a spot-on reference in APA, MLA, Chicago, Harvard, and other styles

Select a source type:

Consult the lists of relevant articles, books, theses, conference reports, and other scholarly sources on the topic 'NVLink.'

Next to every source in the list of references, there is an 'Add to bibliography' button. Press on it, and we will generate automatically the bibliographic reference to the chosen work in the citation style you need: APA, MLA, Harvard, Chicago, Vancouver, etc.

You can also download the full text of the academic publication as pdf and read online its abstract whenever available in the metadata.

Journal articles on the topic "NVLink"

1

Foley, Denis, and John Danskin. "Ultra-Performance Pascal GPU and NVLink Interconnect." IEEE Micro 37, no. 2 (March 2017): 7–17. http://dx.doi.org/10.1109/mm.2017.37.

Full text
APA, Harvard, Vancouver, ISO, and other styles
2

IBM POWER9 NPU team. "Functionality and performance of NVLink with IBM POWER9 processors." IBM Journal of Research and Development 62, no. 4/5 (July 1, 2018): 9:1–9:10. http://dx.doi.org/10.1147/jrd.2018.2846978.

Full text
APA, Harvard, Vancouver, ISO, and other styles
3

Rosenberg, Duane, Pablo D. Mininni, Raghu Reddy, and Annick Pouquet. "GPU Parallelization of a Hybrid Pseudospectral Geophysical Turbulence Framework Using CUDA." Atmosphere 11, no. 2 (February 8, 2020): 178. http://dx.doi.org/10.3390/atmos11020178.

Full text
Abstract:
An existing hybrid MPI-OpenMP scheme is augmented with a CUDA-based fine grain parallelization approach for multidimensional distributed Fourier transforms, in a well-characterized pseudospectral fluid turbulence code. Basics of the hybrid scheme are reviewed, and heuristics provided to show a potential benefit of the CUDA implementation. The method draws heavily on the CUDA runtime library to handle memory management and on the cuFFT library for computing local FFTs. The manner in which the interfaces to these libraries are constructed, and ISO bindings utilized to facilitate platform portability, are discussed. CUDA streams are implemented to overlap data transfer with cuFFT computation. Testing with a baseline solver demonstrated significant aggregate speed-up over the hybrid MPI-OpenMP solver by offloading to GPUs on an NVLink-based test system. While the batch streamed approach provided little benefit with NVLink, we saw a performance gain of 30 % when tuned for the optimal number of streams on a PCIe-based system. It was found that strong GPU scaling is nearly ideal, in all cases. Profiling of the CUDA kernels shows that the transform computation achieves 15% of the attainable peak FlOp-rate based on a roofline model for the system. In addition to speed-up measurements for the fiducial solver, we also considered several other solvers with different numbers of transform operations and found that aggregate speed-ups are nearly constant for all solvers.
APA, Harvard, Vancouver, ISO, and other styles
4

Li, Ang, Shuaiwen Leon Song, Jieyang Chen, Jiajia Li, Xu Liu, Nathan R. Tallent, and Kevin J. Barker. "Evaluating Modern GPU Interconnect: PCIe, NVLink, NV-SLI, NVSwitch and GPUDirect." IEEE Transactions on Parallel and Distributed Systems 31, no. 1 (January 1, 2020): 94–110. http://dx.doi.org/10.1109/tpds.2019.2928289.

Full text
APA, Harvard, Vancouver, ISO, and other styles
5

Szénási, Sándor. "Solving the inverse heat conduction problem using NVLink capable Power architecture." PeerJ Computer Science 3 (November 20, 2017): e138. http://dx.doi.org/10.7717/peerj-cs.138.

Full text
Abstract:
The accurate knowledge of Heat Transfer Coefficients is essential for the design of precise heat transfer operations. The determination of these values requires Inverse Heat Transfer Calculations, which are usually based on heuristic optimisation techniques, like Genetic Algorithms or Particle Swarm Optimisation. The main bottleneck of these heuristics is the high computational demand of the cost function calculation, which is usually based on heat transfer simulations producing the thermal history of the workpiece at given locations. This Direct Heat Transfer Calculation is a well parallelisable process, making it feasible to implement an efficient GPU kernel for this purpose. This paper presents a novel step forward: based on the special requirements of the heuristics solving the inverse problem (executing hundreds of simulations in a parallel fashion at the end of each iteration), it is possible to gain a higher level of parallelism using multiple graphics accelerators. The results show that this implementation (running on 4 GPUs) is about 120 times faster than a traditional CPU implementation using 20 cores. The latest developments of the GPU-based High Power Computations area were also analysed, like the new NVLink connection between the host and the devices, which tries to solve the long time existing data transfer handicap of GPU programming.
APA, Harvard, Vancouver, ISO, and other styles
6

Zhang, Ge, and Larissa Hjorth. "Live-streaming, games and politics of gender performance: The case of Nüzhubo in China." Convergence: The International Journal of Research into New Media Technologies 25, no. 5-6 (November 8, 2017): 807–25. http://dx.doi.org/10.1177/1354856517738160.

Full text
Abstract:
In the emerging scholarship on live-streaming sites, the role of gender has been relatively overlooked. This article aims to address this oversight by capturing the controversial rise of nüzhubo (Chinese for ‘female casters’) in the Chinese live-streaming platform, Douyu. Through ethnographic research on Douyu over 2 years, we have witnessed female performers who – motivated by both entrepreneurial spirit and creative agency – have embraced new forms of performative practices in, and around, video game commentary cultures. We begin with a brief contextualizing the gendered nature of media in the history of Chinese video sites and how theories around gender – especially gender performativity – might be adapted. While acknowledging the homogenizing effect of the term nüzhubo, we focus on two performers on Douyu – Hani9 and Nvliu – that are challenging conventional nüzhubo tropes. We argue for a situated notion of gender performativity that also engages with the platform-specific social, cultural and technical infrastructures – ‘platformativity’ to use Thomas Lamarre’s word.
APA, Harvard, Vancouver, ISO, and other styles

Dissertations / Theses on the topic "NVLink"

1

Stodůlka, Martin. "Akcelerace ultrazvukových simulací pomocí multi-GPU systémů." Master's thesis, Vysoké učení technické v Brně. Fakulta informačních technologií, 2021. http://www.nusl.cz/ntk/nusl-445538.

Full text
Abstract:
The main focus of this project is usage of multi - GPU systems and usage of CUDA unified memory . Its goal is to accelerate computation of 2D and 3D FFT, which is the main part of simulations in k- Wave library .K- Wave is a C++/ Matlab library used for simulations of propagation of ultrasonic waves in 1D , 2D or 3D space . Acceleration of these functions is necessary , because the simulations are computationally intensive .
APA, Harvard, Vancouver, ISO, and other styles

Book chapters on the topic "NVLink"

1

Pearson, Carl, I.-Hsin Chung, Zehra Sura, Wen-Mei Hwu, and Jinjun Xiong. "NUMA-Aware Data-Transfer Measurements for Power/NVLink Multi-GPU Systems." In Lecture Notes in Computer Science, 448–54. Cham: Springer International Publishing, 2018. http://dx.doi.org/10.1007/978-3-030-02465-9_32.

Full text
APA, Harvard, Vancouver, ISO, and other styles

Conference papers on the topic "NVLink"

1

Danskin, John, and Denis Foley. "Pascal GPU with NVLink." In 2016 IEEE Hot Chips 28 Symposium (HCS). IEEE, 2016. http://dx.doi.org/10.1109/hotchips.2016.7936202.

Full text
APA, Harvard, Vancouver, ISO, and other styles
2

Buono, Daniele, Fausto Artico, Fabio Checconi, Jee W. Choi, Xinyu Que, and Lars Schneidenbach. "Data Analytics with NVLink." In CF '17: Computing Frontiers Conference. New York, NY, USA: ACM, 2017. http://dx.doi.org/10.1145/3075564.3075569.

Full text
APA, Harvard, Vancouver, ISO, and other styles
3

Christensen, Cameron, Thomas Fogal, Nathan Luehr, and Cliff Woolley. "Topology-aware image compositing using NVLink." In 2016 IEEE 6th Symposium on Large Data Analysis and Visualization (LDAV). IEEE, 2016. http://dx.doi.org/10.1109/ldav.2016.7874334.

Full text
APA, Harvard, Vancouver, ISO, and other styles
4

Choi, Yea Rem, Vsevolod Nikolskiy, and Vladimir Stegailov. "Matrix-Matrix Multiplication Using Multiple GPUs Connected by Nvlink." In 2020 Global Smart Industry Conference (GloSIC). IEEE, 2020. http://dx.doi.org/10.1109/glosic50886.2020.9267865.

Full text
APA, Harvard, Vancouver, ISO, and other styles
5

Gocho, Masato, and Takehiro Hoshino. "Performance Evaluation of Parallel Stripmap CS-SAR Imaging on NVLink-Connected GPUs." In 2018 IEEE International Conference on Cluster Computing (CLUSTER). IEEE, 2018. http://dx.doi.org/10.1109/cluster.2018.00027.

Full text
APA, Harvard, Vancouver, ISO, and other styles
6

Appelhans, David, and Bob Walkup. "Leveraging NVLINK and asynchronous data transfer to scale beyond the memory capacity of GPUs." In SC '17: The International Conference for High Performance Computing, Networking, Storage and Analysis. New York, NY, USA: ACM, 2017. http://dx.doi.org/10.1145/3148226.3148232.

Full text
APA, Harvard, Vancouver, ISO, and other styles
7

Hu, Yang, Huiba Li, and Yuxing Peng. "NVLAN: A Novel VLAN Technology for Scalable Multi-tenant Datacenter Networks." In 2014 Second International Conference on Advanced Cloud and Big Data (CBD). IEEE, 2014. http://dx.doi.org/10.1109/cbd.2014.32.

Full text
APA, Harvard, Vancouver, ISO, and other styles
We offer discounts on all premium plans for authors whose works are included in thematic literature selections. Contact us to get a unique promo code!

To the bibliography