Kliknij ten link, aby zobaczyć inne rodzaje publikacji na ten temat: Sparse Matrix Vector Multiplication.

Artykuły w czasopismach na temat „Sparse Matrix Vector Multiplication”

Utwórz poprawne odniesienie w stylach APA, MLA, Chicago, Harvard i wielu innych

Wybierz rodzaj źródła:

Sprawdź 50 najlepszych artykułów w czasopismach naukowych na temat „Sparse Matrix Vector Multiplication”.

Przycisk „Dodaj do bibliografii” jest dostępny obok każdej pracy w bibliografii. Użyj go – a my automatycznie utworzymy odniesienie bibliograficzne do wybranej pracy w stylu cytowania, którego potrzebujesz: APA, MLA, Harvard, Chicago, Vancouver itp.

Możesz również pobrać pełny tekst publikacji naukowej w formacie „.pdf” i przeczytać adnotację do pracy online, jeśli odpowiednie parametry są dostępne w metadanych.

Przeglądaj artykuły w czasopismach z różnych dziedzin i twórz odpowiednie bibliografie.

1

Tao, Yuan, Yangdong Deng, Shuai Mu, Zhenzhong Zhang, Mingfa Zhu, Limin Xiao i Li Ruan. "GPU accelerated sparse matrix-vector multiplication and sparse matrix-transpose vector multiplication". Concurrency and Computation: Practice and Experience 27, nr 14 (7.10.2014): 3771–89. http://dx.doi.org/10.1002/cpe.3415.

Pełny tekst źródła
Style APA, Harvard, Vancouver, ISO itp.
2

Filippone, Salvatore, Valeria Cardellini, Davide Barbieri i Alessandro Fanfarillo. "Sparse Matrix-Vector Multiplication on GPGPUs". ACM Transactions on Mathematical Software 43, nr 4 (23.03.2017): 1–49. http://dx.doi.org/10.1145/3017994.

Pełny tekst źródła
Style APA, Harvard, Vancouver, ISO itp.
3

ERHEL, JOCELYNE. "SPARSE MATRIX MULTIPLICATION ON VECTOR COMPUTERS". International Journal of High Speed Computing 02, nr 02 (czerwiec 1990): 101–16. http://dx.doi.org/10.1142/s012905339000008x.

Pełny tekst źródła
Style APA, Harvard, Vancouver, ISO itp.
4

Haque, Sardar Anisul, Shahadat Hossain i M. Moreno Maza. "Cache friendly sparse matrix-vector multiplication". ACM Communications in Computer Algebra 44, nr 3/4 (28.01.2011): 111–12. http://dx.doi.org/10.1145/1940475.1940490.

Pełny tekst źródła
Style APA, Harvard, Vancouver, ISO itp.
5

Bienz, Amanda, William D. Gropp i Luke N. Olson. "Node aware sparse matrix–vector multiplication". Journal of Parallel and Distributed Computing 130 (sierpień 2019): 166–78. http://dx.doi.org/10.1016/j.jpdc.2019.03.016.

Pełny tekst źródła
Style APA, Harvard, Vancouver, ISO itp.
6

Heath, L. S., C. J. Ribbens i S. V. Pemmaraju. "Processor-efficient sparse matrix-vector multiplication". Computers & Mathematics with Applications 48, nr 3-4 (sierpień 2004): 589–608. http://dx.doi.org/10.1016/j.camwa.2003.06.009.

Pełny tekst źródła
Style APA, Harvard, Vancouver, ISO itp.
7

Yang, Xintian, Srinivasan Parthasarathy i P. Sadayappan. "Fast sparse matrix-vector multiplication on GPUs". Proceedings of the VLDB Endowment 4, nr 4 (styczeń 2011): 231–42. http://dx.doi.org/10.14778/1938545.1938548.

Pełny tekst źródła
Style APA, Harvard, Vancouver, ISO itp.
8

Romero, L. F., i E. L. Zapata. "Data distributions for sparse matrix vector multiplication". Parallel Computing 21, nr 4 (kwiecień 1995): 583–605. http://dx.doi.org/10.1016/0167-8191(94)00087-q.

Pełny tekst źródła
Style APA, Harvard, Vancouver, ISO itp.
9

Zardoshti, Pantea, Farshad Khunjush i Hamid Sarbazi-Azad. "Adaptive sparse matrix representation for efficient matrix–vector multiplication". Journal of Supercomputing 72, nr 9 (28.11.2015): 3366–86. http://dx.doi.org/10.1007/s11227-015-1571-0.

Pełny tekst źródła
Style APA, Harvard, Vancouver, ISO itp.
10

Yzelman, A. N., i Rob H. Bisseling. "Cache-Oblivious Sparse Matrix–Vector Multiplication by Using Sparse Matrix Partitioning Methods". SIAM Journal on Scientific Computing 31, nr 4 (styczeń 2009): 3128–54. http://dx.doi.org/10.1137/080733243.

Pełny tekst źródła
Style APA, Harvard, Vancouver, ISO itp.
11

Sun, C. C., J. Götze, H. Y. Jheng i S. J. Ruan. "Sparse matrix-vector multiplication on network-on-chip". Advances in Radio Science 8 (22.12.2010): 289–94. http://dx.doi.org/10.5194/ars-8-289-2010.

Pełny tekst źródła
Streszczenie:
Abstract. In this paper, we present an idea for performing matrix-vector multiplication by using Network-on-Chip (NoC) architecture. In traditional IC design on-chip communications have been designed with dedicated point-to-point interconnections. Therefore, regular local data transfer is the major concept of many parallel implementations. However, when dealing with the parallel implementation of sparse matrix-vector multiplication (SMVM), which is the main step of all iterative algorithms for solving systems of linear equation, the required data transfers depend on the sparsity structure of the matrix and can be extremely irregular. Using the NoC architecture makes it possible to deal with arbitrary structure of the data transfers; i.e. with the irregular structure of the sparse matrices. So far, we have already implemented the proposed SMVM-NoC architecture with the size 4×4 and 5×5 in IEEE 754 single float point precision using FPGA.
Style APA, Harvard, Vancouver, ISO itp.
12

Isupov, Konstantin. "Multiple-precision sparse matrix–vector multiplication on GPUs". Journal of Computational Science 61 (maj 2022): 101609. http://dx.doi.org/10.1016/j.jocs.2022.101609.

Pełny tekst źródła
Style APA, Harvard, Vancouver, ISO itp.
13

Zou, Dan, Yong Dou, Song Guo i Shice Ni. "High performance sparse matrix-vector multiplication on FPGA". IEICE Electronics Express 10, nr 17 (2013): 20130529. http://dx.doi.org/10.1587/elex.10.20130529.

Pełny tekst źródła
Style APA, Harvard, Vancouver, ISO itp.
14

Gao, Jiaquan, Yifei Xia, Renjie Yin i Guixia He. "Adaptive diagonal sparse matrix-vector multiplication on GPU". Journal of Parallel and Distributed Computing 157 (listopad 2021): 287–302. http://dx.doi.org/10.1016/j.jpdc.2021.07.007.

Pełny tekst źródła
Style APA, Harvard, Vancouver, ISO itp.
15

Yzelman, A. N., i Rob H. Bisseling. "Two-dimensional cache-oblivious sparse matrix–vector multiplication". Parallel Computing 37, nr 12 (grudzień 2011): 806–19. http://dx.doi.org/10.1016/j.parco.2011.08.004.

Pełny tekst źródła
Style APA, Harvard, Vancouver, ISO itp.
16

Yilmaz, Buse, Bariş Aktemur, MaríA J. Garzarán, Sam Kamin i Furkan Kiraç. "Autotuning Runtime Specialization for Sparse Matrix-Vector Multiplication". ACM Transactions on Architecture and Code Optimization 13, nr 1 (5.04.2016): 1–26. http://dx.doi.org/10.1145/2851500.

Pełny tekst źródła
Style APA, Harvard, Vancouver, ISO itp.
17

MUKADDES, ABUL MUKID MOHAMMAD, MASAO OGINO i RYUJI SHIOYA. "PERFORMANCE EVALUATION OF DOMAIN DECOMPOSITION METHOD WITH SPARSE MATRIX STORAGE SCHEMES IN MODERN SUPERCOMPUTER". International Journal of Computational Methods 11, supp01 (listopad 2014): 1344007. http://dx.doi.org/10.1142/s0219876213440076.

Pełny tekst źródła
Streszczenie:
The use of proper data structures with corresponding algorithms is critical to achieve good performance in scientific computing. The need of sparse matrix vector multiplication in each iteration of the iterative domain decomposition method has led to implementation of a variety of sparse matrix storage formats. Many storage formats have been presented to represent sparse matrix and integrated in the method. In this paper, the storage efficiency of those sparse matrix storage formats are evaluated and compared. The performance results of sparse matrix vector multiplication used in the domain decomposition method is considered. Based on our experiments in the FX10 supercomputer system, some useful conclusions that can serve as guidelines for the optimization of domain decomposition method are extracted.
Style APA, Harvard, Vancouver, ISO itp.
18

Liu, Sheng, Yasong Cao i Shuwei Sun. "Mapping and Optimization Method of SpMV on Multi-DSP Accelerator". Electronics 11, nr 22 (11.11.2022): 3699. http://dx.doi.org/10.3390/electronics11223699.

Pełny tekst źródła
Streszczenie:
Sparse matrix-vector multiplication (SpMV) solves the product of a sparse matrix and dense vector, and the sparseness of a sparse matrix is often more than 90%. Usually, the sparse matrix is compressed to save storage resources, but this causes irregular access to dense vectors in the algorithm, which takes a lot of time and degrades the SpMV performance of the system. In this study, we design a dedicated channel in the DMA to implement an indirect memory access process to speed up the SpMV operation. On this basis, we propose six SpMV algorithm schemes and map them to optimize the performance of SpMV. The results show that the M processor’s SpMV performance reached 6.88 GFLOPS. Besides, the average performance of the HPCG benchmark is 2.8 GFLOPS.
Style APA, Harvard, Vancouver, ISO itp.
19

Jao, Nicholas, Akshay Krishna Ramanathan, John Sampson i Vijaykrishnan Narayanan. "Sparse Vector-Matrix Multiplication Acceleration in Diode-Selected Crossbars". IEEE Transactions on Very Large Scale Integration (VLSI) Systems 29, nr 12 (grudzień 2021): 2186–96. http://dx.doi.org/10.1109/tvlsi.2021.3114186.

Pełny tekst źródła
Style APA, Harvard, Vancouver, ISO itp.
20

Kamin, Sam, María Jesús Garzarán, Barış Aktemur, Danqing Xu, Buse Yılmaz i Zhongbo Chen. "Optimization by runtime specialization for sparse matrix-vector multiplication". ACM SIGPLAN Notices 50, nr 3 (12.05.2015): 93–102. http://dx.doi.org/10.1145/2775053.2658773.

Pełny tekst źródła
Style APA, Harvard, Vancouver, ISO itp.
21

Fernandez, D. M., D. Giannacopoulos i W. J. Gross. "Efficient Multicore Sparse Matrix-Vector Multiplication for FE Electromagnetics". IEEE Transactions on Magnetics 45, nr 3 (marzec 2009): 1392–95. http://dx.doi.org/10.1109/tmag.2009.2012640.

Pełny tekst źródła
Style APA, Harvard, Vancouver, ISO itp.
22

Shantharam, Manu, Anirban Chatterjee i Padma Raghavan. "Exploiting dense substructures for fast sparse matrix vector multiplication". International Journal of High Performance Computing Applications 25, nr 3 (sierpień 2011): 328–41. http://dx.doi.org/10.1177/1094342011414748.

Pełny tekst źródła
Style APA, Harvard, Vancouver, ISO itp.
23

Gao, Jiaquan, Panpan Qi i Guixia He. "Efficient CSR-Based Sparse Matrix-Vector Multiplication on GPU". Mathematical Problems in Engineering 2016 (2016): 1–14. http://dx.doi.org/10.1155/2016/4596943.

Pełny tekst źródła
Streszczenie:
Sparse matrix-vector multiplication (SpMV) is an important operation in computational science and needs be accelerated because it often represents the dominant cost in many widely used iterative methods and eigenvalue problems. We achieve this objective by proposing a novel SpMV algorithm based on the compressed sparse row (CSR) on the GPU. Our method dynamically assigns different numbers of rows to each thread block and executes different optimization implementations on the basis of the number of rows it involves for each block. The process of accesses to the CSR arrays is fully coalesced, and the GPU’s DRAM bandwidth is efficiently utilized by loading data into the shared memory, which alleviates the bottleneck of many existing CSR-based algorithms (i.e., CSR-scalar and CSR-vector). Test results on C2050 and K20c GPUs show that our method outperforms a perfect-CSR algorithm that inspires our work, the vendor tuned CUSPARSE V6.5 and CUSP V0.5.1, and three popular algorithms clSpMV, CSR5, and CSR-Adaptive.
Style APA, Harvard, Vancouver, ISO itp.
24

Maggioni, Marco, i Tanya Berger-Wolf. "Optimization techniques for sparse matrix–vector multiplication on GPUs". Journal of Parallel and Distributed Computing 93-94 (lipiec 2016): 66–86. http://dx.doi.org/10.1016/j.jpdc.2016.03.011.

Pełny tekst źródła
Style APA, Harvard, Vancouver, ISO itp.
25

Geus, Roman, i Stefan Röllin. "Towards a fast parallel sparse symmetric matrix–vector multiplication". Parallel Computing 27, nr 7 (czerwiec 2001): 883–96. http://dx.doi.org/10.1016/s0167-8191(01)00073-4.

Pełny tekst źródła
Style APA, Harvard, Vancouver, ISO itp.
26

Zhang, Jilin, Enyi Liu, Jian Wan, Yongjian Ren, Miao Yue i Jue Wang. "Implementing Sparse Matrix-Vector Multiplication with QCSR on GPU". Applied Mathematics & Information Sciences 7, nr 2 (1.03.2013): 473–82. http://dx.doi.org/10.12785/amis/070207.

Pełny tekst źródła
Style APA, Harvard, Vancouver, ISO itp.
27

Feng, Xiaowen, Hai Jin, Ran Zheng, Zhiyuan Shao i Lei Zhu. "A segment-based sparse matrix-vector multiplication on CUDA". Concurrency and Computation: Practice and Experience 26, nr 1 (7.12.2012): 271–86. http://dx.doi.org/10.1002/cpe.2978.

Pełny tekst źródła
Style APA, Harvard, Vancouver, ISO itp.
28

Neves, Samuel, i Filipe Araujo. "Straight-line programs for fast sparse matrix-vector multiplication". Concurrency and Computation: Practice and Experience 27, nr 13 (28.01.2014): 3245–61. http://dx.doi.org/10.1002/cpe.3211.

Pełny tekst źródła
Style APA, Harvard, Vancouver, ISO itp.
29

Nastea, Sorin G., Ophir Frieder i Tarek El-Ghazawi. "Load-Balanced Sparse Matrix–Vector Multiplication on Parallel Computers". Journal of Parallel and Distributed Computing 46, nr 2 (listopad 1997): 180–93. http://dx.doi.org/10.1006/jpdc.1997.1361.

Pełny tekst źródła
Style APA, Harvard, Vancouver, ISO itp.
30

He, Guixia, i Jiaquan Gao. "A Novel CSR-Based Sparse Matrix-Vector Multiplication on GPUs". Mathematical Problems in Engineering 2016 (2016): 1–12. http://dx.doi.org/10.1155/2016/8471283.

Pełny tekst źródła
Streszczenie:
Sparse matrix-vector multiplication (SpMV) is an important operation in scientific computations. Compressed sparse row (CSR) is the most frequently used format to store sparse matrices. However, CSR-based SpMVs on graphic processing units (GPUs), for example, CSR-scalar and CSR-vector, usually have poor performance due to irregular memory access patterns. This motivates us to propose a perfect CSR-based SpMV on the GPU that is called PCSR. PCSR involves two kernels and accesses CSR arrays in a fully coalesced manner by introducing a middle array, which greatly alleviates the deficiencies of CSR-scalar (rare coalescing) and CSR-vector (partial coalescing). Test results on a single C2050 GPU show that PCSR fully outperforms CSR-scalar, CSR-vector, and CSRMV and HYBMV in the vendor-tuned CUSPARSE library and is comparable with a most recently proposed CSR-based algorithm, CSR-Adaptive. Furthermore, we extend PCSR on a single GPU to multiple GPUs. Experimental results on four C2050 GPUs show that no matter whether the communication between GPUs is considered or not PCSR on multiple GPUs achieves good performance and has high parallel efficiency.
Style APA, Harvard, Vancouver, ISO itp.
31

Liu, Yongchao, i Bertil Schmidt. "LightSpMV: Faster CUDA-Compatible Sparse Matrix-Vector Multiplication Using Compressed Sparse Rows". Journal of Signal Processing Systems 90, nr 1 (10.01.2017): 69–86. http://dx.doi.org/10.1007/s11265-016-1216-4.

Pełny tekst źródła
Style APA, Harvard, Vancouver, ISO itp.
32

Giannoula, Christina, Ivan Fernandez, Juan Gómez-Luna, Nectarios Koziris, Georgios Goumas i Onur Mutlu. "Towards Efficient Sparse Matrix Vector Multiplication on Real Processing-In-Memory Architectures". ACM SIGMETRICS Performance Evaluation Review 50, nr 1 (20.06.2022): 33–34. http://dx.doi.org/10.1145/3547353.3522661.

Pełny tekst źródła
Streszczenie:
Several manufacturers have already started to commercialize near-bank Processing-In-Memory (PIM) architectures, after decades of research efforts. Near-bank PIM architectures place simple cores close to DRAM banks. Recent research demonstrates that they can yield significant performance and energy improvements in parallel applications by alleviating data access costs. Real PIM systems can provide high levels of parallelism, large aggregate memory bandwidth and low memory access latency, thereby being a good fit to accelerate the Sparse Matrix Vector Multiplication (SpMV) kernel. SpMV has been characterized as one of the most significant and thoroughly studied scientific computation kernels. It is primarily a memory-bound kernel with intensive memory accesses due its algorithmic nature, the compressed matrix format used, and the sparsity patterns of the input matrices given. This paper provides the first comprehensive analysis of SpMV on a real-world PIM architecture, and presents SparseP, the first SpMV library for real PIM architectures. We make two key contributions. First, we design efficient SpMV algorithms to accelerate the SpMV kernel in current and future PIM systems, while covering a wide variety of sparse matrices with diverse sparsity patterns. Second, we provide the first comprehensive analysis of SpMV on a real PIM architecture. Specifically, we conduct our rigorous experimental analysis of SpMV kernels in the UPMEM PIM system, the first publicly-available real-world PIM architecture. Our extensive evaluation provides new insights and recommendations for software designers and hardware architects to efficiently accelerate the SpMV kernel on real PIM systems. For more information about our thorough characterization on the SpMV PIM execution, results, insights and the open-source SparseP software package [21], we refer the reader to the full version of the paper [3, 4]. The SparseP software package is publicly and freely available at https://github.com/CMU-SAFARI/SparseP.
Style APA, Harvard, Vancouver, ISO itp.
33

Dubois, David, Andrew Dubois, Thomas Boorman, Carolyn Connor i Steve Poole. "Sparse Matrix-Vector Multiplication on a Reconfigurable Supercomputer with Application". ACM Transactions on Reconfigurable Technology and Systems 3, nr 1 (styczeń 2010): 1–31. http://dx.doi.org/10.1145/1661438.1661440.

Pełny tekst źródła
Style APA, Harvard, Vancouver, ISO itp.
34

Catalyurek, U. V., i C. Aykanat. "Hypergraph-partitioning-based decomposition for parallel sparse-matrix vector multiplication". IEEE Transactions on Parallel and Distributed Systems 10, nr 7 (lipiec 1999): 673–93. http://dx.doi.org/10.1109/71.780863.

Pełny tekst źródła
Style APA, Harvard, Vancouver, ISO itp.
35

Toledo, S. "Improving the memory-system performance of sparse-matrix vector multiplication". IBM Journal of Research and Development 41, nr 6 (listopad 1997): 711–25. http://dx.doi.org/10.1147/rd.416.0711.

Pełny tekst źródła
Style APA, Harvard, Vancouver, ISO itp.
36

Williams, Samuel, Leonid Oliker, Richard Vuduc, John Shalf, Katherine Yelick i James Demmel. "Optimization of sparse matrix–vector multiplication on emerging multicore platforms". Parallel Computing 35, nr 3 (marzec 2009): 178–94. http://dx.doi.org/10.1016/j.parco.2008.12.006.

Pełny tekst źródła
Style APA, Harvard, Vancouver, ISO itp.
37

Peters, Alexander. "Sparse matrix vector multiplication techniques on the IBM 3090 VF". Parallel Computing 17, nr 12 (grudzień 1991): 1409–24. http://dx.doi.org/10.1016/s0167-8191(05)80007-9.

Pełny tekst źródła
Style APA, Harvard, Vancouver, ISO itp.
38

Li, ShiGang, ChangJun Hu, JunChao Zhang i YunQuan Zhang. "Automatic tuning of sparse matrix-vector multiplication on multicore clusters". Science China Information Sciences 58, nr 9 (24.06.2015): 1–14. http://dx.doi.org/10.1007/s11432-014-5254-x.

Pełny tekst źródła
Style APA, Harvard, Vancouver, ISO itp.
39

Dehn, T., M. Eiermann, K. Giebermann i V. Sperling. "Structured sparse matrix-vector multiplication on massively parallel SIMD architectures". Parallel Computing 21, nr 12 (grudzień 1995): 1867–94. http://dx.doi.org/10.1016/0167-8191(95)00055-0.

Pełny tekst źródła
Style APA, Harvard, Vancouver, ISO itp.
40

Zeiser, Andreas. "Fast Matrix-Vector Multiplication in the Sparse-Grid Galerkin Method". Journal of Scientific Computing 47, nr 3 (26.11.2010): 328–46. http://dx.doi.org/10.1007/s10915-010-9438-2.

Pełny tekst źródła
Style APA, Harvard, Vancouver, ISO itp.
41

Yang, Bing, Shuo Gu, Tong-Xiang Gu, Cong Zheng i Xing-Ping Liu. "Parallel Multicore CSB Format and Its Sparse Matrix Vector Multiplication". Advances in Linear Algebra & Matrix Theory 04, nr 01 (2014): 1–8. http://dx.doi.org/10.4236/alamt.2014.41001.

Pełny tekst źródła
Style APA, Harvard, Vancouver, ISO itp.
42

Ahmad, Khalid, Hari Sundar i Mary Hall. "Data-driven Mixed Precision Sparse Matrix Vector Multiplication for GPUs". ACM Transactions on Architecture and Code Optimization 16, nr 4 (10.01.2020): 1–24. http://dx.doi.org/10.1145/3371275.

Pełny tekst źródła
Style APA, Harvard, Vancouver, ISO itp.
43

Tao, Yuan, i Huang Zhi-Bin. "Shuffle Reduction Based Sparse Matrix-Vector Multiplication on Kepler GPU". International Journal of Grid and Distributed Computing 9, nr 10 (31.10.2016): 99–106. http://dx.doi.org/10.14257/ijgdc.2016.9.10.09.

Pełny tekst źródła
Style APA, Harvard, Vancouver, ISO itp.
44

Dehnavi, Maryam Mehri, David M. Fernandez i Dennis Giannacopoulos. "Finite-Element Sparse Matrix Vector Multiplication on Graphic Processing Units". IEEE Transactions on Magnetics 46, nr 8 (sierpień 2010): 2982–85. http://dx.doi.org/10.1109/tmag.2010.2043511.

Pełny tekst źródła
Style APA, Harvard, Vancouver, ISO itp.
45

Liang, Yun, Wai Teng Tang, Ruizhe Zhao, Mian Lu, Huynh Phung Huynh i Rick Siow Mong Goh. "Scale-Free Sparse Matrix-Vector Multiplication on Many-Core Architectures". IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems 36, nr 12 (grudzień 2017): 2106–19. http://dx.doi.org/10.1109/tcad.2017.2681072.

Pełny tekst źródła
Style APA, Harvard, Vancouver, ISO itp.
46

Aktemur, Barış. "A sparse matrix-vector multiplication method with low preprocessing cost". Concurrency and Computation: Practice and Experience 30, nr 21 (25.05.2018): e4701. http://dx.doi.org/10.1002/cpe.4701.

Pełny tekst źródła
Style APA, Harvard, Vancouver, ISO itp.
47

Chen, Xinhai, Peizhen Xie, Lihua Chi, Jie Liu i Chunye Gong. "An efficient SIMD compression format for sparse matrix-vector multiplication". Concurrency and Computation: Practice and Experience 30, nr 23 (29.06.2018): e4800. http://dx.doi.org/10.1002/cpe.4800.

Pełny tekst źródła
Style APA, Harvard, Vancouver, ISO itp.
48

Karsavuran, M. Ozan, Kadir Akbudak i Cevdet Aykanat. "Locality-Aware Parallel Sparse Matrix-Vector and Matrix-Transpose-Vector Multiplication on Many-Core Processors". IEEE Transactions on Parallel and Distributed Systems 27, nr 6 (1.06.2016): 1713–26. http://dx.doi.org/10.1109/tpds.2015.2453970.

Pełny tekst źródła
Style APA, Harvard, Vancouver, ISO itp.
49

Burkhardt, Paul. "Optimal Algebraic Breadth-First Search for Sparse Graphs". ACM Transactions on Knowledge Discovery from Data 15, nr 5 (26.06.2021): 1–19. http://dx.doi.org/10.1145/3446216.

Pełny tekst źródła
Streszczenie:
There has been a rise in the popularity of algebraic methods for graph algorithms given the development of the GraphBLAS library and other sparse matrix methods. An exemplar for these approaches is Breadth-First Search (BFS). The algebraic BFS algorithm is simply a recurrence of matrix-vector multiplications with the n × n adjacency matrix, but the many redundant operations over nonzeros ultimately lead to suboptimal performance. Therefore an optimal algebraic BFS should be of keen interest especially if it is easily integrated with existing matrix methods. Current methods, notably in the GraphBLAS, use a Sparse Matrix masked-Sparse Vector multiplication in which the input vector is kept in a sparse representation in each step of the BFS, and nonzeros in the vector are masked in subsequent steps. This has been an area of recent research in GraphBLAS and other libraries. While in theory, these masking methods are asymptotically optimal on sparse graphs, many add work that leads to suboptimal runtime. We give a new optimal, algebraic BFS for sparse graphs, thus closing a gap in the literature. Our method multiplies progressively smaller submatrices of the adjacency matrix at each step. Let n and m refer to the number of vertices and edges, respectively. On a sparse graph, our method takes O ( n ) algebraic operations as opposed to O ( m ) operations needed by theoretically optimal sparse matrix approaches. Thus, for sparse graphs, it matches the bounds of the best-known sequential algorithm, and on a Parallel Random Access Machine, it is work-optimal. Our result holds for both directed and undirected graphs. Compared to a leading GraphBLAS library, our method achieves up to 24x faster sequential time, and for parallel computation, it can be 17x faster on large graphs and 12x faster on large-diameter graphs.
Style APA, Harvard, Vancouver, ISO itp.
50

Fitriyani, Fitriyani. "Sliced Coordinate List Implementation Analysis on Sparse Matrix-Vector Multiplication Using Compute Unified Device Architecture". International Journal on Information and Communication Technology (IJoICT) 2, nr 1 (1.07.2016): 13. http://dx.doi.org/10.21108/ijoict.2016.21.71.

Pełny tekst źródła
Streszczenie:
<p>Matrices are one of the most used data representation form from real-world problems. Lot of matrix was formed very big but sparse, hence information inside the matrix is relatively small compared to its size. This caused into heavy computational resources needed to process those matrices within short time. One of the solutions to do an efficient process to the sparse matrix is to form it into a specialized form of sparse matrix, such as Sliced Coordinate List (SCOO). SCOO format for sparse matrix has been developed and combined within an implementation using Compute Unified Device Architecture (CUDA). In this research, performance of SCOO implementation using GPU – CUDA will be compared to the other sparse matrix format named Coordinate List (COO) based on its memory usage and execution time. Results obtained from this research show that although SCOO implementation for sparse matrix use memory 1.000529 larger than COO format, its serial performance is 3.18 faster than serial COO, besides that, if SCOO implementation is conducted parallel using GPU – CUDA then its performance can be achieved around 123.8 faster than parallel COO or 77 times faster than parallel COO using one of the available library for CUDA, named CUSP.</p>
Style APA, Harvard, Vancouver, ISO itp.
Oferujemy zniżki na wszystkie plany premium dla autorów, których prace zostały uwzględnione w tematycznych zestawieniach literatury. Skontaktuj się z nami, aby uzyskać unikalny kod promocyjny!

Do bibliografii