Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
97 tokens/sec
GPT-4o
53 tokens/sec
Gemini 2.5 Pro Pro
44 tokens/sec
o3 Pro
5 tokens/sec
GPT-4.1 Pro
47 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

Recognition of convolutional neural network based on CUDA Technology (1506.00074v2)

Published 30 May 2015 in cs.DC and cs.NE

Abstract: For the problem whether Graphic Processing Unit(GPU),the stream processor with high performance of floating-point computing is applicable to neural networks, this paper proposes the parallel recognition algorithm of Convolutional Neural Networks(CNNs).It adopts Compute Unified Device Architecture(CUDA)technology, definite the parallel data structures, and describes the mapping mechanism for computing tasks on CUDA. It compares the parallel recognition algorithm achieved on GPU of GTX200 hardware architecture with the serial algorithm on CPU. It improves speed by nearly 60 times. Result shows that GPU based the stream processor architecture ate more applicable to some related applications about neural networks than CPU.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (6)
  1. Yi-bin Huang (1 paper)
  2. Kang Li (207 papers)
  3. Ge Wang (214 papers)
  4. Min Cao (22 papers)
  5. Pin Li (2 papers)
  6. Yu-jia Zhang (2 papers)
Citations (7)

Summary

We haven't generated a summary for this paper yet.