Skip to content
Change the repository type filter

All

    Repositories list

    • GPU implementation of Xnor network on inference level.
      Cuda
      6000Updated Jul 30, 2020Jul 30, 2020
    • Simd

      Public
      C++ image processing and machine learning library with using of SIMD: SSE, SSE2, SSE3, SSSE3, SSE4.1, SSE4.2, AVX, AVX2, AVX-512, VMX(Altivec) and VSX(Power7), NEON for ARM.
      C++
      MIT License
      409000Updated Dec 19, 2019Dec 19, 2019
    • ROCm Software Platform Documentation
      C++
      92000Updated Aug 27, 2019Aug 27, 2019
    • Generate a quantization parameter file for ncnn framework int8 inference
      Python
      BSD 3-Clause "New" or "Revised" License
      153000Updated Jul 25, 2019Jul 25, 2019
    • vulkan subgroups example for reduce and scan
      C++
      MIT License
      3000Updated May 9, 2019May 9, 2019
    • C++
      Apache License 2.0
      37000Updated Jan 10, 2019Jan 10, 2019
    • GLSL-Card

      Public
      着色器语言 GLSL (opengl-shader-language)入门大全
      298000Updated Jan 10, 2019Jan 10, 2019
    • Minimal Example of Using Vulkan for Compute Operations. Only ~400LOC.
      C++
      MIT License
      71000Updated Nov 5, 2018Nov 5, 2018
    • VKL

      Public
      An abstraction layer on-top of Vulkan to help reduce boiler-plate code.
      C
      MIT License
      1000Updated Oct 3, 2018Oct 3, 2018
    • ucc162.3

      Public
      A lightweight open-source C compiler for research and education.
      C
      139000Updated Jul 8, 2018Jul 8, 2018
    • ⚡ 6.824: Distributed Systems (Spring 2017). A course which present abstractions and implementation techniques for engineering distributed systems.
      Go
      77000Updated May 29, 2018May 29, 2018
    • video distributed compressor with ffmpeg
      Go
      8000Updated Apr 1, 2018Apr 1, 2018
    • ffmpeg build scripts for android ndk usage (including x264)
      Shell
      87000Updated Mar 31, 2018Mar 31, 2018
    • MIT课程《Distributed Systems 》学习和翻译
      Go
      663000Updated Mar 1, 2018Mar 1, 2018
    • Depth_conv for MobileNet
      Cuda
      5000Updated Nov 22, 2017Nov 22, 2017
    • The repository targets the OpenCL gemm function performance optimization. It compares several libraries clBLAS, clBLAST, MIOpenGemm, Intel MKL(CPU) and cuBLAS(CUDA) on different matrix sizes/vendor's hardwares/OS. Out-of-the-box easy as MSVC, MinGW, Linux(CentOS) x86_64 binary provided. 在不同矩阵大小/硬件/操作系统下比较几个BLAS库的sgemm函数性能,提供binary,开盒即用。
      C
      MIT License
      6000Updated Nov 9, 2017Nov 9, 2017
    • tensor

      Public
      A Modern C++ Heterogeneous Computing Library
      C++
      MIT License
      38000Updated Oct 24, 2017Oct 24, 2017
    • darknet

      Public
      Convolutional Neural Networks
      C
      Other
      21k000Updated Oct 10, 2017Oct 10, 2017
    • 用python学习rgbd-slam系列
      Python
      65000Updated Sep 27, 2017Sep 27, 2017
    • mit6.828

      Public
      C
      Apache License 2.0
      21000Updated Jul 11, 2017Jul 11, 2017
    • C++
      1000Updated May 1, 2017May 1, 2017
    • SLAM 开发学习资源与经验分享
      954000Updated Apr 28, 2017Apr 28, 2017
    • Winograd-based convolution implementation in OpenCL
      C
      11000Updated Jan 22, 2017Jan 22, 2017
    • CNN_CUDA

      Public
      C++
      1000Updated Jan 16, 2017Jan 16, 2017
    • C++
      BSD 2-Clause "Simplified" License
      2000Updated Nov 4, 2016Nov 4, 2016
    • A demonstration of speeding up a 1D convolution using SSE
      C
      11000Updated Sep 6, 2016Sep 6, 2016
    • maxas

      Public
      Assembler for NVIDIA Maxwell architecture
      CSS
      MIT License
      161000Updated Jun 10, 2016Jun 10, 2016
    • Deep learning with Caffe on phones, with OpenCL support for CPU and GPU devices.
      C++
      Other
      19k000Updated Mar 11, 2016Mar 11, 2016
    • XNet

      Public
      Simple CuDNN wrapper
      C++
      13000Updated Nov 29, 2015Nov 29, 2015
    • C++
      1000Updated Aug 31, 2015Aug 31, 2015