TAG»CUDA«

OCTOBER2023

ARTICLE

Benchmarking Bit Errors in Quantized Neural Networks with PyTorch

ADVERSARIAL MACHINE LEARNING C CUDA DEEP LEARNING DNN ACCELERATORS PYTHON PYTORCH

Similar to my article series on adversarial robustness, I was planning to have a series on bit errors robustness accompanied by PyTorch code. Instead, due to time constraints, I decided to condense the information into a single article. The code for the originally planned six articles is available on GitHub.

More ...

JUNE2023

PROJECT

C CUDA DEEP LEARNING DNN ACCELERATORS PYTHON PYTORCH

OPEN SOURCE Bit Error Robustness in PyTorch Article Series I was planning to have an article series on bit error robustness in deep learning — similar to my article series on adversarial robustness — with accompanying PyTorch code. However, the recent progress in machine learning made me focus on other projects. Nevertheless, the articles should […]

More ...

NOVEMBER2022

PROJECT

C++ COMPUTER VISION CUDA DEEP LEARNING LUA TORCH

Torch/CUDA implementation of batch normalization for OctNets.

More ...

AUGUST2022

PROJECT

C CUDA DEEP LEARNING MACHINE LEARNING PYTHON PYTORCH

A template for extending PyTorch using C/CUDA operations.

More ...

AUGUST2022

PROJECT

C COMPUTER VISION CUDA DEEP LEARNING LUA MACHINE LEARNING TORCH

Basic and advanced torch examples, template for implementing custom C/CUDA modules and implementations of variational auto-encoders.

More ...

AUGUST2022

PROJECT

C++ COMPUTER VISION CUDA DEEP LEARNING MACHINE LEARNING PYTHON

3D mesh fusion, voxelization and evaluation for computer vision research.

More ...

APRIL2020

ARTICLE

Implementing Custom PyTorch Tensor Operations in C and CUDA

C CUDA DEEP LEARNING PYTORCH

PyTorch, alongside TensorFlow, has become standard among deep learning researchers and practitioners. While PyTorch provides a large variety in terms of tensor operations or deep learning layers, some specialized operations still need to be implemented manually. In cases where runtime is crucial, this should be done in C or CUDA for supporting both CPU and GPU computation. In this article, I want to provide a simple example and framework for extending PyTorch with custom C and CUDA operations using CFFI for Python and CuPy.

More ...

JANUARY2019

ARTICLE

Convolutional Batch Normalization for OctNets

C++ COMPUTER VISION CUDA DEEP LEARNING LUA TORCH

During my master thesis I partly worked on OctNets, octree-bases convolutional neural networks for efficient learning in 3D. Among others, I implemented convolutional batch normalization for OctNets. This article briefly discusses the implementation, which will be available on GitHub.

More ...

AUGUST2018

ARTICLE

Implementing Torch Modules in C/CUDA

C++ CUDA DEEP LEARNING LUA TORCH

Torch is a framework for scientific computing in LUA. However, it has mostly been used for deep learning research as it provides efficient and comfortable C/CUDA implementations of a wide range of (convolutional and/or recurrent) neural network components. In this article, I want to provide a code template allowing to easily extend torch.nn by custom modules implemented in C and/or CUDA without knowledge of Torch’s core.

More ...

APRIL2018

ARTICLE

Compiling OpenCV 2.4.x with CUDA 9

C++ COMPUTER VISION CUDA OPENCV

Currently, both OpenCV 2 and OpenCV 3 seem to have some minor issues with CUDA 9. However, CUDA 9 is required for the latest generation of NVidia graphics cards. In this article, based on this StackOverflow question, I want to discuss a very simple patch to get OpenCV 2 running with CUDA 9.

More ...

OLDER

IAM

DAVIDSTUTZ

TAG»CUDA«

ARTICLE

Benchmarking Bit Errors in Quantized Neural Networks with PyTorch

PROJECT

Bit Error Robustness in PyTorch Article Series

PROJECT

OctNet Batch Normalization

PROJECT

PyTorch C/CUDA Module Template

PROJECT

Torch Examples, Guides and Resources

PROJECT

Mesh Fusion, Voxelization & Evaluation

ARTICLE

Implementing Custom PyTorch Tensor Operations in C and CUDA

ARTICLE

Convolutional Batch Normalization for OctNets

ARTICLE

Implementing Torch Modules in C/CUDA

ARTICLE

Compiling OpenCV 2.4.x with CUDA 9

SEARCHTHEBLOG

ARCHIVES

TAGS