AITemplate

AITemplate is a Python framework which renders neural network into high performance CUDA/HIP C++ code. Specialized for FP16 TensorCore (NVIDIA GPU)...

最近更新: 4天前

libfabric

Open Fabric Interfaces

最近更新: 4天前

pytorch_scatter

PyTorch Extension Library of Optimized Scatter Operations

最近更新: 4天前

MITuna

最近更新: 4天前

TransferBench

TransferBench is a utility capable of benchmarking simultaneous copies between user-specified devices (CPUs/GPUs)

最近更新: 4天前

pytorch-lightning

The lightweight PyTorch wrapper for high-performance AI research. Scale your models, not the boilerplate.

最近更新: 4天前

recommenders-addons

Additional utils and helpers to extend TensorFlow when build recommendation systems, contributed and maintained by SIG Recommenders.

最近更新: 4天前

rtg_tracer

最近更新: 4天前

kineto

A CPU+GPU Profiling library that provides access to timeline traces and hardware performance counters.

最近更新: 4天前

rocWMMA

rocWMMA

最近更新: 4天前

sukha-tools

Profiling tools intended for Nvidia or AMD GPUs

最近更新: 4天前

triton

Development repository for the Triton language and compiler

最近更新: 4天前

hipify_torch

最近更新: 4天前

composable_kernel_remove

Composable Kernel: Performance Portable Programming Model for Machine Learning Tensor Operators

最近更新: 4天前

gpufort

GPUFORT: S2S translation tool for CUDA Fortran and Fortran+X in the spirit of hipify

最近更新: 4天前

tensorflow-build

Build-related tools for TensorFlow

最近更新: 4天前

aws-ofi-rccl

最近更新: 4天前

rocGemmDriver

rocGemmDriver

最近更新: 4天前

audio

Data manipulation and transformation for audio signal processing, powered by PyTorch

最近更新: 4天前

hipOMB

OSU MPI benchmarks with ROCm support

最近更新: 4天前
成就
1
Star
2
Fork
成员(1)
镜像

搜索帮助