HGPU group (@hgpu) 's Twitter Profile
HGPU group

@hgpu

High performance computing on graphics processing units (GPU): AMD/ATI, nVidia, Intel Xeon Phi, CUDA, OpenCL, OpenGL, GPGPU, HPC

ID: 301967669

linkhttp://hgpu.org/ calendar_today20-05-2011 11:21:39

10,10K Tweet

3,3K Followers

118 Following

HGPU group (@hgpu) 's Twitter Profile Photo

LiteGD: Lightweight and dynamic GPU Dispatching for Large-scale Heterogeneous Clusters #GPUcluster hgpu.org/?p=29950

HGPU group (@hgpu) 's Twitter Profile Photo

A Novel Compiler Transformation for Fast Sparse Matrix Multiplication in GPUs #CUDA #Compilers #Sparse #MatrixMultiplication hgpu.org/?p=29951

HGPU group (@hgpu) 's Twitter Profile Photo

Engineering Supercomputing Platforms for Biomolecular Applications #CUDA #ROCm #Biology #Biomolecules #MolecularDynamics #HPC #Physics #Package hgpu.org/?p=29954

HGPU group (@hgpu) 's Twitter Profile Photo

GCStack+GCScaler: Fast and Accurate GPU Performance Analyses Using Fine-Grained Stall Cycle Accounting and Interval Analysis #CUDA #Performance hgpu.org/?p=29982

HGPU group (@hgpu) 's Twitter Profile Photo

No More Shading Languages: Compiling C++ to Vulkan Shaders #Vulkan #Compilers #GLSL #Rendering #Raytracing #Package hgpu.org/?p=29983

HGPU group (@hgpu) 's Twitter Profile Photo

P4OMP: Retrieval-Augmented Prompting for OpenMP Parallelism in Serial Code #OpenMP #LLM #HPC #CodeGeneration hgpu.org/?p=30004

HGPU group (@hgpu) 's Twitter Profile Photo

ParEval-Repo: A Benchmark Suite for Evaluating LLMs with Repository-level HPC Translation Tasks #CUDA #OpenMP #LLM #CodeGeneration #Benchmarking #Package hgpu.org/?p=30005

HGPU group (@hgpu) 's Twitter Profile Photo

Libra: Synergizing CUDA and Tensor Cores for High-Performance Sparse Matrix Multiplication #CUDA #Sparse #SpMM #DeepLearning #DL #Package hgpu.org/?p=30006

HGPU group (@hgpu) 's Twitter Profile Photo

Accelerated discovery and design of Fe-Co-Zr magnets with tunable magnetic anisotropy through machine learning and parallel computing #CUDA #Physics #MaterialsScience #CondensedMatter #MachineLearning #ML #Package hgpu.org/?p=30007

HGPU group (@hgpu) 's Twitter Profile Photo

KIS-S: A GPU-Aware Kubernetes Inference Simulator with RL-Based Auto-Scaling #GPU #Kubernets #Package hgpu.org/?p=30034

HGPU group (@hgpu) 's Twitter Profile Photo

Demystifying NCCL: An In-depth Analysis of GPU Communication Protocols and Algorithms #CUDA #GPUcluster #Communication hgpu.org/?p=30035

HGPU group (@hgpu) 's Twitter Profile Photo

Serving LLMs in HPC Clusters: A Comparative Study of Qualcomm Cloud AI 100 Ultra and High-Performance GPUs #Qualcomm #Cloud #LLM #HPC #DeepLearning #DL hgpu.org/?p=30036

HGPU group (@hgpu) 's Twitter Profile Photo

Hardware Compute Partitioning on NVIDIA GPUs for Composable Systems #CUDA #TaskScheduling #Package hgpu.org/?p=30037

HGPU group (@hgpu) 's Twitter Profile Photo

Mutual-Supervised Learning for Sequential-to-Parallel Code Translation #CUDA #HPC #LLM #CodeGeneration #Package hgpu.org/?p=30038