lizamd

lizamd

Achievements

Megatron-LM_disable_TE Megatron-LM_disable_TE Public

Modified from ali branch, for llama training

Python
sglang sglang Public

Forked from sgl-project/sglang

SGLang is a fast serving framework for large language models and vision language models.

Python
ao ao Public

Forked from pytorch/ao

PyTorch native quantization and sparsity for training and inference

Python
flash-attention-benchmark flash-attention-benchmark Public

Flash Attention Kernel Benchmark: CK vs Triton vs PyTorch on AMD MI300X