Optimize rmsnorm/layernorm to get better performance than aiter/triton#610
Open
cschenjunlin wants to merge 3 commits into
Open
Optimize rmsnorm/layernorm to get better performance than aiter/triton#610cschenjunlin wants to merge 3 commits into
cschenjunlin wants to merge 3 commits into