-
PH.D. student at Sun Yat-sen university
-
LLM Inference, HPC, Simulaters, GPU, architecture
- xDiT: Fix parallel vae link
- DistVAE: Fix batch dimension link
- vLLM: [Benchmark] Refactor sample_requests in benchmark_throughput link
- vLLM: [Bugfix] fix automatic prefix args and add log info link
- vLLM: [Minor Fix] Fix comments in benchmark_serving link
- vLLM: [Minor Fix] Remove unused code in benchmark_prefix_caching.py link
- TVM: [Doc] Fix minor error in "Expressions in Relay" link
- TVM: [Doc] Fix minor error in doc (Add an operator to Relay) link