Skip to content

Commit

Permalink
Update publications.md
Browse files Browse the repository at this point in the history
  • Loading branch information
bohanzhuang authored Sep 25, 2024
1 parent 3196b62 commit 84e88c3
Showing 1 changed file with 26 additions and 22 deletions.
48 changes: 26 additions & 22 deletions _pages/publications.md
Original file line number Diff line number Diff line change
Expand Up @@ -7,35 +7,45 @@ author_profile: true

(Selected Publications. \* equal contribution, \# corresponding author)

**2024**

## Preprint

- **MiniCache: KV Cache Compression in Depth Dimension for Large Language Models**
- **ME-Switch: A Memory-Efficient Expert Switching Framework for Large Language Models**

**Akide Liu**, Jing Liu, Zizheng Pan, Yefei He, Gholamreza Haffari, Bohan Zhuang\#
**Jing Liu**, Ruihao Gong, Mingyang Zhang, Yefei He, Jianfei Cai, Bohan Zhuang\#

[[Paper](https://arxiv.org/abs/2405.14366)]
[[Paper](https://arxiv.org/abs/2406.09041)]


- **ZipCache: Accurate and Efficient KV Cache Quantization with Salient Token Identification**
- **T-Stitch: Accelerating Sampling in Pre-Trained Diffusion Models with Trajectory Stitching**

**Yefei He**, Luoming Zhang, Weijia Wu, Jing Liu, Hong Zhou, Bohan Zhuang\#
**Zizheng Pan**, Bohan Zhuang\#, De-An Huang, Weili Nie, Zhiding Yu, Chaowei Xiao, Jianfei Cai, Anima Anandkumar

[[Paper](https://arxiv.org/abs/2405.14256)]
[[Paper](https://arxiv.org/abs/2402.14167)][[Project](https://t-stitch.github.io/)][[Code](https://github.com/NVlabs/T-Stitch)]


- **ME-Switch: A Memory-Efficient Expert Switching Framework for Large Language Models**
## 2024

**Jing Liu**, Ruihao Gong, Mingyang Zhang, Yefei He, Jianfei Cai, Bohan Zhuang\#

[[Paper](https://arxiv.org/abs/2406.09041)]
- **MiniCache: KV Cache Compression in Depth Dimension for Large Language Models**

**Akide Liu**, Jing Liu, Zizheng Pan, Yefei He, Gholamreza Haffari, Bohan Zhuang\#

- **T-Stitch: Accelerating Sampling in Pre-Trained Diffusion Models with Trajectory Stitching**
[[Paper](https://arxiv.org/abs/2405.14366)] **NeurIPS 2024**

**Zizheng Pan**, Bohan Zhuang\#, De-An Huang, Weili Nie, Zhiding Yu, Chaowei Xiao, Jianfei Cai, Anima Anandkumar

[[Paper](https://arxiv.org/abs/2402.14167)][[Project](https://t-stitch.github.io/)][[Code](https://github.com/NVlabs/T-Stitch)]
- **ZipCache: Accurate and Efficient KV Cache Quantization with Salient Token Identification**

**Yefei He**, Luoming Zhang, Weijia Wu, Jing Liu, Hong Zhou, Bohan Zhuang\#

[[Paper](https://arxiv.org/abs/2405.14256)] **NeurIPS 2024**


- **MVSplat360: Benchmarking 360° Generalizable 3D Novel View Synthesis from Sparse Views**

**Yuedong Chen**, Chuanxia Zheng, Haofei Xu, Bohan Zhuang, Andrea Vedaldi, Tat-Jen Cham, Jianfei Cai

[[Paper]()] **NeurIPS 2024**


- **GMAI-MMBench: A Comprehensive Multimodal Evaluation Benchmark Towards General Medical AI**
Expand Down Expand Up @@ -123,16 +133,10 @@ Guoan Wang, **Jin Ye**, Junlong Cheng, Tianbin Li, Zhaolin Chen, Jianfei Cai, Ju
[[Paper](https://arxiv.org/abs/2407.04938)] **MICCAI 2024**


- **Streaming Video Diffusion: Online Video Editing with Diffusion Models**

**Feng Chen**, Zhen Yang, Bohan Zhuang, Qi Wu

[[Paper](https://arxiv.org/abs/2405.19726)]




**2023**
## 2023


- **Stitchable Neural Networks**
Expand Down Expand Up @@ -220,7 +224,7 @@ Guoan Wang, **Jin Ye**, Junlong Cheng, Tianbin Li, Zhaolin Chen, Jianfei Cai, Ju



**2022**
## 2022

- **EcoFormer: Energy-Saving Attention with Linear Complexity**

Expand Down Expand Up @@ -298,7 +302,7 @@ Guoan Wang, **Jin Ye**, Junlong Cheng, Tianbin Li, Zhaolin Chen, Jianfei Cai, Ju



**2021**
## 2021


- **Scalable Vision Transformers with Hierarchical Pooling**
Expand Down

0 comments on commit 84e88c3

Please sign in to comment.