-
Notifications
You must be signed in to change notification settings - Fork 25
Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
Merge pull request #1 from JerryYin777/dev
Dev
- Loading branch information
Showing
11 changed files
with
108 additions
and
25 deletions.
There are no files selected for viewing
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -1,6 +1,6 @@ | ||
|
||
title: Congrui Yin's Homepage | ||
page-top-title: Congrui Yin (殷骢睿) | ||
top-section-bg-text: An LLM Designer and Efficient Machine Learning System Nerd | ||
home-subtitle: Congrui Yin | 殷骢睿 | ||
copyright-text: '© Congrui Yin 2023. All Rights Reserved.' | ||
page-top-title: Jerry (Congrui) Yin's Homepage | ||
top-section-bg-text: A NLP & MLSys Nerd | ||
home-subtitle: Jerry (Congrui) Yin (殷骢睿) | ||
copyright-text: '© Congrui Yin 2024. All Rights Reserved.' |
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -1,27 +1,56 @@ | ||
|
||
#### News | ||
|
||
<strong style="color:red;"><strong>I’m actively applying for a MLsys Ph.D. position in 2025 Fall! </strong></strong> If you need a student who is familiar with <strong style="color:red;"><strong>both NLP and computer systems </strong></strong> with <strong style="color:red;"><strong>extensive industry experiences</strong></strong>, feel free to <a href="#contact-info">Contact Me</a>! | ||
|
||
|
||
#### Biography | ||
I am currently a junior Undergraduate Student pursuing a bachelor's Degree in computer science at College of Liberal Arts, University of Minnesota Twins Cities. | ||
I am currently a senior Undergraduate Student pursuing a bachelor's Degree in computer science at College of Liberal Arts, University of Minnesota Twin Cities, supervised by Prof. [Zirui Liu](https://zirui-ray-liu.github.io/). In the summer of 2023, I visited [TsinghuaNLP](https://github.com/thunlp) and conducted research under Prof. [Zhiyuan Liu](https://nlp.csai.tsinghua.edu.cn/~lzy/). | ||
|
||
I have experience in NLP and computer systems(both architecture and high performance machine learning systems), along with extensive industry research internship experience. This includes: | ||
|
||
* Participating in the pretraining of the Yi-Large model at 01.AI. | ||
* Contributing to ML Infra of the pretraining of the on-device small model [MiniCPM-2B](https://huggingface.co/collections/openbmb/minicpm-2b-65d48bf958302b9fd25b698f) at ModelBest (with TsinghuaNLP) | ||
* Participating in the finetuning of the CodeLLM [Raccoon](https://raccoon.sensetime.com/code) (Copilot-like) at SenseTime (with CUHK MMLab). | ||
|
||
#### Research Interests | ||
|
||
My current passion revolves around building **EFFICIENT** system solutions to AGI and LLM(VLM) for **RELIABLE** Hardware Design, this includes: | ||
|
||
1. Machine Learning System | ||
* Training: Design more effective training system and algorithms, examples include [BMTrain](https://github.com/OpenBMB/BMTrain). | ||
* Quantization (e.g. Attempting to finetune Llama 3.1 405B on a single A100 80GB GPU), the other example includes [IAPT](https://arxiv.org/pdf/2405.18203). | ||
* Long context inference: example includes [Cross Layer Attention](https://github.com/JerryYin777/Cross-Layer-Attention). | ||
2. LLM(VLM) for RELIABLE Hardware Design | ||
* Synthesise pretraining and finetuning common knowledge of CodeLLM, exploring the boundary capabilities of LLM/VLM for hardware design (e.g. pretrain/finetune a VerilogLLM). | ||
* Align the simulation code with the waveform image data to finetune VerilogVLM. | ||
|
||
#### Misc | ||
|
||
Before transferring to the University of Minnesota, I studied at Nanchang University, majoring in Artificial Intelligence in a top-tier class with a School Academic Special Scholarship. I was the leader of Nanchang University Supercomputer Cluster Team ([NCUSCC](https://ncuscc.github.io/)) Leader, with experience of ASC22 and SC23(IndySCC). | ||
|
||
I am passionate about open source and firmly believe in its potential to disseminate knowledge widely, leverage technology to lead innovation to the world and contribute to the advancement of human society. I am proud to have garnered over **1k stars** and acquired over **250 followers** on [GitHub](https://github.com/JerryYin777). I occasionally share my explorations in the machine learning system and LLM field on [Zhihu](https://www.zhihu.com/people/ycr222/posts) in Mandarin. | ||
|
||
#### Contact<p id="contact-info"></p> | ||
|
||
My research interests lie in Large Multimodal Models (LMMs) and their application in diverse practical scenarios, such as biological and system large models. My focus also extends to developing efficient machine learning systems aimed at expediting the training and inference processing of LMMs (especially LLMs), leveraging expertise in high-performance computing and distributed systems. | ||
✉️ [yin00486 [at] umn.edu](mailto:[email protected]) | ||
|
||
Before transferring to the University of Minnesota, I studied at Nanchang University, majoring in Artificial Intelligence in a top-tier class with a School Academic Special Scholarship. I was honored to be advised by Professor [Zichen Xu](https://good.ncu.edu.cn/Pages/Professor.html) at [GOOD LAB](https://good.ncu.edu.cn) starting from March 2022, where my focus was on solving data-centric challenges and building efficient and reliable systems. I was the leader of Nanchang University Supercomputer Cluster Team ([NCUSCC](https://hpc.ncuscc.tech/)) Leader, with experience of ASC22 and SC23(IndySCC). | ||
<!-- Before transferring to the University of Minnesota, I studied at Nanchang University, majoring in Artificial Intelligence in a top-tier class with a School Academic Special Scholarship. I was honored to be advised by Professor [Zichen Xu](https://good.ncu.edu.cn/Pages/Professor.html) at [GOOD LAB](https://good.ncu.edu.cn) starting from March 2022, where my focus was on solving data-centric challenges and building efficient and reliable systems. I was the leader of Nanchang University Supercomputer Cluster Team ([NCUSCC](https://hpc.ncuscc.tech/)) Leader, with experience of ASC22 and SC23(IndySCC). --> | ||
|
||
I was also fortunately recruited as a research assistant at **TOP** NLP Lab [TsinghuaNLP](https://github.com/thunlp) in Beijing from July to September 2023, advised by Professor [Zhiyuan Liu](https://nlp.csai.tsinghua.edu.cn/~lzy/), trying to build efficient distributed large language model training framework [BMTrain](https://github.com/OpenBMB/BMTrain) and Develop 10B Chinese LLM [CPM-Bee](https://github.com/OpenBMB/CPM-Bee/blob/main/README_en.md). | ||
<!-- I was also fortunately recruited as a research assistant at [TsinghuaNLP](https://github.com/thunlp) in Beijing from July to September 2023, advised by Professor [Zhiyuan Liu](https://nlp.csai.tsinghua.edu.cn/~lzy/), [Weilin Zhao](https://achazwl.github.io/) and [Xu Han](https://scholar.google.com/citations?user=rz4rOSMAAAAJ&hl=zh-CN), trying to build efficient distributed large language model training framework [BMTrain](https://github.com/OpenBMB/BMTrain) and Develop 2B on-device Chinese LLM [MiniCPM](https://huggingface.co/collections/openbmb/minicpm-2b-65d48bf958302b9fd25b698f) at ModelBest (面壁智能). I also interned at 01.AI (零一万物) and SenseTime Research (商汤研究院) as algorithm intern. --> | ||
|
||
I am passionate about open source and firmly believe in its potential to disseminate knowledge widely, leverage technology to lead innovation to the world and contribute to the advancement of human society. I am proud to have garnered over **1000 stars** and acquired **155 followers** on GitHub. It is gratifying to know that my open-source projects have benefitted numerous individuals, and I have personally gained valuable knowledge from the open-source community. | ||
<!-- I am passionate about open source and firmly believe in its potential to disseminate knowledge widely, leverage technology to lead innovation to the world and contribute to the advancement of human society. I am proud to have garnered over **1000 stars** and acquired over**200 followers** on GitHub. It is gratifying to know that my open-source projects have benefitted numerous individuals, and I have personally gained valuable knowledge from the open-source community. --> | ||
|
||
#### Contact | ||
* Github: [JerryYin777](https://github.com/JerryYin777) | ||
<!-- #### Contact | ||
* Email: yin00486 [at] umn.edu | ||
* Google Scholar:[Congrui Yin's Google Scholar](https://scholar.google.com/citations?hl=en&user=7gsdLw4AAAAJ) | ||
#### Education | ||
2023.12 - Present, Computer Science, College of Liberal Arts, University of Minnesota Twin Cities. | ||
2021.09 - 2023.12, Artificial Intelligence, School of Information Engineering, Nanchang University. | ||
#### Research Interests | ||
Natural Language Processing, Machine Learning System, Human Computer Interaction. | ||
Natural Language Processing, Machine Learning System. | ||
#### Skills | ||
* **Natural Language Processing:** Proficient in using the PyTorch framework, with the ability to reproduce mainstream large-scale models in the industry (such as Baichuan, llama2, Qwen). Proficient in using quantization and inference tools such as QLoRA, vLLM, and skilled in distributed parallel training (using training tools such as BMTrain, DeepSpeed). | ||
|
@@ -30,4 +59,4 @@ Natural Language Processing, Machine Learning System, Human Computer Interaction | |
* **Computer System Architecture:** Familiar with GPU architecture and RISC-V instruction set, and has participated in the [One Life, One Core](https://ysyx.oscc.cc/) project. | ||
* **Other:** Understanding of serverless computing architecture, experience in applying federated learning in network security, involvement in both front-end and back-end development, familiarity with Linux operating system and operation and maintenance ( maintained a large server cluster with 21 nodes in the GOOD LAB). Additionally, has researched computer vision and reinforcement learning in the field of artificial intelligence. | ||
* **Other:** Understanding of serverless computing architecture, experience in applying federated learning in network security, involvement in both front-end and back-end development, familiarity with Linux operating system and operation and maintenance ( maintained a large server cluster with 21 nodes in the GOOD LAB). Additionally, has researched computer vision and reinforcement learning in the field of artificial intelligence. --> |
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,16 @@ | ||
<!-- <a href="https://github.com/OpenBMB/MiniCPM"><img src="https://github-readme-stats.vercel.app/api/pin/?username=OpenBMB&repo=MiniCPM" alt="Readme Card" /></a></p><p><a href="https://github.com/thunlp/OpenDelta"><img src="https://github-readme-stats.vercel.app/api/pin/?username=thunlp&repo=OpenDelta" alt="Readme Card" /></a></p><p><a href="https://github.com/thunlp/OpenPrompt"><img src="https://github-readme-stats.vercel.app/api/pin/?username=thunlp&repo=OpenPrompt" alt="Readme Card" /></a></p><p><a href="https://github.com/thunlp/PromptPapers"><img src="https://github-readme-stats.vercel.app/api/pin/?username=thunlp&repo=PromptPapers" alt="Readme Card" /></a></p><p><a href="https://github.com/OpenBMB/InfiniteBench"><img src="https://github-readme-stats.vercel.app/api/pin/?username=OpenBMB&repo=InfiniteBench" alt="Readme Card" /></a></p><p><a href="https://github.com/OpenBMB/BMPrinciples"><img src="https://github-readme-stats.vercel.app/api/pin/?username=OpenBMB&repo=BMPrinciples" alt="Readme Card" /></a></p> --> | ||
|
||
<div style="display: flex; flex-wrap: wrap; justify-content: space-around;"> | ||
<a href="https://github.com/OpenBMB/MiniCPM"><img src="https://github-readme-stats.vercel.app/api/pin/?username=OpenBMB&repo=MiniCPM" alt="Readme Card" /></a> | ||
<a href="https://github.com/OpenBMB/BMTrain"><img src="https://github-readme-stats.vercel.app/api/pin/?username=OpenBMB&repo=BMTrain" alt="Readme Card" /></a> | ||
<a href="https://github.com/CGCL-codes/naturalcc"><img src="https://github-readme-stats.vercel.app/api/pin/?username=CGCL-codes&repo=naturalcc" alt="Readme Card" /></a> | ||
<a href="https://github.com/JerryYin777/FPGA_Competition-RISC-V_Processor-in-PGL22G"><img src="https://github-readme-stats.vercel.app/api/pin/?username=JerryYin777&repo=FPGA_Competition-RISC-V_Processor-in-PGL22G" alt="Readme Card" /></a> | ||
<a href="https://github.com/JerryYin777/PaperHelper"><img src="https://github-readme-stats.vercel.app/api/pin/?username=JerryYin777&repo=PaperHelper" alt="Readme Card" /></a> | ||
<a href="https://github.com/JerryYin777/Cross-Layer-Attention"><img src="https://github-readme-stats.vercel.app/api/pin/?username=JerryYin777&repo=Cross-Layer-Attention" alt="Readme Card" /></a> | ||
<a href="https://github.com/JerryYin777/ASC22-Yuan"><img src="https://github-readme-stats.vercel.app/api/pin/?username=NCUSCC&repo=ASC22-Yuan" alt="Readme Card" /></a> | ||
<a href="https://github.com/JerryYin777/NanoGPT-Pytorch2.0-Implementation"><img src="https://github-readme-stats.vercel.app/api/pin/?username=JerryYin777&repo=NanoGPT-Pytorch2.0-Implementation" alt="Readme Card" /></a> | ||
<a href="https://github.com/JerryYin777/Cr_Research_Toolchain"><img src="https://github-readme-stats.vercel.app/api/pin/?username=JerryYin777&repo=Cr_Research_Toolchain" alt="Readme Card" /></a> | ||
|
||
|
||
</div> | ||
|
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -1,4 +1,7 @@ | ||
For full paper list (not now, but I'm sure there will be more great work in the future), please refer to my [Google Scholar](https://scholar.google.com/citations?user=7gsdLw4AAAAJ&hl=en) | ||
|
||
- *X. Gao, W. Zhu, J. Gao and <strong><strong>C. Yin</strong></strong>. (2023). F-PABEE: Flexible-Patience-Based Early Exiting For Single-Label and Multi-Label Text Classification Tasks. 2023 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).* [[Paper]](https://ieeexplore.ieee.org/abstract/document/10095864) | ||
- *W. Zhu, Y. Ni, <strong><strong>C. Yin</strong></strong>, A. Tian, X. Wang, G. Xie. (2024). IAPT: Instance-Aware Prompt Turing for Large Language Models. The 62nd Annual Meeting of the Association for Computational Linguistics <strong><strong>(ACL 2024)</strong></strong>.*[[Paper]](https://arxiv.org/pdf/2405.18203) | ||
|
||
- *<strong><strong>C. Yin</strong></strong>. (2023). Multi-scale and multi-task learning for human audio forensics based on convolutional networks. International Conference on Image, Signal Processing, and Pattern Recognition (ISPP 2023).* [[Paper]](https://doi.org/10.1117/12.2681344) | ||
- *X. Gao, W. Zhu, J. Gao and <strong><strong>C. Yin</strong></strong>. (2023). F-PABEE: Flexible-Patience-Based Early Exiting For Single-Label and Multi-Label Text Classification Tasks. 2023 IEEE International Conference on Acoustics, Speech and Signal Processing <strong><strong>(ICASSP 2023)</strong></strong>.* [[Paper]](https://ieeexplore.ieee.org/abstract/document/10095864) | ||
|
||
- *<strong><strong>C. Yin</strong></strong>. (2023). Multi-scale and multi-task learning for human audio forensics based on convolutional networks. International Conference on Image, Signal Processing, and Pattern Recognition <strong><strong>(ISPP 2023)</strong></strong>.* [[Paper]](https://doi.org/10.1117/12.2681344) |
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,3 @@ | ||
|
||
- Reviewer for EMNLP'2024 | ||
- Reviewer for ACL'2024 |
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Binary file not shown.
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
|
@@ -2,7 +2,7 @@ | |
|
||
|
||
:root{ | ||
--h-title-color:#3948d2; | ||
--h-title-color:#000000; | ||
} | ||
|
||
.header { | ||
|
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters