Skip to content

Commit

Permalink
Merge pull request #1 from JerryYin777/dev
Browse files Browse the repository at this point in the history
Dev
  • Loading branch information
JerryYin777 authored Aug 9, 2024
2 parents 0092dca + 1f1f5c7 commit 3394807
Show file tree
Hide file tree
Showing 11 changed files with 108 additions and 25 deletions.
4 changes: 2 additions & 2 deletions contents/awards.md
Original file line number Diff line number Diff line change
@@ -1,7 +1,7 @@

- **Dean's List (All Semester)** at University of Minnesota Twin Cities.



- Special Corporate Scholarships, 2023. **(1/30)**

- School Special Academic Scholarship, 2023.**(1%)**

Expand Down
8 changes: 4 additions & 4 deletions contents/config.yml
Original file line number Diff line number Diff line change
@@ -1,6 +1,6 @@

title: Congrui Yin's Homepage
page-top-title: Congrui Yin (殷骢睿)
top-section-bg-text: An LLM Designer and Efficient Machine Learning System Nerd
home-subtitle: Congrui Yin | 殷骢睿
copyright-text: '© Congrui Yin 2023. All Rights Reserved.'
page-top-title: Jerry (Congrui) Yin's Homepage
top-section-bg-text: A NLP & MLSys Nerd
home-subtitle: Jerry (Congrui) Yin (殷骢睿)
copyright-text: '© Congrui Yin 2024. All Rights Reserved.'
49 changes: 39 additions & 10 deletions contents/home.md
Original file line number Diff line number Diff line change
@@ -1,27 +1,56 @@

#### News

<strong style="color:red;"><strong>I’m actively applying for a MLsys Ph.D. position in 2025 Fall! </strong></strong> If you need a student who is familiar with <strong style="color:red;"><strong>both NLP and computer systems </strong></strong> with <strong style="color:red;"><strong>extensive industry experiences</strong></strong>, feel free to <a href="#contact-info">Contact Me</a>!


#### Biography
I am currently a junior Undergraduate Student pursuing a bachelor's Degree in computer science at College of Liberal Arts, University of Minnesota Twins Cities.
I am currently a senior Undergraduate Student pursuing a bachelor's Degree in computer science at College of Liberal Arts, University of Minnesota Twin Cities, supervised by Prof. [Zirui Liu](https://zirui-ray-liu.github.io/). In the summer of 2023, I visited [TsinghuaNLP](https://github.com/thunlp) and conducted research under Prof. [Zhiyuan Liu](https://nlp.csai.tsinghua.edu.cn/~lzy/).

I have experience in NLP and computer systems(both architecture and high performance machine learning systems), along with extensive industry research internship experience. This includes:

* Participating in the pretraining of the Yi-Large model at 01.AI.
* Contributing to ML Infra of the pretraining of the on-device small model [MiniCPM-2B](https://huggingface.co/collections/openbmb/minicpm-2b-65d48bf958302b9fd25b698f) at ModelBest (with TsinghuaNLP)
* Participating in the finetuning of the CodeLLM [Raccoon](https://raccoon.sensetime.com/code) (Copilot-like) at SenseTime (with CUHK MMLab).

#### Research Interests

My current passion revolves around building **EFFICIENT** system solutions to AGI and LLM(VLM) for **RELIABLE** Hardware Design, this includes:

1. Machine Learning System
* Training: Design more effective training system and algorithms, examples include [BMTrain](https://github.com/OpenBMB/BMTrain).
* Quantization (e.g. Attempting to finetune Llama 3.1 405B on a single A100 80GB GPU), the other example includes [IAPT](https://arxiv.org/pdf/2405.18203).
* Long context inference: example includes [Cross Layer Attention](https://github.com/JerryYin777/Cross-Layer-Attention).
2. LLM(VLM) for RELIABLE Hardware Design
* Synthesise pretraining and finetuning common knowledge of CodeLLM, exploring the boundary capabilities of LLM/VLM for hardware design (e.g. pretrain/finetune a VerilogLLM).
* Align the simulation code with the waveform image data to finetune VerilogVLM.

#### Misc

Before transferring to the University of Minnesota, I studied at Nanchang University, majoring in Artificial Intelligence in a top-tier class with a School Academic Special Scholarship. I was the leader of Nanchang University Supercomputer Cluster Team ([NCUSCC](https://ncuscc.github.io/)) Leader, with experience of ASC22 and SC23(IndySCC).

I am passionate about open source and firmly believe in its potential to disseminate knowledge widely, leverage technology to lead innovation to the world and contribute to the advancement of human society. I am proud to have garnered over **1k stars** and acquired over **250 followers** on [GitHub](https://github.com/JerryYin777). I occasionally share my explorations in the machine learning system and LLM field on [Zhihu](https://www.zhihu.com/people/ycr222/posts) in Mandarin.

#### Contact<p id="contact-info"></p>

My research interests lie in Large Multimodal Models (LMMs) and their application in diverse practical scenarios, such as biological and system large models. My focus also extends to developing efficient machine learning systems aimed at expediting the training and inference processing of LMMs (especially LLMs), leveraging expertise in high-performance computing and distributed systems.
✉️ [yin00486 [at] umn.edu](mailto:[email protected])

Before transferring to the University of Minnesota, I studied at Nanchang University, majoring in Artificial Intelligence in a top-tier class with a School Academic Special Scholarship. I was honored to be advised by Professor [Zichen Xu](https://good.ncu.edu.cn/Pages/Professor.html) at [GOOD LAB](https://good.ncu.edu.cn) starting from March 2022, where my focus was on solving data-centric challenges and building efficient and reliable systems. I was the leader of Nanchang University Supercomputer Cluster Team ([NCUSCC](https://hpc.ncuscc.tech/)) Leader, with experience of ASC22 and SC23(IndySCC).
<!-- Before transferring to the University of Minnesota, I studied at Nanchang University, majoring in Artificial Intelligence in a top-tier class with a School Academic Special Scholarship. I was honored to be advised by Professor [Zichen Xu](https://good.ncu.edu.cn/Pages/Professor.html) at [GOOD LAB](https://good.ncu.edu.cn) starting from March 2022, where my focus was on solving data-centric challenges and building efficient and reliable systems. I was the leader of Nanchang University Supercomputer Cluster Team ([NCUSCC](https://hpc.ncuscc.tech/)) Leader, with experience of ASC22 and SC23(IndySCC). -->

I was also fortunately recruited as a research assistant at **TOP** NLP Lab [TsinghuaNLP](https://github.com/thunlp) in Beijing from July to September 2023, advised by Professor [Zhiyuan Liu](https://nlp.csai.tsinghua.edu.cn/~lzy/), trying to build efficient distributed large language model training framework [BMTrain](https://github.com/OpenBMB/BMTrain) and Develop 10B Chinese LLM [CPM-Bee](https://github.com/OpenBMB/CPM-Bee/blob/main/README_en.md).
<!-- I was also fortunately recruited as a research assistant at [TsinghuaNLP](https://github.com/thunlp) in Beijing from July to September 2023, advised by Professor [Zhiyuan Liu](https://nlp.csai.tsinghua.edu.cn/~lzy/), [Weilin Zhao](https://achazwl.github.io/) and [Xu Han](https://scholar.google.com/citations?user=rz4rOSMAAAAJ&hl=zh-CN), trying to build efficient distributed large language model training framework [BMTrain](https://github.com/OpenBMB/BMTrain) and Develop 2B on-device Chinese LLM [MiniCPM](https://huggingface.co/collections/openbmb/minicpm-2b-65d48bf958302b9fd25b698f) at ModelBest (面壁智能). I also interned at 01.AI (零一万物) and SenseTime Research (商汤研究院) as algorithm intern. -->

I am passionate about open source and firmly believe in its potential to disseminate knowledge widely, leverage technology to lead innovation to the world and contribute to the advancement of human society. I am proud to have garnered over **1000 stars** and acquired **155 followers** on GitHub. It is gratifying to know that my open-source projects have benefitted numerous individuals, and I have personally gained valuable knowledge from the open-source community.
<!-- I am passionate about open source and firmly believe in its potential to disseminate knowledge widely, leverage technology to lead innovation to the world and contribute to the advancement of human society. I am proud to have garnered over **1000 stars** and acquired over**200 followers** on GitHub. It is gratifying to know that my open-source projects have benefitted numerous individuals, and I have personally gained valuable knowledge from the open-source community. -->

#### Contact
* Github: [JerryYin777](https://github.com/JerryYin777)
<!-- #### Contact
* Email: yin00486 [at] umn.edu
* Google Scholar:[Congrui Yin's Google Scholar](https://scholar.google.com/citations?hl=en&user=7gsdLw4AAAAJ)
#### Education
2023.12 - Present, Computer Science, College of Liberal Arts, University of Minnesota Twin Cities.
2021.09 - 2023.12, Artificial Intelligence, School of Information Engineering, Nanchang University.
#### Research Interests
Natural Language Processing, Machine Learning System, Human Computer Interaction.
Natural Language Processing, Machine Learning System.
#### Skills
* **Natural Language Processing:** Proficient in using the PyTorch framework, with the ability to reproduce mainstream large-scale models in the industry (such as Baichuan, llama2, Qwen). Proficient in using quantization and inference tools such as QLoRA, vLLM, and skilled in distributed parallel training (using training tools such as BMTrain, DeepSpeed).
Expand All @@ -30,4 +59,4 @@ Natural Language Processing, Machine Learning System, Human Computer Interaction
* **Computer System Architecture:** Familiar with GPU architecture and RISC-V instruction set, and has participated in the [One Life, One Core](https://ysyx.oscc.cc/) project.
* **Other:** Understanding of serverless computing architecture, experience in applying federated learning in network security, involvement in both front-end and back-end development, familiarity with Linux operating system and operation and maintenance ( maintained a large server cluster with 21 nodes in the GOOD LAB). Additionally, has researched computer vision and reinforcement learning in the field of artificial intelligence.
* **Other:** Understanding of serverless computing architecture, experience in applying federated learning in network security, involvement in both front-end and back-end development, familiarity with Linux operating system and operation and maintenance ( maintained a large server cluster with 21 nodes in the GOOD LAB). Additionally, has researched computer vision and reinforcement learning in the field of artificial intelligence. -->
16 changes: 16 additions & 0 deletions contents/project.md
Original file line number Diff line number Diff line change
@@ -0,0 +1,16 @@
<!-- <a href="https://github.com/OpenBMB/MiniCPM"><img src="https://github-readme-stats.vercel.app/api/pin/?username=OpenBMB&amp;repo=MiniCPM" alt="Readme Card" /></a></p><p><a href="https://github.com/thunlp/OpenDelta"><img src="https://github-readme-stats.vercel.app/api/pin/?username=thunlp&amp;repo=OpenDelta" alt="Readme Card" /></a></p><p><a href="https://github.com/thunlp/OpenPrompt"><img src="https://github-readme-stats.vercel.app/api/pin/?username=thunlp&amp;repo=OpenPrompt" alt="Readme Card" /></a></p><p><a href="https://github.com/thunlp/PromptPapers"><img src="https://github-readme-stats.vercel.app/api/pin/?username=thunlp&amp;repo=PromptPapers" alt="Readme Card" /></a></p><p><a href="https://github.com/OpenBMB/InfiniteBench"><img src="https://github-readme-stats.vercel.app/api/pin/?username=OpenBMB&amp;repo=InfiniteBench" alt="Readme Card" /></a></p><p><a href="https://github.com/OpenBMB/BMPrinciples"><img src="https://github-readme-stats.vercel.app/api/pin/?username=OpenBMB&amp;repo=BMPrinciples" alt="Readme Card" /></a></p> -->

<div style="display: flex; flex-wrap: wrap; justify-content: space-around;">
<a href="https://github.com/OpenBMB/MiniCPM"><img src="https://github-readme-stats.vercel.app/api/pin/?username=OpenBMB&amp;repo=MiniCPM" alt="Readme Card" /></a>
<a href="https://github.com/OpenBMB/BMTrain"><img src="https://github-readme-stats.vercel.app/api/pin/?username=OpenBMB&amp;repo=BMTrain" alt="Readme Card" /></a>
<a href="https://github.com/CGCL-codes/naturalcc"><img src="https://github-readme-stats.vercel.app/api/pin/?username=CGCL-codes&amp;repo=naturalcc" alt="Readme Card" /></a>
<a href="https://github.com/JerryYin777/FPGA_Competition-RISC-V_Processor-in-PGL22G"><img src="https://github-readme-stats.vercel.app/api/pin/?username=JerryYin777&amp;repo=FPGA_Competition-RISC-V_Processor-in-PGL22G" alt="Readme Card" /></a>
<a href="https://github.com/JerryYin777/PaperHelper"><img src="https://github-readme-stats.vercel.app/api/pin/?username=JerryYin777&amp;repo=PaperHelper" alt="Readme Card" /></a>
<a href="https://github.com/JerryYin777/Cross-Layer-Attention"><img src="https://github-readme-stats.vercel.app/api/pin/?username=JerryYin777&amp;repo=Cross-Layer-Attention" alt="Readme Card" /></a>
<a href="https://github.com/JerryYin777/ASC22-Yuan"><img src="https://github-readme-stats.vercel.app/api/pin/?username=NCUSCC&amp;repo=ASC22-Yuan" alt="Readme Card" /></a>
<a href="https://github.com/JerryYin777/NanoGPT-Pytorch2.0-Implementation"><img src="https://github-readme-stats.vercel.app/api/pin/?username=JerryYin777&amp;repo=NanoGPT-Pytorch2.0-Implementation" alt="Readme Card" /></a>
<a href="https://github.com/JerryYin777/Cr_Research_Toolchain"><img src="https://github-readme-stats.vercel.app/api/pin/?username=JerryYin777&amp;repo=Cr_Research_Toolchain" alt="Readme Card" /></a>


</div>

7 changes: 5 additions & 2 deletions contents/publications.md
Original file line number Diff line number Diff line change
@@ -1,4 +1,7 @@
For full paper list (not now, but I'm sure there will be more great work in the future), please refer to my [Google Scholar](https://scholar.google.com/citations?user=7gsdLw4AAAAJ&hl=en)

- *X. Gao, W. Zhu, J. Gao and <strong><strong>C. Yin</strong></strong>. (2023). F-PABEE: Flexible-Patience-Based Early Exiting For Single-Label and Multi-Label Text Classification Tasks. 2023 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).* [[Paper]](https://ieeexplore.ieee.org/abstract/document/10095864)
- *W. Zhu, Y. Ni, <strong><strong>C. Yin</strong></strong>, A. Tian, X. Wang, G. Xie. (2024). IAPT: Instance-Aware Prompt Turing for Large Language Models. The 62nd Annual Meeting of the Association for Computational Linguistics <strong><strong>(ACL 2024)</strong></strong>.*[[Paper]](https://arxiv.org/pdf/2405.18203)

- *<strong><strong>C. Yin</strong></strong>. (2023). Multi-scale and multi-task learning for human audio forensics based on convolutional networks. International Conference on Image, Signal Processing, and Pattern Recognition (ISPP 2023).* [[Paper]](https://doi.org/10.1117/12.2681344)
- *X. Gao, W. Zhu, J. Gao and <strong><strong>C. Yin</strong></strong>. (2023). F-PABEE: Flexible-Patience-Based Early Exiting For Single-Label and Multi-Label Text Classification Tasks. 2023 IEEE International Conference on Acoustics, Speech and Signal Processing <strong><strong>(ICASSP 2023)</strong></strong>.* [[Paper]](https://ieeexplore.ieee.org/abstract/document/10095864)

- *<strong><strong>C. Yin</strong></strong>. (2023). Multi-scale and multi-task learning for human audio forensics based on convolutional networks. International Conference on Image, Signal Processing, and Pattern Recognition <strong><strong>(ISPP 2023)</strong></strong>.* [[Paper]](https://doi.org/10.1117/12.2681344)
3 changes: 3 additions & 0 deletions contents/service.md
Original file line number Diff line number Diff line change
@@ -0,0 +1,3 @@

- Reviewer for EMNLP'2024
- Reviewer for ACL'2024
42 changes: 37 additions & 5 deletions index.html
Original file line number Diff line number Diff line change
Expand Up @@ -3,7 +3,7 @@

<head>
<meta charset="utf-8" />
<meta name="viewport" content="width=device-width, initial-scale=1, shrink-to-fit=no" />
<meta name="viewport" content="width=device-width, initial-scale=1, shrink-to-fit=" />
<meta name="description" content="" />
<meta name="author" content="" />
<title id="title"></title>
Expand Down Expand Up @@ -44,9 +44,16 @@
<li class="nav-item">
<a class="nav-link me-lg-3" href="#publications">PUBLICATIONS</a>
</li>
<li class="nav-item">
<a class="nav-link me-lg-3" href="#project">SIDE PROJECTS</a>
</li>
<li class="nav-item">
<a class="nav-link me-lg-3" href="#awards">AWARDS</a>
</li>
<li class="nav-item">
<a class="nav-link me-lg-3" href="#service">SERVICE</a>
</li>

<li class="nav-item">
<a class="nav-link me-lg-3" href="docs/Congrui_Yin_CV.pdf">CV</a>
</li>
Expand All @@ -59,10 +66,10 @@
<!-- Top Section -->
<section class="top-section" style="background-image: url('static/assets/img/MIT.png');">
<div class="top-section-content">
<div class="container px-5">
<div class="container px-1">
<div class="row">
<div class="col-lg-8">
<h2 id="top-section-bg-text" class="text-white display-3 lh-1 mb-4 font-alt"></h2>
<h2 id="top-section-bg-text" class="text-light display-3 lh-1 mb-4 font-alt"></h2>
</div>
<div class="col-lg-4">
<div class="user-photo photo"><img class="shadow"
Expand Down Expand Up @@ -101,18 +108,43 @@ <h2 id="publications-subtitle"><i class="bi bi-file-text-fill"></i>&nbsp;PUBLICA
</section>
<!-- Publications -->

<!-- Side Project -->
<section class="bg-gradient-primary-to-secondary-light mt5 md5" id="project">
<div class="container px-5">
<header>
<!-- <h2 id="achievements-subtitle"><i class="bi bi-trophy-fill"></i>&nbsp;ACHIEVEMENTS</h2> -->
<h2 id="achievements-subtitle"><i class="bi bi-code-slash"></i>&nbsp; SIDE PROJECTS</h2>
</header>
<div class="main-body" id="project-md">
</div>
</div>
</section>
<!-- Side Project -->

<!-- Awards -->
<section class="bg-gradient-primary-to-secondary-light mt5 md5" id="awards">
<section class="bg-gradient-primary-to-secondary-gray mt5 md5 id="awards">
<div class="container px-5">
<header>
<h2 id="awards-subtitle"><i class="bi bi-award-fill"></i>AWARDS </h2>
<h2 id="awards-subtitle"><i class="bi bi-award-fill"></i>&nbsp;AWARDS </h2>
</header>
<div class="main-body" id="awards-md"></div>

</div>
</section>
<!-- Awards -->

<!-- Service -->
<section class="bg-gradient-primary-to-secondary-light mt5 md5" id="service">
<div class="container px-5">
<header>
<h2 id="awards-subtitle"><i class="bi bi-chat-left-text"></i>&nbsp;SERVICE </h2>
</header>
<div class="main-body" id="service-md"></div>
</div>
</section>
<!-- Service -->



<!-- Footer-->
<footer class="bg-bottom text-center py-5">
Expand Down
Binary file removed static/assets/img/Jerry.jpg
Binary file not shown.
Binary file modified static/assets/img/photo.jpg
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
2 changes: 1 addition & 1 deletion static/css/main.css
Original file line number Diff line number Diff line change
Expand Up @@ -2,7 +2,7 @@


:root{
--h-title-color:#3948d2;
--h-title-color:#000000;
}

.header {
Expand Down
2 changes: 1 addition & 1 deletion static/js/scripts.js
Original file line number Diff line number Diff line change
Expand Up @@ -2,7 +2,7 @@

const content_dir = 'contents/'
const config_file = 'config.yml'
const section_names = ['home', 'publications', 'awards']
const section_names = ['home', 'publications', 'awards', 'project', 'service']


window.addEventListener('DOMContentLoaded', event => {
Expand Down

0 comments on commit 3394807

Please sign in to comment.