这个需要多大得显存可以跑起来RTX4090 24G可以吗 #11

anstonjie · 2024-08-17T01:02:06Z

System Info / 系統信息

这个需要多大得显存可以跑起来RTX4090 24G可以吗

Who can help? / 谁可以帮助到您？

No response

Information / 问题信息

The official example scripts / 官方的示例脚本
My own modified scripts / 我自己修改的脚本和任务

Reproduction / 复现过程

这个需要多大得显存可以跑起来RTX4090 24G可以吗

Expected behavior / 期待表现

这个需要多大得显存可以跑起来RTX4090 24G可以吗

bys0318 · 2024-08-17T14:30:20Z

你好，我这边用H800是可以在24g显存之内完成1w字生成的

bys0318 · 2024-08-17T14:30:34Z

显存占用大概在20g左右

anstonjie · 2024-08-18T12:51:36Z

你好，我的RTX 4090 16G的，试了可以运行，但是每运行一次输出，显存就增高一些，输出三次左右显存就爆了，这个是什么原因，是不是有历史记录的原因

…

---- 回复的原邮件 ---- | 发件人 | Yushi ***@***.***> | | 日期 | 2024年08月17日 22:30 | | 收件人 | ***@***.***> | | 抄送至 | ***@***.***>***@***.***> | | 主题 | Re: [THUDM/LongWriter] 这个需要多大得显存可以跑起来RTX4090 24G可以吗 (Issue #11) | 显存占用大概在20g左右 — Reply to this email directly, view it on GitHub, or unsubscribe. You are receiving this because you authored the thread.Message ID: ***@***.***>

anstonjie · 2024-08-18T12:53:16Z

第一次输出差不多8000多字，第二次9000多字，第三次也差不多8000多字，第四次的时候显存就满了，然后字马上变少

…

---- 回复的原邮件 ---- | 发件人 | Yushi ***@***.***> | | 日期 | 2024年08月17日 22:30 | | 收件人 | ***@***.***> | | 抄送至 | ***@***.***>***@***.***> | | 主题 | Re: [THUDM/LongWriter] 这个需要多大得显存可以跑起来RTX4090 24G可以吗 (Issue #11) | 显存占用大概在20g左右 — Reply to this email directly, view it on GitHub, or unsubscribe. You are receiving this because you authored the thread.Message ID: ***@***.***>

bys0318 · 2024-08-18T16:04:25Z

请试试在每次生成后加一下这行代码，释放未使用的显存：

torch.cuda.empty_cache()

anstonjie · 2024-08-18T16:23:38Z

好的，我试试

…

---- 回复的原邮件 ---- | 发件人 | Yushi ***@***.***> | | 日期 | 2024年08月19日 00:04 | | 收件人 | ***@***.***> | | 抄送至 | ***@***.***>***@***.***> | | 主题 | Re: [THUDM/LongWriter] 这个需要多大得显存可以跑起来RTX4090 24G可以吗 (Issue #11) | 请试试在每次生成后加一下 torch.cuda.empty_cache() — Reply to this email directly, view it on GitHub, or unsubscribe. You are receiving this because you authored the thread.Message ID: ***@***.***>

allenxml · 2024-08-23T16:28:08Z

在ollama中导入q4的gguf格式模型，在openwebui中提问，输出速度很慢，ollama主机4060ti显存你8G，显卡核心频率经常在210，很少到最大频率，7950x的CPU占用率50％。

rrgkGitHub · 2024-08-23T20:32:30Z

用bitsandbytes 4bit量化加载模型后，12G显存可以运行。速度中等偏慢，但可以接受。

mrpen2 · 2024-08-28T10:05:16Z

用vllm加速的话需要多少显存？

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

这个需要多大得显存可以跑起来RTX4090 24G可以吗 #11

这个需要多大得显存可以跑起来RTX4090 24G可以吗 #11

anstonjie commented Aug 17, 2024

bys0318 commented Aug 17, 2024

bys0318 commented Aug 17, 2024

anstonjie commented Aug 18, 2024 via email

anstonjie commented Aug 18, 2024 via email

bys0318 commented Aug 18, 2024 •

edited

Loading

anstonjie commented Aug 18, 2024 via email

allenxml commented Aug 23, 2024

rrgkGitHub commented Aug 23, 2024

mrpen2 commented Aug 28, 2024

这个需要多大得显存可以跑起来RTX4090 24G可以吗 #11

这个需要多大得显存可以跑起来RTX4090 24G可以吗 #11

Comments

anstonjie commented Aug 17, 2024

System Info / 系統信息

Who can help? / 谁可以帮助到您？

Information / 问题信息

Reproduction / 复现过程

Expected behavior / 期待表现

bys0318 commented Aug 17, 2024

bys0318 commented Aug 17, 2024

anstonjie commented Aug 18, 2024 via email

anstonjie commented Aug 18, 2024 via email

bys0318 commented Aug 18, 2024 • edited Loading

anstonjie commented Aug 18, 2024 via email

allenxml commented Aug 23, 2024

rrgkGitHub commented Aug 23, 2024

mrpen2 commented Aug 28, 2024

bys0318 commented Aug 18, 2024 •

edited

Loading