-
Notifications
You must be signed in to change notification settings - Fork 3.1k
[Mthreads] support llama 13B train #9666
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Conversation
|
Thanks for your contribution! |
|
|
llm/mthreads/llama/README.md
Outdated
| ``` | ||
|
|
||
| ### (2)训练: | ||
| 1. 多机多卡推理 |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
咱们不是推理,是训练吧
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
已修改。
|
|
||
| 执行如下命令进行推理: | ||
| ```bash | ||
| bash run_dist.sh 10.10.10.123 # 假设master ip 为10.10.10.123,在不同节点上执行此命令 |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
到哪个目录运行run_dist.sh 脚本,需要说明一下,用户可能找不到。
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
已添加进入目录命令
| 3. 安装 paddle | ||
| ``` | ||
| # paddlepaddle『飞桨』深度学习框架,提供运算基础能力 | ||
| git clone git@github.com:PaddlePaddle/Paddle.git -b release-musa/2.6 |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
依赖的paddle代码都合入到了这个官方分支了吗?这个分支可以直接跑?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
是的,这个代码已经都合入完成了。
| 4. 克隆 PaddleNLP 仓库代码,并安装依赖 | ||
| ``` | ||
| # PaddleNLP是基于paddlepaddle『飞桨』的自然语言处理和大语言模型(LLM)开发库,存放了基于『飞桨』框架实现的各种大模型,llama2-13B模型也包含其中。为了便于您更好地使用PaddleNLP,您需要clone整个仓库。 | ||
| git clone git@github.com:shang-mt/PaddleNLP.git -b mthreads-llama-13B |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
这个分支可以提一个pr到paddlenlp,不合入,留一下作为记录吧。
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
#9193
这个Pr之前有提交,可以直接跑。
llm/mthreads/llama/README.md
Outdated
| @@ -0,0 +1,77 @@ | |||
| ## 🚣♂️ 使用 PaddleNLP 在 MTT S4000 下跑通 llama2-13b 模型 🚣 | |||
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
| ## 🚣♂️ 使用 PaddleNLP 在 MTT S4000 下跑通 llama2-13b 模型 🚣 | |
| ## 🚣♂️ 使用 PaddleNLP 在 MTT S4000 下跑通 llama2-13b 模型预训练 🚣 |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
咱们应该支持了预训练,对吧
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
是的。
Codecov ReportAll modified and coverable lines are covered by tests ✅
Additional details and impacted files@@ Coverage Diff @@
## develop #9666 +/- ##
===========================================
+ Coverage 52.56% 52.79% +0.23%
===========================================
Files 721 718 -3
Lines 116012 112241 -3771
===========================================
- Hits 60976 59262 -1714
+ Misses 55036 52979 -2057 ☔ View full report in Codecov by Sentry. 🚀 New features to boost your workflow:
|
ZHUI
left a comment
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
LGTM
PR types
PR changes
Description