Skip to content

[docs - WIP]: llama3 and Qwen 2.5 CPT and pre training recipes#80

Merged
tscholak merged 12 commits into
mainfrom
raymond/llama-recipe
Mar 5, 2025
Merged

[docs - WIP]: llama3 and Qwen 2.5 CPT and pre training recipes#80
tscholak merged 12 commits into
mainfrom
raymond/llama-recipe

Conversation

@RaymondLi0
Copy link
Copy Markdown
Contributor

@RaymondLi0 RaymondLi0 commented Dec 2, 2024

✨ Description

Closes #70

🔍 Type of change

Select all that apply:

  • 🐛 Bug fix (non-breaking change that addresses a specific issue)
  • 🚀 New feature (non-breaking change that adds functionality)
  • ⚠️ Breaking change (a change that could affect existing functionality)
  • 📈 Performance improvement/optimization (improves speed, memory usage, or efficiency)
  • 🛠️ Code refactor (non-functional changes that improve code readability, structure, etc.)
  • 📦 Dependency bump (updates dependencies, including Dockerfile or package changes)
  • 📝 Documentation change (updates documentation, including new content or typo fixes)
  • 🔧 Infrastructure/Build change (affects build process, CI/CD, or dependencies)

Comment thread docs/recipes/train-llama-8b.md Outdated
@tscholak tscholak mentioned this pull request Feb 27, 2025
@bigximik bigximik changed the title [docs - WIP]: llama3 CPT recipe [docs - WIP]: llama3 and Qwen 2.5 CPT and pre training recipes Mar 4, 2025
Copy link
Copy Markdown
Collaborator

@tscholak tscholak left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Excellent work, thanks @RaymondLi0 and @bigximik!

@tscholak tscholak merged commit 33cd664 into main Mar 5, 2025
@tscholak tscholak deleted the raymond/llama-recipe branch March 5, 2025 15:02
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

[docs] Llama 3.1 8B continual pretraining recipe

3 participants