Update README.md
Browse files
README.md
CHANGED
|
@@ -6,9 +6,11 @@ tags:
|
|
| 6 |
- unsloth
|
| 7 |
- llama
|
| 8 |
- trl
|
| 9 |
-
license: llama3.
|
| 10 |
language:
|
| 11 |
- en
|
|
|
|
|
|
|
| 12 |
---
|
| 13 |
|
| 14 |
# ReasoningCore-Llama-3B-R1-aligned
|
|
@@ -112,6 +114,7 @@ Please use "Please reason step by step, and put your final answer within \boxed{
|
|
| 112 |
---
|
| 113 |
|
| 114 |
### Safety Fine‑Tuning & Data Strategy
|
|
|
|
| 115 |
|
| 116 |
#### Objectives:
|
| 117 |
- Provide a reliable tool for building secure and helpful reasoning systems.
|
|
@@ -164,7 +167,7 @@ Please use "Please reason step by step, and put your final answer within \boxed{
|
|
| 164 |
|
| 165 |
### Conclusion
|
| 166 |
|
| 167 |
-
**ReasoningCore
|
| 168 |
|
| 169 |
For further details, questions, or feedback, please email [email protected]
|
| 170 |
|
|
|
|
| 6 |
- unsloth
|
| 7 |
- llama
|
| 8 |
- trl
|
| 9 |
+
license: llama3.2
|
| 10 |
language:
|
| 11 |
- en
|
| 12 |
+
datasets:
|
| 13 |
+
- gretelai/gretel-safety-alignment-en-v1
|
| 14 |
---
|
| 15 |
|
| 16 |
# ReasoningCore-Llama-3B-R1-aligned
|
|
|
|
| 114 |
---
|
| 115 |
|
| 116 |
### Safety Fine‑Tuning & Data Strategy
|
| 117 |
+
SFT fine tuning wit unsloth with dataset
|
| 118 |
|
| 119 |
#### Objectives:
|
| 120 |
- Provide a reliable tool for building secure and helpful reasoning systems.
|
|
|
|
| 167 |
|
| 168 |
### Conclusion
|
| 169 |
|
| 170 |
+
**ReasoningCore-Llama-3B-R1-aligned** represents a significant advancement in multilingual, reasoning‑enhanced language models. Optimized for tasks requiring deep reasoning, contextual understanding, and safe, helpful interactions, it offers a powerful tool for both commercial and research applications. We invite developers and researchers to explore its capabilities and contribute to building secure, innovative AI systems.
|
| 171 |
|
| 172 |
For further details, questions, or feedback, please email [email protected]
|
| 173 |
|