--- license: cc-by-nc-4.0 language: - en base_model: - redmint/studybuddy-ai new_version: redmint/studybuddy-V2 pipeline_tag: text-generation library_name: adapter-transformers tags: - art - education - love --- # StudyBuddy-AI v2 (LoRA Fine-Tuned Upgrade) This model is an upgraded version of **redmint/studybuddy-ai**, fine-tuned using LoRA adapters to improve **instruction following, truthfulness, and multi-step reasoning performance**. ## What This Version Improves * Stronger multi-step reasoning using a 500-example custom reasoning dataset. * Higher factual accuracy using a 500-example faithfulness dataset. * Better general instruction handling using a 1000-example general dataset. * More stable generation thanks to improved tokenization and padding setup. ## Training Setup * **Base model:** `redmint/studybuddy-ai` * **Method:** LoRA fine-tuning * **Hardware:** Google Colab T4 * **Steps:** ~500 for Stage 1 + ~500 for Stage 2 * **Datasets:** * `general.jsonl` (1000 lines) * `multi-stepV2.jsonl` (500 lines) * `stayfaithful.jsonl` (500 lines) ## Files in This Model * LoRA weight files (`adapter_config.json`, `adapter_model.bin`) * Tokenizer files (`tokenizer.json`, `special_tokens_map.json`, etc.) * Training configuration and HF metadata ## **🚀 How to Use** ```python from transformers import AutoTokenizer, AutoModelForCausalLM from peft import PeftModel base = AutoModelForCausalLM.from_pretrained("redmint/studybuddy-ai") peft = PeftModel.from_pretrained(base, "redmint/studybuddy-v2") tokenizer = AutoTokenizer.from_pretrained("your-username/studybuddy-v2") input_ids = tokenizer("Explain gravity.", return_tensors="pt").input_ids print(tokenizer.decode(peft.generate(input_ids, max_new_tokens=200)[0])) ``` ## Notes * This repo contains **LoRA adapters**, not a full standalone model. * To deploy the upgraded model, always load the **base model** + these **LoRA weights**. ## License This model is licensed under the Creative Commons Attribution-NonCommercial 4.0 International License (CC BY-NC 4.0). You are free to: -Use the model for personal or educational purposes -Modify, remix, or build upon it **You may not**: - Sell, sublicense, or use the model or its outputs for any commercial purposes © 2025 Micah Miles ## NOTICE This project is licensed for **non-commercial use only**. For commercial licensing or partnership inquiries, contact: micahmiles21@gmail.com *Message to future me*, "I appreciate the disciplne and resilience you've managed to maintain during this whole project regardless of the trials and tribulations, and for that reason regardless of how it goes from here.. *I'm proud of you dawg*"😂 Keep shooting for the stars to anyone else that is reading this, and always remember.. **SKY IS NOT THE LIMIT**