KoBART Summarization - Fine-tuned on XL-Sum (Korean)

์ด ๋ชจ๋ธ์€ gogamza/kobart-summarization์„ ๊ธฐ๋ฐ˜์œผ๋กœ XL-Sum ํ•œ๊ตญ์–ด ๋ฐ์ดํ„ฐ์…‹์„ ํ™œ์šฉํ•ด LoRA(Low-Rank Adaptation) ๊ธฐ๋ฒ•์œผ๋กœ ํŒŒ์ธํŠœ๋‹ํ•œ ๋ชจ๋ธ์ž…๋‹ˆ๋‹ค. ๋‰ด์Šค ๊ธฐ์‚ฌ์™€ ๊ฐ™์€ ๊ธด ํ…์ŠคํŠธ๋ฅผ ํ•ต์‹ฌ ๋ฌธ์žฅ์œผ๋กœ ์••์ถ•ํ•˜๋Š” ๋ฐ ํŠนํ™”๋˜์–ด ์žˆ์Šต๋‹ˆ๋‹ค.

How to Use

from transformers import AutoTokenizer, AutoModelForSeq2SeqLM

tokenizer = AutoTokenizer.from_pretrained("rudalson/kobart-summarization-ko")
model = AutoModelForSeq2SeqLM.from_pretrained("rudalson/kobart-summarization-ko")

text = """5์›” 1์ผ ๋…ธ๋™์ ˆ๋ถ€ํ„ฐ 5์ผ ์–ด๋ฆฐ์ด๋‚ ๊นŒ์ง€ ์ด์–ด์ง€๋Š” ์ตœ๋Œ€ 5์ผ๊ฐ„์˜ ํ™ฉ๊ธˆ์—ฐํœด๋ฅผ ์•ž๋‘๊ณ  ๊ตญ๋‚ด ์ฃผ์š” ๊ด€๊ด‘์ง€์˜ ์ˆ™๋ฐ• ์š”๊ธˆ์ด ์ฒœ์ •๋ถ€์ง€๋กœ ์น˜์†Ÿ๊ณ  ์žˆ์Šต๋‹ˆ๋‹ค.
์ค‘๋™์ „์Ÿ ์—ฌํŒŒ๋กœ ํ•ญ๊ณต ์œ ๋ฅ˜๋น„๊ฐ€ ์˜ค๋ฅด์ž ํ•ด์™ธ ๋Œ€์‹  ๊ตญ๋‚ด ์—ฌํ–‰์œผ๋กœ ์ˆ˜์š”๊ฐ€ ๋ชฐ๋ฆฐ ๋ฐ๋‹ค, ์ผ๋ณธ๊ณผ ์ค‘๊ตญ์˜ ์—ฐํœด๊นŒ์ง€ ๊ฒน์น˜๋ฉฐ ์ˆ™์†Œ ๊ตฌํ•˜๊ธฐ๊ฐ€ ๊ทธ์•ผ๋ง๋กœ 'ํ•˜๋Š˜์˜ ๋ณ„๋”ฐ๊ธฐ'๊ฐ€ ๋œ ์ƒํ™ฉ์ž…๋‹ˆ๋‹ค."""

inputs = tokenizer(text, return_tensors="pt", truncation=True, max_length=1024)
summary_ids = model.generate(
    inputs["input_ids"], 
    num_beams=4, 
    max_length=128,
    min_length=10,
    no_repeat_ngram_size=3,
    repetition_penalty=1.2,
    early_stopping=True
)

print(tokenizer.decode(summary_ids[0], skip_special_tokens=True))

Training Detail

  • Training Data: XL-Sum (Korean)
  • Technique: LoRA
  • Epochs: 5
  • Learning Rate: 2e-5

Evaluation

ํ‰๊ฐ€ ์‹œ ํ˜•ํƒœ์†Œ ๋ถ„์„๊ธฐ Kiwi๋ฅผ ์‚ฌ์šฉํ•˜์—ฌ ํ† ํฐํ™” ํ›„ ์ธก์ •๋˜์—ˆ์Šต๋‹ˆ๋‹ค.

Metric Score
ROUGE-1 11.63%
ROUGE-L 11.57%

Support

  • SSAFY Tesla V100-PCIE-32GB
Downloads last month
197
Safetensors
Model size
0.1B params
Tensor type
F16
ยท
Inference Providers NEW
This model isn't deployed by any Inference Provider. ๐Ÿ™‹ Ask for provider support

Model tree for rudalson/kobart-summarization-xlsum

Finetuned
(5)
this model

Dataset used to train rudalson/kobart-summarization-xlsum