Granite-Embedding-97M-Multilingual-R2-GGUF

import numpy as np
from llama_cpp import Llama
from sentence_transformers import SentenceTransformer
from sentence_transformers.util import cos_sim

model = SentenceTransformer(
    "ibm-granite/granite-embedding-97m-multilingual-r2",
)
llama = Llama.from_pretrained(
    repo_id="mykor/granite-embedding-97m-multilingual-r2-GGUF",
    filename="granite-embedding-97M-multilingual-r2-BF16.gguf",
    verbose=False,
    embedding=True,
    n_ctx=0,
)

text = """์ด์   ๊นจ์–ด๋‚˜ ์„ค๋ ˆ๋Š” ์ด ๋งˆ์Œ ๋”ฐ๋ผ
๊ธฐ๋‹ค๋ฆฌ๋˜ ์˜ค๋Š˜์„ ๋งŒ๋“ค ๊ฑฐ์•ผ

์ด ๋งŽ์€ ์‹œ๊ฐ„์„ ๋‚œ
ํ˜๋ฆฌ๊ณ  ์‹ถ์ง€๋Š” ์•Š์•„

์ด์ œ ๋‚˜๋Š” ๋ถ„๋ช…ํžˆ
์ค€๋น„๊ฐ€ ๋ผ ์žˆ๋Š”๋ฐ

์–ด๋–ค ๋ฌด๊ธฐ๋ ฅํ•จ์ด ์™€๋„
๋‚˜์—๊ฒ ์˜๋ฏธ๊ฐ€ ์—†์„ ๋งŒํผ

๋˜๋‹ค์‹œ wake up ๋” wake up
๋‹ฌ๊ถˆ์ ธ ๋” ๋œจ๊ฒ๊ฒŒ

๊ตณ์ด ๋ด์ค„ ํ•„์š” ์—†์–ด ์–ด์ฐจํ”ผ never give up
๋ญ๊ฐ€ ๋ค๋ฒผ์˜ค๋“  ๋‚˜๋Š” ์–ด์ฉŒ๋ผ ํ•˜๋“ฏ์ด
๋ญ๋“  ํ•  ์ˆ˜ ์žˆ์„ ๊ฑฐ ๊ฐ™์•„

์ด์   ๊นจ์–ด๋‚˜ ์„ค๋ ˆ๋Š” ์ด ๋งˆ์Œ ๋”ฐ๋ผ
๊ธฐ๋‹ค๋ฆฌ๋˜ ์˜ค๋Š˜์„ ๋งŒ๋“ค ๊ฑฐ์•ผ

๋„์‹œ ์‚ฌ์ด์— ํ”ผ์–ด๋‚˜ ์ด๊ฒจ๋‚ด์˜จ ๊ฝƒ์ฒ˜๋Ÿผ
๋ถ„๋ช… ํ”ผ์›Œ๋‚ผ ๊ฑฐ์•ผ ๋‚˜๋ฅผ

๋‚˜์—๊ฒŒ ๋‚จ์•„ ์žˆ๋Š” ๊ฒŒ ์–ผ๋งˆ ๋˜์ง€ ์•Š๋Š” ๊ฒƒ ๊ฐ™์•„๋„
๋‘ ๋ฒˆ ๋‹ค์‹  ๋‚ด๊ฒŒ ๋ถ€๋„๋Ÿฝ์ง€ ์•Š๊ฒŒ
๋˜‘๋ฐ”๋กœ ๋ด

๊ตณ์ด ๋ด์ค„ ํ•„์š” ์—†์–ด ์–ด์ฐจํ”ผ never give up
๋ญ๊ฐ€ ๋ค๋ฒผ์˜ค๋“  ๋‚˜๋Š” ์–ด์ฉŒ๋ผ ํ•˜๋“ฏ์ด
๋ญ๋“  ์ด๊ธธ ์ˆ˜ ์žˆ์„๊ฑฐ์•ผ

์ด์   ๊นจ์–ด๋‚˜ ์„ค๋ ˆ๋Š” ์ด ๋งˆ์Œ ๋”ฐ๋ผ
๊ธฐ๋‹ค๋ฆฌ๋˜ ์˜ค๋Š˜์„ ๋งŒ๋“ค ๊ฑฐ์•ผ

๋„์‹œ ์‚ฌ์ด์— ํ”ผ์–ด๋‚˜ ์ด๊ฒจ๋‚ด์˜จ ๊ฝƒ์ฒ˜๋Ÿผ
๋ถ„๋ช… ํ”ผ์›Œ๋‚ผ ๊ฑฐ์•ผ ๋‚˜๋ฅผ

timeโ€™s up ๋ช‡ ๋ฒˆ์„ ํ•ด๋„ Overcharge
๊ทธ๋งŒํ•ด๋„ ๋œ๋‹ค ํ•ด๋„ (๊ทธ๋งŒํ•ด๋„ ๋œ๋‹ค ํ•ด๋„)
no matter no matter ์ˆจ์ด ์ฐจ์˜ฌ๋ผ๋„
์ด ๊ธธ์„ ๋ฐ”๋ž€๋‹ค๋ฉด

ํ•˜๋Š˜ ์ € ๋„ˆ๋จธ ๋ป—์–ด๊ฐ€๋Š” ๋น› ๋”ฐ๋ผ
์ด ๋งˆ์Œ๋„ ๋‚ ๋ ค ๊ณง ๋‹ฟ์„ ๊ฒƒ๋งŒ ๊ฐ™์•„

๋„์‹œ ์‚ฌ์ด์— ํ”ผ์–ด๋‚˜ ์ด๊ฒจ๋‚ด์˜จ ๊ฝƒ์ฒ˜๋Ÿผ
๋ถ„๋ช… ํ”ผ์›Œ๋‚ผ ๊ฑฐ์•ผ ๋‚˜๋ฅผ"""

embed1 = model.encode(text)
embed2 = np.array(llama.embed(text), dtype=np.float32)
print(cos_sim(embed1, embed2).item())
0.7681353688240051

๋ชจ๋ธ ์„ฑ๋Šฅ ์ •์ƒํ™”์— ๋Œ€ํ•œ ์•„์ด๋””์–ด๊ฐ€ ์žˆ์œผ์‹  ๋ถ„์€ ์ œ๋ณด ๋ถ€ํƒ๋“œ๋ฆฝ๋‹ˆ๋‹ค.

๋ชจ๋ธ ์ถœ๋ ฅ์ด ์›๋ณธ๊ณผ ๋‹ค๋ฅธ ์ด์œ ๋Š”, hidden activation์ด ๋‹ค๋ฅด๊ธฐ ๋•Œ๋ฌธ์ž…๋‹ˆ๋‹ค.

llama.cpp์˜ modern bert ๊ตฌํ˜„์€ modern bert์˜ ์›๋ณธ์„ ๋”ฐ๋ผ gelu๋ฅผ ์‚ฌ์šฉํ•˜๋Š”๋ฐ,

ibm-granite/granite-embedding-97m-multilingual-r2 ๋ชจ๋ธ์€ silu๋ฅผ ์‚ฌ์šฉํ•˜๊ณ  ์žˆ์Šต๋‹ˆ๋‹ค.

์ด ์ฐจ์ด๋กœ ์ธํ•ด ์ถœ๋ ฅ ๊ฐ’์— ์ฐจ์ด๊ฐ€ ์ƒ๊น๋‹ˆ๋‹ค.

์ด ๋ฌธ์ œ๋Š” llama.cpp์—์„œ ์ˆ˜์ •๋˜์–ด์•ผ ํ•ฉ๋‹ˆ๋‹ค.

Downloads last month
1,760
GGUF
Model size
97.4M params
Architecture
modern-bert
Hardware compatibility
Log In to add your hardware

3-bit

4-bit

5-bit

6-bit

8-bit

16-bit

Inference Providers NEW
This model isn't deployed by any Inference Provider. ๐Ÿ™‹ Ask for provider support

Model tree for mykor/granite-embedding-97m-multilingual-r2-GGUF

Quantized
(3)
this model