Chao-Chun (Joe) Hsu's picture

Chao-Chun (Joe) Hsu

joe32140

·

https://chaochunhsu.github.io

AI & ML interests

Hi, I am Joe!

Recent Activity

liked a model 2 days ago

Parallia/ClinicalEncoder25-Diagnosable-Colbert-L2-for-medical-texts

updated a collection about 1 month ago

Light-Weight Code Retrieval Models

upvoted a collection about 2 months ago

View all activity

Organizations

upvoted a collection about 2 months ago

Sarashina2.2

Large Language Models developed by SB Intuitions. Pretrained and instruction-tuned models are available in three sizes: 0.5B, 1B, and 3B. • 6 items • Updated Mar 5 • 8

upvoted an article 3 months ago

Article

Introducing RTEB: A New Standard for Retrieval Evaluation

+4

Oct 1

•

132

upvoted a collection 4 months ago

EmbeddingGemma

3 items • Updated Sep 11 • 105

upvoted an article 4 months ago

Article

Welcome EmbeddingGemma, Google's new efficient embedding model

+4

Sep 4

•

267

upvoted an article 6 months ago

Article

Training and Finetuning Sparse Embedding Models with Sentence Transformers v5

Jul 1

•

132

upvoted a collection 7 months ago

Qwen3-Embedding

6 items • Updated about 2 hours ago • 143

upvoted a collection 8 months ago

Qwen3

84 items • Updated about 2 hours ago • 1.53k

upvoted a paper 8 months ago

FreshStack: Building Realistic Benchmarks for Evaluating Retrieval on Technical Documents

Paper • 2504.13128 • Published Apr 17 • 7

upvoted a collection 9 months ago

AceMath

We are releasing math instruction models, math reward models, general instruction models, all training datasets, and a math reward benchmark. • 11 items • Updated 8 days ago • 16

upvoted a collection 10 months ago

reranking series v2

V2 crispy rerank series • 3 items • Updated Jun 25 • 25

upvoted a paper 11 months ago

CG-RAG: Research Question Answering by Citation Graph Retrieval-Augmented LLMs

Paper • 2501.15067 • Published Jan 25 • 1

upvoted a collection 11 months ago

Qwen2.5-1M

The long-context version of Qwen2.5, supporting 1M-token context lengths • 3 items • Updated about 2 hours ago • 126

upvoted a collection 12 months ago

🏟️ Long Code Arena

All the resources for our Long Code Arena benchmark! • 13 items • Updated Mar 14 • 6

upvoted a paper 12 months ago

Measuring Taiwanese Mandarin Language Understanding

Paper • 2403.20180 • Published Mar 29, 2024 • 6

upvoted 2 collections about 1 year ago

OLMoE (November 2024)

Artifacts for open mixture-of-experts language models. • 13 items • Updated 8 days ago • 31

ModernBERT

Bringing BERT into modernity via both architecture changes and scaling • 3 items • Updated Dec 19, 2024 • 151

upvoted an article about 1 year ago

Article

Accelerating LLM Inference: Fast Sampling with Gumbel-Max Trick

Oct 24, 2024

•

14