merge

This is a merge of pre-trained language models created using mergekit.

Merge Details

Merge Method

This model was merged using the Linear merge method.

Models Merged

The following models were included in the merge:

Configuration

The following YAML configuration was used to produce this model:

model_name: "pre-cursa-o1-v1.6"
models:
  - model: marcuscedricridia/cursa-o1-7b
    parameters:
      weight: 1.0
  - model: marcuscedricridia/absolute-o1-7b
    parameters:
      weight: 1.0
  - model: marcuscedricridia/cursa-o1-7b
    parameters:
      weight: 1.0
merge_method: linear
normalize: false
int8_mask: true
dtype: bfloat16
tokenizer_source: "union"  # or "base" or a model path
chat_template: "auto"  # or a template name or Jinja2 template

Downloads last month: 4

Safetensors

Model size

8B params

Tensor type

BF16

Model tree for marcuscedricridia/pre-cursa-o1-v1.6

marcuscedricridia/absolute-o1-7b

marcuscedricridia/cursa-o1-7b

Merge model

this model

Merges

1 model

Paper for marcuscedricridia/pre-cursa-o1-v1.6

Model soups: averaging weights of multiple fine-tuned models improves accuracy without increasing inference time

Paper • 2203.05482 • Published Mar 10, 2022 • 8