Skip to content

Commit c5f9118

Browse files
authored
chore(model gallery): add nousresearch_hermes-4-70b (#6187)
Signed-off-by: Ettore Di Giacinto <[email protected]>
1 parent d829227 commit c5f9118

File tree

1 file changed

+28
-0
lines changed

1 file changed

+28
-0
lines changed

gallery/index.yaml

Lines changed: 28 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -12284,6 +12284,34 @@
1228412284
- filename: Wingless_Imp_8B.i1-Q4_K_M.gguf
1228512285
sha256: 3a5ff776ab3286f43937c3c2d8e2e1e09c5ea1c91a79945c34ec071e23f31e3b
1228612286
uri: huggingface://mradermacher/Wingless_Imp_8B-i1-GGUF/Wingless_Imp_8B.i1-Q4_K_M.gguf
12287+
- !!merge <<: *llama31
12288+
name: "nousresearch_hermes-4-70b"
12289+
icon: https://cdn-uploads.huggingface.co/production/uploads/6317aade83d8d2fd903192d9/roT9o5bMYBtQziRMlaSDf.jpeg
12290+
urls:
12291+
- https://huggingface.co/NousResearch/Hermes-4-70B
12292+
- https://huggingface.co/bartowski/NousResearch_Hermes-4-70B-GGUF
12293+
description: |
12294+
Hermes 4 70B is a frontier, hybrid-mode reasoning model based on Llama-3.1-70B by Nous Research that is aligned to you.
12295+
12296+
Read the Hermes 4 technical report here: Hermes 4 Technical Report
12297+
12298+
Chat with Hermes in Nous Chat: https://chat.nousresearch.com
12299+
12300+
Training highlights include a newly synthesized post-training corpus emphasizing verified reasoning traces, massive improvements in math, code, STEM, logic, creativity, and format-faithful outputs, while preserving general assistant quality and broadly neutral alignment.
12301+
What’s new vs Hermes 3
12302+
12303+
Post-training corpus: Massively increased dataset size from 1M samples and 1.2B tokens to ~5M samples / ~60B tokens blended across reasoning and non-reasoning data.
12304+
Hybrid reasoning mode with explicit <think>…</think> segments when the model decides to deliberate, and options to make your responses faster when you want.
12305+
Reasoning that is top quality, expressive, improves math, code, STEM, logic, and even creative writing and subjective responses.
12306+
Schema adherence & structured outputs: trained to produce valid JSON for given schemas and to repair malformed objects.
12307+
Much easier to steer and align: extreme improvements on steerability, especially on reduced refusal rates.
12308+
overrides:
12309+
parameters:
12310+
model: NousResearch_Hermes-4-70B-Q4_K_M.gguf
12311+
files:
12312+
- filename: NousResearch_Hermes-4-70B-Q4_K_M.gguf
12313+
sha256: ab9b59dd1df27c039952915aa4669a82b5f45e5e9532b98679c65dffe2fe9ee2
12314+
uri: huggingface://bartowski/NousResearch_Hermes-4-70B-GGUF/NousResearch_Hermes-4-70B-Q4_K_M.gguf
1228712315
- &deepseek
1228812316
url: "github:mudler/LocalAI/gallery/deepseek.yaml@master" ## Deepseek
1228912317
name: "deepseek-coder-v2-lite-instruct"

0 commit comments

Comments
 (0)