|
12284 | 12284 | - filename: Wingless_Imp_8B.i1-Q4_K_M.gguf
|
12285 | 12285 | sha256: 3a5ff776ab3286f43937c3c2d8e2e1e09c5ea1c91a79945c34ec071e23f31e3b
|
12286 | 12286 | uri: huggingface://mradermacher/Wingless_Imp_8B-i1-GGUF/Wingless_Imp_8B.i1-Q4_K_M.gguf
|
| 12287 | +- !!merge <<: *llama31 |
| 12288 | + name: "nousresearch_hermes-4-70b" |
| 12289 | + icon: https://cdn-uploads.huggingface.co/production/uploads/6317aade83d8d2fd903192d9/roT9o5bMYBtQziRMlaSDf.jpeg |
| 12290 | + urls: |
| 12291 | + - https://huggingface.co/NousResearch/Hermes-4-70B |
| 12292 | + - https://huggingface.co/bartowski/NousResearch_Hermes-4-70B-GGUF |
| 12293 | + description: | |
| 12294 | + Hermes 4 70B is a frontier, hybrid-mode reasoning model based on Llama-3.1-70B by Nous Research that is aligned to you. |
| 12295 | + |
| 12296 | + Read the Hermes 4 technical report here: Hermes 4 Technical Report |
| 12297 | + |
| 12298 | + Chat with Hermes in Nous Chat: https://chat.nousresearch.com |
| 12299 | + |
| 12300 | + Training highlights include a newly synthesized post-training corpus emphasizing verified reasoning traces, massive improvements in math, code, STEM, logic, creativity, and format-faithful outputs, while preserving general assistant quality and broadly neutral alignment. |
| 12301 | + What’s new vs Hermes 3 |
| 12302 | + |
| 12303 | + Post-training corpus: Massively increased dataset size from 1M samples and 1.2B tokens to ~5M samples / ~60B tokens blended across reasoning and non-reasoning data. |
| 12304 | + Hybrid reasoning mode with explicit <think>…</think> segments when the model decides to deliberate, and options to make your responses faster when you want. |
| 12305 | + Reasoning that is top quality, expressive, improves math, code, STEM, logic, and even creative writing and subjective responses. |
| 12306 | + Schema adherence & structured outputs: trained to produce valid JSON for given schemas and to repair malformed objects. |
| 12307 | + Much easier to steer and align: extreme improvements on steerability, especially on reduced refusal rates. |
| 12308 | + overrides: |
| 12309 | + parameters: |
| 12310 | + model: NousResearch_Hermes-4-70B-Q4_K_M.gguf |
| 12311 | + files: |
| 12312 | + - filename: NousResearch_Hermes-4-70B-Q4_K_M.gguf |
| 12313 | + sha256: ab9b59dd1df27c039952915aa4669a82b5f45e5e9532b98679c65dffe2fe9ee2 |
| 12314 | + uri: huggingface://bartowski/NousResearch_Hermes-4-70B-GGUF/NousResearch_Hermes-4-70B-Q4_K_M.gguf |
12287 | 12315 | - &deepseek
|
12288 | 12316 | url: "github:mudler/LocalAI/gallery/deepseek.yaml@master" ## Deepseek
|
12289 | 12317 | name: "deepseek-coder-v2-lite-instruct"
|
|
0 commit comments