Skip to content
Discussion options

You must be logged in to vote

You are running out of memory because the context of this model is 256k and requires ~25GB. You should be able to run with -c 32768 and probably higher depending on how much free memory you have.

Replies: 1 comment 3 replies

Comment options

You must be logged in to vote
3 replies
@cristianadam
Comment options

@cristianadam
Comment options

@ggerganov
Comment options

Answer selected by cristianadam
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
2 participants