Skip to content

Repeated tokens generated from 'generate.py' running on GPU #284

@shengzhelyu65

Description

@shengzhelyu65

Dear Authors,

Thanks for introducing the amazing project. When I tested the BitNet Inference Kernel on RTX 3090 with Ubuntu system, I followed the commands in README.md, but I got repeated tokens as the output. For example:

Could you help me explain Python?

OfOfOfOfOfOfOfOfOfOfOfOfOfOfOfOfOfOfOfOfOfOfOfOfOfOfOfOfOfOfOfOf

Could you help me check if anything could be wrong here? Thanks.

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions