Skip to content

Conversation

PawelPeczek-Roboflow
Copy link
Collaborator

@PawelPeczek-Roboflow PawelPeczek-Roboflow commented Jul 23, 2025

Description

inference-exp support for recent extensions introduced to RF API weights-provider:

  • torch-script backend
  • modelFeatures manifested for model packages to denote special (and optional) features of the model that this package bring - implemented in forward compatible mode - stale inference-exp should automatically reject the model packages for a specific backend which was registered with features newer than known in the time when inference-exp model implementation was created - this should improve stability of packages selected while auto-negotiation.

Type of change

Please delete options that are not relevant.

  • Bug fix (non-breaking change which fixes an issue)
  • New feature (non-breaking change which adds functionality)
  • This change requires a documentation update

How has this change been tested, please provide a testcase or example of how you tested the change?

  • new tests in CI
  • old tests still 🟢

Any specific deployment considerations

For example, documentation changes, usability, usage/costs, secrets, etc.

Docs

  • Docs updated? What were the changes:

@@ -12,7 +13,15 @@
EMBEDDING_TASK = "embedding"


REGISTERED_MODELS: Dict[Tuple[ModelArchitecture, TaskType, BackendType], LazyClass] = {
@dataclass(frozen=True)
class RegistryEntry:
Copy link
Contributor

@hansent hansent Jul 24, 2025

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

would it simplify by having it be just RegistryEntry? with e.g. dataclass:

  • module_name
  • class_name
  • Optional[model_feature]

then below in the list contributors don't have to choose or wonder about difference between using LazyClass and RegistryEntry. could instantiate LazyClass or resolve directly in resolve_model_class if we don't do it elsewhere?

@@ -49,8 +50,10 @@ def rank_model_packages(
# discarded in the previous stage.
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

re-reading the whole comment her made me think maybe at some point might consider giving a mechanism to configure ranking preference. Maybe configuring explicitly on AutoModel by passing constraints is already enough (but only gives control over filtering, not ranking). Just a thought not something we need to worry about now.

hansent
hansent previously approved these changes Jul 24, 2025
Copy link
Contributor

@hansent hansent left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM. left some comments, mostly food for thought / future

@hansent
Copy link
Contributor

hansent commented Sep 4, 2025

@PawelPeczek-Roboflow Are these changes strictly tied to new version of registry service? wondering if we can merge / merge partially to avoid merge conflicts.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants