Skip to content
@TinyDataML

TinyData AI, where the data are valued

TinyData aim to adopt a data-centric AI philosophy to accelerate the process of deploying AI products to the business scenarios in industry.

TinyData AI Logo

  👋 Welcome to TinyData AI

  TinyData AI, where data is valued.

  Welcome to TinyData AI! We are a team of AI researchers and engineers with our roots at Peking University.  
  Our mission is to bridge the gap between cutting-edge academic research and real-world industry applications through the Data-Centric AI paradigm.


🚀 Our Philosophy: Data-Centric AI

We firmly believe that high-quality, high-signal data is the cornerstone of any great AI system. While traditional Model-Centric AI development focuses on iterating on code and algorithms, we champion a shift in focus.

In complex, real-world applications, systematically improving the data often yields more significant and reliable performance gains. All our tools are built around this core principle, designed to help developers efficiently understand, clean, augment, and manage their data to unlock the full potential of AI.


🛠️ Our Core Projects

We are building a suite of tools, prefixed with "Tiny", to create data-driven productivity systems for various AI domains.

Project Description Status
⭐️ Tiny3D The first Data-Centric AI production system for 3D applications. 🚀 Active Development
Tiny2D A next-generation AI production system for 2D vision. 🌱 Early Stage
TinyLabeling A deep learning tool for automated data annotation. 🌱 Early Stage
TinyMedical A next-generation production system for Medical AI. (See Collaborations) 🔒 Private

🤝 Collaborations

Our academic roots drive our passion for collaboration. We partner with leading institutions to bridge the gap between AI research and real-world impact.

Our key collaborations include:

  • Peking University Third Hospital: We are partnering to develop cutting-edge Medical AI solutions with our TinyMedical system, aiming to bring data-centric methodologies to critical healthcare challenges.

✨ Featured Project: Tiny3D

Tiny3D is our flagship project, a comprehensive production system for 3D object detection services.

It is built with four transformative features:

  • Performance Optimization Engine: A Data-Centric approach to help users easily achieve high-accuracy and high-speed 3D detection services.
  • One-Line Full Pipeline: Complete the entire workflow—from dataset editing and model training to compression and deployment—with a single line of code.
  • Fine-grained Data Editing: Supports granular operations on datasets of any size, down to a single data point.
  • User-Friendly Web Interface: (Planned) A low-code, visual interface to enhance team collaboration and productivity.

👉 Learn more about Tiny3D here!


💬 Get Involved

We are an open and dynamic community, welcoming developers, researchers, and students interested in Data-Centric AI! You can get involved in many ways:

  • Try our projects: Take Tiny3D for a spin and share your feedback.
  • Submit Issues: Found a bug or have a feature request? Open an issue in the relevant repository.
  • Contribute Code: Pull Requests are always welcome, from small bug fixes to new features.
  • Join the Discussion: Share your ideas with the community on our [Discussions](https://github.com/orgs/TinyData AI/discussions) tab. (Note: Please enable this feature in your organization's settings.)

Let's build the next generation of AI development tools together!

Popular repositories Loading

  1. Tiny3D Tiny3D Public template

    Tiny3D is a first Data-Centric 3D AI service production system.

    Python 441 44

  2. .github .github Public

    TinyData Team aim to adopt a data-centric AI philosophy to accelerate the process of deploying AI products to the business scenarios in industry.

  3. TinyLabeling TinyLabeling Public

    an deep learning data auto-labeling tool.

  4. Tiny2D Tiny2D Public

    Tiny2D is a next generation of 2D AI service production system.

  5. UER-py UER-py Public

    Forked from dbiir/UER-py

    Open Source Pre-training Model Framework in PyTorch & Pre-trained Model Zoo

    Python

  6. dygiepp dygiepp Public

    Forked from dwadden/dygiepp

    Span-based system for named entity, relation, and event extraction.

    Python

Repositories

Showing 6 of 6 repositories
  • .github Public

    TinyData Team aim to adopt a data-centric AI philosophy to accelerate the process of deploying AI products to the business scenarios in industry.

    TinyDataML/.github’s past year of commit activity
    0 0 0 0 Updated Sep 10, 2025
  • dygiepp Public Forked from dwadden/dygiepp

    Span-based system for named entity, relation, and event extraction.

    TinyDataML/dygiepp’s past year of commit activity
    Python 0 MIT 122 0 0 Updated Jun 1, 2023
  • Tiny3D Public template

    Tiny3D is a first Data-Centric 3D AI service production system.

    TinyDataML/Tiny3D’s past year of commit activity
    Python 441 44 24 3 Updated Apr 21, 2023
  • UER-py Public Forked from dbiir/UER-py

    Open Source Pre-training Model Framework in PyTorch & Pre-trained Model Zoo

    TinyDataML/UER-py’s past year of commit activity
    Python 0 Apache-2.0 529 0 0 Updated Mar 13, 2023
  • Tiny2D Public

    Tiny2D is a next generation of 2D AI service production system.

    TinyDataML/Tiny2D’s past year of commit activity
    0 0 0 0 Updated Oct 27, 2022
  • TinyLabeling Public

    an deep learning data auto-labeling tool.

    TinyDataML/TinyLabeling’s past year of commit activity
    0 0 0 0 Updated Sep 2, 2022

People

This organization has no public members. You must be a member to see who’s a part of this organization.

Top languages

Loading…

Most used topics

Loading…