AIAXIO-AI Matched To Your Need

15,370 AI tools for 3,203 Tasks

DataFlow logo

DataFlow

1.0.0

11

0

Data Processing
Easily prepare data using AI-driven functions.
Input:
Output:
DataFlow screenshot
Updated: Mar 20, 2026 Free

Description

OpenDCAI/DataFlow is a tool designed for data preparation and training purposes. It aims to create, improve, assess, and filter premium data for AI applications from varied inputs like PDFs, simple text, and lower-grade Question-Answer datasets.

This system is intended to enhance the efficiency of large language models (LLMs) through focused training in areas such as healthcare, finance, law, and academic studies.

The system employs an operator-based structure to convert the entire data refinement procedure into a pipeline that is reproducible, reusable, and shareable. This functions as the fundamental framework for the Data-Centric AI community.

Furthermore, OpenDCAI/DataFlow offers an intelligent agent feature capable of dynamically building new pipelines by either combining existing operators or developing new ones as needed.

This tool supports the development of superior LLM training datasets from unprocessed data by utilizing visual, low-code pipelines with versatile arrangement across sectors and applications.

The tool also incorporates functionalities for text, math, and code data production, along with instruments like AgenticRAG and Text2SQL for data generation. Additional capabilities encompass extensive PDF to QA conversion and structured data retrieval.

Pricing Plans

Model
free
Packages
1 Package
Price Start From
free
Payment Model
Not specified

Releases

Initial launch of DataFlow.

Reviews

Pros & Cons

Pros

Produces high-quality data

Improves noisy sources

Assesses data quality

Cons

Noisy data refinement unclear

Limited languages support

Lacks multi-platform support

Q&A

New Released

New Released