Dingo FAQs

Question 1

What is Dingo and what does it do?

Accepted Answer

Dingo is a data quality evaluation tool that automatically detects data quality issues in datasets. It uses built-in rules and LLM-based methods for comprehensive analysis of text and image data.

Question 2

What types of data can Dingo evaluate?

Accepted Answer

Dingo supports text and image data modalities. It is suitable for pre-training, fine-tuning, and evaluation datasets. It can process local files (JSON, JSONL, plaintext) and Hugging Face datasets.

Question 3

How can I use Dingo?

Accepted Answer

Dingo offers a CLI and SDK for easy integration into existing workflows. You can also use the GUI to visualize evaluation results. LLM-based evaluations are also supported.

Question 4

Does Dingo support LLMs for data quality evaluation?

Accepted Answer

Yes, Dingo supports LLM-based evaluation using models like OpenAI (GPT-4o) and Llama3. It offers pre-defined prompts for assessing various quality dimensions like completeness, effectiveness, and security.

Question 5

Is Dingo open source?

Accepted Answer

Yes, Dingo is an open-source project with a permissive license, enabling you to use, modify, and distribute it freely. It also has a growing community on GitHub.

Dingo

About

Key Features

Use Cases

Dingo

About

Key Features

Use Cases