Skip to content
#

data-profiling

Here are 175 public repositories matching this topic...

OpenMetadata

The Open Context Layer for Data and AI , OpenMetadata is the open platform for building trusted data context and business semantics for humans, AI assistants, and agents.

  • Updated Jun 7, 2026
  • TypeScript

Cleanlab's open-source library is the standard data-centric AI package for data quality and machine learning with messy, real-world data and labels.

  • Updated Jan 13, 2026
  • Python
odd-platform

First open-source data discovery and observability platform. We make a life for data practitioners easy so you can focus on your business.

  • Updated Jun 4, 2026
  • Java
desbordante-core

Desbordante is a high-performance data profiler that is capable of discovering many different patterns in data using various algorithms. It also allows to run data cleaning scenarios using these algorithms. Desbordante has a console version and an easy-to-use web application.

  • Updated Jun 1, 2026
  • C++
dqo

Data Quality and Observability platform for the whole data lifecycle, from profiling new data sources to full automation with Data Observability. Configure data quality checks from the UI or in YAML files, let DQOps run the data quality checks daily to detect data quality issues.

  • Updated Jan 5, 2026
  • Java
data-observability-installer

Installer for DataKitchen's Open Source Data Observability Products. Data breaks. Servers break. Your toolchain breaks. Ensure your team is the first to know and the first to solve with visibility across and down your data estate. Save time with simple, fast data quality test generation and execution. Trust your data, tools, and systems end to end.

  • Updated May 15, 2026
  • Python

Improve this page

Add a description, image, and links to the data-profiling topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the data-profiling topic, visit your repo's landing page and select "manage topics."

Learn more