Schema registry for CSV, TSV, JSON, AVRO and Parquet schema. Supports schema inference and GraphQL API.
-
Updated
Mar 5, 2020 - Scala
Schema registry for CSV, TSV, JSON, AVRO and Parquet schema. Supports schema inference and GraphQL API.
Automated assistance for the schema development lifecycle
Infer a JSON schema from example data, produce nonsense synthetic data (drivel) according to the schema
A tool to automatically infer columns data types in .csv files
NoSQL Data Engineering
A Polars plugin for JSON schema inference from string columns using genson-rs.
Schema inference for semistructured data using Formal Concept Analysis
Learn API schemas from live traffic and generate OpenAPI specs. Analyze MCP/JSON-RPC agent sessions for wasted calls. Zero-infrastructure local reverse proxy.
A tiny CLI that reads JSON and infers a clean, type-oriented YAML summary. Perfect for exploring APIs, documenting unknown data, or bootstrapping schemas.
Lattice is a JSON document store with real-time schema processing, indexing during ingestion, and a SQL-like interface for querying data
ColumnCore is a high-performance analytical database system designed for beginners or small projects. It supports a rich SQL dialect, runs within the same process as the application, has a vectorized query execution engine, and uses a columnar storage format.
Infer robust Pydantic v2 models from messy, evolving JSON streams
Inference-driven schema mapping engine for Python and TypeScript. 7 built-in scorers, domain dictionaries (healthcare/finance/ecommerce), confidence calibration, cross-language accuracy benchmark (F1 0.84), and full Python↔TypeScript parity.
A deterministic engine that transforms messy, user-uploaded CSVs into clean, schema-compliant, import-ready data.
Um servidor Model Context Protocol (MCP) para interagir com MongoDB, permitindo que IAs descubram schemas automaticamente e executem queries, agregações e CRUD via linguagem natural.
AI-powered synthetic data generator with automatic schema inference using LangChain + Groq. Upload a CSV, get realistic fake data via Faker — with an interactive Streamlit UI and chat-style prompts for on-demand row generation.
AI-powered ETL: LLM schema inference, automated data cleaning, anomaly detection, and intelligent transformation suggestions
Dump, diff, and schema-profile Airflow XCom payloads from the CLI or a lightweight web viewer — fills the gap Airflow's UI leaves around cross-task debugging, payload size limits, and schema drift.
Add a description, image, and links to the schema-inference topic page so that developers can more easily learn about it.
To associate your repository with the schema-inference topic, visit your repo's landing page and select "manage topics."