open-dataset

Here are 22 public repositories matching this topic...

gifale95 / eeg_encoding

Use DNNs to build encoding models of EEG visual responses.

deep-neural-networks eeg computational-neuroscience human-vision things-database open-dataset neural-encoding visual-object-recognition

Updated Apr 21, 2026
Python

openrepair / data

Star

Source of the Open Repair Alliance downloadable datasets

data open-data repair open-dataset

Updated Mar 22, 2026

aloth / cred-1

Star

CRED-1: An Open Multi-Signal Domain Credibility Dataset (2,672 domains)

Updated Apr 21, 2026
Python

Ijwi-ry-Ikirundi-AI / Kirundi_Dataset

Star

🇧🇮 The first large-scale, open-source speech and text dataset for Kirundi language. Building AI models for 12M+ Kirundi speakers through community collaboration. Includes ASR, TTS, and MT capabilities.

nlp machine-learning text-to-speech ai machine-translation tts speech-recognition african-languages community-driven asr burundi speech-dataset open-dataset kirundi low-resource-language

Updated Apr 26, 2026
Jupyter Notebook

emailmarketingdataset / Open-Email-Marketing-Dataset

Star

Following is the Open Email Marketing Dataset; you can use it without any restrictions.

email-marketing lead-generation jsonl gdpr-compliant cold-email marketing-dataset open-dataset llm-training-data b2b-dataset verified-emails seo-dataset

Updated Jul 12, 2025

piazzai / smj-18-19552

Star

Data and code for SMJ 2019 paper

Updated Oct 31, 2025
R

oddsflowai-team / oddsflow-ai-football-value-signals

Star

Open, verifiable AI-driven football market analytics project for detecting mispriced bookmaker odds.

probability football probability-statistics sports-analytics market-analysis data-transparency ai-model open-dataset odds-analysis auditability value-betting

Updated Apr 19, 2026

lod-db / orc-losses

Star

🇷🇺 Russian losses in 🇺🇦 Ukraine since the start of the 2022 full-scale invasion 🪖.

opendata open-data dataset ukraine russia war geopolitics open-dataset russia-ukraine-war

Updated Apr 28, 2026

33k0 / PALADIN-Framework

Star

A Framework for Robust, Self-Recovering Tool-Using Language Model Agents — trained on 50K+ failure-annotated trajectories for fault-tolerant reasoning and recovery.

open-source fault-tolerance reproducible-research language-model error-recovery resilience robustness failure-injection paladin llm open-dataset tool-using-agent ai-robustness toolbench self-recovering-agents recovery-dataset

Updated Nov 10, 2025
Python

dialogue-for-dignity / Dialogue-for-Dignity

Star

A small dialogue dataset exploring the boundaries of machine decision-making, agency, and alignment. Useful for fine-tuning conversational agents or testing moral reasoning

dialogue language-model ai-ethics open-dataset llm-fine-tuning

Updated Apr 16, 2025

piazzai / os-ms-21-15751

Star

Data and code for OS 2024 paper

Updated Oct 31, 2025
R

bnyusntryo / live-italy

Star

📊 Fetch real-time Italian statistics on economy, population, health, and more with the Live Italy API Wrapper, fully supporting TypeScript.

statistics telegram matlab switzerland live open-data flutter hacktoberfest public-api italy telegrambot lake alps covid-19 logistic-model-epidemics italy-dataset open-dataset italy-proxy

Updated Apr 28, 2026
HTML

RoxyAPI / astrology-api-benchmark

Star

Reproducible MIT-licensed accuracy benchmark for any astrology API. 210 planet positions across 21 charts verified against NASA JPL Horizons DE441.

Updated Apr 28, 2026
Python

PocketNugget / Coherence-assessment-of-generated-realistic-images

Star

Dataset of 4,368 AI-generated images based on COCO for assessing coherence and realism in synthetic imagery.

machine-learning computer-vision synthetic-data coco-dataset ai-generated-images open-dataset image-coherence-assessment realistic-image-generation ai-evaluation-metrics image-realism

Updated Oct 24, 2024

rozetyp / win95stack-benchmark

Star

LLM behavioral benchmark from 25-month narrative gameplay. 540 runs, 6 models, pre-registered statistical analysis. GPT-4o-mini shows a perfect binary switch on a social decision from prompt framing alone.

gemini claude narrative-game chi-square-analysis open-dataset prompt-engineering llm-evaluation llm-agents gpt-4o llm-benchmark behavioral-benchmark

Updated Apr 21, 2026
TypeScript

AccessToNutritionInitiative / kenya-market-assesment

Star

Data from ATNi's 2025 East Africa Market Assessment (EAMA) on company nutrition policies and product portfolios in Kenya

open-source open-data africa kenya nutrition east-africa market-assessment micronutrient food-systems open-dataset food-system eama fortification atni

Updated Aug 6, 2025

AccessToNutritionInitiative / tanzania-market-assesment

Star

Data from ATNi's 2025 East Africa Market Assessment (EAMA) on company nutrition policies and product portfolios in Tanzania.

open-source open-data africa nutrition tanzania east-africa market-assessment micronutrient food-systems open-dataset food-system eama fortification atni

Updated Aug 6, 2025

howmuchtoinsure / us-car-insurance-rates-by-state-2026

Star

This dataset (updated monthly) covers average full coverage rates for all 50 states plus Washington DC, based on a standardized driver profile (25–55 years old, clean record, full coverage). Data is aggregated from official state insurance department filings and annual market conduct reports.

open-data dataset united-states car-insurance open-dataset

Updated Apr 25, 2026

mma735 / TFM-DS

Star

Comparison of distributed machine learning techniques applied to openly available datasets

privacy distributed-deep-learning distributed-machine-learning federated-learning gossip-learning split-learning open-dataset

Updated Feb 16, 2024
Jupyter Notebook

smenjas / sf-data

Star

Some of San Francisco's open data

opendata open-data san-francisco open-datasets open-dataset

Updated Sep 21, 2025
JavaScript

Improve this page

Add a description, image, and links to the open-dataset topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the open-dataset topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

open-dataset

Here are 22 public repositories matching this topic...

gifale95 / eeg_encoding

openrepair / data

aloth / cred-1

Ijwi-ry-Ikirundi-AI / Kirundi_Dataset

emailmarketingdataset / Open-Email-Marketing-Dataset

piazzai / smj-18-19552

oddsflowai-team / oddsflow-ai-football-value-signals

lod-db / orc-losses

33k0 / PALADIN-Framework

dialogue-for-dignity / Dialogue-for-Dignity

piazzai / os-ms-21-15751

bnyusntryo / live-italy

RoxyAPI / astrology-api-benchmark

PocketNugget / Coherence-assessment-of-generated-realistic-images

rozetyp / win95stack-benchmark

AccessToNutritionInitiative / kenya-market-assesment

AccessToNutritionInitiative / tanzania-market-assesment

howmuchtoinsure / us-car-insurance-rates-by-state-2026

mma735 / TFM-DS

smenjas / sf-data

Improve this page

Add this topic to your repo