A GPipe implementation in PyTorch
-
Updated
Jul 25, 2024 - Python
A GPipe implementation in PyTorch
An I/O benchmark for deep Learning applications
Very-Low Overhead Checkpointing System
Sayiir — simple, embeddable durable workflow engine in Rust, node.js/python bindings. Checkpoint-based recovery, no deterministic replay. Simplified aternative to Temporal, Restate, Airflow..
Extending DOLFINx with checkpointing functionality
Keras wrapper that autosaves what ModelCheckpoint cannot.
Weavegraph rust graph/agent/node
Git-like branching, checkpointing, and comparison for AI agent execution paths. pip install agentgit
[WIP] Debug TypeScript/JavaScript via TUI. Checkpoint functions, edit state, skip execution. Written in Rust 🦀
A Python package for performing memory-intensive computations in parallel using chunks and checkpointing.
A Python package for checkpointing, saving, and loading objects.
Zero-cost, crash-proof LLM pipeline orchestrator. Features disk-based checkpointing, free-tier routing, and structured output. (LangGraph / CrewAI alternative)
This FLINK project will consume streams from an azure event-hub and produce to a different event-hub ,and the config files for deploying the same in kubernetes
A lightweight checkpointing program written in C.
Code and tutorial on integrating wandb sweeps with Slurm pre-emption
Hangman Game Word Predictor (Character-level attention)
Slash-first TUI and local web ops UI for AI-agent-driven research automation, from paper collection to experiment execution and paper drafting.
This is a standalone flink producer using for testing the flink-consume-produce-ek repo contents
Add a description, image, and links to the checkpointing topic page so that developers can more easily learn about it.
To associate your repository with the checkpointing topic, visit your repo's landing page and select "manage topics."