AIND Ephys Pipeline

aind-ephys-pipeline

Electrophysiology analysis pipeline with SpikeInterface.

Overview

The pipeline is based on Nextflow and it includes the following steps:

Pipeline Architecture

flowchart LR
    %% Input/Output
    input[("📥 Input<br/>Ephys Data")]
    output[("📤 Output<br/>NWB + QC + Viz")]

    %% Deployment
    subgraph deploy["🚀 Deployment"]
        direction TB
        co["Code Ocean<br/><a href='pipeline/main.nf'>main.nf</a>"]
        sl["SLURM/Local<br/><a href='pipeline/main_multi_backend.nf'>main_multi_backend.nf</a>"]
    end

    %% Infrastructure (link to detailed docs)
    containers["☁️ <b><a href='https://aind-ephys-pipeline.readthedocs.io/en/latest/architecture.html#infrastructure-components'>Containers</a></b><br/>4 images from GHCR"]
    models["🤗 <b><a href='https://aind-ephys-pipeline.readthedocs.io/en/latest/architecture.html#infrastructure-components'>ML Models</a></b><br/>UnitRefine classifiers"]

    %% Pipeline (simplified)
    subgraph pipeline["📊 <b><a href='https://aind-ephys-pipeline.readthedocs.io/en/latest/architecture.html'>Processing Pipeline</a></b> (11 steps)"]
        direction TB
        prep["1. Job Dispatch<br/>2. Preprocessing"]
        sort["3. Spike Sorting<br/>(KS2.5/KS4/SC2)"]
        post["4-6. Postprocessing<br/>Curation<br/>Visualization"]
        final["7-11. Results Collection<br/>Quality Control<br/>NWB Export"]

        prep --> sort --> post --> final
    end

    %% Connections
    input --> pipeline
    deploy -.->|"orchestrates"| pipeline
    containers -.->|"provide runtime"| pipeline
    models -.->|"used in curation"| pipeline
    pipeline --> output

    %% Styling
    classDef infra fill:#fff4e6,stroke:#ff9800,stroke-width:2px
    classDef data fill:#e8f5e9,stroke:#4caf50,stroke-width:3px

    class containers,models,deploy infra
    class input,output data

📖 View detailed architecture diagram with all infrastructure components, step details, and data flow.

Key Points:

Two Deployment Modes: Code Ocean (main.nf) uses branch-based sorter selection; SLURM/Local (main_multi_backend.nf) uses parameter-based selection
11 Processing Steps:
- 1. Job dispatch
- 2. Preprocessing
- 3. Spike sorting (KS2.5/KS4/SC2)
- 4-6. Postprocessing → Curation → Visualization
- 7-11. Results collection → Quality control → NWB export
Infrastructure: 4 container images from GHCR and UnitRefine ML models from Hugging Face
Parallelization: Steps run in parallel per probe/shank; version controlled via capsule_versions.env

See the detailed architecture documentation for complete infrastructure details, data flow, and numbered step-by-step breakdown.

Documentation

The documentation is available at ReadTheDocs.

Code Ocean Deployment (AIND)

At AIND, the pipeline is deployed on the Code Ocean platform. Since currently Code Ocean does not support conditional processes, pipelines running different sorters and AIND-specific options are implemented in separate branches.

This is a list of the available pipeline branches that are deployed in Code Ocean:

main/co_kilosort4: pipeline with Kilosort4 sorter
co_kilosort25: pipeline with Kilosort2.5 sorter
co_spykingcircus2: pipeline with Spyking Circus 2 sorter
co_kilosort25_opto: pipeline with Kilosort2.5 sorter and optogenetics artifact removal
co_kilosort4_opto: pipeline with Kilosort4 sorter and optogenetics artifact removal
co_spykingcircus2_opto: pipeline with Spyking Circus 2 sorter and optogenetics artifact removal

Name		Name	Last commit message	Last commit date
Latest commit History 119 Commits
.claude		.claude
.github/workflows		.github/workflows
docs		docs
environment		environment
metadata		metadata
params_app		params_app
pipeline		pipeline
sample_dataset		sample_dataset
tests		tests
.codespellrc		.codespellrc
.gitignore		.gitignore
.readthedocs.yaml		.readthedocs.yaml
LICENSE		LICENSE
README.md		README.md
pull_pipeline_images.sh		pull_pipeline_images.sh

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

AIND Ephys Pipeline