Project Setup

This project uses Pixi to manage environments and dependencies.

1. Install Pixi (if not installed yet)

Check whether Pixi is already available:

pixi --version

If the command is not found, install Pixi:

curl -fsSL https://pixi.sh/install.sh | bash

After installation, restart your terminal (or reload your shell config) so pixi is on PATH.

2. Enter the Pixi environment

From the project root (where pixi.toml is located), run:

pixi shell

Pixi will create/sync the environment and open a shell with all project dependencies.

3. Run commands inside Pixi

Examples:

pixi run python --version
pixi run python benchmark_modalities.py

5. Set Up PAM for Audio Quality Assessment

For experiments with calibrated quality scores, the audio quality scorer expects a local PAM repository at:

./PAM

From the project root, run:

git clone https://github.com/soham97/PAM.git PAM
pixi run pip install -r PAM/requirements.txt

pixi run python -c "from PAM import PAM; print('PAM import OK')"

Run Experiments

Benchmark Unmodified Model

Use benchmark_modalities.py to run the base Qwen model without Quality Aware Attention.

Example:

pixi run python benchmark_modalities.py \
  --dataset meld \
  --classification-task emotion \
  --split test \
  --modalities text,audio,video \
  --batch-size 4

Useful options:

--noisy-modalities audio,image,text,video to load noisy variants for selected modalities
--noise-severity <S> to filter noisy variants to a specific severity
--stratified-samples <N>, --total-samples <N>, --start-at-sample <idx> to control evaluation size
--qwen-model-id <hf-model-id> to switch checkpoints (default: Qwen/Qwen2.5-Omni-7B)

Outputs are written to out/... (predictions and error rows), unless overridden with --out-path and --out-error-path.

Benchmark Model with Quality Aware Attention

Use benchmark_scored_modalities.py to run the model with Quality Aware Attention (QAA), where modality quality scores are used to scale first-layer attention.

Example (calibrated quality scores):

pixi run python benchmark_scored_modalities.py \
  --dataset imdb \
  --split test \
  --modalities text,audio,video \
  --quality-calibration \
  --batch-size 4

Quality scoring modes:

Add --qwen-quality to estimate quality scores with Qwen (cannot be combined with --quality-calibration)
Add --quality-calibration to use ecdf calibrated quality scores (cannot be combined with --qwen-quality)

Extra QAA controls:

--qaa-normalization-mode global|exclude_unscaled depending on whether quality scores are normalized across all samples or only among the scaled modalities for each sample
--force-quality-scores-one or --force-modality-quality-scores text=0.2,audio=0.9
--quality-placebo-random --quality-placebo-random-seed <seed> for placebo runs

This script writes:

prediction CSV (--out-path)
error CSV (--out-error-path)
per-sample quality score CSV (--quality-score-out-path)

Name		Name	Last commit message	Last commit date
Latest commit History 174 Commits
analysis		analysis
batch_scripts		batch_scripts
ecdf_manifest		ecdf_manifest
tests		tests
utils		utils
.gitattributes		.gitattributes
.gitignore		.gitignore
README.md		README.md
benchmark_modalities.py		benchmark_modalities.py
benchmark_scored_modalities.py		benchmark_scored_modalities.py
pixi.lock		pixi.lock
pixi.toml		pixi.toml
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Project Setup

1. Install Pixi (if not installed yet)

2. Enter the Pixi environment

3. Run commands inside Pixi

5. Set Up PAM for Audio Quality Assessment

Run Experiments

Benchmark Unmodified Model

Benchmark Model with Quality Aware Attention

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Project Setup

1. Install Pixi (if not installed yet)

2. Enter the Pixi environment

3. Run commands inside Pixi

5. Set Up PAM for Audio Quality Assessment

Run Experiments

Benchmark Unmodified Model

Benchmark Model with Quality Aware Attention

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages