Skip to content

Output related questions #109

@JensUweUlrich

Description

@JensUweUlrich

Hi,

I have some questions regarding the output of chopper layout. I tried to calculate the layout for the viral refseq and got the following header lines as part of the output

#HIGH_LEVEL_IBF max_bin_id:173
#MERGED_BIN_0 max_bin_id:153
#MERGED_BIN_1 max_bin_id:196
#MERGED_BIN_2 max_bin_id:117
#MERGED_BIN_3 max_bin_id:253
#MERGED_BIN_4 max_bin_id:127
#MERGED_BIN_5 max_bin_id:167
.
.
.
#MERGED_BIN_447 max_bin_id:34
#FILES  BIN_INDICES     NUMBER_OF_BINS
files.renamed/GCF_002826665.1_genomic.fna.gz    0;0     1;1
files.renamed/GCF_002219365.1_genomic.fna.gz    0;1     1;1
files.renamed/GCF_003847265.1_genomic.fna.gz    0;2     1;1
files.renamed/GCF_002826065.1_genomic.fna.gz    0;3     1;2
files.renamed/GCF_000915375.1_genomic.fna.gz    0;5     1;1
.
.
.
iles.renamed/GCF_001995575.1_genomic.fna.gz    432     1
files.renamed/GCF_001041755.1_genomic.fna.gz    433;0   1;35
files.renamed/GCF_001502095.1_genomic.fna.gz    433;35  1;29
files.renamed/GCF_000903335.1_genomic.fna.gz    434     1
files.renamed/GCF_002116175.1_genomic.fna.gz    435;0   1;35
files.renamed/GCF_016811445.1_genomic.fna.gz    435;35  1;29
files.renamed/GCF_001308775.1_genomic.fna.gz    436     1
files.renamed/GCF_001041035.1_genomic.fna.gz    437     1
files.renamed/GCF_000865825.1_genomic.fna.gz    438     1
files.renamed/GCF_002826725.1_genomic.fna.gz    439;0   1;40
files.renamed/GCF_000839765.1_genomic.fna.gz    439;40  1;24
files.renamed/GCF_001602085.1_genomic.fna.gz    440     1
files.renamed/GCF_002628245.1_genomic.fna.gz    441     1
files.renamed/GCF_000887095.1_genomic.fna.gz    442     1
files.renamed/GCF_000924835.1_genomic.fna.gz    443     1
files.renamed/GCF_000922335.1_genomic.fna.gz    444     1
files.renamed/GCF_001654305.1_genomic.fna.gz    445     1
files.renamed/GCF_000848085.2_genomic.fna.gz    446;0   1;32
files.renamed/GCF_000923135.1_genomic.fna.gz    446;32  1;32
files.renamed/GCF_001316375.1_genomic.fna.gz    447;0   1;34
files.renamed/GCF_000875305.1_genomic.fna.gz    447;34  1;30
files.renamed/GCF_000893455.1_genomic.fna.gz    448     1

As far as I can see, these are all merged bins, but what does max_bin_id refer to? And how can I infer the topology of the hierarchy from the output?
How can I interpret the BIN_INDICES and NUMBER_OF_BINS columns?

Cheers
Jens

Metadata

Metadata

Assignees

No one assigned

    Labels

    documentationImprovements or additions to documentation

    Type

    No type
    No fields configured for issues without a type.

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions