From CDS block information in ENA assembly files, the code needs to determine whether the protein associated with the CDS is a fragment or not. I'm thinking this might be as simple as checking for the ambiguous location symbols/formatting but I may need to write code to get the nucleotide sequence from the bottom of the Record entry and search the string for start and stop codons. Ideally, I don't have to do that but I might.
From CDS block information in ENA assembly files, the code needs to determine whether the protein associated with the CDS is a fragment or not. I'm thinking this might be as simple as checking for the ambiguous location symbols/formatting but I may need to write code to get the nucleotide sequence from the bottom of the Record entry and search the string for start and stop codons. Ideally, I don't have to do that but I might.