The official website of the HHMI Science Education Alliance-Phage Hunters Advancing Genomics and Evolutionary Science program.

Welcome to the forums at Please feel free to ask any questions related to the SEA-PHAGES program. Any logged-in user may post new topics and reply to existing topics. If you'd like to see a new forum created, please contact us using our form or email us at

All posts created by welkin

| posted 24 Feb, 2016 15:03
Nope, the tRNA cassettes are weird, and no you don't have to have a promoter for each of them or for each protein encoding gene interspersed within the cassette. We tend to steer clear of a tRNA and a protein occupying the same space, but there are definitely genomes where they get pretty close (take a look at the Ms). The only promoter rule is still about switching directions– presumably, switching transcriptional directions would still require two promoters even for tRNAs.
Posted in: tRNAsHow close can one pack protein and tRNA's genes
| posted 24 Feb, 2016 15:00
It does not strictly refer to NCBI. Frankly, I would prefer the phagesdb match over a random gene in GenBank that was likely deposited by auto-annotation software with no personal review.

The matches retrieved by BLAST at NCBI are also sorted according to age; identical newer hits are listed first. The point of listing the BLAST data is to demonstrate that you are examining similar genes and looking to see which starts have been used previously. Using the top hit for your notes is a convenience rather than a requirement: GenBank used to only list our phages rather than tons of auto-annotated prophages, so it didn't really matter where you got the alignment from. Noting the alignment of your gene to similar phage gene makes much more sense to me!
Posted in: Notes and Final FilesBLAST notes: PhagesDB or GenBank?
| posted 16 Feb, 2016 18:46
It does not mean accession number. The conserved domain database is part of NCBI and is searched by default when you BLAST a protein against GenBank through your browser rather than through DNA Master. If you have a CDD match, you will see a separate page with your CDD results, like in your attached figure. For this result, you would write "CDD, phage capsid".
Phage capsid is the domain ID.
You can also see CDD matches in Phamerator (the yellow boxes that appear on your genome map in your genes that you can browse over).
Posted in: Notes and Final FilesConserved Domain ID in notes
| posted 16 Feb, 2016 18:34
We want to make certain we are completing the information in DNA Master correctly. At our recent training we were given the two choices for the "LO:" description.

LO: Longest Reasonable ORF -or-
Not Longest Reasonable ORF (explain in notes below)

I think we have some confusion over this language and a worksheet we 'borrowed' from another school. That worksheet actually has the student record the length of the longest open reading frame. Am I correct that the DNA Master "Notes" section does not require this information?

Correct. you do not need to include the length of the open reading frame.

When would one choose an ORF that is not the longest reasonable ORF? If there was a long overlap, that would make it unreasonable? (We are finding some 90+bp overlaps that have been used in PhageDB.) If the SD sequence was 3' of the ATG that would make it unreasonable? What else might cause one to select an ORF other than the "Longest Reasonable ORF"?

the most common example of "not the longest reasonable ORF" is that the start chosen leaves a gap between the gene you are working on and the upstream gene that could be made smaller by choosing a alternate start. So I am not looking so much for reasonable vs not reasonable, as longest vs not longest. People were taking "longest" as the most important criteria, and saying that genes were not the longest ORF because there was one distant upstream start codon that could cause a 90% overlap with the upstream gene. That is not a "reasonable" start to consider.

If something has a 90bp overlap, I definitely want to know about it.

Can you give me some scenarios that might lead one to select the "longest 'unreasonable' ORF" or a "shorter-than-longest reasonable ORF"?

Sure. when the comparative genomics, like through STarterator or BLAST, shows that the start that gives you the longest ORF in your phage gene in your genome isn't present in closely related genes, and one that gives you a shorter gene product is. Solid comparative data trumps everything.

I hope that helps!
Posted in: DNA MasterLO: Designation Question
| posted 06 Oct, 2015 15:14
Yes. Your IT people can install DNA Master and give it admin rights for all users regardless of their privileges. It will run fine.

Posted in: DNA MasterRunning with Administrator Privileges