SEA-PHAGES Logo

The official website of the HHMI Science Education Alliance-Phage Hunters Advancing Genomics and Evolutionary Science program.

Welcome to the forums at seaphages.org. Please feel free to ask any questions related to the SEA-PHAGES program. Any logged-in user may post new topics and reply to existing topics. If you'd like to see a new forum created, please contact us using our form or email us at info@seaphages.org.

All posts created by anders

| posted 07 Nov, 2023 06:25
In the other thread about immunty repressors (https://seaphages.org/forums/topic/4952/), Debbie invited folks to post wet bench data.

My students and I have some preliminary evidence that the A1-like immunity repressor in the F1 phages Coco12 and Phanphagia is sufficient to defend against the A1 phage Bxb1.

Briefly,
–they made lysogens of five F1 phages, and found that lysogens of Coco12 and Phanphagia reduced plaque formation of superinfecting A1 phage Bxb1 (EOP ~10^-3). The other three lysogens (carrying GUmbie, Gandalph, and Veteran) had no effect on Bxb1 superinfection.

–They observed that Coco12 and Phanphagia carry the A1-like immunity repressor gene, wereas the other F1 phages in their tests do not. Perhaps this A1 repressor gene is causing the defense against Bxb1.

–They cloned the A1-like repressor ORF into the expression plasmid pLAM12, and they cloned the two gene operon including the A1-like repressor and its upstream gene into the integration plasmid pMH94. They transformed M.smeg with the plasmids and tested for Bxb1 plaque formation.

–Both plasmid clones appear to be sufficient for defense against Bxb1 infection, compared to strains carrying empty plasmids.

I'm attaching a summary of the experiments.
Posted in: Cluster F Annotation Tips2 Repressors in some Cluster F phages
| posted 09 Jun, 2023 22:08
As I tried to apply this to some proteins that we are annotating, I was confused because I couldn't replicate Rick's result with Q1 being in the first alpha helix.

Pollenz
Note that Usavi70 has alignments to the subjects with Q1 to all hits. Amino acid 1 is the start of a helix and there is also a second helix in Usavi70, but the Usavi70 has NO beta sheet that precedes the hits compared to the queries. See that the deeper analysis of the PDB queries show that they are all aligning well past amino acid 1 and all have a 7-9 amino acid beta sheet that precedes the helix 1 that is aligned to Usavi70.

Thus, the ribbon helix helix IS NOT supported for Usavi70 and should not be called. NCBI conserved domains also does not support either an HTH or DNA binding call as well. So best to leave a hypothetical NKF since all components of the domain are not present.

When I used the protein sequence that Tejas posted, I got HHPred alignments that did predict some beta sheet in front of the first alpha helix.

Then I realized that Rick may have used the draft genome protein that began at the Glimmer auto-annotated start, which was downstream of the start that Tejas had used for the amino acid sequence she posted.

If we use a start codon even further upstream, to make the protein longer at its N-terminus, we can get a pretty good HHPred alignment to ribbon-helix-helix structures that includes a full-length beta strand.

I've attached an image of the HHPred alignment with the longer amino acid sequence.

My interpretation is that HHPred alignments do support the ribbon-helix-helix domain, if translation begins at a start codon far enough upstream to contain the predicted beta strand.

For Usavi, it will come down to which start codon the annotators choose as the most appropriate start. For the genes we're working on, we will pay close attention to the key features of ribbon-helix-helix that Rick nicely explained above! ~Kirk
Posted in: Functional AnnotationUsavi_draft_70: potentially ribbon-helix-helix?
| posted 20 May, 2021 19:41
This is a follow-up to the toxin BrnT posts https://seaphages.org/forums/topic/4458/.

Like Paola (K5), SoSeph (K5) also has a pair of genes that looks like the BrnTA toxin/antitoxin pair of genes [SoSeph_draft_88 (57177-57455) and SoSeph_draft_89 (57457-57888 )]. As Jordan, Veronique and Kristen described in the BrnT discussion, the downstream antitoxin gene has a ribbon-helix-helix structure. In HHPred, it hits the KacA antitoxin crystal in and RelD and CopG antitoxin CCD domains, all of which are also members of type II toxin-antitoxin systems.

We propose that this ribbon-helix-helix protein be called "antitoxin in toxin/antitoxin system, BrnA-like". It has the predicted ribbon-helix-helix motif of the antitoxin, the predicted size (ie, larger than the toxin), and the predicted synteny (immediately downstream of the toxin in a potential two-gene operon).

BrnTA structure-function paper (Heaton et al., 2012) https://www.jbc.org/article/S0021-9258(20)53133-5/fulltext

The official function list includes the BrnT toxin but does not include the partner BrnA antitoxin.
Edited 20 May, 2021 19:51
Posted in: Request a new function on the SEA-PHAGES official listantitoxin in toxin/antitoxin system, BrnA-like
| posted 14 May, 2021 19:14
Thanks for this, Chris. I would like to change my suggestion to the 34328 start!

Knowing that mass spec data exist, I've been applying the "second-of-tandem-start-codons" rule when there is a tandem set at 4bp and 1bp overlaps, choosing the 1 bp overlap start. In my mind, I had thought the data applied to cases where a ribosome was potentially restarting after termination of the upstream gene (4 bp and 1 bp overlaps).

But the statement you quoted in the guide suggests it is more broadly observed.

Which makes me wonder: are the mass spec data relative to tandem start codons published someplace? It would be interesting to look at the range of cases observed.
Posted in: Choosing Start SitesF1 gene needs help on start site
| posted 13 May, 2021 22:26
Hi, here's how I think about this. I see that the coding potential extends further upstream than the Glimmer start at 34172, which strongly suggests that the start is further upstream. So the only possible starts are two that are right next to each other upstream of this, 34331 and 34328. Neither of them have a significant match to a Shine-Dalgarno sequence, so they seem equal with regard to SD. To me, it seems like an arbitrary choice to choose one over the other, and there would be a very minor difference in the protein of one amino acid at the N-terminus. Sounds like GM called the 34331 site in your analysis (it did not in my DNA Master auto-annotation just now). This makes the longest possible ORF and would match 1:1 with the start of SiSi_42 and two others. This would tip the balance for me and I would choose the 34331 start. Best wishes! -Kirk
Posted in: Choosing Start SitesF1 gene needs help on start site
| posted 07 Jan, 2021 23:38
Hi Maria,
The official function list does have Holliday junction resolvase. I found it near the bottom of the spreadsheet when we annotated some M2s recently.
Best, Kirk
Posted in: Functional AnnotationRuvC resolvase vs Holliday Junction Resolvase
| posted 09 Dec, 2020 00:02
Hi,

We're curious to hear what others think about this situation. We're working on CrunchyBoi and PineapplePluto and here's what we see.

CrunchyBoi, PineapplePluto, and Pabst (EK1) share a ~100 bp insertion on the right end of the genome relative to other EKs. Glimmer calls a gene in this region in all three, and there is coding potential across the orf in self-trained Genemark. The orf fills the gap created by the insertion. Its overlaps are a bit large (7 bp overlap with the upstream gene, and 14 bp overlap with the downstream gene). SD z=2.5. Nucleotide substitutions in the ORF among the three genomes change 14 codons: 10/14 codon changes are synonymous and do not alter the AA, and 3 of the 4 nonsynonymous changes make chemically conserved AA changes (V->A, N->D, E->K; non-conserved = W->G). We might expect to see this pattern if natural selection is keeping this ORF open and producing a protein. (I have attached views of the data.)

If the orf was in the middle of a genome, we would probably call it as a small gene. But the EKs are circularly permuted, and this reverse ORF sits over the boundary at the genome ends, causing us to proceed with more caution.

Should we call it a gene?

thanks!
Kirk
Edited 09 Dec, 2020 01:06
Posted in: Gene or not a Genegene at end of some EK1s?
| posted 03 Dec, 2020 01:10
Hi,
I'm having new problems with Genome Comparison in DNA Master. When I use it to look at map comparison and ANIs, the results show zero alignments were found: every gene in the map comparison is black and all the ANIs read 0.000. Genomes previously analyzed are fine, as if the results had been stored. But adding new genomes for analysis throws "File read" and "FASTA error" warnings.

I found that this problem does not happen when I use an older version (5.23.3, build 2657), which is the current state of DNA Master on most of our lab computers because we didn't use DNA Master in class this fall. But when a lab computer version gets updated (now 5.23.6, build 2674), the above problem is recreated.

I know that doing a fresh installation on my home computer does not help, and I have confirmed the Watson release code is set in preferences. Are there any other troubleshooting things I can do, or is something broken in the most recent update?

thanks, Kirk
Posted in: DNA MasterGenome Comparison
| posted 21 Aug, 2020 22:08
I can see it now! Thanks Steve
Posted in: PhameratorPhamerator down?
| posted 20 Aug, 2020 22:53
Thanks for posting this, Steve. I got a new router last week and am having the same issues. The modem and router are separate. When I connect directly by ethernet to the modem, I get to phamerator.org just fine. But when I try to connect using the router (wifi or ethernet), I get a 502 Bad Gateway response after a while. Despite that error through a browser, I can detect phamerator just fine with a ping request. I have tried to turn off the "Armor" security on the router, but it's not clear yet that this is actually off.

Kirk
Posted in: PhameratorPhamerator down?