More about the Blueprint Tool
Foldit players posed several great questions about the Blueprint tool for our last Science chat, but we didn’t have time to answer all of them. We're long overdue for an in-depth explanation about the Blueprint tool, but it seems that players are finding the tool useful and we'd like to share more about it's mechanics. In particular, we hope this blog post can shed some light on the following question:
It has been pointed out that removing Blueprint tool constraints towards the end allows for substantial score improvement. Why is this, as it seems counterintuitive? – gitwut
Before I answer this question, I’d like to offer a little more background on the Blueprint tool:
There are two motivations behind the Blueprint tool: The first is simply to make “ideal loops” more accessible to players. The Ideal Loop Filter has helped Foldit designs tremendously, and the recent top-scoring designs have all had excellent loops. However, it seemed that players were required to do a lot of work in order to satisfy that filter. Hopefully, the Blueprint tool has made it easier (especially for beginners) to satisfy the Ideal Loop Filter.
The second motivation for developing the Blueprint tool is to provide an alternative design process. Some of us suspect that bad Foldit backbones are the result of aggressive loop building in middle- or late-game strategies. For example, suppose you're designing a protein and decide to form the loops last: by the time you build loops, you may have already cemented your helices and sheets into place and optimized the core packing of your protein, and as a result the backbone does not have a lot flexibility for rebuilding loops. The endpoints of two neighboring beta strands may be positioned such that there is no stable loop to bridge them. Aggressively using Rebuild/Remix to force a loop between incompatible endpoints is akin to hammering a square peg into a round hole. It will be impossible to close the loop without compromising the geometry of the backbone. We had hoped the Blueprint tool could be used early in the design process to quickly construct a "healthy" rough draft of a design, which could be gradually optimized without compromising the backbone geometry.
BuildingBlock Torsion Constraints
To answer gitwut's question, BuildingBlocks include torsion constraints. Torsion constraints force a residue to a certain region of the Ramachandran Map—much like Rubber Bands (which represent distance constraints) force two residues to be a certain distance from one another. When constraints are present, Wiggling a solution will not produce points as quickly, but the solution will try to follow the constraints. Broadly speaking, constraints allow us to redirect Wiggle toward a desired result, usually sacrificing short-term gains to find an ultimately better model.
Placing a BuildingBlock loop onto the Blueprint Panel introduces torsion constraints to the loop residues (likewise, removing the BuildingBlock removes the constraints). The torsion constraints are intended to preserve the BuildingBlock loop while a player develops the rest of his or her design. Constraints are needed in this case because the Foldit energy function does not necessarily favor the BuildingBlock loops. In fact, we don't fully understand why the BuildingBlock loops are so prevalent in natural proteins. These loops may be favored for reasons that are not explicitly modeled in Foldit—like folding kinetics, or more complex entropic effects. (In contrast, helices and sheets are naturally stabilized by hydrogen bond forces, which are captured by the Foldit energy function.) Without the torsion constraints, Wiggle is prone to obliterate the BuildingBlock loop in favor of more short-sighted energy gains. We intended that players might keep the constraints around to preserve the BuildingBlock loops until a design-in-progress has settled into a mature fold—only then removing the constraints for late-game refinement.
To make things even more complicated, note that we've manually adjusted how BuildingBlocks are applied through the Blueprint Panel. That is, when you drag a BuildingBlock onto the Blueprint Panel and the protein backbone snaps into place, this initial "adjusted" form is only a rough approximation of the loop's optimal form. When you Wiggle the loop, the torsion constraints will drag the backbone to its optimal shape, which may be slightly different from initial adjusted shape (this is particularly noticeable for β-hairpins BuildingBlocks). This is because the BuildingBlock loops are derived from native proteins, which never have perfectly ideal helices and sheets. If you were to apply the optimal BuildingBlock loops to Foldit's ideal beta strands, the ideal beta strands would not align to form hydrogen bonds (Figure A, above). In order to make the tool more user-friendly, we adjusted the optimal BuildingBlocks so that the hairpin loops would be compatible with Foldit's ideal sheets. Thus, a BuildingBlock hairpin will initially snap two ideal strands into perfect alignment (Figure B); and subsequent Wiggling will allow the beta strands to flex slightly, so that the BuildingBlock loop can relax into its optimal form (Figure C).
As an aside, some astute Foldit players have noticed that the BuildingBlocks collection is missing a BAAB β-hairpin, which is a stable loop frequently found in nature. As it turns out, this loop induces significant deformation of the adjacent beta strands. As much as we tried, we were unable to adjust the BAAB BuildingBlock so that it would be reasonably compatible with Foldit's ideal beta strands, and that particular loop was omitted from the BuildingBlocks collection.( Posted by bkoep 73 547 | Mon, 01/30/2017 - 20:33 | 7 comments )
Tuberculosis Challenge – Alternate Target
Tuberculosis (TB) is a disease that affects millions of people. We have posted a protein drug target puzzle previously on this topic. In our continued effort to make a dent in this disease, we have also partnered with the Sacchettini lab at Texas A&M University to post another drug target puzzle for TB.
The Sacchettini lab is working in collaboration with other groups on understanding biology and virulence factors of tuberculosis bacteria. The ability of Mycobacterium tuberculosis, which causes TB, to survive inside the host depends on sensing the environment and launching appropriate responses to stimuli. This means that specific protein production levels are strictly controlled and tuned. The machinery and the players of this carefully orchestrated battle against our immune system are poorly characterized. In general, protein production levels could be regulated on multiple levels and by different means. One of the ways involves small non-coding RNA molecules which aid in efficient translation of some mRNAs into proteins and the degradation of others. In pathogenic bacteria specifically, the regulation of production of the proteins required for virulence and intracellular survival has been shown to depend on small RNAs. Reviewed here (Oliva G., Sahr T., Buchrieser C. (2015). Small RNAs, 5’ UTR elements and RNA-binding proteins in intracellular bacteria: impact on metabolism and virulence. FEMS Microbiol. Rev. 39 331–349. :
To carry out their missions, small RNAs require protective chaperon protein – Hfq. Specifically, the protein structure adopts an Sm like fold composed of 6 subunits forming a homo-hexameric ring. Hfq and Sm proteins have been identified in numerous bacteria, yet no known homologs have been annotated in Mycobacterium tuberculosis genome. Through careful examination of secondary structure patterns predictions of the Mycobacterium tuberculosis proteome, Rv3208A has been proposed as a possible Hfq candidate.
If we were able to solve the structure, it would mean that we learn about machinery which has been shown to be important for virulence in other pathogens but is not characterized in Mycobacterium yet. By targeting this RNA chaperon protein, instrumental to any small RNA mediated responses, scientists can prevent Mycobacterium tuberculosis from survival inside human host.
Right now, the protein has been crystallized and diffraction data are available, but none of the models that scientists have created have helped to solve the phase and build the structure. By posting this protein, we are hoping that everyone can come up with a model that will help resolve the structure. As always, we are committed to publishing the work and sharing models created by Foldit players. Lets make a dent in TB!( Posted by free_radical 73 1765 | Tue, 11/29/2016 - 19:29 | 6 comments )
This blog post introduces a new tool for Foldit protein design. The Blueprint Panel displays the amino acid sequence and secondary structure of your protein. By default, each letter of the sequence is colored according to the φ and ψ torsions at that position in the structure, following the same ABEGO coloring scheme used by the Rama Map. Above the sequence, a secondary structure diagram reflects the sheet and helix assignment at each position. The Auto Structures button will detect sheets and helices, and make the appropriate secondary structure assignments.
The Blueprint panel is accompanied by the Building Blocks panel. Building blocks represent discrete patterns of protein backbone that can be applied to your protein structure. The building blocks provided here correspond to specific loops that are frequently observed in natural proteins (they were previously known as "Ideal Loops" in the Rama Map). All building blocks are meant to connect secondary structure elements directly, with a sheet or helix on either side. The appropriate use of a building block is dependent on this secondary structure "context." For example, a Helix-Sheet building block would make a good connection between a helix at position 20 and a sheet at position 23, but would not work well if the helix and sheet positions were reversed.
Click-and-drag a building block onto the Blueprint panel to apply the building block to your structure. Applied building blocks will remain outlined in the Blueprint panel. Applied building blocks will continue to exert torsional constraints wherever they are placed—these constraints behave like rubber bands for φ and ψ torsions, and will try to keep residues close to the original building block shape whenever you use Wiggle.
Click-and-drag a building block off of the Blueprint panel to remove the building block; this will also remove the associated torsional constraints. It is recommended that you leave building blocks and torsional constraints in place while you continue to fold a protein.
The Building Block panel includes two special building blocks next to the Context menu: one block is all-helix and another is all-sheet. These special building blocks can be placed on the Blueprint panel to shape residues into ideal helices and ideal sheets. They do not exert torsional constraints, and disappear immediately after they are applied.
The Blueprint Panel will be enabled in specific design puzzles. It can be accessed from the Actions menu in the Original Interface, or from the Main menu in the Selection Interface. Try it out now in Puzzle 1305!( Posted by bkoep 73 547 | Thu, 10/27/2016 - 22:14 | 0 comments )
New Tool Preview!
We’re excited about a new Foldit tool that has been developed for protein design! The Blueprint panel, alongside its partner Building Blocks panel, is meant to ease the construction of “ideal” loops. Check out the video below to see a prototype in action! We hope to start testing the new feature in devprev in a matter of days!( Posted by inkycatz 73 1765 | Tue, 10/25/2016 - 15:10 | 0 comments )
Foldit Plays for Jain Foundation / DYSF
As another example of applying Foldit to human disease, this month we have a puzzle on the protein dysferlin. The deficiency or absence of dysferlin causes one genetic type of Limb Girdle Muscular Dystrophy. Muscular dystrophy caused by dysferlin has autosomal recessive inheritance (meaning it is equally likely to affect females and males) and typical onset between the ages of 15 and 30. The UW Institute for Protein Design is conducting a research project on the structure and function of dysferlin for the Jain Foundation, a nonprofit foundation based in Seattle which supports research and the development of treatments for dysferlinopathy. The exact function of dysferlin is not completely understood, but it is thought to be involved in repair of the muscle cell membrane if it is damaged, and in resetting the muscle to a quiescent state following contraction. Sept. 30 is Limb Girdle Muscular Dystrophy Awareness Day, and we are introducing Puzzle 1291: Dysferlin C2B Domain to commemorate this day and to spread awareness to the Foldit community.
The following video features an interview with a neurologist on Limb Girdle Muscular Dystrophy, and with a patient who has dysferlin deficiency.
Ferlins are a family of transmembrane proteins which contain multiple C2 domains. The N-terminus is located inside the cell, and there is a single transmembrane domain near the C-terminus, which is located on the cell’s exterior. Ferlins are thought to participate in membrane fusion events and are involved in a variety of functions in many organisms. The first ferlin to be described is fer-1 in C. elegans, which is required for sperm function and hence fertility (giving rise to the name “fer”). Ferlins has also been described in drosophila and sea urchins. Deficiencies in two of the five mammalian ferlins have been associated with human disease. Otoferlin is required for transduction of signals from the inner ear to the nervous system for hearing, and its deficiency is a genetic cause of deafness. The most abundant dysferlin isoform in skeletal muscle is 2080 amino acids long, and contains at least seven C2 domains as well as additional protein domains of other types.( Posted by inkycatz 73 1765 | Thu, 09/29/2016 - 19:33 | 1 comment )