Big Update: Experimental Client (Drug Design)
There is a new update to the ligand design client (experimental update group). In order to get the experimental client to work, follow the steps outlined here. There is a new puzzle, two new filters, and two new tools released with this update. Also, several bugs have been addressed and others noted:
Bug fixes that are complete
• Fix atom tree crashes when wiggling complex molecules/molecules clashing with backbone
• Fix errors associated with MMFF minimization
• Fix several loading issues
• Move MMFF to wiggle/shake/design submenu in selection tool
• Add new icon for MMFF
Known issues on the schedule for fixing
• When switching from design panel to undo, it is possible to have spheres left behind from the design panel.
• Original interface needs a position for MMFF button
• Switching tracks in undo menu works – occasionally
• Graphics for drawn 2D small molecules in Ligand Queue Tool needs to be greatly improved
• Scaling of bonus for Rule of Five and Similarity filter
New puzzle released
When a patient is diagnosed with HIV, they are almost immediately given anti-viral drugs that target HIV Protease 1. This protease is essential for the life cycle of HIV and inhibiting it helps prevent the spread of HIV within the body. To inhibit the protease, scientists have developed small molecules that trap the protease in a conformation that stops the enzyme from working normally.
To aid the scientists in their designs, they often look at specific ligand centric metrics. Ligand centric methods focus on attributes of the small molecule and ignore the protein. Two metrics, ligand similarity and Lipenski’s Rule of Five are often used during the design process. For this puzzle, we have provided two new filters that help track ligand similarity and Lipenski’s Rule of Five along with two new tools.
The new Rule of Five filter
Lipenski’s Rule of Five is a set of rules that are used to evaluate designed small molecules. The rules are based off of observations for known chemical entities that have made it to market. This is a ligand centric filter, meaning it only takes into account the small molecule being designed, not the protein – small molecule complex. You will be penalized if you violate the rules during design. The rules are:
• No more than 5 hydrogen bond donors
• No more than 10 hydrogen bond acceptors
• Molecular weight less than 500 daltons
• A logP of 5 or less
Molecular weight is determined by summing all the elements weights in a small molecule. For example, carbon has a weight of 12 and hydrogen has a weight of 1. If you have 6 carbons and 6 hydrogens, the weight would be 78. LogP is a measure of how soluble a small molecule is. The more soluble a molecule is, the more likely it is to be taken up into your body. LogP is determined based on the type of atoms present in a small molecule.
All of Lipenski’s rules are reported by the Rule of Five Filter; however, you can view these properties by clicking the ligand view tool.
The new Ligand Similarity filter
Ligand Similarity is used to describe the similarity between small molecules and known drugs. In the new HIV Protease puzzle, the Ligand Similarity filter is used specifically to identify how similar the designed small molecules are to known “tight” binders. This means that the higher the bonus received, the closer that you are to known chemical entities. This tops out at 75% similarity, to encourage looking for new designs. If you show the filter while using it, atoms that are similar to known binders will be highlighted. Changing those highlighted atoms will result in divergence away from known binders.
The new Ligand Queue tool
This tool will be changed from its current state to work more like the remix tool. The purpose of the Ligand Queue tool is to provide a set of small molecules to players; the set comes from actual experiments -– high-throughput screens, virtual high-throughput screens, and automated designs. For the HIV Protease puzzle, the small molecules that are shown are very low binders to HIV. Modifications of these ligands can result in making a small molecule that tightly binds HIV Protease. The ligands provide a good starting spot for design. This tool is still in a rough developmental state, but it is currently useful for providing context for a new concept.
High throughput screening and virtual screening are tools that scientists use to quickly identify small molecules that might bind a protein target. This is usually done by a robot that will screen millions of compounds rapidly. After the compounds have been screened, a process of identifying the molecule and modifying it occurs. This whole process is one of the first things done for each new protein target. We will discuss this more in a later blog post!
The new Ligand View tool
The ligand view tool provides ligand centric values that help you decide if the small molecule being designed is “drug-like”. The current values reported by the ligand view tool will be later incorporated into a filter to further guide you in your designs. In addition to the ligand centric metrics, there is also an iso surface tool that draws an isosurface around the ligand binding pocket to help better show packing against the ligand.
Enjoy! Post your questions and comments here and we'll do our best to get them addressed in future updates.( Posted by free_radical 105 1787 | Wed, 05/18/2016 - 19:52 | 3 comments )
Remix and Fragment Insertion
This post will cover the new Remix tool and the idea of Fragment Insertion in protein design.
A Fragment is a shape for a piece of protein backbone. Fragments can be of any size. A fragment of size 3 will be a shape for 3 residues in a row, size 9 will be for 9 residues.
When we insert a fragment, we are copying the shape of that fragment onto a piece of backbone. Think of it as copy/paste for a piece of backbone shape. Below you can see 3 different fragments in turquoise that were copied onto the backbone by Remix.
The collection of fragments that we copy/paste from is called a fragment library.
We want our fragment library to be filled with the best fragments possible - fragments that we’re confident are good shapes that will give our folds the highest chance of success.
So where does our fragment library come from?
Often times the best approach to protein folding (or anything, really) is to take what works and re-use it.
We have thousands of proteins from nature whose shape we already know. We’re certain that those shapes work because we have physical proof. By looking at these known shapes, we can look for fragments that are common in many natural proteins. We take these and make our fragment library out of them.
Then, when someone needs a shape for a piece of backbone, we look into our library and find fragments which we can copy/paste onto our protein. The tool that does this looking up and copy/paste is called the fragment picker.
Foldit's Fragment Pickers
Rebuild was the first and original fragment picker in Foldit. Rebuild picks from a library of fragments of size 3. When you run rebuild on a piece of backbone, it picks a random sub-piece of size 3 within your selection, looks up a fragment, then copies and pastes it onto your protein.
There are two problems with Rebuild. The first is that only having fragments of size 3 means that if you want a bigger fragment, you’re going to have to combine several smaller fragments, which is less scientifically valid.
The second and larger problem is that Rebuild isn’t very particular about which shapes it chooses. You just ask for fragments of size 3 and it gives them to you*. Then Rebuild does a bunch of work behind the scenes to try and make the fragments fit into your selection.
You can see the actual fragments Rebuild is trying to insert (no behind the scenes work to make it fit) by putting a cutpoint at one end of the selection and disabling cutpoint forces. Take a look at some of the results below:
As you can see, these fragments really don't fit very well. The blue band represents the gap between where the endpoint is and where it needs to be. The only way to make them "fit" requires destroying the original fragment in the process.
Remix tries to solve both of these problems. Firstly, Remix's fragment library has fragments from size 3 up to size 9.
Second, and more importantly, when you ask for a fragment out of Remix, it instead looks for a fragment that will naturally fit between the ends of your selection.
Here are some results of Remix without any modification after insertion:
All of these fragments just fit. The yellow band shows you the cutpoint is already close enough to be closed. In reality, we still "fix" the fragments from Remix as well, but they only need minor adjustment, so the fragment is left intact.
What this means is that Remix is much better at leaving you with more scientifically valid fragments.
* Rebuild does take your backbone sequence and secondary structure into account when doing a lookup, but no conformation information.
Using The New Remix
Remixing through the UI
To Remix a piece of backbone, select the piece and hit the Remix button (or, in the original interface, right click and hit the Remix button). This will pop up the Remix UI.
Let's go over the UI here. First off, there are arrow keys left and right. These let you cycle through the various fragments that Remix found for this selection. You can see which fragment you're currently looking at in the text below the buttons. The first fragment is always what you started with before you ran Remix, and won't change anything.
The Stop button accepts the currently shown fragment. You can also use the stop button in the upper left hand corner of the screen, and it will have the same effect.
Next to the text showing which fragment you have selected, you can also see a score. This score allows you to get an idea of how well fragments score without having to close the tool and shake the selection. Keep in mind it's only a rough estimate, and is only useful for comparing the results relatively. Your final score will likely be nothing like the score shown here.
Lastly, we have the Plus button. This button gives you access to the quicksave functionality of the new Remix tool.
When you press the button, you will see a new button pop up above.
Pressing this new Plus button will quicksave this fragment to Slot 1.
After saving, you can click that quicksave button to go back to that fragment. Pressing Plus for an additional fragment will give you the option of saving to a new quicksave slot, or overwriting an existing one. You can also press the Stop button that replaced the Plus in order to cancel the quicksave.
When you've got all the fragments you want, press Stop and these fragments will be available in your Quicksave slots. Pressing Ctrl-1 through Ctrl-8 will give you access to them.
Remixing with Scripts
Remix can also be accessed via scripts. Here's a quick tutorial on how to use it:
The function call for Remix is
When you run this function, it will Remix your selection and place up to num_results different results into your quicksave slots, starting at slot start_quicksave. It will return the number of results that were actually inserted, as sometimes there will not be as many as you have requested available.
Let's look at an example:
If there were 3 or more results, this would print "3" and place the results in quicksave slots 5,6,7.
If there were only two results available, it would print "2" and you would only have results in quicksave slots 5 and 6.
Fragment picking is best used for figuring out the loops of the protein. Loop shapes vary a lot more than other secondary structure, and so finding good loops is harder, and using actual fragments from real proteins becomes more important.
In general, it is best to use the larger fragments, since that gives you a bigger piece of good backbone in a way that several smaller Remixes may not.
Don't put too much value in the estimated score shown in the Remix UI. Differences of less than 100 points aren't very meaningful.
In the event that Remix doesn't find anything, try selecting one more or one less residue on either side of the selection. Often times this will be enough to give you a better range of results. This is easy to do in the Selection Interface, but requires some secondary structure reassigning in the Original Interface.
Lastly, after inserting a fragment, any changes to that selection will put you further and further from the fragment. As such, it's best if you can find a fragment that requires minimal modification to make it into your final design.( Posted by jflat06 105 1056 | Wed, 05/11/2016 - 00:16 | 4 comments )
Drug Design Update: Merk Molecular Force Field
Today we are releasing a new way to wiggle small molecules, the Merk Molecular Force Field (MMFF). This wiggle action uses a different scoring scheme (MMFF) than the current wiggle button. In the current implementation of wiggle, it is possible for the small molecule to “fold” in on itself. With MMFF, this shouldn’t be a problem. MMFF is also very good at optimizing hydrogen bonds and the torsional space of the small molecule. There are some optimization problems that need to be worked out with MMFF (as you can see from the videos), but for now, give it a try and let us know what you think! It should be noted that this is only available through the selection interface, for now.
In addition to the new wiggle of molecules, there are numerous bug fixes in this release. These include fixes for:
1) When switching menus, custom geometry does not disappear. (changing from design to pulling the molecule around)
2) Game freezes when rotamers are generated
3) Too much output in logs
4) When loop building around the ligand, game crashes
5) Replacing atoms crashes - a lot
6) Selection interface quirks
Enjoy, and please post all your bugs in feedback and suggestions and science questions right here in the thread.( Posted by free_radical 105 1787 | Tue, 05/03/2016 - 19:06 | 10 comments )
Drug Design Update: Tool Talk
We are now ready to deploy a series of tools for small molecule drug discovery! Our goal is to release new tools on a rolling basis for the next couple of weeks. Because the tools are still being tested and not guaranteed to be bug free, we have created an “experimental” user group. This user group is open to everyone who wants to test the drug discovery tools; however, we do ask, that you report any bugs that you find or any suggestions that you have for the interface. After all the tools have been tested, we will release the drug discovery tools to Foldit’s main client.
You should expect this build to be experimental. This means that you have a chance of losing your designs. Because of this, it is highly recommend that you keep a separate install outside of your main client. This will ensure that you do not lose your main game puzzle progress.
In order to get access for the experimental group, follow the steps for access to the devprev build. Instead of replacing “main” with “devprev” in the options.txt file, type “experimental”.
"update_group" : "experimental"
The first tool is a very simple tool and is an expansion of the very first puzzle’s tool released for small molecule drug discovery. It is simply named, “Ligand Design Tool”. This tool provides the ability to change the identity of elements, create bonds between atoms, add a predefined set of fragments, delete atoms, and delete bonds. The Ligand Design Tool is available in both the Selection Interface and in the original Foldit interface.
You may want to change your view to the ligand design view. This will let you see the hydrogens on the small molecule, which can be used to extend the ligand by a single atom. You can access this by changing the view options in the advanced options menu.
In addition to releasing the first tool, we have three new tutorials. The tutorials are a work in progress and will be updated when we have some down time. The tutorials are particularly exciting as they follow the progression of a scientific team as they design an inhibitor for the FKB binding protein.
The puzzle that we are using for testing is Dihydrofulate Reductase (DHFR). Read on to get some more information on DHFR!
Finally, here is what to expect in the upcoming weeks. We will have a new blog post for each topic explaining in detail the science behind each tool and the tool's purpose:
Ligand minimization with a new force field, MMFF, and a set of new filters to guide you in designing small molecules, the rule of five filter and a similarity filter. We will have an explanation of all these features later on, but here is a video of the MMFF minimization.
There is also the ligand queueing interface. This tool allows us to give you small molecules that have been pre-identified (through experiments or virtual high throughput screening) that might bind in the target protein. This tool will also allow you to share your small molecule designs between your teams.
Finally, we are also creating a tool that lets you design small molecules like medicinal chemists. This is called reaction based drug design and provides a synthetic pathway for organic chemists to design the small molecule you create.
Additionally, I should be around in chat on Friday (April 29, 2016) around 2PM Eastern to help answer questions! Think of it like an "office hour", where you can drop in and get things answered versus an actual scientist chat.
A Ramachandran plot is a way to examine the backbone conformation of each residue in a protein. It was first used by G.N. Ramachandran et al. in 1963 to describe stable arrangements of individual residues of a protein. Today, a Ramachandran plot is frequently used by crystallographers to identify protein models with an unrealistic backbone.
As many of you may recall, each residue of a protein has two rotatable bonds, which we designate φ and ψ. If we take a protein structure and measure the rotations about these bonds (between -180 and 180 degrees), then we can plot each residue with respect to its φ (x-axis) and ψ (y-axis). The result is a Ramachandran plot, where each black point is a residue of the protein:
Certain rotations are more stable than others: white areas of the Rama plot are unstable, and a residue in this space will have a bad backbone score; colored areas of the Rama plot are more stable, and a residue in this space will have a better backbone score.
The stable areas of the Rama Map in Foldit are divided into four regions, called ABEGO regions, and are colored accordingly:
- Red: Right-handed helix (characteristic of α-helix)
- Blue: Right-handed strand (characteristic of β-strand)
- Green: Left-handed helix (uncommon, except for GLY)
- Yellow: Left-handed strand (very uncommon, except for GLY)
Because the 20 different amino acid types have different properties, each amino acid type has a slightly different Rama profile. For example, most amino acids have a side chain that would clash with the backbone in a left-handed helix, so maps of these residues have only a faint green region. However, glycine has no side chain and can easily adopt a left-handed helix conformation, so its map has a large, intense green region.
Mouse over a point in the Rama Map to see its residue type and number in the upper right corner.
Click on a point to see the specific Rama profile for its amino acid type; this also selects the residue in Selection Mode.
Click and drag a point to change the φ and ψ rotations of a single residue's backbone.
The viewport at the top of the Rama Map will focus on a selected residue, and simply shows the local configuration of the protein backbone around the selected residue. Each residue in the viewport is colored according to the ABEGO region in which it lies. The ABEGO coloring scheme can also be applied to the main Foldit console in the View Options with View->AbegoColor.
When designing a protein, there are usually a number of different loop backbones that can connect α-helices and β-strands. However, we've found that certain types of loops occur frequently in native proteins, and that these "ideal" loops can be distinguished by ABEGO patterns. For example, the most common way to connect two β-strands is by a short hairpin, with two residues in left-handed helix (green) conformation.
The Foldit Rama Map includes a gallery of ideal loops, located in the drop menus in the upper right corner. Each drop menu displays a handful of ideal loops that can be used to connect some combination of α-helices and β-strands. These are provided as a reference for Foldit players, and we encourage players to try to incorporate these loop structures in their designs. Within each drop menu, the most common loops are listed at the top, but a less common loop may be preferred depending on the precise layout of α-helices and β-sheets in a design!
The Rama Map will be available to use in selected design puzzles. It can be accessed from the Actions menu in the Original Interface; or from the Main menu in Selection Interface. Try out the new Rama Map in the latest design puzzle!( Posted by bkoep 105 1360 | Wed, 03/16/2016 - 01:53 | 10 comments )