puzzle picture
1501: Revisiting Puzzle 58: Insulin Mutant
Status: Closed


1Beta Folders10,036100
2L'Alliance Francophone10,02374
3Go Science9,99354
4Void Crushers9,93638
5Anthropic Dreams9,93127
8Marvin's bunch9,7798
10Hun-Magyar Csapat9,4373
11DU CHEM 3705 Winter 20188,9742
14Trinity Biology8,7301
15BCC 8,7001
16Italiani Al Lavoro8,5961
17E?????! Heureka!8,4631
18DSN @ Home8,3491

Need this puzzle? Log in to download.  


NinjaGreg's picture
User offline. Last seen 2 days 3 hours ago. Offline
Joined: 05/21/2010
Groups: Go Science
Just out of curiosity ..

Was this intended to be two separate lengths? The numbering is continuous across the gap, but it's not clear if the two gap ends are supposed to remain close to one another.

bkoep's picture
User offline. Last seen 1 day 3 hours ago. Offline
Joined: 11/15/2012
Groups: Foldit Staff
Yes, two chains

This puzzle contains two distinct chains that can be moved independently of one another. The chains may interact with one another and form disulfides with one another, but there is no reason that the gap ends should remain close together.

Foldit automatically renumbers all the residues in a puzzle. If there are multiple chains in a puzzle, then Foldit renumbers the first chain (from the N- to C-terminus), and then continues this numbering at the next chain (from its N- to C-terminus). The main purpose of the Foldit numbering is so that the program can keep track of each residue. (Importantly, the Foldit numbering does not correspond to conventional numbering that we use to talk about proteins. So, for example, if you read something on Wikipedia about "position 22 of insulin," that is not the same as residue number 22 in this Foldit puzzle.)

The synthesis of insulin in the body is actually a multi-step process. It actually starts out as a single chain, forms disulfide bonds, and then the middle portion of the single chain is excised by proteases, resulting in two distinct protein chains connected by disulfide bonds.

NinjaGreg's picture
User offline. Last seen 2 days 3 hours ago. Offline
Joined: 05/21/2010
Groups: Go Science
Good information!

Thanks for the explanation, very interesting. Fascinating how this all is so intricate!

(Side note: if you could delete the duplicate post below, I'd appreciate it. I hit save twice, apparently.)

LociOiling's picture
User offline. Last seen 1 hour 55 min ago. Offline
Joined: 12/27/2012
Groups: Beta Folders
super subtle stuff

Totally geeking out, but here goes.

This puzzle has two chains, so there are two C terminals. (Also, two N terminals.)

The main reason that matters is atom numbering - the C terminal has an extra oxygen backbone atom, so the beta carbon, the first atom of the sidechain, is atom 6 instead of atom 5.

This scrap of code bands the beta carbons of the C terminals in 1501 to demonstrate:

band.AddBetweenSegments ( 21, 50, 6, 6 )

Lots of banding recipes offer to band beta carbons, but most probably miss the target when there are multiple chains. They'll end up banding to the extra oxygen atom in every C terminal except the last one. (Missed it by *this much*!)

This protein is available in the PDB as 1A7F. The two chains are identified as A and B. The A chain is segments 1 - 21 in Foldit, and the B chain is segments 22 - 50. (In the PDB, each chain is numbered starting at residue 1.)

If you look at the PDB header for 1A7F, the SSBOND records give the disulfide bridges:

SSBOND   1 CYS A  6  CYS A 11   1555   1555  2.02  
SSBOND   2 CYS A  7  CYS B  7   1555   1555  2.02  
SSBOND   3 CYS A 20  CYS B 19   1555   1555  2.02  

(I removed some excess spaces there, Fortran fans beware. I'm actually not sure what the 1555s mean, but the 2.02s are the distance in Angstroms between the two sulfur atoms in the bridge.)

The first bridge is within the A chain, but the other two connect the A chain to the B chain. So "CYS A 6" and "CYS A 11" connects the cysteines at segments 6 and 11. Both are on the A chain, so the numbers are the same in Foldit.

The next bridge connects "CYS A 7" to "CYS B 7". Both are residue 7 in PDB terms, each in its own chain.

To convert the B chain number to the Foldit number, just add 21, the length of the A chain. So "CYS B 7" refers to Foldit segment 28. In the third bridge, "CYS B 19" becomes segment 40.

The bridges in bridge wiggle terms are:

6,11 7,28 20,40 

Some puzzles may have more than two chains. The recent 1448: Classroom Puzzle: Amyloid Protein is the only one I can think of, but there may have been others. Since Foldit doesn't have chain ids, the only way to detect N terminals and C terminals in the middle of the protein is to look at the atom count from structure.GetAtomCount and compare it to the normal value for the same amino acid in the middle of the chain.

The N terminals have *two* extra atoms, but they're both hydrogens, which get counted after other atoms. So the extra N terminal atoms won't throw off beta carbon banding recipes. The extra oxygen in the C terminal is what changes the number of the beta carbon from 5 to 6.

Enough already. One day print protein will identify chains, and maybe the some weak spots in the wiki will get patched up.

frood66's picture
User offline. Last seen 2 days 12 hours ago. Offline
Joined: 09/20/2011
Groups: Marvin's bunch

this has already been mentioned regarding 1498...same prob here?

bkoep's picture
User offline. Last seen 1 day 3 hours ago. Offline
Joined: 11/15/2012
Groups: Foldit Staff
Which bug?

Are you referring to the erroneous high scores, or something else? Have you also experienced problems in this puzzle?

User login
Download links:
  Windows    OSX    Linux  
(10.12 or later)

Are you new to Foldit? Click here.

Are you a student? Click here.

Are you an educator? Click here.
Social Media

Only search fold.it
Other Games: Mozak
Recommend Foldit
Top New Users

Developed by: UW Center for Game Science, UW Institute for Protein Design, Northeastern University, Vanderbilt University Meiler Lab, UC Davis
Supported by: DARPA, NSF, NIH, HHMI, Amazon, Microsoft, Adobe, Boehringer Ingelheim, RosettaCommons