puzzle picture
1501: Revisiting Puzzle 58: Insulin Mutant
Status: Closed

Summary

Name: 1501: Revisiting Puzzle 58: Insulin Mutant
Status: Closed
Created: 03/25/2018
Points: 100
Expired: 04/02/2018 - 23:00
Difficulty: Intermediate
Description: This is a throwback puzzle to the early days of Foldit. These are the two chains of a bio-engineered variant of human insulin, which contains six cysteine residues that oxidize to form three disulfide bonds. We are revisiting old Foldit puzzles so we can see how useful the recent additions to the game have been.

Sequence:
GIVEQCCTSICSLYQLENYCN / FVNQHLCGSHLVEALELVCGERGGFYTPK
Categories: Overall, Prediction

Top Groups

RankGroupScorePoints
1Beta Folders10,036100
2L'Alliance Francophone10,02374
3Go Science9,99354
4Void Crushers9,93638
5Anthropic Dreams9,93127

Top Evolvers

Top Soloists

RankPlayerGroupScorePoints
1RockOn 78 75  10,390100
2eusair 78 1584  10,33897
3retiredmichael 16 3 Beta Folders10,03294
4nicobul 42 22 L'Alliance Francophone10,02391
5LociOiling 2 1 Beta Folders10,00687


Need this puzzle? Log in to download.  

Comments

NinjaGreg's picture
User offline. Last seen 21 hours 18 min ago. Offline
Joined: 05/21/2010
Groups: Go Science
Just out of curiosity ..

Was this intended to be two separate lengths? The numbering is continuous across the gap, but it's not clear if the two gap ends are supposed to remain close to one another.

bkoep's picture
User offline. Last seen 53 min 2 sec ago. Offline
Joined: 11/15/2012
Groups: None
Yes, two chains

This puzzle contains two distinct chains that can be moved independently of one another. The chains may interact with one another and form disulfides with one another, but there is no reason that the gap ends should remain close together.

Foldit automatically renumbers all the residues in a puzzle. If there are multiple chains in a puzzle, then Foldit renumbers the first chain (from the N- to C-terminus), and then continues this numbering at the next chain (from its N- to C-terminus). The main purpose of the Foldit numbering is so that the program can keep track of each residue. (Importantly, the Foldit numbering does not correspond to conventional numbering that we use to talk about proteins. So, for example, if you read something on Wikipedia about "position 22 of insulin," that is not the same as residue number 22 in this Foldit puzzle.)

The synthesis of insulin in the body is actually a multi-step process. It actually starts out as a single chain, forms disulfide bonds, and then the middle portion of the single chain is excised by proteases, resulting in two distinct protein chains connected by disulfide bonds.

NinjaGreg's picture
User offline. Last seen 21 hours 18 min ago. Offline
Joined: 05/21/2010
Groups: Go Science
Good information!

Thanks for the explanation, very interesting. Fascinating how this all is so intricate!

(Side note: if you could delete the duplicate post below, I'd appreciate it. I hit save twice, apparently.)

LociOiling's picture
User offline. Last seen 8 hours 17 min ago. Offline
Joined: 12/27/2012
Groups: Beta Folders
super subtle stuff

Totally geeking out, but here goes.

This puzzle has two chains, so there are two C terminals. (Also, two N terminals.)

The main reason that matters is atom numbering - the C terminal has an extra oxygen backbone atom, so the beta carbon, the first atom of the sidechain, is atom 6 instead of atom 5.

This scrap of code bands the beta carbons of the C terminals in 1501 to demonstrate:

band.AddBetweenSegments ( 21, 50, 6, 6 )

Lots of banding recipes offer to band beta carbons, but most probably miss the target when there are multiple chains. They'll end up banding to the extra oxygen atom in every C terminal except the last one. (Missed it by *this much*!)

This protein is available in the PDB as 1A7F. The two chains are identified as A and B. The A chain is segments 1 - 21 in Foldit, and the B chain is segments 22 - 50. (In the PDB, each chain is numbered starting at residue 1.)

If you look at the PDB header for 1A7F, the SSBOND records give the disulfide bridges:

SSBOND   1 CYS A  6  CYS A 11   1555   1555  2.02  
SSBOND   2 CYS A  7  CYS B  7   1555   1555  2.02  
SSBOND   3 CYS A 20  CYS B 19   1555   1555  2.02  

(I removed some excess spaces there, Fortran fans beware. I'm actually not sure what the 1555s mean, but the 2.02s are the distance in Angstroms between the two sulfur atoms in the bridge.)

The first bridge is within the A chain, but the other two connect the A chain to the B chain. So "CYS A 6" and "CYS A 11" connects the cysteines at segments 6 and 11. Both are on the A chain, so the numbers are the same in Foldit.

The next bridge connects "CYS A 7" to "CYS B 7". Both are residue 7 in PDB terms, each in its own chain.

To convert the B chain number to the Foldit number, just add 21, the length of the A chain. So "CYS B 7" refers to Foldit segment 28. In the third bridge, "CYS B 19" becomes segment 40.

The bridges in bridge wiggle terms are:

6,11 7,28 20,40 

Some puzzles may have more than two chains. The recent 1448: Classroom Puzzle: Amyloid Protein is the only one I can think of, but there may have been others. Since Foldit doesn't have chain ids, the only way to detect N terminals and C terminals in the middle of the protein is to look at the atom count from structure.GetAtomCount and compare it to the normal value for the same amino acid in the middle of the chain.

The N terminals have *two* extra atoms, but they're both hydrogens, which get counted after other atoms. So the extra N terminal atoms won't throw off beta carbon banding recipes. The extra oxygen in the C terminal is what changes the number of the beta carbon from 5 to 6.

Enough already. One day print protein will identify chains, and maybe the some weak spots in the wiki will get patched up.

frood66's picture
User is online Online
Joined: 09/20/2011
Groups: Marvin's bunch
bug?

this has already been mentioned regarding 1498...same prob here?

bkoep's picture
User offline. Last seen 53 min 2 sec ago. Offline
Joined: 11/15/2012
Groups: None
Which bug?

Are you referring to the erroneous high scores, or something else? Have you also experienced problems in this puzzle?

Get Started: Download
  Windows    OSX    Linux  
Windows
(7/8/10)
OSX
(10.7 or later)
Linux
(64-bit)

Are you new to Foldit? Click here.

Are you a student? Click here.

Are you an educator? Click here.
Search
Only search fold.it
Recommend Foldit
User login
Soloists
Evolvers
Groups
Topics
Top New Users
Sitemap

Developed by: UW Center for Game Science, UW Institute for Protein Design, Northeastern University, Vanderbilt University Meiler Lab, UC Davis
Supported by: DARPA, NSF, NIH, HHMI, Amazon, Microsoft, Adobe, RosettaCommons