Was this intended to be two separate lengths? The numbering is continuous across the gap, but it's not clear if the two gap ends are supposed to remain close to one another.
This puzzle contains two distinct chains that can be moved independently of one another. The chains may interact with one another and form disulfides with one another, but there is no reason that the gap ends should remain close together.
Foldit automatically renumbers all the residues in a puzzle. If there are multiple chains in a puzzle, then Foldit renumbers the first chain (from the N- to C-terminus), and then continues this numbering at the next chain (from its N- to C-terminus). The main purpose of the Foldit numbering is so that the program can keep track of each residue. (Importantly, the Foldit numbering does not correspond to conventional numbering that we use to talk about proteins. So, for example, if you read something on Wikipedia about "position 22 of insulin," that is not the same as residue number 22 in this Foldit puzzle.)
The synthesis of insulin in the body is actually a multi-step process. It actually starts out as a single chain, forms disulfide bonds, and then the middle portion of the single chain is excised by proteases, resulting in two distinct protein chains connected by disulfide bonds.
Thanks for the explanation, very interesting. Fascinating how this all is so intricate!
(Side note: if you could delete the duplicate post below, I'd appreciate it. I hit save twice, apparently.)
Totally geeking out, but here goes.
This puzzle has two chains, so there are two C terminals. (Also, two N terminals.)
The main reason that matters is atom numbering - the C terminal has an extra oxygen backbone atom, so the beta carbon, the first atom of the sidechain, is atom 6 instead of atom 5.
This scrap of code bands the beta carbons of the C terminals in 1501 to demonstrate:
band.AddBetweenSegments ( 21, 50, 6, 6 )
Lots of banding recipes offer to band beta carbons, but most probably miss the target when there are multiple chains. They'll end up banding to the extra oxygen atom in every C terminal except the last one. (Missed it by *this much*!)
This protein is available in the PDB as 1A7F. The two chains are identified as A and B. The A chain is segments 1 - 21 in Foldit, and the B chain is segments 22 - 50. (In the PDB, each chain is numbered starting at residue 1.)
If you look at the PDB header for 1A7F, the SSBOND records give the disulfide bridges:
SSBOND 1 CYS A 6 CYS A 11 1555 1555 2.02
SSBOND 2 CYS A 7 CYS B 7 1555 1555 2.02
SSBOND 3 CYS A 20 CYS B 19 1555 1555 2.02
(I removed some excess spaces there, Fortran fans beware. I'm actually not sure what the 1555s mean, but the 2.02s are the distance in Angstroms between the two sulfur atoms in the bridge.)
The first bridge is within the A chain, but the other two connect the A chain to the B chain. So "CYS A 6" and "CYS A 11" connects the cysteines at segments 6 and 11. Both are on the A chain, so the numbers are the same in Foldit.
The next bridge connects "CYS A 7" to "CYS B 7". Both are residue 7 in PDB terms, each in its own chain.
To convert the B chain number to the Foldit number, just add 21, the length of the A chain. So "CYS B 7" refers to Foldit segment 28. In the third bridge, "CYS B 19" becomes segment 40.
The bridges in bridge wiggle terms are:
6,11 7,28 20,40
Some puzzles may have more than two chains. The recent 1448: Classroom Puzzle: Amyloid Protein is the only one I can think of, but there may have been others. Since Foldit doesn't have chain ids, the only way to detect N terminals and C terminals in the middle of the protein is to look at the atom count from structure.GetAtomCount and compare it to the normal value for the same amino acid in the middle of the chain.
The N terminals have *two* extra atoms, but they're both hydrogens, which get counted after other atoms. So the extra N terminal atoms won't throw off beta carbon banding recipes. The extra oxygen in the C terminal is what changes the number of the beta carbon from 5 to 6.
Enough already. One day print protein will identify chains, and maybe the some weak spots in the wiki will get patched up.
this has already been mentioned regarding 1498...same prob here?
Are you referring to the erroneous high scores, or something else? Have you also experienced problems in this puzzle?