The Science Behind Foldit
What is protein folding?
What is a protein? Proteins are the workhorses in every cell of every living thing. Your body is made up of trillions of cells, of all different kinds: muscle cells, brain cells, blood cells, and more. Inside those cells, proteins are allowing your body to do what it does: break down food to power your muscles, send signals through your brain that control the body, and transport nutrients through your blood. Proteins come in thousands of different varieties, but they all have a lot in common. For instance, they're made of the same stuff: every protein consists of a long chain of joined-together amino acids.
What are amino acids? Amino acids are small molecules made up of atoms of carbon, oxygen, nitrogen, sulfur, and hydrogen. To make a protein, the amino acids are joined in an unbranched chain, like a line of people holding hands. Just as the line of people has their legs and feet "hanging" off the chain, each amino acid has a small group of atoms (called a sidechain) sticking off the main chain (backbone) that connects them all together. There are 20 different kinds of amino acids, which differ from one another based on what atoms are in their sidechains. These 20 amino acids fall into different groups based on their chemical properties: acidic or alkaline, hydrophilic (water-loving) or hydrophobic (greasy).
What shape will a protein fold into? Even though proteins are just a long chain of amino acids, they don't like to stay stretched out in a straight line. The protein folds up to make a compact blob, but as it does, it keeps some amino acids near the center of the blob, and others outside; and it keeps some pairs of amino acids close together and others far apart. Every kind of protein folds up into a very specific shape -- the same shape every time. Most proteins do this all by themselves, although some need extra help to fold into the right shape. The unique shape of a particular protein is the most stable state it can adopt. Picture a ball at the top of a hill -- the ball will always roll down to the bottom. If you try to put the ball back on top it will still roll down to the bottom of the hill because that is where it is most stable.
Why is shape important? This structure specifies the function of the protein. For example, a protein that breaks down glucose so the cell can use the energy stored in the sugar will have a shape that recognizes the glucose and binds to it (like a lock and key) and chemically reactive amino acids that will react with the glucose and break it down to release the energy.
What do proteins do? Proteins are involved in almost all of the processes going on inside your body: they break down food to power your muscles, send signals through your brain that control the body, and transport nutrients through your blood. Many proteins act as enzymes, meaning they catalyze (speed up) chemical reactions that wouldn't take place otherwise. But other proteins power muscle contractions, or act as chemical messages inside the body, or hundreds of other things. Here's a small sample of what proteins do:
- Amylase starts the process of breaking down starch from food into forms the body can use.
- Alcohol dehydrogenase transforms alcohol from beer/wine/liquor into a non-toxic form that the body uses for food.
- Hemoglobin carries oxygen in our blood.
- Fibrin forms a scab to protect cuts as they heal.
- Collagen gives structure and support to our skin, tendons, and even bones.
- Actin is one of the major proteins in our muscles.
- Growth hormone helps regulate the growth of children into adults.
- Potassium channels help send signals through the brain and other nerve cells.
- Insulin regulates the amount of sugar in the blood and is used to treat diabetes.
Proteins are present in all living things, even plants, bacteria, and viruses. Some organisms have proteins that give them their special characteristics:
- Photosystem I is a collection of proteins in plants that captures sunlight for photosynthesis.
- Luciferase catalyzes the chemical reaction that makes fireflies glow.
- Hemagglutinin helps the influenza virus invade our cells.
You can find more information on the rules of protein folding in our FAQ.
Why is this game important?
What big problems is this game tackling?
- Protein structure prediction: As described above, knowing the structure of a protein is key to understanding how it works and to targeting it with drugs. A small protein can consist of 100 amino acids, while some human proteins can be huge (1000 amino acids). The number of different ways even a small protein can fold is astronomical because there are so many degrees of freedom. Figuring out which of the many, many possible structures is the best one is regarded as one of the hardest problems in biology today and current methods take a lot of money and time, even for computers. Foldit attempts to predict the structure of a protein by taking advantage of humans' puzzle-solving intuitions and having people play competitively to fold the best proteins.
- Protein design: Since proteins are part of so many diseases, they can also be part of the cure. Players can design brand new proteins that could help prevent or treat important diseases.
How does my game playing contribute to curing diseases?
With all the things proteins do to keep our bodies functioning and healthy, they can be involved in disease in many different ways. The more we know about how certain proteins fold, the better new proteins we can design to combat the disease-related proteins and cure the diseases. Below, we list three diseases that represent different ways that proteins can be involved in disease.
- HIV / AIDS: The HIV virus is made up largely of proteins, and once inside a cell it creates other proteins to help itself reproduce. HIV-1 protease and reverse transcriptase are two proteins made by the HIV virus that help it infect the body and replicate itself. HIV-1 protease cuts the "polyprotein" made by the replicating virus into the functional pieces it needs. Reverse transcriptase converts HIV's genes from RNA into a form its host understands, DNA. Both proteins are critical for the virus to replicate inside the body, and both are targeted by anti-HIV drugs. This is an example of a disease producing proteins that do not occur naturally in the body to help it attack our cells.
- Cancer: Cancer is very different from HIV in that it's usually our own proteins to blame, instead of proteins from an outside invader. Cancer arises from the uncontrolled growth of cells in some part of our bodies, such as the lung, breast, or skin. Ordinarily, there are systems of proteins that limit cell growth, but they may be damaged by things like UV rays from the sun or chemicals from cigarette smoke. But other proteins, like p53 tumor suppressor, normally recognize the damage and stop the cell from becoming cancerous -- unless they too are damaged. In fact, damage to the gene for p53 occurs in about half of human cancers (together with damage to various other genes).
- Alzheimer's: In some ways, Alzheimer's is the disease most directly caused by proteins. A protein called amyloid-beta precursor protein is a normal part of healthy, functioning nerve cells in the brain. But to do its job, it gets cut into two pieces, leaving behind a little scrap from the middle -- amyloid-beta peptide. Many copies of this peptide (short protein segment) can come together to form clumps of protein in the brain. Although many things about Alzheimer's are still not understood, it is thought that these clumps of protein are a major part of the disease.
What other good stuff am I contributing to by playing?
Proteins are found in all living things, including plants. Certain types of plants are grown and converted to biofuel, but the conversion process is not as fast and efficient as it could be. A critical step in turning plants into fuel is breaking down the plant material, which is currently done by microbial enzymes (proteins) called "cellulases". Perhaps we can find new proteins to do it better.
Can humans really help computers fold proteins?
We’re collecting data to find out if humans' pattern-recognition and puzzle-solving abilities make them more efficient than existing computer programs at pattern-folding tasks. If this turns out to be true, we can then teach human strategies to computers and fold proteins faster than ever!
You can find more information about the goals of the project in our FAQ.
Foldit Scientific Publications
Robert Kleffner, Jeff Flatten, Andrew Leaver-Fay, David Baker, Justin B. Siegel, Firas Khatib and Seth Cooper. Foldit Standalone: a video game-derived protein structure manipulation interface using Rosetta. Bioinformatics (2017). [link]
Jacqueline Gaston and Seth Cooper. To three or not to three: improving human computation game onboarding with a three-star system. Proceedings of the SIGCHI Conference on Human Factors in Computing Systems (2017). [link]
Scott Horowitz, Brian Koepnick, Raoul Martin, Agnes Tymieniecki, Amanda A. Winburn, Seth Cooper, Jeff Flatten, David S. Rogawski, Nicole M. Koropatkin, Tsinatkeab T. Hailu, Neha Jain, Philipp Koldewey, Logan S. Ahlstrom, Matthew R. Chapman, Andrew P. Sikkema, Meredith A. Skiba, Finn P. Maloney, Felix R. M. Beinlich, Foldit Players, University of Michigan students, Zoran Popović, David Baker, Firas Khatib and James C. A. Bardwell. Determining crystal structures through crowdsourcing and coursework. Nature Communications 7, Article number: 12549 (2016), doi:10.1038/ncomms12549
Dun-Yu Hsiao, Min Sun, Christy Ballweber, Seth Cooper and Zoran Popović. Proactive sensing for improving hand pose estimation. Proceedings of the SIGCHI Conference on Human Factors in Computing Systems (2016). [link]
Dun-Yu Hsiao, Seth Cooper, Christy Ballweber and Zoran Popović. User behavior transformation through dynamic input mappings. Proceedings of the 9th International Conference on the Foundations of Digital Games (2014). [link]
George A. Khoury, Adam Liwo, Firas Khatib, Hongyi Zhou, Gaurav Chopra, Jaume Bacardit, Leandro O. Bortot, Rodrigo A. Faccioli, Xin Deng, Yi He, Pawel Krupa, Jilong Li, Magdalena A. Mozolewska, Adam K. Sieradzan, James Smadbeck, Tomasz Wirecki, Seth Cooper, Jeff Flatten, Kefan Xu, David Baker, Jianlin Cheng, Alexandre C. B. Delbem, Christodoulos A. Floudas, Chen Keasar, Michael Levitt, Zoran Popović, Harold A. Scheraga, Jeffrey Skolnick, Silvia N. Crivelli and Foldit Players. WeFold: a coopetition for protein structure prediction. Proteins (2014). [link]
Christopher B. Eiben, Justin B. Siegel, Jacob B. Bale, Seth Cooper, Firas Khatib, Betty W. Shen, Foldit Players, Barry L. Stoddard, Zoran Popović and David Baker. Increased Diels-Alderase activity through backbone remodeling guided by Foldit players. Nature Biotechnology (2012). [link]
Erik Andersen, Eleanor O'Rourke, Yun-En Liu, Richard Snider, Jeff Lowdermilk, David Truong, Seth Cooper and Zoran Popović. The impact of tutorials on games of varying complexity. Proceedings of the SIGCHI Conference on Human Factors in Computing Systems (2012). [link]
Firas Khatib, Seth Cooper, Michael D. Tyka, Kefan Xu, Ilya Makedon, Zoran Popović, David Baker and Foldit Players. Algorithm discovery by protein folding game players. Proceedings of the National Academy of Sciences of the United States of America (2011). [link]
Miroslaw Gilski, Maciej Kazmierczyk, Szymon Krzywda, Helena Zábranská, Seth Cooper, Zoran Popović, Firas Khatib, Frank DiMaio, James Thompson, David Baker, Iva Pichová and Mariusz Jaskolskia. High-resolution structure of a retroviral protease folded as a monomer. Acta Crystallographica (2011). [link]
Firas Khatib, Frank DiMaio, Foldit Contenders Group, Foldit Void Crushers Group, Seth Cooper, Maciej Kazmierczyk, Miroslaw Gilski, Szymon Krzywda, Helena Zábranská, Iva Pichová, James Thompson, Zoran Popović, Mariusz Jaskolski and David Baker. Crystal structure of a monomeric retroviral protease solved by protein folding game players. Nature Structural and Molecular Biology (2011). [link]
Seth Cooper, Firas Khatib, Ilya Makedon, Hao Lu, Janos Barbero, David Baker, James Fogarty, Zoran Popović and Foldit Players. Analysis of social gameplay macros in the Foldit cookbook. Proceedings of the 6th International Conference on the Foundations of Digital Games (2011). [link]
Seth Cooper, Firas Khatib, Adrien Treuille, Janos Barbero, Jeehyung Lee, Michael Beenen, Andrew Leaver-Fay, David Baker, Zoran Popović and Foldit Players. Predicting protein structures with a multiplayer online game. Nature (2010). [link]
Seth Cooper, Adrien Treuille, Janos Barbero, Andrew Leaver-Fay, Kathleen Tuite, Firas Khatib, Alex Cho Snyder, Michael Beenen, David Salesin, David Baker, Zoran Popović and Foldit players. The challenge of designing scientific discovery games. Proceedings of the 5th International Conference on the Foundations of Digital Games (2010). [link]
News Articles about Foldit
Foldit has been in dozens of publications over the years - to list them all would take a page of their own. For a sampling, please see our Center for Game Science page.
Check out the Rosetta@Home Screensaver to see how computers fold proteins using distributed computing.
Rules for Foldit community members can be found here.
Let's Foldit Podcast
You can listen to all episodes of the Let's Foldit podcast here.
Instructions for Educators
Thank you for using Foldit in your classroom! We have put together a set of instructions to assist you in setting up your students to play Foldit.
Terms of Service and Consent
The Foldit terms of service and consent can be found here.
You can find the researchers and supporters associated with this study on the game's credits page.