Suggestion for SC Score system
|Opened on:||Tuesday, October 6, 2020 - 17:06|
|Last modified:||Sunday, October 11, 2020 - 21:38|
------ Concepts -------
-Rosetta score is accurate, and works nice to find a stable alone protein.
-Most of the metrics are usefull to eliminate bad solutions.
For that the score system is good, like "Core existance".
With some metrics that it is harder to get max value(Like BUNS or SC), eliminate bad solutions is not enough.
We need more accurate score system to compare solutions, and make sure we keep the best one.
Score system is like an Heuristic In Artificial Intelligence. If the score system is accurate to compare solutions,
there is less chance that a recipe miss a very good solution.
------ Playing experience -------
When playing I often saw thoses situations :
After using some recipes, the score system give more points when a protein is a bit detached, because that resolve some BUNS.
It often mutates to hydrophobic too, to resolve them. This should give less points.
If we detach the protein complety, we have max score for BUNS, I think we should have less.
For SC, if only a sidechain is near the target, and have very good SC, it will give max score, because the SC can be good with only this sidechain.
It is a problem when a second sidechain is is contact, and have a total lower SC, it will give less score.
So recipe can consider it is a worse solution, and will come back to detached position.
I think SC should be low if there is not enough sidechain in contact zone. Like when it is 0 when there is no contact.
------ Suggestion for SC Score -------
In Metric UI we can show an additional number near the SC value :
the "Sidechain number that are inclued in SC compute" (so the sidechains near the target).
The more we have sidechains near target, the more point we can have (algo to define).
If protein if detached, we will have 0 points.
We can have a max value for the number of sidechain needed.
For exemple, we can define 14 sidechains max, if player have only 7 sidechains in contact, we give only 50% of SC score (500 point if 0.6 SC).
If think if we do that, it is a little like rosetta score system, we have like a score part in each sidechain..
but with a limit in sidechain inclued, because all the protein doesn't need to have contact.
So scientists can find thoses values to have somethink more accurate.
Of course the system that give score depending of sidechain number can be non linear.
The same idea can be used for BUNS too.
thanks for reading my suggestion :)