Learn Bioinformatics via Playing online Games

image

(1)The Cure

The Cure is a serious, biology-based card game. Assemble the best hands and you can win - just like poker. The challenge with The Cure is that we don't know the rules yet! Until you play, we can't tell you if you have the equivalent of a royal flush or a pair of 2s. By playing the game, you will be both teaching and learning the rules of nature.
image

Background

Using the latest technology we can now assess the activity level of more than 20,000 known human genes and identify literally millions of variations in every human genome. The challenge now is to make use of all of this data. For example, if a woman has been diagnosed with breast cancer we would like to be able to predict whether the cancer will metastasize and how quickly because we can use this information to make decisions about treatment - e.g. whether or not she should receive chemotherapy. Given a collection of samples from tumors that did and did not end up spreading, we take measurements using all the technology at our disposal and then attempt to identify consistent patterns. If a particular gene is much more active than normal in all the samples that eventually spread, this might be a good signal to look for in future patients.

Challenge

In seeking these patterns we are faced with two key problems, reproducibility and a combinatorial explosion. First, it often happens that patterns observed in one dataset do not appear with the same strength in new datasets. This is partly caused by natural biological variability and partly by the relative immaturity of the measuring technology that is being applied. Aside from reproducibility, the patterns themselves may be very complex. It may take a unique combination of 25 different variables to produce an excellent predictive pattern. The trouble is that it is impossible to test all such potential combinations. If we assume 20,000 human genes, there are on the order of 1082 possible 25 gene combinations! (People estimate that the total number of atoms in the universe is close to 1080.)

The idea

  • One advantage when it comes to addressing these problems is our increasing understanding of biology. Our knowledge of genes, biological mechanisms and disease is growing exponentially and much of this knowledge is available on the Web. When we start looking for predictive patterns in a dataset such as the breast cancer example, we can use this knowledge to guide our search. TheCure was created as a fun way to solicit help in guiding the search for stable patterns that can be used to make biologically and medically important predictions. When people play TheCure they use their knowledge (or their ability to search the Web or their social networks) to make informed decisions about the best combinations of variables (e.g. genes) to use to build predictive patterns. These combos are the ‘hands’ in TheCure card game. Every time a game is played, the hands are evaluated and stored. Eventually predictors will be developed using advanced machine learning algorithms that are informed by the hands played in the game.

Who can play?

  • Anyone is welcome to play. The more you know about biology and disease at the level of gene function, the better you are likely to do, but you can also learn as you go. The game provides a lot of information about the genes as well as links off to related Web resources. We hope that anyone who plays will learn something about gene function.

How do you evaluate the quality of the data provided by game players?

  • The predictors generated using The Cure data are evaluated for accuracy on independent test datasets - just like any other predictor inferred by experts or by statistics would be. By testing on real data, we can tell the good players apart from those that are guessing randomly. Since each player action in the game is associated with their account, it is then very easy to filter out data that is not useful. This approach, while it may seem strange for a scientific project, follows the ‘publish then filter’ approach that has made the Web so successful. We hope that it encourages many people to share their time and their intelligence with the project.
image

(2)Dizeez!

Dizeez game now allows players to select a specific area of biology (for example, by disease or protein family) that best matches their expertise. Also, a “recap” functionality has been added at the end of the game, that shows supporting evidence (in Gene Wiki and PubMed GeneRIFs) for each gene-disease association played. Users can review the game log and even suggest new evidences.
  • After removing all gene-disease links already annotated in Gene Wiki/OMIM/PharmGKB, there were 18 assertions  that were provided by players two or more times. We quickly ranked them using theNormalized Medline Distance (NMD) as a proxy for gene and disease co-occurrence in Pubmed articles.  Results could easily be verified by a quick literature search. The top five associations are summarized here:
image

The Rules

  1. You are shown one gene name
  2. You are also shown five diseases
  3. Pick the disease that is linked to the gene to get points
  4. Get as many points as you can in one minute
image

(3)GenESP

image
image

Other Games:

  1. Science Game Center Science learning games collection!
  2. Phylo Play Tetris with DNA!
  3. Mobianga! Suggest and learn Gene-disease connections.
  4. Foldit Fold proteins!
  5. eteRNA Build RNA molecules!
  6. Eyewire Trace neurons!

The Cure is a serious, biology-based card game. Assemble the best hands and you can win - just like poker. The challenge with The Cure is that we don't know the rules yet! Until you play, we can't tell you if you have the equivalent of a royal flush or a pair of 2s. By playing the game, you will be both teaching and learning the rules of nature.