Maximizing the potential of our increasingly vast base of scientific knowledge
We aren't yet at the stage where we can loose computers upon the stores of human knowledge only to return a week later with discoveries that would supplant those of Einstein or Newton in our scientific pantheon. But computational methods are helpful. Working in concert with people -- we are still needed to sort the wheat from the chaff -- computational programs and automated techniques can connect scientific areas that ought to be speaking to each other yet haven't, stitching together different fields until the interconnectivity between the different areas becomes clear.
In the fall of 2010, a team of scientists in the Netherlands published the first results of a project called CoPub Discovery. Their previous work had involved the creation of a massive database based on the co-occurrence of words in articles. If two papers both have the terms "p53" and "oncogenesis," for example, they would be linked more strongly than words with no two key terms in common. CoPub Discovery involved creating a new program that mines their database for unknown relationships between genes and diseases.
One of the most celebrated examples of drug repurposing is Viagra, which was originally designed to treat hypertension.
Essentially, CoPub Discovery automates the detections of relationships between thousands of genes and thousands of diseases, gene pathways, and even the effectiveness of different drugs. Doing this automatically allows many possible discoveries to be detected. In addition, CoPub Discovery also has a careful system of checks designed to sift out false positives -- instances where the program might say there is an association when there really isn't.
And it works! The program was able to find a number of exciting new associations between genes and the diseases that they may cause, ones that had never before been written about in the literature.
For example, there is a condition known as Graves' disease that normally causes hyperthyroidism, a condition in which the thyroid produces too much hormone. Symptoms include heat intolerance and eyes that stick out more prominently, yielding a somewhat bug-eyed appearance for sufferers. CoPub Discovery, when automatically plowing through the large database, found a number of genes that had never before been implicated in Graves' that might be involved in causing the disease. Specifically, it found a large cluster of genes related to something known as programmed cell death.
Programmed cell death is not nearly as scary as it sounds. Our bodies often require the death of individual cells in order to perform correctly, and there is a set of genes in our cells tailored for this purpose. For example, during embryonic development, our hands initially have webbing between the fingers. But prior to birth the cells in the webbing are given the signal to die, causing us to not have webbed hands. Webbed hands and feet only occur when the signal is given incorrectly, or when these genes don't work properly.
What CoPub Discovery computationally hypothesized is that when these programmed cell death genes don't work properly in other ways, a cascade of effects might follow, eventually leading to the condition known as Graves' disease. CoPub Discovery has also found relationships between drugs and diseases and determined other previously unknown effects of currently used drugs. For example, while a medicine might be used to help treatment for a specific condition, not all of its effects might be known. Using the CoPub Discovery engine and the concept of undiscovered public knowledge, it becomes possible to actually see what the other effects of such a drug might be.