Systems Biology: An Evolving Approach in Drug Discovery and Development
Abstract:
Investments in systems biology approaches by the pharmaceutical
industry have not yet yielded the payoffs envisaged by many. In most
cases, a plethora of novel drug targets arising from genomics led to
many more failed projects in the pipeline, suggesting that target-based
drug discovery may not be an optimal strategy for the industry. Before
high-throughput `-omics' technologies and computer analysis became
commonplace, most drug candidates were laboriously screened in animal
systems to identify compounds that produced useful responses.
Interestingly, the targets of many of the compounds that became drugs
are still uncertain to this day. It is likely that drugs act on
multiple targets in concert over time, the identification of which will
require not only system level cataloguing and measurements, but next
generation multiscale systems modelling. The concept of a
`differentiated drug response' - elucidating and integrating responses
composed of a range of effects on different tissues and, importantly,
different time scales - may eventually prove to be the dominant
paradigm of systems biology research. In this article, we explore key
relevant concepts and technologies that we believe are critical for the
future of systems biology and its place in pharmaceutical research.
Keywords:
Research and development;
Research Tools;
Systems biology
Document Type: Leading article
Affiliations:
1:
1 Rosa & Co., La Jolla, California, USA
2:
2 PRTM, Newport Beach, California, USA
Anti-Bacterial Drug Discovery Using Systems Biology
Abstract:
The pipeline for new antibacterials is bleak despite the fact that infectious diseases account for a quarter of all worldwide deaths due to disease. Bacteria are ideal organisms for a systems biology approach to understanding pathogenesis by combined use of genomic technologies and computer algorithms. This approach can be applied to identify control points in molecular networks, which could be targets for novel drugs.
Keywords: Genomics; systems biology; molecular networks; antibacterials; pathogens
Document Type: Research article
DOI: 10.2174/138955706778742803
Affiliations: 1: Genesis Research and Development Corporation Limited, One Fox Street, Parnell, Auckland, New Zealand.
YEAR OF MIRACLES
Ever since 1900, when Gregor Mendel’s work on peas and inheritance was rediscovered, scientists have regarded the “gene” as the fundamental unit of heredity (just as the atom was regarded as the bedrock of pre-Einsteinian physics). Crick and Watson’s discovery of the DNA double helix as the carrier of hereditary information did little to disturb the status quo. In recent months, however, a perfect storm of new technology and research has blown apart 20th-century dogma. The notion of the Mendelian gene as a unit of heredity, scientists now realise, is a fiction.
Many scientists now believe that heredity is the result of an incredibly complex interplay among the basic components of the genome, scattered among many different genes and even the vast stretches of “junk DNA” once thought to serve no purpose. Biology has been building up to this insight for years, but some big puzzle pieces have now fallen into place. Once scientists abandoned their preconceived notions of genes and looked instead at individual DNA “letters” in the genome—the four bases A, C, T and G— they immediately began to see cause-and-effect connections to myriad diseases and human traits.
The result of this seemingly modest conceptual breakthrough has been a torrent of new discoveries. In five months this year, from April through August, geneticists at the Harvard/MIT Broad Institute, founded by Eric Lander; at deCODE Genetics in Iceland, founded by Kari Stefansson, and several other institutions have published papers suggesting that the key to a deeper understanding of the human genome may finally be in hand. These scientists have identified specific alterations in the sequence of DNA that play causative roles in a broad range of common diseases, including type 1 and type 2 diabetes; schizophrenia; bipolar disorder; glaucoma; inflammatory bowel disease; rheumatoid arthritis; hypertension; restless legs syndrome; susceptibility to gallstone formation; lupus; multiple sclerosis; coronary heart disease; colorectal, prostate and breast cancer, and the pace at which HIV infection causes full-blown AIDS. Unlike so many previous “disease gene” discoveries, these findings are being replicated and validated. “The race to discover disease-linked genes reaches fever pitch,” declared the leading British science journal Nature. Its American counterparts at Science chimed in: “After years of chasing false leads, gene hunters feel they have finally cornered their prey. They are experiencing a rush this spring as they find, time after time, that a new strategy is enabling them to identify genetic variations that likely lie behind common diseases.” That the world’s top two scientific journals still use the old language of “genes” to describe these discoveries shows how new the new thinking really is.
These findings are just a prelude to what’s shaping up as a true conceptual and technological revolution. Just as physics shocked the world in the 20th century, it is now clear that the life sciences will shake up the world in the 21st. In a handful of years, your doctor may be able to run a computer analysis of your personal genome to get a detailed profile of your health. This goes well beyond merely making predictions. A new technology called RNA interference may also allow doctors to control how your DNA is “expressed,” helping you circumvent potential health risks. Many common diseases that have preyed on humans for eons—devastating neurological conditions such as Alzheimer’s, Parkinson’s, cancer—could be eradicated. If this sounds outrageously optimistic, so did the promise of eliminating smallpox and polio to previous generations.
Why is all this happening now?What has changed between this year and last? To answer these questions, we need to trace the story of how mainstream biomedical scientists tried to link the cause of diseases to single genes and, despite early success, hit a brick wall. Meanwhile, a handful of renegade scientists, pursuing their own pet projects, happened to develop exactly the intellectual tools needed to break through that wall. These biologists are now the leaders of the new revolution in biomedical science.
The seeds of our new understanding were first sown in the 1960s, when molecular biologists figured out how genetic information is organised, regulated and reproduced inside single-cell bacteria. In bacteria, a gene is a discrete segment of DNA that contains the “code” that tells the cell how to make a particular type of protein. Bacterial genes are arranged along a single DNA molecule, with only tiny gaps in between. Since all organisms have DNA and work by essentially the same biochemistry, scientists assumed that a human genome would look like a larger version of a bacterium’s.
Clues that something was amiss came quickly with the development of DNA-sequencing methods in the 1970s. The first surprising result was: genes accounted for only 2 per cent of the human genome —the rest of the DNA didn’t seem to have any purpose at all. Biologists Phillip Sharp and Richard Roberts made things worse with a discovery that won them a Nobel Prize in 1993. If the gene were the basic unit of heredity, the DNA required to make any particular protein should be contained in its corresponding gene. But Sharp and Roberts found DNA codes for individual proteins are often split and scattered throughout the genome.
A visionary physician-scientist named Leroy Hood, now at the Institute for Systems Biology in Seattle, grasped the far-reaching implications of a fundamental fact: while even the simplest organism is immensely complicated, the primary structures of its most complicated parts — DNA and proteins — are very simple. The alphabet of DNA contains only the four chemical letters (or bases) A, C, G and T, and proteins are made from just 21 amino acids. Hood saw that this simplicity would make it possible for robots and computers to read and write DNA and proteins more quickly, accurately and cheaply than human beings.
Hood completely transformed the biomedical enterprise. DNA-writing machines give genetic engineers an unlimited capacity to create novel genes that can be studied in test tubes or added to the genomes of living organisms. And protein-writing and -reading machines provided drug firms with the ability to create a new generation of protein-based drugs. The DNA-reading machines suddenly made it conceivable to crack the 3 billion-base sequence of an entire human genome. In 1990 the U.S. government embarked on a 15-year, $3 billion project to do just that.
Eight years later, however, the project — parceled out to many US scientists — was still less than 10 percent complete. Now it was biotech entrepreneur Craig Venter who was frustrated. Convinced that government-funded workers were the problem rather than the solution, Venter enlisted private funding of $200 million to build an enormous lab filled with hundreds of automated machines, working 24/7, overseen by a handful of technicians. Within three years, the first reading of a human genome was essentially complete.
Armed with data from the genome project, scientists figured they’d surely be able to crack the really hard diseases, like cancer and heart disease. But a funny thing happened when they began to look closely at this vast storehouse of genetic information. Geneticists Andrew Fire and Craig Melo galvanised the field by discovering a key mechanism that had been completely overlooked— the cellular process of RNA interference. (They shared a Nobel Prize in 2006 for the work.)
Geneticists had taken for granted that the machinery of cells involved genes directing the production of proteins, and proteins doing the work of the cell. Here was a process that didn’t involve proteins at all. Instead, tens of thousands of hitherto mysterious regions of the human genome — part of the so-called junk DNA — directed the production of specific molecules called microRNAs (consisting of bits of RNA, a well-known component of cells). These microRNAs then oversaw a whole new process, called RNA interference (RNAi), that served to modulate the expression of DNA.
The good news was that RNAi could open up a whole new approach to biomedical therapy (more on that later). But RNAi also made it clear that the fundamental unit of heredity and genetic function is not the gene but the position of each individual DNA letter.
To make it all harder to fathom, each bit of DNA is susceptible to mutation and variation among individuals. Of the 3 billion DNA bases in the human genome, geneticists identified about one tenth of one percent (millions) that differ from one person to another. Variations in these particular letters — called “snips,” or SNPs, for single nucleotide polymorphisms — have replaced genes as the unit of heredity.
Many scientists responded to this devastating realization by going into a funk. Fortunately, another visionary scientist, Kari Stefansson of Iceland, was already blazing a trail out of this thicket. If the genome was far more complex than scientists had thought, they would need to test for many more variables, and to do that they would need more test subjects. To find the cause of diseases would now require the participation of very large groups of genetically related people.
Stefansson decided to solve this problem by taking aim at the largest well-documented extended family that he knew — his own. Nearly all the 300,000 citizens of Iceland can trace their ancestors back, through detailed, public genealogical records, to the Vikings who settled this desolate European island more than 1,000 years ago. He persuaded the Icelandic government to provide his company, decode, with exclusive access to the health records of its citizens in return for bringing investment capital and hi-tech jobs to the capital, Reykjavik. So far, more than 100,000 Icelandic volunteers have donated their DNA to deCODE.
The power of large numbers was soon apparent. In a study of obesity, Stefannson directed his software to look for SNPs associated with subsets of the population who were either extremely overweight or very thin. Within just a few hours, it began finding evidence that variations among particular DNA letters indeed played a causative role, confirming SNPs as the new unit of inheritance.
As of September, deCODE has made progress in identifying SNPs that may play a role in 28 common diseases, including glaucoma, schizophrenia, diabetes, heart disease, prostate cancer, hypertension and stroke. In some cases, such as glaucoma and prostate cancer, deCODE’s findings could lead to diagnostic tests for identifying people at risk of developing the disease. In other instances, such as schizophrenia, links to particular proteins have led to insight about the cause of the disease, which could lead to therapies.
Buoyed by Stefansson’s success, other geneticists were eager to perform large-scale family studies, yet few had similar access to ancient genealogical records. But serendipity would deliver an epiphany: it’s possible to study the entire human population as a single extended family, provided scientists collect enormous amounts of data. Eric Lander, an MIT professor and the intellectual leader of the US government effort to sequence the first human genome, realised scaling up would require a new approach. In 2004, Lander persuaded MIT and Harvard to combine their enormous resources toward the creation of the Broad Institute. Backed by $200 million from billionaire philanthropists Eli and Edythe Broad, the institute is driving the development of ever more advanced genetic technologies. One technology, based on computer-chip fabrication, can identify DNA base letters present at 500,000 SNPs in the genomes of 40,000 or more people.
Think of this as a spreadsheet with 500,000 columns (each representing a specific SNP) and 40,000 rows (one for each person). To hunt for a genetic basis for, say, bipolar disease, the computer searches rows of people who have the disorder, checking column by column for an unusually high frequency of particular letters in comparison with people without the disease. As it turns out, a collaboration of American and German researchers has done this work—and found that variations of DNA letters in 20 different positions are influential in bipolar disease.
Incredibly, most disease-causing variants are the most common ones present in the human population: the strongest-acting one, for instance, exists in 80 percent of people without bipolar disease and 85 percent of people with the disease. The implication is that these variants are beneficial in some way, and cause problems only when their number exceeds a threshold.
To make sense of this complexity, scientists would like ultimately to build a vast international database that contains the complete sequence of DNA bases in the genomes of hundreds of millions of people. Ideally, such a database would be available for analysis by all biomedical researchers and would provide the foundation for understanding the genetic components of all human traits. That sounds like a lot of data — think of a spreadsheet with 3 billion columns and 100 million rows — but computing power is getting cheaper by the year.
The explosion of genetic discoveries shows no sign of letting up any time soon. New diseases are being added to the list every month, and biologists are rapidly parlaying gene- and SNP-disease links into a deeper understanding of how proteins and other molecules can misbehave to cause different medical problems in different people. Other scientists are working to advance the biology revolution (accompanying interviews). As a result of their efforts, many children born this year could very well be alive and healthy at the dawn of the next century, when they may look back in awe at the annus mirabilis of biomedical genetics in 2007.
-LEE SILVER (Newsweek)
Since January, Berci over at Scienceroll has been writing about how Web 2.0 is changing medicine. He’s written a number of interesting articles, including Medical wikis: the future of medicine? and Medical Web 2.0 Sites.
In Web 3.0 and medicine, Berci writes about WikiProteins, a new site that plans to use Web 3.0 technologies to incorporate real time community annotation into a semantic framework. The article Meet the uber-wiki is a great review of the up-and-coming resource.
According to Nova Spivack, founder of Radar Networks, a San Francisco based startup that is developing a new web-based online service that will bring the power of the Intelligent Web to consumers, Web 3.0 is closer than you think (1). His company plans to launch their first product later this year.
What is Web 3.0 and what does it have to do with health?
Web 3.0 will bring together advanced technologies that include the semantic web and adaptive data mining, and move towards making content accessible by applications other than a web browser. Everyone will build the next layer of intelligence into the web using integrated tools for social networking, allowing for both interaction and collaboration. Web 2.0-style tagging will be formalized and expanded so that documents and other web data that now must be interpreted by humans can be read and understood by computers.
According to Wikipedia:
“The semantic web is an evolving extension of the World Wide Web in which web content can be expressed not only in natural language, but also in a form that can be understood, interpreted and used by software agents, thus permitting them to find, share and integrate information more easily.”
The medical industry was one of the first groups involved in the development of the semantic Web. The World Wide Web Consortium (W3C) launched the Health Care and Life Sciences Interest Group, chartered to:
” … develop and support the use of Semantic Web technologies and practices to improve collaboration, research and development, and innovation adoption in the of Health Care and Life Science domains. Success in these domains depends on a foundation of semantically rich system, process and information interoperability.”
Personal medicine and systems biology
Perhaps one of the greatest challenges for 21st century medicine is to provide effective therapies that are customized to a patient’s genetic and environmental profile to better manage their disease or predisposition toward a disease. Using new methods of molecular analysis, the goal of personalized medicine is to achieve optimal medical outcomes by assisting physicians and patients in the identification of the disease management approaches most likely to work best in the context of a patient’s unique biological state. Thus, personalized medicine is dependent upon an understanding of systems biology and on systems-based developed intervention (2).
In contrast to the reductionist “divide and conquer” approach that has been the paradigm of science since the time of the renaissance, systems biology is centered on the idea one can study complex biological systems by evaluating, in parallel, the interactions of DNA, RNA and proteins that network together in terms of perturbations and model organisms. Systems biology is a comprehensive understanding of how large numbers of interrelated components of a system comprise modules or networks whose functional properties emerge as a phenotype or disease state. Semantic web technologies as recommended by the W3C expand the current data standard technology for biological data representation and management and are of considerable importance to realize the promise of systems biology (3).
Phenotypes and disease states are influenced by genetic variation. Indeed, genetic variation greatly influences how the body processes medication. An example of this is the recent finding by two research groups that the cancer drug gefitinib (brand name Iressa), which produces rapid clinical response in approximately 10% of lung cancer patients, is due to a genetic mutation in their tumors (4-5). Environmental factors are also closely connected. The microbiome (meaning the collective genome of the human intestinal microbiota) (6) is the exact point where host genetics meets environment and is considered to be a necessary part of future personalized health-care paradigms (7). Many diseases such as heart disease, diabetes, obesity and cancer may be related to changes in the activities or composition of gut microbiota, and has likely been affected by antibiotic use over the last 50 years (8).
Systems biology also allows researchers to study the underlying mechanisms of human health in relation to diet. As our understanding of systems biology evolves, personalized nutrition will become central to disease prevention. Nutrigenomics, which studies how nutrients interact with humans, taking predetermined genetic factors into account, will bring about new insights into human health that will have a significant positive impact on our quality of life (9).
Much work needs to be done to move personalized medicine from promise to practice. Keith over at Omics! Omics! writes about the long slog, detailing the realities of a clinical trial by Millennium Pharmaceuticals that had a personalized medicine component for bortezomib (brand name Velcade) in multiple myeloma (10). Genes predictive of response or survival were identified, but an interpretation of the results is limited.
Personalized medicine in 2007
The Personalized Medicine Coalition (PMC) is a non-governmental, non-profit group established to foster discussion and advance the understanding and adoption of personalized medicine for the benefit of patients. The PMC website lists numerous research studies that reflect the evolution of personalized medicine.
There are a number of genetic tests currently available that can help predict likely responses or bad reactions to certain medications (11). Some of the more common tests are:
- Cytochrome P450 genotyping test, useful for certain antidepressant medications, anticoagulants, proton pump inhibitors and many other medications.
- Thiopurine methyltransferase test, used to test patients prior to chemotherapy for some leukemias.
- UGT1A1 TA repeat genotype test, used to test patients prior to chemotherapy for colorectal cancer.
- Dihydropyrimidine dehydrogenase test, used to test patients prior to chemotherapy.
In January 2007, about 1,000 patients with atrial fibrillation started participating in a study that matches their warfarin (brand name Coumadin) dose to their specific genetic needs using DNA fingerprinting (12). Those patients whose bodies break down the drug faster or slower than normal can be readily identified so that their doctors can adjust their dosage accordingly. Medco Health Solutions, a leading pharmacy benefit manager company based in Franklin Lakes, New Jersey, is collaborating in the effort with Mayo Clinic.
Edward Abrahams, Ph.D., Executive Director of the PMC predicts that if the study is successful, patients will start demanding personalized medicine. If the study proves to save money and protect patients, insurers will too.
Get ready … personalized medicine coming, perhaps sooner than you think.
References
- Ready for Web 3.0?. MSNBC. March 26, 2007.
- van der Greef et al. Metabolomics-based systems biology and personalized medicine: moving towards n = 1 clinical trials? Pharmacogenomics. 2006 Oct;7(7):1087-94.
View abstract
- Wang et al. From XML to RDF: how semantic web technologies will change the design of ‘omic’ standards. Nat Biotechnol. 2005 Sep;23(9):1099-103.
View abstract
- Paez et al. EGFR mutations in lung cancer: correlation with clinical response to gefitinib therapy. Science. 2004 Jun 4;304(5676):1497-500. Epub 2004 Apr 29.
View abstract
- Lynch et al. Activating mutations in the epidermal growth factor receptor underlying responsiveness of non-small-cell lung cancer to gefitinib. N Engl J Med. 2004 May 20;350(21):2129-39. Epub 2004 Apr 29.
View abstract
- Gill et al. Metagenomic analysis of the human distal gut microbiome. Science. 2006 Jun 2;312(5778):1355-9.
View abstract
- Nicholson et al. Gut microorganisms, mammalian metabolism and personalized health care. Nat Rev Microbiol. 2005 May;3(5):431-8.
View abstract
- Nicholson JK. Global systems biology, personalized medicine and molecular epidemiology. Mol Syst Biol. 2006;2:52. Epub 2006 Oct 3.
View abstract
- Desiere F. Towards a systems biology understanding of human health: interplay between genotype, environment and nutrition. Biotechnol Annu Rev. 2004;10:51-84.
View abstract
- Mulligan et al. Gene expression profiling and correlation with outcome in clinical trials of the proteasome inhibitor bortezomib. Blood. 2007 Apr 15;109(8):3177-88. Epub 2006 Dec 21.
View abstract
- Personalized medicine: Tailoring treatment to your genetic profile. Mayo Clinic. June 30, 2006.
- DNA-tailored medicine moves into mainstream. MSNBC. January 12, 2007.