sexta-feira, 29 de fevereiro de 2008

Genomics

Reconstruction of the Genomes

Drew Endy*

"I am the family face; flesh perishes, I live on, projecting trait and trace through time to times anon, and leaping from place to place over oblivion." So starts the poem Heredity by Thomas Hardy, whose protagonist personifies the observation that all life exists through a process of direct descent from one generation to the next. Scientifically, the replication and propagation of genetic material, as DNA or RNA, is the primary mechanism by which each generation transmits the instructions underlying the traits and traces of their offspring. On page 1215 of this issue, Gibson et al. (1) bypass nature's constraint of direct descent by combining information and raw chemicals to construct the entire set of genetic material, or genome, encoding a bacterium (see the figure). This first construction of a genome encoding a self-reproducing organism heralds important opportunities in both genetics and biotechnology, highlights the need for improved DNA construction technology, and reinforces the value of ongoing public discussion of the impacts of making organisms easier to engineer.

Figure 1 Genome construction. DNA sequencing technology decodes the genome of an organism. DNA synthesis and genome construction technologies enable the opposite process. Bacterial genomes can be built from DNA sequence information and raw chemicals.

CREDIT: ADAPTED FROM DREW ENDY

Gibson et al. used a multistage process to construct the genome of Mycoplasma genitalium. First, information defining the 582,970-base pair (bp) DNA sequence of the genome to be synthesized was obtained from a computer database and divided into shorter sections, or cassettes of DNA up to ~7000 bp long. Commercial DNA suppliers then constructed these cassettes. Raw chemicals derived from sugar cane were combined to synthesize specific oligonucleotides, short fragments of DNA up to several hundred base pairs long (2). The suppliers then combined subsets of oligonucleotides to produce the requested cassettes (3). Gibson et al. used a hierarchical scheme to assemble, check, and, as needed, repair ever-longer DNA fragments, eventually producing the full-length genome.

Given that all life is encoded by genetic material, ongoing and future advances in DNA synthesis and genome construction technology will be important. For example, the U.S. National Institutes of Health is estimated to spend ~$1.5 billion annually supporting the manual manipulation of DNA (4). Such work consumes most of the experimental effort for many biologists and biological engineers, a hidden opportunity cost that is harder to quantify. Moreover, the required slavish mastery of ad hoc methods and tedious tools for DNA manipulation discourages most students and researchers in fields such as physics, electrical engineering, and computer science from exploring biomedical and biotechnology research. Thus, an improved ability to provide any DNA molecule quickly, reliably, and economically would enhance and expand life sciences and engineering research (5), and might well become the goal of well-coordinated public research programs. Unfortunately, no such programs exist today.

Meanwhile, consider that most early discoveries of genetically encoded functions depended on analysis of the linkage between natural or randomly generated mutations and phenotypes (6), a powerful approach akin to blindly smashing many cars with a hammer and then determining which broken parts matter by attempting to drive each machine. Over the past 30 years, the invention (7) and development (8) of DNA sequencing technology have provided a complementary approach for discovering genetic functions. By comparing DNA sequence information from different organisms, researchers can now identify sequences that have remained relatively constant throughout millions of years of evolution (9). The presence of a DNA sequence across distantly related organisms implies that disruption of the sequence via an evolutionary "hammer" would have produced a deleterious effect on the organism, and thus the conserved sequence likely encodes an important function.

However, two additional approaches are needed to confirm and exhaustively identify all functions encoded by a natural DNA sequence. Specific DNA sequences thought to affect phenotypes must be purposefully changed and the expected effect confirmed. Also, seemingly irrelevant DNA sequences must be removed, disrupted, or otherwise modified and shown to be unnecessary. To date, the application of these additional approaches has been limited to short DNA sequences (10) or well-studied organisms (11). In developing their genome construction methods, Gibson et al. are hoping to more readily explore whether genes that can be individually disrupted (12) might also be disrupted in combination. Going forward, the ability to implement many simultaneous and directed changes to natural DNA sequences (13) and to build and test synthetic systems (14) will give researchers a powerful new "hammer" for constructing how life works.

The 582,970-bp "synthetic" genome produced by Gibson et al. also unequivocally demonstrates that it is now possible to construct the genomes for all known human viruses, including strictly regulated pathogens (such as smallpox), from publicly available DNA sequence data, methods, and materials. For now, the process of genome construction, as well as the production of an infectious agent given a newly synthesized but inert genome, requires highly skilled experts and considerable resources (Gibson et al. must still demonstrate that their synthesized genome will encode a living bacterium). In the meantime, recent international efforts to establish and coordinate best safety and security practices among competing DNA suppliers can be celebrated and improved (15). And, new efforts might focus on developing professional societies and improved standards of practice among biological engineers.

References

  1. D. G. Gibson et al., Science 319, 1215 (2008); published online 24 January 2008 (10.1126/science.1151721).
  2. Y. Sanghvi, A Roadmap to the Assembly of Synthetic DNA from Raw Materials, http://hdl.handle.net/1721.1/39657 (2007).
  3. S. J. Kodumal et al., Proc. Natl. Acad. Sci. U.S.A. 101, 15573 (2004).
  4. H. Bügl et al., A Practical Perspective on DNA Synthesis and Biological Security, http://dsspace.mit.edu/bitstream/1721.1/40280/1/PPDS.pdf (2006).
  5. D. Baker et al., Sci. Am. 294, 44 (June, 2006)
  6. F. W. Studier, R. Hausmann, Virology 39, 587 (1969).
  7. F. Sanger et al., Proc. Natl. Acad. Sci. U.S.A. 74, 5463 (1977).
  8. R. Carlson, Biosecur. Bioterror. 1, 203 (2003).
  9. G. Bejerano et al., Science 304, 1321 (2004).
  10. T. D. Schneider, G. D. Stormo, Nucleic Acids Res. 17, 659 (1989).
  11. A. W. Murray, J. W. Szostak, Nature 306, 189 (1983).
  12. C. A. Hutchison et al., Science 286, 2165 (1999).
  13. L. Y. Chan et al., Mol. Syst. Biol. 1, 2005.0018 (2005).
  14. M. B. Elowitz, S. Leibler, Nature 403, 335 (2000).
  15. H. Bügl et al., Nat. Biotechnol. 25, 627 (2007).

Nenhum comentário: