Researchers at the University of Toronto have discovered a fundamentally new view of how living cells use a limited number of genes to generate enormously complex organs such as the brain.

In a paper published on May 6 in the journal Nature entitled "Deciphering the Splicing Code," a research team led by Professors Brendan Frey and Benjamin Blencowe of the University of Toronto describes how a hidden code within DNA explains one of the central mysteries of genetic research -- namely how a limited number of human genes can produce a vastly greater number of genetic messages. The discovery bridges a decade-old gap between our understanding of the genome and the activity of complex processes within cells, and could one day help predict or prevent diseases such as cancers and neurodegenerative disorders.

When the human genome was fully sequenced in 2004, approximately 20,000 genes were found. However, it was discovered that living cells use those genes to generate a much richer and more dynamic source of instructions, consisting of hundreds of thousands of genetic messages that direct most cellular activities. Frey, who has appointments in Engineering and Medicine, likens this discovery to "hearing a full orchestra playing behind a locked door, and then when you pry the door open, you discover only three or four musicians generating all that music."

To figure out how living cells generate vast diversity in their genetic information, Frey and postdoctoral fellow Yoseph Barash developed a new computer-assisted biological analysis method that finds 'codewords' hidden within the genome that constitute what is referred to as a 'splicing code'. This code contains the biological rules that are used to govern how separate parts of a genetic message copied from a gene can be spliced together in different ways to produce different genetic messages (messenger RNAs). "For example, three neurexin genes can generate over 3,000 genetic messages that help control the wiring of the brain," says Frey.

"Previously, researchers couldn't predict how the genetic messages would be rearranged, or spliced, within a living cell," Frey said. "The splicing code that we discovered has been successfully used to predict how thousands of genetic messages are rearranged differently in many different tissues." Blencowe's group, including graduate student John Calarco, generated experimental data used to derive and test predictions from the code. "That the splicing code can make accurate predictions on such a large scale is a major step forward for the field," says Blencowe.

Frey and Blencowe attribute the success of their project to the close collaboration between their team of talented computational and experimental biologists. "Understanding a complex biological system is like understanding a complex electronic circuit. Our team 'reverse-engineered' the splicing code using large-scale experimental data generated by the group," Frey said.

Prof. Frey has appointments to the Canadian Institute for Advanced Research and the U of T's Department of Electrical and Computer Engineering, the Banting & Best Department of Medical Research (BBDMR) and the Department of Computer Science. Prof. Blencowe works in the University's Donnelly Centre for Cellular & Biomolecular Research and has appointments in the BBDMR and Department of Molecular Genetics

The research was supported by the Government of Canada through Genome Canada and the Ontario Genomics Institute, the Canadian Institutes of Health Research, National Cancer Institute of Canada, and Microsoft Research. Frey is an NSERC EWR Steacie Fellow and said that the fellowship was critical in freeing up resources so he could complete the project. The authors of the study are: Yoseph Barash, John A. Calarco, Weijun Gao, Qun Pan, Xinchen Wang Ofer Shai, Benjamin J. Blencowe & Brendan J. Frey.