One of the leading challenges in contemporary biology is to discover and understand the regulation mechanisms of genes. Biological experiments for this purpose are typically costly and time consuming. As an example, it usually takes a year for biologist to verify a regulatory pathway between two genes and one knockout experiment to find the effect of deleting one gene from DNA is 100 dollars, but there are 6000 genes in the even simplest organism - yeast. On the other hand, a rich and diverse collection of high-throughput one biological data sources is currently available for elucidating transcriptional regulatory mechanisms from, for example, expression microarrays experiments. Thus, the corresponding challenge in the biocomputing sphere is (I) how to make use of existing data to infer documented and un-documented protein-protein, protein-gene causal relationships, (II) to provide biologist with hints on designing new experiments.
(1) Data Pre-processing: solving naming conventiona and normalization for different datasets
(2) Representation: using Regulatory Network
(3) Inference: clustering and combining different networks built from different datasets
(4) Verification: checking knock-out experiment result or annotated knowledge
(5) Demonstration: developing a visualization tool for resulting regulatory networks