Abstract
We propose a method for finding CRMs in a set of co-regulated genes. Each CRM consists of a set of binding sites of transcription factors. We wish to find CRMs involving the same transcription factors in multiple sequences. Finding such a combination of transcription factors is inherently a combinatorial problem. We solve this problem by combining the principles of itemset mining and constraint programming. The constraints involve the putative binding sites of transcription factors, the number of sequences in which they co-occur and the proximity of the binding sites. Genomic background sequences are used to assess the significance of the modules. We experimentally validate our approach and compare it with state-of-the-art techniques.
Original language | English |
---|---|
Title of host publication | Proceedings - 2010 IEEE International Conference on Bioinformatics and Biomedicine, BIBM 2010 |
Pages | 363-368 |
Number of pages <span style="color:red"p> <font size="1.5"> ✽ </span> </font> | 6 |
DOIs | |
Publication status | Published - 1 Dec 2010 |
Event | 2010 IEEE International Conference on Bioinformatics and Biomedicine, BIBM 2010 - Hong Kong, China Duration: 18 Dec 2010 → 21 Dec 2010 |
Conference
Conference | 2010 IEEE International Conference on Bioinformatics and Biomedicine, BIBM 2010 |
---|---|
Country/Territory | China |
City | Hong Kong |
Period | 18/12/10 → 21/12/10 |
Keywords
- Cis-regulatory module
- Constraint programming
- Itemset mining