Limits...
Identifying TF-MiRNA Regulatory Relationships Using Multiple Features.

Shao M, Sun Y, Zhou S - PLoS ONE (2015)

Bottom Line: To further improve the accuracy of our inference, we also apply a mean reciprocal rank (MRR)-based method to the candidate peaks.We apply our pipeline to infer TF-miRNA regulatory relationships in mouse embryonic stem cells.The experimental results show that our approach provides very specific findings of TF-miRNA regulatory relationships.

View Article: PubMed Central - PubMed

Affiliation: School of Computer Science and Shanghai Key Lab of Intelligent Information Processing, Fudan University, 220 Handan Road, Shanghai 200433, China; Department of Computer Science and Engineering, Michigan State University, 428 S. Shaw Lane, East Lansing, 48824, USA.

ABSTRACT
MicroRNAs are known to play important roles in the transcriptional and post-transcriptional regulation of gene expression. While intensive research has been conducted to identify miRNAs and their target genes in various genomes, there is only limited knowledge about how microRNAs are regulated. In this study, we construct a pipeline that can infer the regulatory relationships between transcription factors and microRNAs from ChIP-Seq data with high confidence. In particular, after identifying candidate peaks from ChIP-Seq data, we formulate the inference as a PU learning (learning from only positive and unlabeled examples) problem. Multiple features including the statistical significance of the peaks, the location of the peaks, the transcription factor binding site motifs, and the evolutionary conservation are derived from peaks for training and prediction. To further improve the accuracy of our inference, we also apply a mean reciprocal rank (MRR)-based method to the candidate peaks. We apply our pipeline to infer TF-miRNA regulatory relationships in mouse embryonic stem cells. The experimental results show that our approach provides very specific findings of TF-miRNA regulatory relationships.

Show MeSH
The recall and removal rate of prediction on Oct4-miRNA relationships using known regulation between Oct4 and protein-coding genes or miRNA genes.In each panel, the x-axis denotes the parameter p of SVMlight. The y-axis denotes the recall (left panel) and the removal rate (right panel) of the prediction, respectively. f10 means 10% of the known positive examples are put into the unlabeled data,and f30, ⋯, f90 mean 30%, ⋯, 90% of the known positive examples are put into the unlabeled data, respectively.
© Copyright Policy
Related In: Results  -  Collection

License
getmorefigures.php?uid=PMC4414601&req=5

pone.0125156.g003: The recall and removal rate of prediction on Oct4-miRNA relationships using known regulation between Oct4 and protein-coding genes or miRNA genes.In each panel, the x-axis denotes the parameter p of SVMlight. The y-axis denotes the recall (left panel) and the removal rate (right panel) of the prediction, respectively. f10 means 10% of the known positive examples are put into the unlabeled data,and f30, ⋯, f90 mean 30%, ⋯, 90% of the known positive examples are put into the unlabeled data, respectively.

Mentions: We use recall and removal rate to evaluate the prediction performance of SVMlight throughout the study. Recall is defined as the fraction of predicted positive examples in the set of known positive examples. Removal rate means the fraction of examples predicted as negative in the unlabelled data. A sensitive and specific classifier should achieve a high recall rate while being able to remove a large number of negative peaks (i.e. high removal rate). Fig 3 shows the prediction performance using the positive training data that includes the peaks derived from known regulatory relationships between Oct4 and protein-coding genes or miRNA genes.


Identifying TF-MiRNA Regulatory Relationships Using Multiple Features.

Shao M, Sun Y, Zhou S - PLoS ONE (2015)

The recall and removal rate of prediction on Oct4-miRNA relationships using known regulation between Oct4 and protein-coding genes or miRNA genes.In each panel, the x-axis denotes the parameter p of SVMlight. The y-axis denotes the recall (left panel) and the removal rate (right panel) of the prediction, respectively. f10 means 10% of the known positive examples are put into the unlabeled data,and f30, ⋯, f90 mean 30%, ⋯, 90% of the known positive examples are put into the unlabeled data, respectively.
© Copyright Policy
Related In: Results  -  Collection

License
Show All Figures
getmorefigures.php?uid=PMC4414601&req=5

pone.0125156.g003: The recall and removal rate of prediction on Oct4-miRNA relationships using known regulation between Oct4 and protein-coding genes or miRNA genes.In each panel, the x-axis denotes the parameter p of SVMlight. The y-axis denotes the recall (left panel) and the removal rate (right panel) of the prediction, respectively. f10 means 10% of the known positive examples are put into the unlabeled data,and f30, ⋯, f90 mean 30%, ⋯, 90% of the known positive examples are put into the unlabeled data, respectively.
Mentions: We use recall and removal rate to evaluate the prediction performance of SVMlight throughout the study. Recall is defined as the fraction of predicted positive examples in the set of known positive examples. Removal rate means the fraction of examples predicted as negative in the unlabelled data. A sensitive and specific classifier should achieve a high recall rate while being able to remove a large number of negative peaks (i.e. high removal rate). Fig 3 shows the prediction performance using the positive training data that includes the peaks derived from known regulatory relationships between Oct4 and protein-coding genes or miRNA genes.

Bottom Line: To further improve the accuracy of our inference, we also apply a mean reciprocal rank (MRR)-based method to the candidate peaks.We apply our pipeline to infer TF-miRNA regulatory relationships in mouse embryonic stem cells.The experimental results show that our approach provides very specific findings of TF-miRNA regulatory relationships.

View Article: PubMed Central - PubMed

Affiliation: School of Computer Science and Shanghai Key Lab of Intelligent Information Processing, Fudan University, 220 Handan Road, Shanghai 200433, China; Department of Computer Science and Engineering, Michigan State University, 428 S. Shaw Lane, East Lansing, 48824, USA.

ABSTRACT
MicroRNAs are known to play important roles in the transcriptional and post-transcriptional regulation of gene expression. While intensive research has been conducted to identify miRNAs and their target genes in various genomes, there is only limited knowledge about how microRNAs are regulated. In this study, we construct a pipeline that can infer the regulatory relationships between transcription factors and microRNAs from ChIP-Seq data with high confidence. In particular, after identifying candidate peaks from ChIP-Seq data, we formulate the inference as a PU learning (learning from only positive and unlabeled examples) problem. Multiple features including the statistical significance of the peaks, the location of the peaks, the transcription factor binding site motifs, and the evolutionary conservation are derived from peaks for training and prediction. To further improve the accuracy of our inference, we also apply a mean reciprocal rank (MRR)-based method to the candidate peaks. We apply our pipeline to infer TF-miRNA regulatory relationships in mouse embryonic stem cells. The experimental results show that our approach provides very specific findings of TF-miRNA regulatory relationships.

Show MeSH