Knowledge Discovery in Multi-label Phenotype Data

A. Clare; R. King

DOI:10.1007/3-540-44794-6_4
Corpus ID: 15136247

Knowledge Discovery in Multi-label Phenotype Data

@inproceedings{Clare2001KnowledgeDI,
  title={Knowledge Discovery in Multi-label Phenotype Data},
  author={Amanda Clare and Ross D. King},
  booktitle={European Conference on Principles of Data Mining and Knowledge Discovery},
  year={2001},
  url={https://api.semanticscholar.org/CorpusID:15136247}
}

A. ClareR. King
Published in European Conference on… 3 September 2001
Biology, Computer Science

This work uses KDD to analyse data from mutant phenotype growth experiments with the yeast S. cerevisiae to predict novel gene functions, and learns rules which are accurate and biologically meaningful.

View via Publisher

rd.springer.com

753 Citations

Highly Influential Citations

Background Citations

242

Methods Citations

291

Results Citations

Topics

Missing Values Knowledge Discovery

Hierarchical multi-label classification for protein function prediction going beyond traditional approaches

F. FotouhiChandan K. ReddyNoor Alaydie

Computer Science, Biology
2012

The author proposed the HiBLADE algorithm (Hierarchical multi-label Boosting with LAbel DEpendency), a novel algorithm that takes advantage of not only the pre-established hierarchical taxonomy of the classes, but also effectively exploits the hidden correlation among the classes that is not shown through the class hierarchy, thereby improving the quality of the predictions.

3 Citations

A Randomized Clustering Forest Approach for Efficient Prediction of Protein Functions

Hong TangYuanyuan WangShaomin TangDianhui ChuChunshan Li

Computer Science, Biology
IEEE Access
2019

A novel ensemble MIML algorithm called multi-instance multi-label randomized clustering forest (MIMLRC-Forest) for protein function prediction is proposed, which develops a set of hierarchical clustering trees and conducts a label transfer mechanism to identify the relevant function labels in learning process.

Knowledge-based analysis of microarray gene expression data by using support vector machines.

M. S. BrownW. Grundy D. Haussler

Computer Science, Biology
Proceedings of the National Academy of Sciences…
2000

A method of functionally classifying genes by using gene expression data from DNA microarray hybridization experiments, based on the theory of support vector machines (SVMs), to predict functional roles for uncharacterized yeast ORFs based on their expression data is introduced.

A functional genomics strategy that uses metabolome data to reveal the phenotype of silent mutations

L. RaamsdonkB. Teusink Stephen G. Oliver

Biology, Chemistry
Nature Biotechnology
2001

It is demonstrated how the intracellular concentrations of metabolites can reveal phenotypes for proteins active in metabolic regulation, and this approach to functional analysis, using comparative metabolomics, is called FANCY—an abbreviation for functional analysis by co-responses in yeast.

1,004 Citations

Cluster analysis and display of genome-wide expression patterns.

M. EisenP. SpellmanP. BrownD. Botstein

Biology, Computer Science
Proceedings of the National Academy of Sciences…
1998

A system of cluster analysis for genome-wide expression data from DNA microarray hybridization is described that uses standard statistical algorithms to arrange genes according to similarity in pattern of gene expression, finding in the budding yeast Saccharomyces cerevisiae that clustering gene expression data groups together efficiently genes of known similar function.

TRIPLES: a database of gene function in Saccharomyces cerevisiae

Anuj KumarK. CheungP. Ross-MacdonaldP. CoelhoP. MillerM. Snyder

Biology, Computer Science
Nucleic Acids Res.
2000

Using a novel multipurpose mini-transposon, a collection of defined mutant alleles for the analysis of disruption phenotypes, protein localization, and gene expression in Saccharomyces cerevisiae are generated and cataloged in TRIPLES, a Web-accessible database of TRansposon-Insertion Phenotypes, Localization and Expression in SacCharomyces.

Knowledge Discovery in Multi-label Phenotype Data

Topics

Feature selection for gene function prediction using multi-labelled lazy learning

A multi-label approach using binary relevance and decision trees applied to functional genomics

Multi-label Classification of Gene Function using MLPs

Hierarchical multi-label classification for protein function prediction going beyond traditional approaches

A Randomized Clustering Forest Approach for Efficient Prediction of Protein Functions

Diagnosis labeling with disease-specific characteristics mining.

Multilabel Neural Networks with Applications to Functional Genomics and Text Categorization

An Adaptation of Binary Relevance for Multi-Label Classiﬁcation applied to Functional Genomics

Comparing Several Approaches for Hierarchical Classification of Proteins with Decision Trees

Multi-label Classification with ART Neural Networks

Genome scale prediction of protein functional class from sequence using data mining

On the optimization of classes for the assignment of unidentified reading frames in functional genomics programmes: the need for machine learning.

Knowledge-based analysis of microarray gene expression data by using support vector machines.

A functional genomics strategy that uses metabolome data to reveal the phenotype of silent mutations

Prediction of Enzyme Classification from Protein Sequence without the Use of Sequence Similarity

Cluster analysis and display of genome-wide expression patterns.

Analysis of gene expression data using self‐organizing maps

TRIPLES: a database of gene function in Saccharomyces cerevisiae

A Study of Cross-Validation and Bootstrap for Accuracy Estimation and Model Selection

C4.5: Programs for Machine Learning

Knowledge Discovery in Multi-label Phenotype Data

Topics

753 Citations

Feature selection for gene function prediction using multi-labelled lazy learning

A multi-label approach using binary relevance and decision trees applied to functional genomics

Multi-label Classification of Gene Function using MLPs

Hierarchical multi-label classification for protein function prediction going beyond traditional approaches

A Randomized Clustering Forest Approach for Efficient Prediction of Protein Functions

Diagnosis labeling with disease-specific characteristics mining.

Multilabel Neural Networks with Applications to Functional Genomics and Text Categorization

An Adaptation of Binary Relevance for Multi-Label Classiﬁcation applied to Functional Genomics

Comparing Several Approaches for Hierarchical Classification of Proteins with Decision Trees

Multi-label Classification with ART Neural Networks

37 References

Genome scale prediction of protein functional class from sequence using data mining

On the optimization of classes for the assignment of unidentified reading frames in functional genomics programmes: the need for machine learning.

Knowledge-based analysis of microarray gene expression data by using support vector machines.

A functional genomics strategy that uses metabolome data to reveal the phenotype of silent mutations

Prediction of Enzyme Classification from Protein Sequence without the Use of Sequence Similarity

Cluster analysis and display of genome-wide expression patterns.

Analysis of gene expression data using self‐organizing maps

TRIPLES: a database of gene function in Saccharomyces cerevisiae

A Study of Cross-Validation and Bootstrap for Accuracy Estimation and Model Selection

C4.5: Programs for Machine Learning

Related Papers