CrMP-Sol database: classification, bioinformatic analyses and comparison of cancer-related membrane proteins and their water-soluble variant designsMembrane proteinProtein designQTY codeMachine learningProtein
(_; it indicates there are no multiple chains in this protein) and domain name (_). SCOP database has been linked from the PDB to obtain the structural class information of each protein directly. On the other hand, one can search the SCOP database for obtaining the str...
In recent times, big data classification has become a hot research topic in various domains, such as healthcare, e-commerce, finance, etc. The inclusion of the feature selection process helps to improve the big data classification process and can be done
The WES data, CNA data, RNA sequencing data and metabolome data for this study have been deposited into the Genome Sequence Archive (GSA) database under accession codes PRJCA017539 (https://ngdc.cncb.ac.cn/bioproject/browse/PRJCA017539). TMT-based mass spectrometry (MS)-quantified protein ...
The importance and remarkable versatility of these enzymes, as well as the difficulties in their functional classification, create a need for an integrated source of information about them. Description - The B6 database http://bioinformatics.unipr.it/B6db contains documented B6-dependent activities ...
ONCOMINE: a cancer micro- array database and integrated data-mining platform. Neoplasia. 2004;6(1):1-6.Nagi, S., Bhattacharyya, D.: Classification of microarray cancer data using ensemble approach. Network Modeling Analysis in Health Informatics and Bioinformatics, 1–15 (2013)...
[10–13]. In the bioinformatics domain, the hierarchical structure of GO was utilized to classify proteins based on various biological data, e.g., gene sequences and microarray [10,14,15]. With respect to literature-based GO annotation, reports from text mining workshops have explored ...
Each has unique advantages: sketching has been shown to bound error better than sampling [9, 10], while systematic sampling (such as uniform sampling) can provide bounds on the number of samples from specific sections of the original data included in the generated subset. Both sketching and ...
Overview of Datasets The currently selected datasets are divided into three categories. There is a group of datasets focused on human regulatory functional elements, either produced from mining the Ensembl database, or from published datasets used in multiple articles. For promoters, we have imported...
Microarray data of both H1 and H2 are available in the ArrayExpress database [18], respectively under accession numbers E-MTAB-5278 and E-MTAB-5279. The distribution of training and testing labels and their categories are depicted in Fig. 2. For the human samples, 18604 gene expression data...