Comput. Biol. Med. - Forest classification trees and forest support vector machines algorithms: Demonstration using microarray data.

Tópicos

{ structur(1116) can(940) graph(676) }
{ featur(3375) classif(2383) classifi(1994) }
{ data(3008) multipl(1320) sourc(1022) }
{ error(1145) method(1030) estim(1020) }
{ state(1844) use(1261) util(961) }
{ sampl(1606) size(1419) use(1276) }
{ model(3404) distribut(989) bayesian(671) }
{ problem(2511) optim(1539) algorithm(950) }
{ high(1669) rate(1365) level(1280) }
{ imag(1057) registr(996) error(939) }
{ data(1714) softwar(1251) tool(1186) }
{ implement(1333) system(1263) develop(1122) }
{ studi(2440) review(1878) systemat(933) }
{ learn(2355) train(1041) set(1003) }
{ group(2977) signific(1463) compar(1072) }
{ gene(2352) biolog(1181) express(1162) }
{ can(981) present(881) function(850) }
{ cancer(2502) breast(956) screen(824) }
{ data(1737) use(1416) pattern(1282) }
{ system(1976) rule(880) can(841) }
{ measur(2081) correl(1212) valu(896) }
{ patient(2315) diseas(1263) diabet(1191) }
{ take(945) account(800) differ(722) }
{ assess(1506) score(1403) qualiti(1306) }
{ chang(1828) time(1643) increas(1301) }
{ algorithm(1844) comput(1787) effici(935) }
{ method(984) reconstruct(947) comput(926) }
{ search(2224) databas(1162) retriev(909) }
{ import(1318) role(1303) understand(862) }
{ visual(1396) interact(850) tool(830) }
{ model(2656) set(1616) predict(1553) }
{ decis(3086) make(1611) patient(1517) }
{ activ(1452) weight(1219) physic(1104) }
{ method(2212) result(1239) propos(1039) }
{ can(774) often(719) complex(702) }
{ imag(1947) propos(1133) code(1026) }
{ inform(2794) health(2639) internet(1427) }
{ bind(1733) structur(1185) ligand(1036) }
{ sequenc(1873) structur(1644) protein(1328) }
{ method(1219) similar(1157) match(930) }
{ imag(2830) propos(1344) filter(1198) }
{ network(2748) neural(1063) input(814) }
{ imag(2675) segment(2577) method(1081) }
{ motion(1329) object(1292) video(1091) }
{ treatment(1704) effect(941) patient(846) }
{ surgeri(1148) surgic(1085) robot(1054) }
{ framework(1458) process(801) describ(734) }
{ concept(1167) ontolog(924) domain(897) }
{ clinic(1479) use(1117) guidelin(835) }
{ extract(1171) text(1153) clinic(932) }
{ method(1557) propos(1049) approach(1037) }
{ design(1359) user(1324) use(1319) }
{ control(1307) perform(991) simul(935) }
{ model(2220) cell(1177) simul(1124) }
{ care(1570) inform(1187) nurs(1089) }
{ general(901) number(790) one(736) }
{ featur(1941) imag(1645) propos(1176) }
{ case(1353) use(1143) diagnosi(1136) }
{ howev(809) still(633) remain(590) }
{ data(3963) clinic(1234) research(1004) }
{ studi(1410) differ(1259) use(1210) }
{ risk(3053) factor(974) diseas(938) }
{ perform(999) metric(946) measur(919) }
{ research(1085) discuss(1038) issu(1018) }
{ system(1050) medic(1026) inform(1018) }
{ model(2341) predict(2261) use(1141) }
{ compound(1573) activ(1297) structur(1058) }
{ perform(1367) use(1326) method(1137) }
{ studi(1119) effect(1106) posit(819) }
{ blood(1257) pressur(1144) flow(957) }
{ spatial(1525) area(1432) region(1030) }
{ record(1888) medic(1808) patient(1693) }
{ health(3367) inform(1360) care(1135) }
{ model(3480) simul(1196) paramet(876) }
{ monitor(1329) mobil(1314) devic(1160) }
{ ehr(2073) health(1662) electron(1139) }
{ research(1218) medic(880) student(794) }
{ patient(2837) hospit(1953) medic(668) }
{ data(2317) use(1299) case(1017) }
{ age(1611) year(1155) adult(843) }
{ medic(1828) order(1363) alert(1069) }
{ signal(2180) analysi(812) frequenc(800) }
{ cost(1906) reduc(1198) effect(832) }
{ first(2504) two(1366) second(1323) }
{ intervent(3218) particip(2042) group(1664) }
{ activ(1138) subject(705) human(624) }
{ time(1939) patient(1703) rate(768) }
{ patient(1821) servic(1111) care(1106) }
{ use(2086) technolog(871) perceiv(783) }
{ analysi(2126) use(1163) compon(1037) }
{ health(1844) social(1437) communiti(874) }
{ use(976) code(926) identifi(902) }
{ use(1733) differ(960) four(931) }
{ drug(1928) target(777) effect(648) }
{ result(1111) use(1088) new(759) }
{ survey(1388) particip(1329) question(1065) }
{ estim(2440) model(1874) function(577) }
{ process(1125) use(805) approach(778) }
{ method(1969) cluster(1462) data(1082) }
{ detect(2391) sensit(1101) algorithm(908) }

Resumo

Classification into multiple classes when the measured variables are outnumbered is a major methodological challenge in -omics studies. Two algorithms that overcome the dimensionality problem are presented: the forest classification tree (FCT) and the forest support vector machines (FSVM). In FCT, a set of variables is randomly chosen and a classification tree (CT) is grown using a forward classification algorithm. The process is repeated and a forest of CTs is derived. Finally, the most frequent variables from the trees with the smallest apparent misclassification rate (AMR) are used to construct a productive tree. In FSVM, the CTs are replaced by SVMs. The methods are demonstrated using prostate gene expression data for classifying tissue samples into four tumor types. For threshold split value 0.001 and utilizing 100 markers the productive CT consisted of 29 terminal nodes and achieved perfect classification (AMR=0). When the threshold value was set to 0.01, a tree with 17 terminal nodes was constructed based on 15 markers (AMR=7%). In FSVM, reducing the fraction of the forest that was used to construct the best classifier from the top 80% to the top 20% reduced the misclassification to 25% (when using 200 markers). The proposed methodologies may be used for identifying important variables in high dimensional data. Furthermore, the FCT allows exploring the data structure and provides a decision rule.

Resumo Limpo

classif multipl class measur variabl outnumb major methodolog challeng omic studi two algorithm overcom dimension problem present forest classif tree fct forest support vector machin fsvm fct set variabl random chosen classif tree ct grown use forward classif algorithm process repeat forest cts deriv final frequent variabl tree smallest appar misclassif rate amr use construct product tree fsvm cts replac svms method demonstr use prostat gene express data classifi tissu sampl four tumor type threshold split valu util marker product ct consist termin node achiev perfect classif amr threshold valu set tree termin node construct base marker amr fsvm reduc fraction forest use construct best classifi top top reduc misclassif use marker propos methodolog may use identifi import variabl high dimension data furthermor fct allow explor data structur provid decis rule

Resumos Similares

IEEE Trans Pattern Anal Mach Intell - Free Energy Score Spaces: Using Generative Information in Discriminative Classifiers. ( 0,730471811030832 )
J. Comput. Biol. - The approximability of shortest path-based graph orientations of protein-protein interaction networks. ( 0,695793211053565 )
IEEE Trans Vis Comput Graph - Drawing Contour Trees in the Plane. ( 0,685414722087922 )
J Chem Inf Model - Generative topographic mapping-based classification models and their applicability domain: application to the biopharmaceutics Drug Disposition Classification System (BDDCS). ( 0,674005753024772 )
IEEE Trans Image Process - Constrained and dimensionality-independent path openings. ( 0,663857102704728 )
Comput Math Methods Med - Using the K-nearest neighbor algorithm for the classification of lymph node metastasis in gastric cancer. ( 0,660116742680481 )
J Chem Inf Model - Characterization of heterocyclic rings through quantum chemical topology. ( 0,657110672918472 )
Comput Methods Programs Biomed - Automatic detection and characterisation of retinal vessel tree bifurcations and crossovers in eye fundus images. ( 0,653652765551839 )
IEEE Trans Pattern Anal Mach Intell - Trinary-Projection Trees for Approximate Nearest Neighbor Search. ( 0,650812182281108 )
IEEE Trans Vis Comput Graph - Flow Visualization with Quantified Spatial and Temporal Errors Using Edge Maps. ( 0,650724007869192 )
IEEE Trans Vis Comput Graph - Output-Sensitive Construction of Reeb Graphs. ( 0,648567578029025 )
Artif Intell Med - Combining image, voice, and the patient's questionnaire data to categorize laryngeal disorders. ( 0,648375384479116 )
Comput Biol Chem - On topological indices for small RNA graphs. ( 0,648308528579546 )
AMIA Annu Symp Proc - Automatic annotation of radiological observations in liver CT images. ( 0,642525262194005 )
Brief. Bioinformatics - Structural mapping: how to study the genetic architecture of a phenotypic trait through its formation mechanism. ( 0,640687137741679 )
J Chem Inf Model - Choosing feature selection and learning algorithms in QSAR. ( 0,63622528140247 )
IEEE Trans Pattern Anal Mach Intell - The Sum-over-Forests Density Index: Identifying Dense Regions in a Graph. ( 0,619309363316474 )
IEEE Trans Image Process - 3-D curvilinear structure detection filter via structure-ball analysis. ( 0,616926190173573 )
IEEE Trans Image Process - A speed-up scheme based on multiple-instance pruning for pedestrian detection using a support vector machine. ( 0,615784378043997 )
AMIA Annu Symp Proc - Synergism between the mapping projects from SNOMED CT to ICD-10 and ICD-10-CM. ( 0,610180045649775 )
Comput Math Methods Med - NIM: a node influence based method for cancer classification. ( 0,607117519567896 )
Comput Methods Programs Biomed - A modular framework for the automatic classification of chromosomes in Q-band images. ( 0,607097724970849 )
J Chem Inf Model - Time-averaged distributions of solute and solvent motions: exploring proton wires of GFP and PfM2DH. ( 0,603072348737622 )
Neural Comput - Incremental slow feature analysis: adaptive low-complexity slow feature updating from high-dimensional input streams. ( 0,602672649417241 )
Neural Comput - Intrinsic graph structure estimation using graph Laplacian. ( 0,601828214025065 )
Comput. Biol. Med. - SVM-based feature selection to optimize sensitivity-specificity balance applied to weaning. ( 0,600834059719382 )
Comput Methods Programs Biomed - TreeVis: a MATLAB-based tool for tree visualization. ( 0,600533118146621 )
Comput Math Methods Med - Discrimination between Alzheimer's disease and mild cognitive impairment using SOM and PSO-SVM. ( 0,595091118787497 )
IEEE Trans Pattern Anal Mach Intell - C^4: Exploring Multiple Solutions in Graphical Models by Cluster Sampling. ( 0,594121926301855 )
BMC Med Inform Decis Mak - Efficient techniques for genotype-phenotype correlational analysis. ( 0,593848938270571 )
Comput. Biol. Med. - A hybrid feature selection method for DNA microarray data. ( 0,593051129194507 )
J. Comput. Biol. - Counting RNA pseudoknotted structures. ( 0,592869932622208 )
IEEE Trans Image Process - Probabilistic graphlet transfer for photo cropping. ( 0,591300162487907 )
Artif Intell Med - An intelligent classifier for prognosis of cardiac resynchronization therapy based on speckle-tracking echocardiograms. ( 0,589212325659475 )
Neural Comput - Simple neural-like p systems for maximal independent set selection. ( 0,589127538007389 )
IEEE Trans Vis Comput Graph - The Design Space of Implicit Hierarchy Visualization: A Survey. ( 0,587481083874829 )
J Biomed Inform - Tree kernel-based protein-protein interaction extraction from biomedical literature. ( 0,58658269306618 )
Comput. Biol. Med. - Fast and efficient lung disease classification using hierarchical one-against-all support vector machine and cost-sensitive feature selection. ( 0,586223237110421 )
IEEE Trans Vis Comput Graph - Image-Based Modeling of Unwrappable Fa?ades. ( 0,584247494800746 )
IEEE Trans Image Process - Retina verification system based on biometric graph matching. ( 0,57998377678646 )
J. Comput. Biol. - Algorithms for MDC-based multi-locus phylogeny inference: beyond rooted binary gene trees on single alleles. ( 0,578293932045198 )
IEEE Trans Image Process - Topology preserving warping of 3-D binary images according to continuous one-to-one mappings. ( 0,577057249649269 )
J. Comput. Biol. - Phylogenetic stochastic mapping without matrix exponentiation. ( 0,572392210738583 )
IEEE Trans Neural Netw Learn Syst - FREL: A Stable Feature Selection Algorithm. ( 0,570084918296451 )
J. Comput. Biol. - Pathset graphs: a novel approach for comprehensive utilization of paired reads in genome assembly. ( 0,568289729252842 )
IEEE Trans Pattern Anal Mach Intell - A Robust O(n) Solution to the Perspective-n-Point Problem. ( 0,566694316891946 )
Comput. Biol. Med. - Decision forest for classification of gene expression data. ( 0,566594586468482 )
IEEE Trans Image Process - Hyperspectral image representation and processing with binary partition trees. ( 0,566393860323831 )
Neural Comput - The support feature machine: classification with the least number of features and application to neuroimaging data. ( 0,566133781473093 )
J. Comput. Biol. - A Bayesian sampler for optimization of protein domain hierarchies. ( 0,56580896099759 )
IEEE Trans Vis Comput Graph - Visual Analysis of Large Graphs Using (X,Y)-clustering and Hybrid Visualizations. ( 0,56429566603067 )
Comput Methods Programs Biomed - Operator functional state classification using least-square support vector machine based recursive feature elimination technique. ( 0,563983759901106 )
J. Comput. Biol. - Shapes of RNA pseudoknot structures. ( 0,560316715523247 )
Comput. Biol. Med. - Computerized system for recognition of autism on the basis of gene expression microarray data. ( 0,559827248234424 )
Int J Med Inform - Use of order sets in inpatient computerized provider order entry systems: a comparative analysis of usage patterns at seven sites. ( 0,557225386109001 )
Int J Comput Assist Radiol Surg - Building an ensemble system for diagnosing masses in mammograms. ( 0,55583133395703 )
J Med Syst - Computer-assisted diagnosis of tuberculosis: a first order statistical approach to chest radiograph. ( 0,555394874263531 )
Comput. Biol. Med. - Effect of bunching of cilia and their interplay on muco-ciliary transport. ( 0,554331453117121 )
Comput Biol Chem - Compact cancer biomarkers discovery using a swarm intelligence feature selection algorithm. ( 0,554267364430599 )
IEEE Trans Neural Netw Learn Syst - Kernel reconstruction ICA for sparse representation. ( 0,55350634315206 )
IEEE Trans Vis Comput Graph - Grouper: A Compact, Streamable Triangle Mesh Data Structure. ( 0,553451841192589 )
Comput. Biol. Med. - Hyperbolic Dirac Nets for medical decision support. Theory, methods, and comparison with Bayes Nets. ( 0,553273342358422 )
J Chem Inf Model - Classifier ensemble based on feature selection and diversity measures for predicting the affinity of A(2B) adenosine receptor antagonists. ( 0,552627342885404 )
Comput Biol Chem - Quick path finding--quick algorithmic solution for unambiguous labeling of phylogenetic tree nodes. ( 0,55203950461055 )
Comput Math Methods Med - Comparison of different EHG feature selection methods for the detection of preterm labor. ( 0,551797253333531 )
IEEE Trans Pattern Anal Mach Intell - Spatial and Anatomical Regularization of SVM: A General Framework for Neuroimaging Data. ( 0,551767517159433 )
IEEE Trans Image Process - Unified structured learning for simultaneous human pose estimation and garment attribute classification. ( 0,551490979250732 )
IEEE Trans Image Process - W-tree indexing for fast visual word generation. ( 0,551141537542287 )
J. Comput. Biol. - Simultaneous folding of alternative RNA structures with mutual constraints: an application to next-generation sequencing-based RNA structure probing. ( 0,550377654240261 )
Comput Biol Chem - A novel divide-and-merge classification for high dimensional datasets. ( 0,54952696564316 )
Neural Comput - Parametric inference in the large data limit using maximally informative models. ( 0,549373678990014 )
Methods Inf Med - Correlation-based gene selection and classification using Taguchi-BPSO. ( 0,548994622010159 )
Comput Biol Chem - Information-theoretic approaches to SVM feature selection for metagenome read classification. ( 0,548883442282568 )
Neural Comput - A network of spiking neurons for computing sparse representations in an energy-efficient way. ( 0,547450697885612 )
J Biomed Inform - A fast gene selection method for multi-cancer classification using multiple support vector data description. ( 0,546155604884656 )
IEEE Trans Vis Comput Graph - A Structure-Based Distance Metric for High-Dimensional Space Exploration with Multi-Dimensional Scaling. ( 0,542528440355223 )
Int J Comput Assist Radiol Surg - Complete fully automatic model-based segmentation of normal and pathological lymph nodes in CT data. ( 0,541159556508615 )
IEEE Trans Image Process - A novel technique for subpixel image classification based on support vector machine. ( 0,541075177290489 )
IEEE Trans Image Process - Stereo matching and view interpolation based on image domain triangulation. ( 0,540977687107633 )
Artif Intell Med - Computer-aided diagnosis of pulmonary nodules using a two-step approach for feature selection and classifier ensemble construction. ( 0,538906207454153 )
J. Comput. Biol. - Loops in canonical RNA pseudoknot structures. ( 0,537612921980549 )
IEEE Trans Image Process - Optimized block-based connected components labeling with decision trees. ( 0,537331905327623 )
J Med Syst - A robust multi-class feature selection strategy based on Rotation Forest Ensemble algorithm for diagnosis of Erythemato-Squamous diseases. ( 0,537205201466479 )
IEEE Trans Neural Netw Learn Syst - MTC: A Fast and Robust Graph-Based Transductive Learning Method. ( 0,536585621218679 )
Comput. Biol. Med. - A modular approach to computer-aided auscultation: analysis and parametric characterization of murmur acoustic qualities. ( 0,534788704614892 )
Comput. Biol. Med. - Using phase space reconstruction for patient independent heartbeat classification in comparison with some benchmark methods. ( 0,534498785007769 )
Comput Biol Chem - CE-PLoc: an ensemble classifier for predicting protein subcellular locations by fusing different modes of pseudo amino acid composition. ( 0,534179640569686 )
IEEE Trans Image Process - Smile detection by boosting pixel differences. ( 0,533591186546736 )
IEEE Trans Pattern Anal Mach Intell - Building Development Monitoring in Multitemporal Remotely Sensed Image Pairs with Stochastic Birth-Death Dynamics. ( 0,533588376560892 )
IEEE Trans Pattern Anal Mach Intell - Consistent Latent Position Estimation and Vertex Classification for Random Dot Product Graphs. ( 0,533380747062665 )
IEEE Trans Vis Comput Graph - Graph Drawing Aesthetics — Created by Users not Algorithms. ( 0,532413249274792 )
J Med Syst - Statistical analysis of textural features for improved classification of oral histopathological images. ( 0,532198009005013 )
J Chem Inf Model - ThermoData Engine (TDE) software implementation of the dynamic data evaluation concept. 7. Ternary mixtures. ( 0,531876370693713 )
IEEE Trans Neural Netw Learn Syst - On recursive edit distance kernels with application to time series classification. ( 0,53187141380217 )
IEEE Trans Image Process - Human detection in images via piecewise linear support vector machines. ( 0,530264039495923 )
J Chem Inf Model - Beyond terrestrial biology: charting the chemical universe of a-amino acid structures. ( 0,529154029787952 )
IEEE Trans Vis Comput Graph - Dynamic Network Visualization with Extended Massive Sequence Views. ( 0,528913756676742 )
Comput. Biol. Med. - Averaging of diffusion tensor imaging direction-encoded color maps for localizing substantia nigra. ( 0,528778610067201 )
J. Comput. Biol. - A polynomial-time algorithm computing lower and upper bounds of the rooted subtree prune and regraft distance. ( 0,528625884168574 )
IEEE Trans Neural Netw Learn Syst - Complex support vector machines for regression and quaternary classification. ( 0,528591872093251 )