J Chem Inf Model - Note on naive Bayes based on binary descriptors in cheminformatics.

Tópicos

{ learn(2355) train(1041) set(1003) }
{ compound(1573) activ(1297) structur(1058) }
{ data(3008) multipl(1320) sourc(1022) }
{ featur(1941) imag(1645) propos(1176) }
{ structur(1116) can(940) graph(676) }
{ howev(809) still(633) remain(590) }
{ featur(3375) classif(2383) classifi(1994) }
{ method(1557) propos(1049) approach(1037) }
{ process(1125) use(805) approach(778) }
{ can(774) often(719) complex(702) }
{ inform(2794) health(2639) internet(1427) }
{ bind(1733) structur(1185) ligand(1036) }
{ take(945) account(800) differ(722) }
{ clinic(1479) use(1117) guidelin(835) }
{ design(1359) user(1324) use(1319) }
{ method(984) reconstruct(947) comput(926) }
{ search(2224) databas(1162) retriev(909) }
{ import(1318) role(1303) understand(862) }
{ record(1888) medic(1808) patient(1693) }
{ model(3480) simul(1196) paramet(876) }
{ use(1733) differ(960) four(931) }
{ drug(1928) target(777) effect(648) }
{ result(1111) use(1088) new(759) }
{ implement(1333) system(1263) develop(1122) }
{ model(3404) distribut(989) bayesian(671) }
{ imag(1947) propos(1133) code(1026) }
{ data(1737) use(1416) pattern(1282) }
{ system(1976) rule(880) can(841) }
{ measur(2081) correl(1212) valu(896) }
{ imag(1057) registr(996) error(939) }
{ sequenc(1873) structur(1644) protein(1328) }
{ method(1219) similar(1157) match(930) }
{ imag(2830) propos(1344) filter(1198) }
{ network(2748) neural(1063) input(814) }
{ imag(2675) segment(2577) method(1081) }
{ patient(2315) diseas(1263) diabet(1191) }
{ studi(2440) review(1878) systemat(933) }
{ motion(1329) object(1292) video(1091) }
{ assess(1506) score(1403) qualiti(1306) }
{ treatment(1704) effect(941) patient(846) }
{ surgeri(1148) surgic(1085) robot(1054) }
{ framework(1458) process(801) describ(734) }
{ problem(2511) optim(1539) algorithm(950) }
{ error(1145) method(1030) estim(1020) }
{ chang(1828) time(1643) increas(1301) }
{ concept(1167) ontolog(924) domain(897) }
{ algorithm(1844) comput(1787) effici(935) }
{ extract(1171) text(1153) clinic(932) }
{ data(1714) softwar(1251) tool(1186) }
{ control(1307) perform(991) simul(935) }
{ model(2220) cell(1177) simul(1124) }
{ care(1570) inform(1187) nurs(1089) }
{ general(901) number(790) one(736) }
{ case(1353) use(1143) diagnosi(1136) }
{ data(3963) clinic(1234) research(1004) }
{ studi(1410) differ(1259) use(1210) }
{ risk(3053) factor(974) diseas(938) }
{ perform(999) metric(946) measur(919) }
{ research(1085) discuss(1038) issu(1018) }
{ system(1050) medic(1026) inform(1018) }
{ model(2341) predict(2261) use(1141) }
{ visual(1396) interact(850) tool(830) }
{ perform(1367) use(1326) method(1137) }
{ studi(1119) effect(1106) posit(819) }
{ blood(1257) pressur(1144) flow(957) }
{ spatial(1525) area(1432) region(1030) }
{ health(3367) inform(1360) care(1135) }
{ monitor(1329) mobil(1314) devic(1160) }
{ ehr(2073) health(1662) electron(1139) }
{ state(1844) use(1261) util(961) }
{ research(1218) medic(880) student(794) }
{ patient(2837) hospit(1953) medic(668) }
{ model(2656) set(1616) predict(1553) }
{ data(2317) use(1299) case(1017) }
{ age(1611) year(1155) adult(843) }
{ medic(1828) order(1363) alert(1069) }
{ signal(2180) analysi(812) frequenc(800) }
{ cost(1906) reduc(1198) effect(832) }
{ group(2977) signific(1463) compar(1072) }
{ sampl(1606) size(1419) use(1276) }
{ gene(2352) biolog(1181) express(1162) }
{ first(2504) two(1366) second(1323) }
{ intervent(3218) particip(2042) group(1664) }
{ activ(1138) subject(705) human(624) }
{ time(1939) patient(1703) rate(768) }
{ patient(1821) servic(1111) care(1106) }
{ use(2086) technolog(871) perceiv(783) }
{ can(981) present(881) function(850) }
{ analysi(2126) use(1163) compon(1037) }
{ health(1844) social(1437) communiti(874) }
{ high(1669) rate(1365) level(1280) }
{ cancer(2502) breast(956) screen(824) }
{ use(976) code(926) identifi(902) }
{ survey(1388) particip(1329) question(1065) }
{ estim(2440) model(1874) function(577) }
{ decis(3086) make(1611) patient(1517) }
{ activ(1452) weight(1219) physic(1104) }
{ method(1969) cluster(1462) data(1082) }
{ method(2212) result(1239) propos(1039) }
{ detect(2391) sensit(1101) algorithm(908) }

Resumo

A plethora of articles on naive Bayes classifiers, where the chemical compounds to be classified are represented by binary-valued (absent or present type) descriptors, have appeared in the cheminformatics literature over the past decade. The principal goal of this paper is to describe how a naive Bayes classifier based on binary descriptors (NBCBBD) can be employed as a feature selector in an efficient manner suitable for cheminformatics. In the process, we point out a fact well documented in other disciplines that NBCBBD is a linear classifier and is therefore intrinsically suboptimal for classifying compounds that are nonlinearly separable in their binary descriptor space. We investigate the performance of the proposed algorithm on classifying a subset of the MDDR data set, a standard molecular benchmark data set, into active and inactive compounds.

Resumo Limpo

plethora articl naiv bay classifi chemic compound classifi repres binaryvalu absent present type descriptor appear cheminformat literatur past decad princip goal paper describ naiv bay classifi base binari descriptor nbcbbd can employ featur selector effici manner suitabl cheminformat process point fact well document disciplin nbcbbd linear classifi therefor intrins suboptim classifi compound nonlinear separ binari descriptor space investig perform propos algorithm classifi subset mddr data set standard molecular benchmark data set activ inact compound

Resumos Similares

IEEE Trans Image Process - Task-specific image partitioning. ( 0,804269999846613 )
IEEE Trans Image Process - Structured max-margin learning for inter-related classifier training and multilabel image annotation. ( 0,800847745609768 )
J Chem Inf Model - Modeling and benchmark data set for the inhibition of c-Jun N-terminal kinase-3. ( 0,756013649871485 )
IEEE Trans Image Process - A linear support higher-order tensor machine for classification. ( 0,752818352005759 )
J. Comput. Biol. - Imbalanced class learning in epigenetics. ( 0,743871663160344 )
IEEE Trans Image Process - Random forest construction with robust semisupervised node splitting. ( 0,739744975318035 )
Comput Math Methods Med - On multilabel classification methods of incompletely labeled biomedical text data. ( 0,734432204489671 )
J Biomed Inform - Learning classification models from multiple experts. ( 0,731153426815702 )
IEEE Trans Neural Netw Learn Syst - An efficient topological distance-based tree kernel. ( 0,730943545657049 )
IEEE Trans Image Process - Multiview Hessian regularization for image annotation. ( 0,730001400409409 )
Comput. Biol. Med. - Robust prediction of protein subcellular localization combining PCA and WSVMs. ( 0,726005294375355 )
IEEE Trans Image Process - Active learning for solving the incomplete data problem in facial age classification by the furthest nearest-neighbor criterion. ( 0,725887414244671 )
Neural Comput - Online learning with (multiple) kernels: a review. ( 0,725112346756308 )
IEEE Trans Pattern Anal Mach Intell - Trainable Convolution Filters and Their Application to Face Recognition. ( 0,720090839832075 )
J Am Med Inform Assoc - Learning classification models with soft-label information. ( 0,719938679663576 )
IEEE Trans Image Process - Geodesic propagation for semantic labeling. ( 0,719406369669223 )
J Am Med Inform Assoc - Active learning for clinical text classification: is it better than random sampling? ( 0,71771635885314 )
Int J Neural Syst - Aggregation of sparse linear discriminant analyses for event-related potential classification in brain-computer interface. ( 0,716804405019051 )
IEEE Trans Pattern Anal Mach Intell - Feature Selection with Conjunctions of Decision Stumps and Learning from Microarray Data. ( 0,710238289813736 )
IEEE Trans Neural Netw Learn Syst - ML-Tree: a tree-structure-based approach to multilabel learning. ( 0,709145980315835 )
J Biomed Inform - Class proximity measures--dissimilarity-based classification and display of high-dimensional data. ( 0,708081772962629 )
IEEE Trans Pattern Anal Mach Intell - Facial Age Estimation by Learning from Label Distributions. ( 0,70673171037848 )
J Chem Inf Model - An unbiased method to build benchmarking sets for ligand-based virtual screening and its application to GPCRs. ( 0,706289526548288 )
Neural Comput - Reduction from cost-sensitive ordinal ranking to weighted binary classification. ( 0,70507029420167 )
J Biomed Inform - Semi-supervised clinical text classification with Laplacian SVMs: an application to cancer case management. ( 0,703484363593909 )
Int J Neural Syst - Structurally enhanced incremental neural learning for image classification with subgraph extraction. ( 0,70003970091977 )
IEEE Trans Image Process - Manifold regularized multitask learning for semi-supervised multilabel image classification. ( 0,697909628825916 )
IEEE Trans Neural Netw Learn Syst - A Kernel Classification Framework for Metric Learning. ( 0,689221299082234 )
Int J Comput Assist Radiol Surg - Statistical shape model of a liver for autopsy imaging. ( 0,688822263867245 )
IEEE J Biomed Health Inform - Systematic Poisoning Attacks on and Defenses for Machine Learning in Healthcare. ( 0,688112046811606 )
IEEE Trans Image Process - Incremental training of a detector using online sparse eigendecomposition. ( 0,687352431348089 )
Neural Comput - Computing sparse representations of multidimensional signals using Kronecker bases. ( 0,687132468492858 )
J Chem Inf Model - Atom environment kernels on molecules. ( 0,686265030734878 )
AMIA Annu Symp Proc - Learning medical diagnosis models from multiple experts. ( 0,684870766855187 )
Neural Comput - Adaptive metric learning vector quantization for ordinal classification. ( 0,683936123725733 )
IEEE Trans Neural Netw Learn Syst - Semi-supervised domain adaptation on manifolds. ( 0,68237585186425 )
IEEE Trans Pattern Anal Mach Intell - Weakly Supervised Recognition of Daily Life Activities with Wearable Sensors. ( 0,680084034426789 )
IEEE Trans Pattern Anal Mach Intell - Feature Selection and Kernel Learning for Local Learning-Based Clustering. ( 0,679187600847405 )
IEEE Trans Neural Netw Learn Syst - Ordinal Distance Metric Learning for Image Ranking. ( 0,677620698710388 )
IEEE Trans Pattern Anal Mach Intell - Label Consistent K-SVD: Learning A Discriminative Dictionary for Recognition. ( 0,676739854612093 )
IEEE Trans Image Process - Joint segmentation of images and scanned point cloud in large-scale street scenes with low-annotation cost. ( 0,674349805875374 )
Neural Comput - Adaptive multiclass classification for brain computer interfaces. ( 0,674349805875374 )
Int J Neural Syst - Linear time relational prototype based learning. ( 0,674338525255664 )
Neural Comput - Multiple spectral kernel learning and a gaussian complexity computation. ( 0,673247535868148 )
IEEE Trans Image Process - Hyperspectral image classification through bilayer graph-based learning. ( 0,671967316304583 )
Neural Comput - Representing objects, relations, and sequences. ( 0,671825013741323 )
J Med Syst - 3D similarity-dissimilarity plot for high dimensional data visualization in the context of biomedical pattern classification. ( 0,669748864639485 )
IEEE Trans Image Process - Stable orthogonal local discriminant embedding for linear dimensionality reduction. ( 0,669553498080962 )
IEEE Trans Pattern Anal Mach Intell - Unsupervised Adaptation Across Domain Shifts By Generating Intermediate Data Representations. ( 0,669419320479737 )
IEEE Trans Image Process - Learning discriminative dictionary for group sparse representation. ( 0,668094604344803 )
J Biomed Inform - Multi-label classification of chronically ill patients with bag of words and supervised dimensionality reduction algorithms. ( 0,665193638662111 )
Comput. Biol. Med. - Sparse Manifold Clustering and Embedding to discriminate gene expression profiles of glioblastoma and meningioma tumors. ( 0,66351803536663 )
IEEE Trans Pattern Anal Mach Intell - Good Practice in Large-Scale Learning for Image Classification. ( 0,659224777224229 )
Int J Neural Syst - Online semi-supervised growing neural gas. ( 0,657148598632108 )
IEEE Trans Pattern Anal Mach Intell - Distance-Based Image Classification: Generalizing to New Classes at Near Zero Cost. ( 0,656199829009147 )
IEEE Trans Image Process - Improving Web image search by bag-based reranking. ( 0,655158930240135 )
Comput Methods Programs Biomed - Modified CC-LR algorithm with three diverse feature sets for motor imagery tasks classification in EEG based brain-computer interface. ( 0,65387101894686 )
J Chem Inf Model - Classifying large chemical data sets: using a regularized potential function method. ( 0,653476301452148 )
IEEE Trans Image Process - Grassmannian regularized structured multi-view embedding for image classification. ( 0,652221517159139 )
IEEE Trans Pattern Anal Mach Intell - Representation Learning: A Review and New Perspectives. ( 0,649700633427762 )
Comput Math Methods Med - A novel multiinstance learning approach for liver cancer recognition on abdominal CT images based on CPSO-SVM and IO. ( 0,649563685339205 )
J Chem Inf Model - Training based on ligand efficiency improves prediction of bioactivities of ligands and drug target proteins in a machine learning approach. ( 0,64767199832776 )
BMC Med Inform Decis Mak - Learning to improve medical decision making from imbalanced data without a priori cost. ( 0,647253010531557 )
J Biomed Inform - Portable automatic text classification for adverse drug reaction detection via multi-corpus training. ( 0,646157898360959 )
Comput Math Methods Med - Correlation kernels for support vector machines classification with applications in cancer data. ( 0,645652985949357 )
IEEE Trans Image Process - Unsupervised amplitude and texture classification of SAR images with multinomial latent model. ( 0,645015289717424 )
AMIA Annu Symp Proc - Comparison and combination of several MeSH indexing approaches. ( 0,644864661239816 )
IEEE Trans Image Process - Visual classification with multitask joint sparse representation. ( 0,64339152148741 )
IEEE Trans Image Process - CSMMI: Class-Specific Maximization of Mutual Information for Action and Gesture Recognition. ( 0,642127875669525 )
J Biomed Inform - Incremental Gaussian Discriminant Analysis based on Graybill and Deal weighted combination of estimators for brain tumour diagnosis. ( 0,64208829380903 )
Neural Comput - Incremental learning by message passing in hierarchical temporal memory. ( 0,640171041183728 )
IEEE Trans Image Process - Self-supervised online metric learning with low rank constraint for scene categorization. ( 0,638914071835619 )
IEEE Trans Image Process - Multiple-kernel, multiple-instance similarity features for efficient visual object detection. ( 0,63620081691787 )
IEEE Trans Image Process - Object detection with DoG scale-space: a multiple kernel learning approach. ( 0,636081780266227 )
Neural Comput - Metacognitive learning in a fully complex-valued radial basis function neural network. ( 0,635448890055704 )
IEEE Trans Neural Netw Learn Syst - Adaptive Batch Mode Active Learning. ( 0,630001708832705 )
Neural Comput - Mismatched training and test distributions can outperform matched ones. ( 0,629527625642069 )
J. Comput. Biol. - Locally learning biomedical data using diffusion frames. ( 0,627935209007316 )
Neural Comput - Large margin low rank tensor analysis. ( 0,626962328219399 )
Neural Comput - Unsupervised learning of generative and discriminative weights encoding elementary image components in a predictive coding model of cortical function. ( 0,625712152841413 )
J Chem Inf Model - Introduction of a methodology for visualization and graphical interpretation of Bayesian classification models. ( 0,62569065819228 )
Int J Neural Syst - Span: spike pattern association neuron for learning spatio-temporal spike patterns. ( 0,623830959550924 )
IEEE Trans Image Process - Real-time probabilistic covariance tracking with efficient model update. ( 0,623406665623551 )
J Biomed Inform - Applying active learning to assertion classification of concepts in clinical text. ( 0,622717962082727 )
IEEE Trans Image Process - Saliency and gist features for target detection in satellite images. ( 0,622554709900785 )
IEEE Trans Image Process - Cooperative sparse representation in two opposite directions for semi-supervised image annotation. ( 0,621154213141801 )
IEEE Trans Image Process - Robust face recognition with structurally incoherent low-rank matrix decomposition. ( 0,616544701039118 )
IEEE Trans Pattern Anal Mach Intell - A Bag-of-Features Framework to Classify Time Series. ( 0,616364623437676 )
J Chem Inf Model - Prediction of activity cliffs using support vector machines. ( 0,615891579826013 )
IEEE Trans Pattern Anal Mach Intell - Latent Dirichlet Allocation Models for Image Classification. ( 0,615868218300417 )
IEEE Trans Pattern Anal Mach Intell - Learning Categories from Few Examples with Multi Model Knowledge Transfer. ( 0,607216897789323 )
Artif Intell Med - Screening nonrandomized studies for medical systematic reviews: a comparative study of classifiers. ( 0,606141163459957 )
Comput. Biol. Med. - Relabeling algorithm for retrieval of noisy instances and improving prediction quality. ( 0,604834673116489 )
Comput Math Methods Med - Dimensionality reduction by supervised neighbor embedding using laplacian search. ( 0,603827429321319 )
J Am Med Inform Assoc - Supervised machine learning and active learning in classification of radiology reports. ( 0,602599406119922 )
IEEE Trans Pattern Anal Mach Intell - Scene-Specific Pedestrian Detection for Static Video Surveillance. ( 0,602548715066235 )
Neural Comput - Divergence-based vector quantization. ( 0,601082924775646 )
IEEE Trans Pattern Anal Mach Intell - Exemplar-Based Colour Constancy and Multiple Illumination. ( 0,596803589890286 )
IEEE Trans Image Process - Image search reranking with query-dependent click-based relevance feedback. ( 0,596489285593438 )
IEEE Trans Image Process - Design of non-linear kernel dictionaries for object recognition. ( 0,593036730580813 )