Artif Intell Med - Screening nonrandomized studies for medical systematic reviews: a comparative study of classifiers.

Tópicos

{ learn(2355) train(1041) set(1003) }
{ featur(3375) classif(2383) classifi(1994) }
{ studi(2440) review(1878) systemat(933) }
{ extract(1171) text(1153) clinic(932) }
{ imag(1947) propos(1133) code(1026) }
{ model(3480) simul(1196) paramet(876) }
{ cost(1906) reduc(1198) effect(832) }
{ first(2504) two(1366) second(1323) }
{ compound(1573) activ(1297) structur(1058) }
{ system(1976) rule(880) can(841) }
{ error(1145) method(1030) estim(1020) }
{ care(1570) inform(1187) nurs(1089) }
{ model(2656) set(1616) predict(1553) }
{ measur(2081) correl(1212) valu(896) }
{ data(3963) clinic(1234) research(1004) }
{ can(981) present(881) function(850) }
{ take(945) account(800) differ(722) }
{ design(1359) user(1324) use(1319) }
{ import(1318) role(1303) understand(862) }
{ implement(1333) system(1263) develop(1122) }
{ can(774) often(719) complex(702) }
{ imag(1057) registr(996) error(939) }
{ motion(1329) object(1292) video(1091) }
{ problem(2511) optim(1539) algorithm(950) }
{ data(1714) softwar(1251) tool(1186) }
{ general(901) number(790) one(736) }
{ search(2224) databas(1162) retriev(909) }
{ studi(1410) differ(1259) use(1210) }
{ perform(999) metric(946) measur(919) }
{ blood(1257) pressur(1144) flow(957) }
{ spatial(1525) area(1432) region(1030) }
{ medic(1828) order(1363) alert(1069) }
{ signal(2180) analysi(812) frequenc(800) }
{ group(2977) signific(1463) compar(1072) }
{ use(2086) technolog(871) perceiv(783) }
{ use(976) code(926) identifi(902) }
{ decis(3086) make(1611) patient(1517) }
{ process(1125) use(805) approach(778) }
{ model(3404) distribut(989) bayesian(671) }
{ data(1737) use(1416) pattern(1282) }
{ inform(2794) health(2639) internet(1427) }
{ bind(1733) structur(1185) ligand(1036) }
{ sequenc(1873) structur(1644) protein(1328) }
{ method(1219) similar(1157) match(930) }
{ imag(2830) propos(1344) filter(1198) }
{ network(2748) neural(1063) input(814) }
{ imag(2675) segment(2577) method(1081) }
{ patient(2315) diseas(1263) diabet(1191) }
{ assess(1506) score(1403) qualiti(1306) }
{ treatment(1704) effect(941) patient(846) }
{ surgeri(1148) surgic(1085) robot(1054) }
{ framework(1458) process(801) describ(734) }
{ chang(1828) time(1643) increas(1301) }
{ concept(1167) ontolog(924) domain(897) }
{ clinic(1479) use(1117) guidelin(835) }
{ algorithm(1844) comput(1787) effici(935) }
{ method(1557) propos(1049) approach(1037) }
{ control(1307) perform(991) simul(935) }
{ model(2220) cell(1177) simul(1124) }
{ method(984) reconstruct(947) comput(926) }
{ featur(1941) imag(1645) propos(1176) }
{ case(1353) use(1143) diagnosi(1136) }
{ howev(809) still(633) remain(590) }
{ risk(3053) factor(974) diseas(938) }
{ research(1085) discuss(1038) issu(1018) }
{ system(1050) medic(1026) inform(1018) }
{ model(2341) predict(2261) use(1141) }
{ visual(1396) interact(850) tool(830) }
{ perform(1367) use(1326) method(1137) }
{ studi(1119) effect(1106) posit(819) }
{ record(1888) medic(1808) patient(1693) }
{ health(3367) inform(1360) care(1135) }
{ monitor(1329) mobil(1314) devic(1160) }
{ ehr(2073) health(1662) electron(1139) }
{ state(1844) use(1261) util(961) }
{ research(1218) medic(880) student(794) }
{ patient(2837) hospit(1953) medic(668) }
{ data(2317) use(1299) case(1017) }
{ age(1611) year(1155) adult(843) }
{ sampl(1606) size(1419) use(1276) }
{ gene(2352) biolog(1181) express(1162) }
{ data(3008) multipl(1320) sourc(1022) }
{ intervent(3218) particip(2042) group(1664) }
{ activ(1138) subject(705) human(624) }
{ time(1939) patient(1703) rate(768) }
{ patient(1821) servic(1111) care(1106) }
{ analysi(2126) use(1163) compon(1037) }
{ health(1844) social(1437) communiti(874) }
{ structur(1116) can(940) graph(676) }
{ high(1669) rate(1365) level(1280) }
{ cancer(2502) breast(956) screen(824) }
{ use(1733) differ(960) four(931) }
{ drug(1928) target(777) effect(648) }
{ result(1111) use(1088) new(759) }
{ survey(1388) particip(1329) question(1065) }
{ estim(2440) model(1874) function(577) }
{ activ(1452) weight(1219) physic(1104) }
{ method(1969) cluster(1462) data(1082) }
{ method(2212) result(1239) propos(1039) }
{ detect(2391) sensit(1101) algorithm(908) }

Resumo

JECTIVES: To investigate whether (1) machine learning classifiers can help identify nonrandomized studies eligible for full-text screening by systematic reviewers; (2) classifier performance varies with optimization; and (3) the number of citations to screen can be reduced.METHODS: We used an open-source, data-mining suite to process and classify biomedical citations that point to mostly nonrandomized studies from 2 systematic reviews. We built training and test sets for citation portions and compared classifier performance by considering the value of indexing, various feature sets, and optimization. We conducted our experiments in 2 phases. The design of phase I with no optimization was: 4 classifiers ? 3 feature sets ? 3 citation portions. Classifiers included k-nearest neighbor, na?ve Bayes, complement na?ve Bayes, and evolutionary support vector machine. Feature sets included bag of words, and 2- and 3-term n-grams. Citation portions included titles, titles and abstracts, and full citations with metadata. Phase II with optimization involved a subset of the classifiers, as well as features extracted from full citations, and full citations with overweighted titles. We optimized features and classifier parameters by manually setting information gain thresholds outside of a process for iterative grid optimization with 10-fold cross-validations. We independently tested models on data reserved for that purpose and statistically compared classifier performance on 2 types of feature sets. We estimated the number of citations needed to screen by reviewers during a second pass through a reduced set of citations.RESULTS: In phase I, the evolutionary support vector machine returned the best recall for bag of words extracted from full citations; the best classifier with respect to overall performance was k-nearest neighbor. No classifier attained good enough recall for this task without optimization. In phase II, we boosted performance with optimization for evolutionary support vector machine and complement na?ve Bayes classifiers. Generalization performance was better for the latter in the independent tests. For evolutionary support vector machine and complement na?ve Bayes classifiers, the initial retrieval set was reduced by 46% and 35%, respectively.CONCLUSIONS: Machine learning classifiers can help identify nonrandomized studies eligible for full-text screening by systematic reviewers. Optimization can markedly improve performance of classifiers. However, generalizability varies with the classifier. The number of citations to screen during a second independent pass through the citations can be substantially reduced.

Resumo Limpo

jectiv investig whether machin learn classifi can help identifi nonrandom studi elig fulltext screen systemat review classifi perform vari optim number citat screen can reducedmethod use opensourc datamin suit process classifi biomed citat point most nonrandom studi systemat review built train test set citat portion compar classifi perform consid valu index various featur set optim conduct experi phase design phase optim classifi featur set citat portion classifi includ knearest neighbor nave bay complement nave bay evolutionari support vector machin featur set includ bag word term ngram citat portion includ titl titl abstract full citat metadata phase ii optim involv subset classifi well featur extract full citat full citat overweight titl optim featur classifi paramet manual set inform gain threshold outsid process iter grid optim fold crossvalid independ test model data reserv purpos statist compar classifi perform type featur set estim number citat need screen review second pass reduc set citationsresult phase evolutionari support vector machin return best recal bag word extract full citat best classifi respect overal perform knearest neighbor classifi attain good enough recal task without optim phase ii boost perform optim evolutionari support vector machin complement nave bay classifi general perform better latter independ test evolutionari support vector machin complement nave bay classifi initi retriev set reduc respectivelyconclus machin learn classifi can help identifi nonrandom studi elig fulltext screen systemat review optim can mark improv perform classifi howev generaliz vari classifi number citat screen second independ pass citat can substanti reduc

Resumos Similares

Neural Comput - Online learning with (multiple) kernels: a review. ( 0,751363366995837 )
J Chem Inf Model - Classifying large chemical data sets: using a regularized potential function method. ( 0,747624184095223 )
J Biomed Inform - Dynamic categorization of clinical research eligibility criteria by hierarchical clustering. ( 0,728819309461331 )
J Biomed Inform - An SVM-based high-quality article classifier for systematic reviews. ( 0,725658275073019 )
IEEE Trans Neural Netw Learn Syst - ML-Tree: a tree-structure-based approach to multilabel learning. ( 0,724622276463903 )
J Med Syst - 3D similarity-dissimilarity plot for high dimensional data visualization in the context of biomedical pattern classification. ( 0,721818650345889 )
Comput. Biol. Med. - Relabeling algorithm for retrieval of noisy instances and improving prediction quality. ( 0,716004774047304 )
IEEE Trans Image Process - A linear support higher-order tensor machine for classification. ( 0,709617521752144 )
IEEE Trans Image Process - Multiple-kernel, multiple-instance similarity features for efficient visual object detection. ( 0,696277169838228 )
IEEE Trans Image Process - Learning discriminative dictionary for group sparse representation. ( 0,691174406933011 )
Comput. Biol. Med. - Sparse Manifold Clustering and Embedding to discriminate gene expression profiles of glioblastoma and meningioma tumors. ( 0,690794335319998 )
Neural Comput - Divergence-based vector quantization. ( 0,690767034987135 )
Neural Comput - Extended robust support vector machine based on financial risk minimization. ( 0,687219322720709 )
J Am Med Inform Assoc - A sequence labeling approach to link medications and their attributes in clinical notes and clinical trial announcements for information extraction. ( 0,686980457335091 )
Comput Math Methods Med - On multilabel classification methods of incompletely labeled biomedical text data. ( 0,683707976240606 )
Neural Comput - Adaptive metric learning vector quantization for ordinal classification. ( 0,683597340058377 )
Comput. Biol. Med. - Robust prediction of protein subcellular localization combining PCA and WSVMs. ( 0,679434945090822 )
IEEE Trans Pattern Anal Mach Intell - Learning Hierarchical Features for Scene Labeling. ( 0,678064104740671 )
J Chem Inf Model - Classifying molecules using a sparse probabilistic kernel binary classifier. ( 0,677063260828835 )
IEEE Trans Image Process - Manifold regularized multitask learning for semi-supervised multilabel image classification. ( 0,67196134486279 )
J Biomed Inform - Semi-supervised clinical text classification with Laplacian SVMs: an application to cancer case management. ( 0,671897692715202 )
Neural Comput - Computing sparse representations of multidimensional signals using Kronecker bases. ( 0,671074913093081 )
Int J Neural Syst - Aggregation of sparse linear discriminant analyses for event-related potential classification in brain-computer interface. ( 0,67103717117637 )
J Am Med Inform Assoc - Applying active learning to supervised word sense disambiguation in MEDLINE. ( 0,670659152974926 )
Neural Comput - Reduction from cost-sensitive ordinal ranking to weighted binary classification. ( 0,670259910918532 )
IEEE J Biomed Health Inform - Automatic detection of atrial fibrillation in cardiac vibration signals. ( 0,669287391673108 )
IEEE J Biomed Health Inform - Systematic Poisoning Attacks on and Defenses for Machine Learning in Healthcare. ( 0,667589311380717 )
IEEE Trans Image Process - Hyperspectral image classification through bilayer graph-based learning. ( 0,66567028651651 )
IEEE Trans Image Process - Geodesic propagation for semantic labeling. ( 0,664770874981435 )
J Chem Inf Model - Modeling and benchmark data set for the inhibition of c-Jun N-terminal kinase-3. ( 0,664220484106072 )
J Am Med Inform Assoc - Learning classification models with soft-label information. ( 0,663915941992621 )
Int J Neural Syst - Structurally enhanced incremental neural learning for image classification with subgraph extraction. ( 0,662403579875397 )
IEEE Trans Image Process - Task-specific image partitioning. ( 0,660968774546021 )
J Biomed Inform - Portable automatic text classification for adverse drug reaction detection via multi-corpus training. ( 0,657429061974309 )
J. Comput. Biol. - Imbalanced class learning in epigenetics. ( 0,656933370329838 )
J Integr Bioinform - On the parameter optimization of Support Vector Machines for binary classification. ( 0,655870111707239 )
Comput Methods Programs Biomed - Modified CC-LR algorithm with three diverse feature sets for motor imagery tasks classification in EEG based brain-computer interface. ( 0,65495814750477 )
J Am Med Inform Assoc - Evaluating the utility of syndromic surveillance algorithms for screening to detect potentially clonal hospital infection outbreaks. ( 0,654895747695421 )
Comput Math Methods Med - Correlation kernels for support vector machines classification with applications in cancer data. ( 0,649725341303879 )
IEEE Trans Neural Netw Learn Syst - A Kernel Classification Framework for Metric Learning. ( 0,648891301144808 )
J Am Med Inform Assoc - Active learning for clinical text classification: is it better than random sampling? ( 0,646575086220687 )
J Biomed Inform - Reducing systematic review workload through certainty-based screening. ( 0,641838809400909 )
IEEE Trans Pattern Anal Mach Intell - Distance-Based Image Classification: Generalizing to New Classes at Near Zero Cost. ( 0,638912992098529 )
J Biomed Inform - Temporal relation discovery between events and temporal expressions identified in clinical narrative. ( 0,635425851240798 )
Int J Neural Syst - Span: spike pattern association neuron for learning spatio-temporal spike patterns. ( 0,634348516824811 )
J Biomed Inform - Applying active learning to assertion classification of concepts in clinical text. ( 0,632863419931187 )
IEEE Trans Image Process - A Probabilistic Associative Model for Segmenting Weakly-Supervised Images. ( 0,63213696603036 )
Comput Methods Programs Biomed - An attribute weight assignment and particle swarm optimization algorithm for medical database classifications. ( 0,63209560377431 )
J Am Med Inform Assoc - Missing values in deduplication of electronic patient data. ( 0,631614354227305 )
IEEE Trans Pattern Anal Mach Intell - Feature Selection with Conjunctions of Decision Stumps and Learning from Microarray Data. ( 0,63120321195483 )
J Med Syst - A software framework for building biomedical machine learning classifiers through grid computing resources. ( 0,630441881647472 )
J Chem Inf Model - Training based on ligand efficiency improves prediction of bioactivities of ligands and drug target proteins in a machine learning approach. ( 0,630325256061614 )
AMIA Annu Symp Proc - Comparison and combination of several MeSH indexing approaches. ( 0,629962760089301 )
IEEE Trans Image Process - Saliency and gist features for target detection in satellite images. ( 0,626250628282233 )
IEEE J Biomed Health Inform - Multiple kernel learning in the primal for multimodal Alzheimer's disease classification. ( 0,625534743115605 )
Artif Intell Med - A fuzzy-based data transformation for feature extraction to increase classification performance with small medical data sets. ( 0,623884089306649 )
IEEE Trans Image Process - Multiview Hessian regularization for image annotation. ( 0,62369186536328 )
Artif Intell Med - Exploiting the systematic review protocol for classification of medical abstracts. ( 0,622283167498464 )
IEEE Trans Image Process - Active learning for solving the incomplete data problem in facial age classification by the furthest nearest-neighbor criterion. ( 0,622113341226372 )
IEEE Trans Neural Netw Learn Syst - Adaptive Batch Mode Active Learning. ( 0,621262932760347 )
J Chem Inf Model - A binary ant colony optimization classifier for molecular activities. ( 0,621141611691206 )
Artif Intell Med - Suppressed fuzzy-soft learning vector quantization for MRI segmentation. ( 0,61859034482254 )
Comput Methods Programs Biomed - Complex extreme learning machine applications in terahertz pulsed signals feature sets. ( 0,617799957896371 )
Comput Methods Programs Biomed - A machine learning approach to multi-level ECG signal quality classification. ( 0,617387730577393 )
AMIA Annu Symp Proc - Improving predictions in imbalanced data using Pairwise Expanded Logistic Regression. ( 0,616333482603603 )
Comput Methods Programs Biomed - Hepatitis disease diagnosis using a novel hybrid method based on support vector machine and simulated annealing (SVM-SA). ( 0,614764014980166 )
Comput Methods Programs Biomed - Machine learning algorithms and forced oscillation measurements applied to the automatic identification of chronic obstructive pulmonary disease. ( 0,613008613737376 )
BMC Med Inform Decis Mak - Recognizing clinical entities in hospital discharge summaries using Structural Support Vector Machines with word representation features. ( 0,611348488977008 )
Comput Methods Programs Biomed - Multistage approach for clustering and classification of ECG data. ( 0,611285859308425 )
IEEE Trans Pattern Anal Mach Intell - Latent Dirichlet Allocation Models for Image Classification. ( 0,610835511264816 )
J Biomed Inform - Incremental Gaussian Discriminant Analysis based on Graybill and Deal weighted combination of estimators for brain tumour diagnosis. ( 0,610522725413126 )
IEEE Trans Image Process - Design of non-linear kernel dictionaries for object recognition. ( 0,606627972377708 )
J Chem Inf Model - Note on naive Bayes based on binary descriptors in cheminformatics. ( 0,606141163459957 )
BMC Med Inform Decis Mak - Learning to improve medical decision making from imbalanced data without a priori cost. ( 0,605669127284696 )
Artif Intell Med - Prediction of intraoperative complexity from preoperative patient data for laparoscopic cholecystectomy. ( 0,605057922832178 )
IEEE Trans Neural Netw Learn Syst - Ordinal Distance Metric Learning for Image Ranking. ( 0,604853498077936 )
Comput. Biol. Med. - EEG-based emotion estimation using Bayesian weighted-log-posterior function and perceptron convergence algorithm. ( 0,602953442452748 )
IEEE Trans Image Process - Joint segmentation of images and scanned point cloud in large-scale street scenes with low-annotation cost. ( 0,602889025696242 )
IEEE Trans Image Process - Self-supervised online metric learning with low rank constraint for scene categorization. ( 0,602355721398589 )
J Am Med Inform Assoc - Machine-learned solutions for three stages of clinical information extraction: the state of the art at i2b2 2010. ( 0,602008711521202 )
Neural Comput - Multiple spectral kernel learning and a gaussian complexity computation. ( 0,601701072443645 )
IEEE Trans Image Process - Incremental training of a detector using online sparse eigendecomposition. ( 0,59975682372224 )
Comput Math Methods Med - Comparison of two methods forecasting binding rate of plasma protein. ( 0,599260551474313 )
J Biomed Inform - Classifying temporal relations in clinical data: a hybrid, knowledge-rich approach. ( 0,598457898675651 )
IEEE Trans Pattern Anal Mach Intell - A Bag-of-Features Framework to Classify Time Series. ( 0,598312351625333 )
Int J Neural Syst - Online semi-supervised growing neural gas. ( 0,597851700128545 )
J Am Med Inform Assoc - Supervised machine learning and active learning in classification of radiology reports. ( 0,597169015300945 )
Artif Intell Med - A classifier ensemble approach for the missing feature problem. ( 0,596071703797985 )
Brief. Bioinformatics - Class-imbalanced classifiers for high-dimensional data. ( 0,596052247832676 )
Comput Biol Chem - CE-PLoc: an ensemble classifier for predicting protein subcellular locations by fusing different modes of pseudo amino acid composition. ( 0,595789188692578 )
Comput. Biol. Med. - Identification of epilepsy stages from ECoG using genetic programming classifiers. ( 0,593434587115353 )
Neural Comput - Feature selection for ordinal text classification. ( 0,592665073898358 )
AMIA Annu Symp Proc - Sample-efficient learning with auxiliary class-label information. ( 0,592644017268173 )
J Biomed Inform - Learning classification models from multiple experts. ( 0,592270550588791 )
AMIA Annu Symp Proc - Classification of medication status change in clinical narratives. ( 0,591934688728864 )
Int J Med Inform - An exploratory study of a text classification framework for Internet-based surveillance of emerging epidemics. ( 0,591093592106339 )
Artif Intell Med - Multi-objective evolutionary algorithms for fuzzy classification in survival prediction. ( 0,590417155836042 )
Neural Comput - Metacognitive learning in a fully complex-valued radial basis function neural network. ( 0,59022163611419 )
IEEE Trans Pattern Anal Mach Intell - Learning Categories from Few Examples with Multi Model Knowledge Transfer. ( 0,589162052604731 )
Comput Biol Chem - Classification of splice-junction sequences via weighted position specific scoring approach. ( 0,589140576115171 )