J Biomed Inform - Boosting performance of gene mention tagging system by hybrid methods.

Tópicos

{ featur(3375) classif(2383) classifi(1994) }
{ method(2212) result(1239) propos(1039) }
{ method(1219) similar(1157) match(930) }
{ extract(1171) text(1153) clinic(932) }
{ model(2220) cell(1177) simul(1124) }
{ visual(1396) interact(850) tool(830) }
{ first(2504) two(1366) second(1323) }
{ general(901) number(790) one(736) }
{ time(1939) patient(1703) rate(768) }
{ network(2748) neural(1063) input(814) }
{ framework(1458) process(801) describ(734) }
{ research(1085) discuss(1038) issu(1018) }
{ perform(1367) use(1326) method(1137) }
{ data(3008) multipl(1320) sourc(1022) }
{ analysi(2126) use(1163) compon(1037) }
{ surgeri(1148) surgic(1085) robot(1054) }
{ search(2224) databas(1162) retriev(909) }
{ case(1353) use(1143) diagnosi(1136) }
{ system(1050) medic(1026) inform(1018) }
{ import(1318) role(1303) understand(862) }
{ blood(1257) pressur(1144) flow(957) }
{ monitor(1329) mobil(1314) devic(1160) }
{ medic(1828) order(1363) alert(1069) }
{ signal(2180) analysi(812) frequenc(800) }
{ gene(2352) biolog(1181) express(1162) }
{ health(1844) social(1437) communiti(874) }
{ structur(1116) can(940) graph(676) }
{ cancer(2502) breast(956) screen(824) }
{ use(1733) differ(960) four(931) }
{ activ(1452) weight(1219) physic(1104) }
{ model(3404) distribut(989) bayesian(671) }
{ can(774) often(719) complex(702) }
{ imag(1947) propos(1133) code(1026) }
{ data(1737) use(1416) pattern(1282) }
{ inform(2794) health(2639) internet(1427) }
{ system(1976) rule(880) can(841) }
{ measur(2081) correl(1212) valu(896) }
{ imag(1057) registr(996) error(939) }
{ bind(1733) structur(1185) ligand(1036) }
{ sequenc(1873) structur(1644) protein(1328) }
{ imag(2830) propos(1344) filter(1198) }
{ imag(2675) segment(2577) method(1081) }
{ patient(2315) diseas(1263) diabet(1191) }
{ take(945) account(800) differ(722) }
{ studi(2440) review(1878) systemat(933) }
{ motion(1329) object(1292) video(1091) }
{ assess(1506) score(1403) qualiti(1306) }
{ treatment(1704) effect(941) patient(846) }
{ problem(2511) optim(1539) algorithm(950) }
{ error(1145) method(1030) estim(1020) }
{ chang(1828) time(1643) increas(1301) }
{ learn(2355) train(1041) set(1003) }
{ concept(1167) ontolog(924) domain(897) }
{ clinic(1479) use(1117) guidelin(835) }
{ algorithm(1844) comput(1787) effici(935) }
{ method(1557) propos(1049) approach(1037) }
{ data(1714) softwar(1251) tool(1186) }
{ design(1359) user(1324) use(1319) }
{ control(1307) perform(991) simul(935) }
{ care(1570) inform(1187) nurs(1089) }
{ method(984) reconstruct(947) comput(926) }
{ featur(1941) imag(1645) propos(1176) }
{ howev(809) still(633) remain(590) }
{ data(3963) clinic(1234) research(1004) }
{ studi(1410) differ(1259) use(1210) }
{ risk(3053) factor(974) diseas(938) }
{ perform(999) metric(946) measur(919) }
{ model(2341) predict(2261) use(1141) }
{ compound(1573) activ(1297) structur(1058) }
{ studi(1119) effect(1106) posit(819) }
{ spatial(1525) area(1432) region(1030) }
{ record(1888) medic(1808) patient(1693) }
{ health(3367) inform(1360) care(1135) }
{ model(3480) simul(1196) paramet(876) }
{ ehr(2073) health(1662) electron(1139) }
{ state(1844) use(1261) util(961) }
{ research(1218) medic(880) student(794) }
{ patient(2837) hospit(1953) medic(668) }
{ model(2656) set(1616) predict(1553) }
{ data(2317) use(1299) case(1017) }
{ age(1611) year(1155) adult(843) }
{ cost(1906) reduc(1198) effect(832) }
{ group(2977) signific(1463) compar(1072) }
{ sampl(1606) size(1419) use(1276) }
{ intervent(3218) particip(2042) group(1664) }
{ activ(1138) subject(705) human(624) }
{ patient(1821) servic(1111) care(1106) }
{ use(2086) technolog(871) perceiv(783) }
{ can(981) present(881) function(850) }
{ high(1669) rate(1365) level(1280) }
{ use(976) code(926) identifi(902) }
{ drug(1928) target(777) effect(648) }
{ result(1111) use(1088) new(759) }
{ implement(1333) system(1263) develop(1122) }
{ survey(1388) particip(1329) question(1065) }
{ estim(2440) model(1874) function(577) }
{ decis(3086) make(1611) patient(1517) }
{ process(1125) use(805) approach(778) }
{ method(1969) cluster(1462) data(1082) }
{ detect(2391) sensit(1101) algorithm(908) }

Resumo

NER (Named Entity Recognition) in biomedical literature is presently one of the internationally concerned NLP (Natural Language Processing) research questions. In order to get higher performance, a hybrid experimental framework is presented for the gene mention tagging task. Six classifiers are firstly constructed by four toolkits (CRF++, YamCha, Maximum Entropy (ME) and MALLET) with different training methods and features sets, and then combined with three different hybrid methods respectively: simple set operation method, voting method and two layer stacking method. Experiments carried out on the corpus of BioCreative II GM task show that the three hybrid methods get the F-measure of 87.40%, 87.31% and 87.70% separately without any post-processing, which are all higher than those of any single ones. Our best hybrid method (two layer stacking method) achieves an F-measure of 88.42% after post-processing, which outperforms most of the state-of-the-art systems. We also discuss the influence on the performance of the ensemble system by the number, performance and divergence of single classifiers in each hybrid method, and give the corresponding analysis why our hybrid models can improve the performance.

Resumo Limpo

ner name entiti recognit biomed literatur present one intern concern nlp natur languag process research question order get higher perform hybrid experiment framework present gene mention tag task six classifi first construct four toolkit crf yamcha maximum entropi mallet differ train method featur set combin three differ hybrid method respect simpl set oper method vote method two layer stack method experi carri corpus biocreat ii gm task show three hybrid method get fmeasur separ without postprocess higher singl one best hybrid method two layer stack method achiev fmeasur postprocess outperform stateoftheart system also discuss influenc perform ensembl system number perform diverg singl classifi hybrid method give correspond analysi hybrid model can improv perform

Resumos Similares

Comput. Biol. Med. - A classification system based on a new wrapper feature selection algorithm for the diagnosis of primary and secondary polycythemia. ( 0,834896256342959 )
Comput. Biol. Med. - An ensemble system for automatic sleep stage classification using single channel EEG signal. ( 0,816771186718961 )
J Biomed Inform - A biological continuum based approach for efficient clinical classification. ( 0,816102850247377 )
J Biomed Inform - Automatic figure classification in bioscience literature. ( 0,813810708691084 )
Comput. Biol. Med. - Contourlet-based mammography mass classification using the SVM family. ( 0,809107617227407 )
Comput Math Methods Med - Discrimination between Alzheimer's disease and mild cognitive impairment using SOM and PSO-SVM. ( 0,796710770803785 )
Neural Comput - An Infomax algorithm can perform both familiarity discrimination and feature extraction in a single network. ( 0,792028059304281 )
J Med Syst - A comparative study on classification of sleep stage based on EEG signals using feature selection and classification algorithms. ( 0,788493291378402 )
J Med Syst - A three-stage expert system based on support vector machines for thyroid disease diagnosis. ( 0,786910288385061 )
Comput Math Methods Med - SVM versus MAP on accelerometer data to distinguish among locomotor activities executed at different speeds. ( 0,783957646380969 )
J Am Med Inform Assoc - Influenza detection from emergency department reports using natural language processing and Bayesian network classifiers. ( 0,783678353619348 )
Comput. Biol. Med. - Disulfide connectivity prediction based on structural information without a prior knowledge of the bonding state of cysteines. ( 0,782642770054816 )
J Med Syst - Enhanced cancer recognition system based on random forests feature elimination algorithm. ( 0,777973904291941 )
Comput Biol Chem - Derivation of an artificial gene to improve classification accuracy upon gene selection. ( 0,770491209586131 )
Comput. Biol. Med. - Fast and efficient lung disease classification using hierarchical one-against-all support vector machine and cost-sensitive feature selection. ( 0,769708337172433 )
Comput Biol Chem - newDNA-Prot: Prediction of DNA-binding proteins by employing support vector machine and a comprehensive sequence representation. ( 0,765717595065632 )
Comput. Biol. Med. - A novel class dependent feature selection method for cancer biomarker discovery. ( 0,76541709440937 )
Comput. Biol. Med. - Classification of EMG signals using PSO optimized SVM for diagnosis of neuromuscular disorders. ( 0,765067391876291 )
Comput Biol Chem - Information-theoretic approaches to SVM feature selection for metagenome read classification. ( 0,764320891800764 )
J Med Syst - Detection of carotid artery disease by using Learning Vector Quantization Neural Network. ( 0,763782064238884 )
Comput Methods Programs Biomed - A new hybrid intelligent system for accurate detection of Parkinson's disease. ( 0,761596323576428 )
Artif Intell Med - Computer-aided diagnosis of pulmonary nodules using a two-step approach for feature selection and classifier ensemble construction. ( 0,760823843809834 )
Comput. Biol. Med. - Gene expression microarray classification using PCA-BEL. ( 0,758841328105457 )
Comput. Biol. Med. - Pairwise FCM based feature weighting for improved classification of vertebral column disorders. ( 0,757353026343336 )
J Med Syst - SVM feature selection based rotation forest ensemble classifiers to improve computer-aided diagnosis of Parkinson disease. ( 0,757300794761827 )
Comput Methods Programs Biomed - Supervised hybrid feature selection based on PSO and rough sets for medical diagnosis. ( 0,756921650661289 )
Artif Intell Med - An intelligent classifier for prognosis of cardiac resynchronization therapy based on speckle-tracking echocardiograms. ( 0,753211863130606 )
J Med Syst - A robust multi-class feature selection strategy based on Rotation Forest Ensemble algorithm for diagnosis of Erythemato-Squamous diseases. ( 0,752336674703592 )
Int J Comput Assist Radiol Surg - Building an ensemble system for diagnosing masses in mammograms. ( 0,751366300414311 )
Comput Math Methods Med - Comparison of different EHG feature selection methods for the detection of preterm labor. ( 0,750812342255831 )
Int J Neural Syst - Single-trial motor imagery classification using asymmetry ratio, phase relation, wavelet-based fractal, and their selected combination. ( 0,746950275239161 )
Comput Methods Programs Biomed - A random forest classifier for lymph diseases. ( 0,746146178737643 )
J Med Syst - Application of higher order spectra to identify epileptic EEG. ( 0,741657892427575 )
J Biomed Inform - A fast gene selection method for multi-cancer classification using multiple support vector data description. ( 0,736115911734704 )
Artif Intell Med - Improving the Mann-Whitney statistical test for feature selection: an approach in breast cancer diagnosis on mammography. ( 0,735005520416356 )
Int J Neural Syst - Assessment of feature selection and classification approaches to enhance information from overnight oximetry in the context of apnea diagnosis. ( 0,734450416780778 )
IEEE J Biomed Health Inform - Support vector machine classification based on correlation prototypes applied to bone age assessment. ( 0,733522371082218 )
Int J Neural Syst - Combination of heterogeneous EEG feature extraction methods and stacked sequential learning for sleep stage classification. ( 0,733451378820539 )
J Am Med Inform Assoc - Learning regular expressions for clinical text classification. ( 0,732276253068762 )
Comput Methods Programs Biomed - Automatic cervical cell segmentation and classification in Pap smears. ( 0,73213138281534 )
Int J Comput Assist Radiol Surg - Multimodality GPU-based computer-assisted diagnosis of breast cancer using ultrasound and digital mammography images. ( 0,730618202049691 )
J Med Syst - A new expert system for diagnosis of lung cancer: GDA-LS_SVM. ( 0,726639903815375 )
Comput. Biol. Med. - Heartbeat classification using disease-specific feature selection. ( 0,726553886536556 )
Comput Biol Chem - A novel divide-and-merge classification for high dimensional datasets. ( 0,725688918191053 )
Neural Comput - The support feature machine: classification with the least number of features and application to neuroimaging data. ( 0,724761917811707 )
Artif Intell Med - Texture feature ranking with relevance learning to classify interstitial lung disease patterns. ( 0,723603775212706 )
Comput Methods Programs Biomed - Performance comparison of machine learning methods for prognosis of hormone receptor status in breast cancer tissue samples. ( 0,72258957586248 )
Artif Intell Med - Electrocardiogram analysis using a combination of statistical, geometric, and nonlinear heart rate variability features. ( 0,721111515753794 )
Comput. Biol. Med. - Extracting predictive SNPs in Crohn's disease using a vacillating genetic algorithm and a neural classifier in case-control association studies. ( 0,720954951405909 )
Comput Math Methods Med - Feature selection in classification of eye movements using electrooculography for activity recognition. ( 0,720569859259491 )
Comput Biol Chem - Compact cancer biomarkers discovery using a swarm intelligence feature selection algorithm. ( 0,720195477555826 )
Comput Math Methods Med - Determination of fetal state from cardiotocogram using LS-SVM with particle swarm optimization and binary decision tree. ( 0,719256539540235 )
J Med Syst - Symptomatic vs. asymptomatic plaque classification in carotid ultrasound. ( 0,718598864693974 )
Comput. Biol. Med. - SVM-based feature selection to optimize sensitivity-specificity balance applied to weaning. ( 0,71812595851986 )
J Med Syst - Classification of speech dysfluencies using LPC based parameterization techniques. ( 0,717346993021084 )
Comput Methods Programs Biomed - Hepatitis disease diagnosis using a novel hybrid method based on support vector machine and simulated annealing (SVM-SA). ( 0,716004034441054 )
Comput. Biol. Med. - Ant colony optimization-based feature selection method for surface electromyography signals classification. ( 0,715379714569756 )
Med Biol Eng Comput - Evaluation of feature extraction methods for EEG-based brain-computer interfaces in terms of robustness to slight changes in electrode locations. ( 0,711802585484644 )
AMIA Annu Symp Proc - Word Sense Disambiguation of clinical abbreviations with hyperdimensional computing. ( 0,71097128570715 )
J Med Syst - Luminance sticker based facial expression recognition using discrete wavelet transform for physically disabled persons. ( 0,710177612298808 )
J Med Syst - Classification of normal and diseased liver shapes based on Spherical Harmonics coefficients. ( 0,708677525200461 )
J Chem Inf Model - Classifying molecules using a sparse probabilistic kernel binary classifier. ( 0,708062167123613 )
Comput. Biol. Med. - Automatic sleep staging from ventilator signals in non-invasive ventilation. ( 0,707503659476288 )
Int J Neural Syst - Improved adaptive splitting and selection: the hybrid training method of a classifier based on a feature space partitioning. ( 0,705415815452678 )
Comput Math Methods Med - Knee joint vibration signal analysis with matching pursuit decomposition and dynamic weighted classifier fusion. ( 0,702200859325609 )
Med Biol Eng Comput - SEMG-based hand motion recognition using cumulative residual entropy and extreme learning machine. ( 0,701582312485809 )
J Am Med Inform Assoc - A comparative analysis of methods for predicting clinical outcomes using high-dimensional genomic datasets. ( 0,701299367902772 )
Comput. Biol. Med. - A new dataset evaluation method based on category overlap. ( 0,700076494018229 )
Comput Math Methods Med - Comparison of the data classification approaches to diagnose spinal cord injury. ( 0,699572932891156 )
J Med Syst - An intelligent system for lung cancer diagnosis using a new genetic algorithm based feature selection method. ( 0,698412839186276 )
J Am Med Inform Assoc - A sequence labeling approach to link medications and their attributes in clinical notes and clinical trial announcements for information extraction. ( 0,697857998574272 )
Artif Intell Med - Document classification for mining host pathogen protein-protein interactions. ( 0,696513522179524 )
Comput. Biol. Med. - Ensemble classification of colon biopsy images based on information rich hybrid features. ( 0,694421157114013 )
Artif Intell Med - Instance-based classifiers applied to medical databases: diagnosis and knowledge extraction. ( 0,693207402422135 )
Med Biol Eng Comput - Wavelet-based sparse functional linear model with applications to EEGs seizure detection and epilepsy diagnosis. ( 0,69244708819013 )
IEEE Trans Image Process - A novel technique for subpixel image classification based on support vector machine. ( 0,691955108284843 )
J Med Syst - Statistical analysis of textural features for improved classification of oral histopathological images. ( 0,691816277321251 )
Comput Methods Programs Biomed - Functional activity maps based on significance measures and Independent Component Analysis. ( 0,691757307102974 )
J Med Syst - Similarity-dissimilarity plot for visualization of high dimensional data in biomedical pattern classification. ( 0,691727997518993 )
IEEE J Biomed Health Inform - Extracting and Selecting Distinctive EEG Features for Efficient Epileptic Seizure Prediction. ( 0,691427664189062 )
IEEE Trans Neural Netw Learn Syst - FREL: A Stable Feature Selection Algorithm. ( 0,691088760382059 )
Comput Methods Programs Biomed - An improved method of early diagnosis of smoking-induced respiratory changes using machine learning algorithms. ( 0,691048872142427 )
Int J Neural Syst - Extraction of neural control commands using myoelectric pattern recognition: a novel application in adults with cerebral palsy. ( 0,690768221306653 )
Comput Math Methods Med - An ensemble-of-classifiers based approach for early diagnosis of Alzheimer's disease: classification using structural features of brain images. ( 0,689214824212023 )
Brief. Bioinformatics - Class-imbalanced classifiers for high-dimensional data. ( 0,688335151196026 )
Comput Methods Programs Biomed - Computer-supported diagnosis for endotension cases in endovascular aortic aneurysm repair evolution. ( 0,687604676101904 )
IEEE Trans Image Process - Smile detection by boosting pixel differences. ( 0,686954599141722 )
Comput Biol Chem - CE-PLoc: an ensemble classifier for predicting protein subcellular locations by fusing different modes of pseudo amino acid composition. ( 0,686406429142232 )
Comput Methods Programs Biomed - A hybrid system based on information gain and principal component analysis for the classification of transcranial Doppler signals. ( 0,68434337608458 )
Comput. Biol. Med. - Ensemble selection for feature-based classification of diabetic maculopathy images. ( 0,682371380722852 )
IEEE J Biomed Health Inform - Recognizing common CT imaging signs of lung diseases through a new feature selection method based on Fisher criterion and genetic optimization. ( 0,681386285672865 )
J Biomed Inform - A genetic algorithm-support vector machine method with parameter optimization for selecting the tag SNPs. ( 0,680307866338754 )
IEEE Trans Image Process - Walsh-Hadamard transform kernel-based feature vector for shot boundary detection. ( 0,680233590447912 )
Comput. Biol. Med. - A hybrid feature selection method for DNA microarray data. ( 0,679407499392886 )
J Chem Inf Model - Classifier ensemble based on feature selection and diversity measures for predicting the affinity of A(2B) adenosine receptor antagonists. ( 0,679289569769942 )
Comput Math Methods Med - An expert system based on Fisher score and LS-SVM for cardiac arrhythmia diagnosis. ( 0,679085108957112 )
J Med Syst - Usage of case-based reasoning, neural network and adaptive neuro-fuzzy inference system classification techniques in breast cancer dataset classification diagnosis. ( 0,678592608892633 )
Comput Methods Programs Biomed - Classification of normal and epileptic seizure EEG signals using wavelet transform, phase-space reconstruction, and Euclidean distance. ( 0,676674344079973 )
Artif Intell Med - A supervised method to assist the diagnosis and monitor progression of Alzheimer's disease using data from an fMRI experiment. ( 0,674771541395085 )
BMC Med Inform Decis Mak - Application of support vector machine modeling for prediction of common diseases: the case of diabetes and pre-diabetes. ( 0,674683560593507 )