Artif Intell Med - Predicting malaria interactome classifications from time-course transcriptomic data along the intraerythrocytic developmental cycle.

Tópicos

{ sequenc(1873) structur(1644) protein(1328) }
{ gene(2352) biolog(1181) express(1162) }
{ imag(2675) segment(2577) method(1081) }
{ extract(1171) text(1153) clinic(932) }
{ structur(1116) can(940) graph(676) }
{ method(1969) cluster(1462) data(1082) }
{ howev(809) still(633) remain(590) }
{ can(774) often(719) complex(702) }
{ data(3008) multipl(1320) sourc(1022) }
{ chang(1828) time(1643) increas(1301) }
{ risk(3053) factor(974) diseas(938) }
{ compound(1573) activ(1297) structur(1058) }
{ age(1611) year(1155) adult(843) }
{ high(1669) rate(1365) level(1280) }
{ use(976) code(926) identifi(902) }
{ model(3404) distribut(989) bayesian(671) }
{ studi(2440) review(1878) systemat(933) }
{ algorithm(1844) comput(1787) effici(935) }
{ design(1359) user(1324) use(1319) }
{ studi(1119) effect(1106) posit(819) }
{ research(1218) medic(880) student(794) }
{ signal(2180) analysi(812) frequenc(800) }
{ drug(1928) target(777) effect(648) }
{ method(2212) result(1239) propos(1039) }
{ data(1737) use(1416) pattern(1282) }
{ imag(1057) registr(996) error(939) }
{ featur(3375) classif(2383) classifi(1994) }
{ imag(2830) propos(1344) filter(1198) }
{ take(945) account(800) differ(722) }
{ surgeri(1148) surgic(1085) robot(1054) }
{ framework(1458) process(801) describ(734) }
{ concept(1167) ontolog(924) domain(897) }
{ method(1557) propos(1049) approach(1037) }
{ method(984) reconstruct(947) comput(926) }
{ data(3963) clinic(1234) research(1004) }
{ studi(1410) differ(1259) use(1210) }
{ state(1844) use(1261) util(961) }
{ patient(2837) hospit(1953) medic(668) }
{ data(2317) use(1299) case(1017) }
{ medic(1828) order(1363) alert(1069) }
{ cost(1906) reduc(1198) effect(832) }
{ group(2977) signific(1463) compar(1072) }
{ use(1733) differ(960) four(931) }
{ implement(1333) system(1263) develop(1122) }
{ survey(1388) particip(1329) question(1065) }
{ decis(3086) make(1611) patient(1517) }
{ activ(1452) weight(1219) physic(1104) }
{ imag(1947) propos(1133) code(1026) }
{ inform(2794) health(2639) internet(1427) }
{ system(1976) rule(880) can(841) }
{ measur(2081) correl(1212) valu(896) }
{ bind(1733) structur(1185) ligand(1036) }
{ method(1219) similar(1157) match(930) }
{ network(2748) neural(1063) input(814) }
{ patient(2315) diseas(1263) diabet(1191) }
{ motion(1329) object(1292) video(1091) }
{ assess(1506) score(1403) qualiti(1306) }
{ treatment(1704) effect(941) patient(846) }
{ problem(2511) optim(1539) algorithm(950) }
{ error(1145) method(1030) estim(1020) }
{ learn(2355) train(1041) set(1003) }
{ clinic(1479) use(1117) guidelin(835) }
{ data(1714) softwar(1251) tool(1186) }
{ control(1307) perform(991) simul(935) }
{ model(2220) cell(1177) simul(1124) }
{ care(1570) inform(1187) nurs(1089) }
{ general(901) number(790) one(736) }
{ search(2224) databas(1162) retriev(909) }
{ featur(1941) imag(1645) propos(1176) }
{ case(1353) use(1143) diagnosi(1136) }
{ perform(999) metric(946) measur(919) }
{ research(1085) discuss(1038) issu(1018) }
{ system(1050) medic(1026) inform(1018) }
{ import(1318) role(1303) understand(862) }
{ model(2341) predict(2261) use(1141) }
{ visual(1396) interact(850) tool(830) }
{ perform(1367) use(1326) method(1137) }
{ blood(1257) pressur(1144) flow(957) }
{ spatial(1525) area(1432) region(1030) }
{ record(1888) medic(1808) patient(1693) }
{ health(3367) inform(1360) care(1135) }
{ model(3480) simul(1196) paramet(876) }
{ monitor(1329) mobil(1314) devic(1160) }
{ ehr(2073) health(1662) electron(1139) }
{ model(2656) set(1616) predict(1553) }
{ sampl(1606) size(1419) use(1276) }
{ first(2504) two(1366) second(1323) }
{ intervent(3218) particip(2042) group(1664) }
{ activ(1138) subject(705) human(624) }
{ time(1939) patient(1703) rate(768) }
{ patient(1821) servic(1111) care(1106) }
{ use(2086) technolog(871) perceiv(783) }
{ can(981) present(881) function(850) }
{ analysi(2126) use(1163) compon(1037) }
{ health(1844) social(1437) communiti(874) }
{ cancer(2502) breast(956) screen(824) }
{ result(1111) use(1088) new(759) }
{ estim(2440) model(1874) function(577) }
{ process(1125) use(805) approach(778) }
{ detect(2391) sensit(1101) algorithm(908) }

Resumo

JECTIVE: Even though a vaccine for malaria infections has been under intense study for many years, it has resisted several different lines of attack attempted by biologists. More than half of Plasmodium proteins still remain uncharacterized and therefore cannot be used in clinical trials. The task is further complicated by the metamorphic life-cycle of the parasite, which allows for rapid evolutionary changes and diversity among related strains, thus making precise targeting of the appropriate proteins for vaccination a technical challenge. We propose an automated method for predicting functions for the malaria parasite, which capitalizes on the importance of the intraerythrocytic developmental cycle data and expression changes during its five phases, as determined computationally by our segmentation algorithm.MATERIALS AND METHODS: Our method combines temporal gene expression profiles with protein-protein interaction data, sequence similarity scores, and metabolic pathway information to produce a set of predicted protein functions that can be used as targets for vaccine development. We use a Bayesian approach, which assigns a probability of having (or not having) a particular function to each protein, given the various sources of evidence. In our method, each data source is represented by either a functional linkage graph or a categorical feature vector.RESULTS AND CONCLUSIONS: The methods are tested on Plasmodium falciparum, the species responsible for the deadliest malaria infections. The algorithm was able to assign meaningful functions to 628 out of 1439 previously unannotated proteins, which are first-choice candidates for experimental vaccine research. We conclude that analyzing time-course gene expression profiles in separate phases leads to much higher prediction accuracy when compared with Pearson correlation coefficients computed across the time course as a whole. Additionally, we demonstrate that temporal expression profiles alone are able to improve the predictive power of the integrated data.

Resumo Limpo

jectiv even though vaccin malaria infect intens studi mani year resist sever differ line attack attempt biologist half plasmodium protein still remain uncharacter therefor use clinic trial task complic metamorph lifecycl parasit allow rapid evolutionari chang divers among relat strain thus make precis target appropri protein vaccin technic challeng propos autom method predict function malaria parasit capit import intraerythrocyt development cycl data express chang five phase determin comput segment algorithmmateri method method combin tempor gene express profil proteinprotein interact data sequenc similar score metabol pathway inform produc set predict protein function can use target vaccin develop use bayesian approach assign probabl particular function protein given various sourc evid method data sourc repres either function linkag graph categor featur vectorresult conclus method test plasmodium falciparum speci respons deadliest malaria infect algorithm abl assign meaning function previous unannot protein firstchoic candid experiment vaccin research conclud analyz timecours gene express profil separ phase lead much higher predict accuraci compar pearson correl coeffici comput across time cours whole addit demonstr tempor express profil alon abl improv predict power integr data

Resumos Similares

Comput Biol Chem - Large replication skew domains delimit GC-poor gene deserts in human. ( 0,790962071847336 )
Comput Biol Chem - Gene expression regulation of the PF00480 or PF14340 domain proteins suggests their involvement in sulfur metabolism. ( 0,785803767360748 )
Wiley Interdiscip Rev Syst Biol Med - Mass spectrometry-based proteomics: qualitative identification to activity-based protein profiling. ( 0,747045562963983 )
Brief. Bioinformatics - Application of second-generation sequencing to cancer genomics. ( 0,738531254871987 )
Brief. Bioinformatics - New developments of alignment-free sequence comparison: measures, statistics and next-generation sequencing. ( 0,734840743596257 )
Brief. Bioinformatics - The genomic and functional characteristics of disease genes. ( 0,731076938464363 )
Brief. Bioinformatics - Experimental evidence validating the computational inference of functional associations from gene fusion events: a critical survey. ( 0,730619018249421 )
Comput Biol Chem - Identification of potential drug targets by subtractive genome analysis of Bacillus anthracis A0248: An in silico approach. ( 0,723481582624007 )
BMC Med Inform Decis Mak - Improved method for protein complex detection using bottleneck proteins. ( 0,720034299136456 )
Comput Biol Chem - In silico characterization and evolutionary analyses of CCAAT binding proteins in the lycophyte plant Selaginella moellendorffii genome: a growing comparative genomics resource. ( 0,719645760592697 )
Comput Methods Programs Biomed - Pinda: a web service for detection and analysis of intraspecies gene duplication events. ( 0,711856801530635 )
Comput Biol Chem - Identical sequence patterns in the ends of exons and introns of human protein-coding genes. ( 0,708826513863829 )
Comput Biol Chem - Global expression analysis of miRNA gene cluster and family based on isomiRs from deep sequencing data. ( 0,705753492851469 )
Comput Biol Chem - In silico identification of conserved microRNAs and their target transcripts from expressed sequence tags of three earthworm species. ( 0,70095364445289 )
Brief. Bioinformatics - Positional orthology: putting genomic evolutionary relationships into context. ( 0,70041009507609 )
Brief. Bioinformatics - Phylogenetic-based propagation of functional annotations within the Gene Ontology consortium. ( 0,695990471343416 )
J Integr Bioinform - Construction of coffee transcriptome networks based on gene annotation semantics. ( 0,694970649013266 )
Comput Biol Chem - Molecular phylogenetic study and expression analysis of ATP-binding cassette transporter gene family in Oryza sativa in response to salt stress. ( 0,692334143301974 )
Brief. Bioinformatics - Rich annotation of DNA sequencing variants by leveraging the Ensembl Variant Effect Predictor with plugins. ( 0,687718504245296 )
J Integr Bioinform - Identification of common carp innate immune genes with whole-genome sequencing and RNA-Seq data. ( 0,679965563632295 )
Comput Biol Chem - Menzerath-Altmann law in mammalian exons reflects the dynamics of gene structure evolution. ( 0,676599276408748 )
Comput. Biol. Med. - Evolution of the mir-181 microRNA family. ( 0,675421128572524 )
Brief. Bioinformatics - Motif discovery and transcription factor binding sites before and after the next-generation sequencing era. ( 0,67414522015933 )
Brief. Bioinformatics - Comparative analysis of methods for genome-wide nucleosome cartography. ( 0,671459727122835 )
J. Comput. Biol. - Detection of structural variants involving repetitive regions in the reference genome. ( 0,667705416757818 )
Brief. Bioinformatics - Sequencing technologies and tools for short tandem repeat variation detection. ( 0,664453759994622 )
Comput Biol Chem - Gene cloning, homology comparison and analysis of the main functional structure domains of beta estrogen receptor in Jining Gray goat. ( 0,664083496360379 )
J. Comput. Biol. - Evaluating, comparing, and interpreting protein domain hierarchies. ( 0,661525171256338 )
J. Comput. Biol. - ImagePlane: an automated image analysis pipeline for high-throughput screens using the planarian Schmidtea mediterranea. ( 0,659526104952751 )
Comput Biol Chem - Characterizing regions in the human genome unmappable by next-generation-sequencing at the read length of 1000 bases. ( 0,657427975244897 )
Comput. Biol. Med. - Impact of TGF-b on breast cancer from a quantitative proteomic analysis. ( 0,657372965074667 )
Comput Biol Chem - Identification of putative and potential cross-reactive chickpea (Cicer arietinum) allergens through an in silico approach. ( 0,656900908214575 )
Comput. Biol. Med. - A content and structural assessment of oxidative motifs across a diverse set of life forms. ( 0,655995139003948 )
J. Comput. Biol. - A probabilistic model of neutral and selective dynamics of protein network evolution. ( 0,655990460890539 )
J. Comput. Biol. - Nonparametric combinatorial sequence models. ( 0,655836068246137 )
Wiley Interdiscip Rev Syst Biol Med - Postgenomic technologies targeting the Wnt signaling network. ( 0,652996976068907 )
Brief. Bioinformatics - Systematic identification of Class I HDAC substrates. ( 0,648390488868957 )
J. Comput. Biol. - Node fingerprinting: an efficient heuristic for aligning biological networks. ( 0,648000988300407 )
J. Comput. Biol. - Modeling alternative splicing variants from RNA-Seq data with isoform graphs. ( 0,646702363496532 )
J. Comput. Biol. - Parallel continuous flow: a parallel suffix tree construction tool for whole genomes. ( 0,646612358659324 )
Brief. Bioinformatics - Ultrafast clustering algorithms for metagenomic sequence analysis. ( 0,645152890349794 )
Comput Biol Chem - Structural characteristics of genomic islands associated with GMP synthases as integration hotspot among sequenced microbial genomes. ( 0,644763294427515 )
Comput Biol Chem - Computational insight into nitration of human myoglobin. ( 0,64344023272889 )
J Integr Bioinform - The topology of the growing human interactome data. ( 0,643381386694866 )
Brief. Bioinformatics - Computational methods for Gene Orthology inference. ( 0,642654614944149 )
J. Comput. Biol. - Triplex DNA:RNA, 3'-to-5' inverted RNA and protein coding in mitochondrial genomes. ( 0,642271959510812 )
Comput Biol Chem - Genes under positive selection in Mycobacterium tuberculosis. ( 0,639543100365885 )
Comput Biol Chem - lncRNAMap: a map of putative regulatory functions in the long non-coding transcriptome. ( 0,63941166367826 )
J. Comput. Biol. - AREM: aligning short reads from ChIP-sequencing by expectation maximization. ( 0,639300193852129 )
Comput. Biol. Med. - A protein mapping method based on physicochemical properties and dimension reduction. ( 0,639223351475176 )
Sci Data - Genomes of diverse isolates of the marine cyanobacterium Prochlorococcus. ( 0,639153314662043 )
J. Comput. Biol. - Simultaneous alignment and folding of protein sequences. ( 0,638714867224435 )
Comput Methods Programs Biomed - GREMET: an integrative tool for the prediction of mutation effects on gene regulation. ( 0,638246592098185 )
J Integr Bioinform - BacillusRegNet: a transcriptional regulation database and analysis platform for Bacillus species. ( 0,638193098874663 )
Comput. Biol. Med. - Improving protein secondary structure prediction using a multi-modal BP method. ( 0,637888915175192 )
J. Comput. Biol. - In silico prediction of escherichia coli proteins targeting the host cell nucleus, with special reference to their role in colon cancer etiology. ( 0,637010077483334 )
Brief. Bioinformatics - Comparative genomics approach to detecting split-coding regions in a low-coverage genome: lessons from the chimaera Callorhinchus milii (Holocephali, Chondrichthyes). ( 0,636891383255177 )
Wiley Interdiscip Rev Syst Biol Med - The zebrafish: scalable in vivo modeling for systems biology. ( 0,636575033419405 )
Comput Biol Chem - Self-organizing approach for meta-genomes. ( 0,635516749001833 )
J Integr Bioinform - Prediction of thioredoxin and glutaredoxin target proteins by identifying reversibly oxidized cysteinyl residues. ( 0,635243805630082 )
Wiley Interdiscip Rev Syst Biol Med - Genome network medicine: innovation to overcome huge challenges in cancer therapy. ( 0,634041208889055 )
J. Comput. Biol. - A novel technique for detecting putative horizontal gene transfer in the sequence space. ( 0,633983672909553 )
J Chem Inf Model - Improved helix and kink characterization in membrane proteins allows evaluation of kink sequence predictors. ( 0,633767565682594 )
J Chem Inf Model - Protein secondary structure classification revisited: processing DSSP information with PSSC. ( 0,632271281129023 )
Brief. Bioinformatics - Next generation sequencing in functional genomics. ( 0,628446121342509 )
J. Comput. Biol. - Statistical significance of optical map alignments. ( 0,628072982002177 )
Comput Biol Chem - ISDTool: a computational model for predicting immunosuppressive domain of HERVs. ( 0,627137592392632 )
J. Comput. Biol. - Catching the genomic wave in oligonucleotide single-nucleotide polymorphism arrays by modeling sequence binding. ( 0,624782711846647 )
Comput Biol Chem - A local average connectivity-based method for identifying essential proteins from the network level. ( 0,62445817068272 )
Comput Biol Chem - Bacterial protein structures reveal phylum dependent divergence. ( 0,62445817068272 )
Brief. Bioinformatics - Pattern recognition and probabilistic measures in alignment-free sequence analysis. ( 0,624398234574767 )
Brief. Bioinformatics - Computational challenges of sequence classification in microbiomic data. ( 0,623628312390101 )
Comput Biol Chem - The challenge of annotating protein sequences: The tale of eight domains of unknown function in Pfam. ( 0,623500877476767 )
Comput Biol Chem - Identification and characterization of lysine-methylated sites on histones and non-histone proteins. ( 0,622127562865742 )
Comput Biol Chem - Identification of novel splice variants and exons of human endothelial cell-specific chemotaxic regulator (ECSCR) by bioinformatics analysis. ( 0,621690907053664 )
Sci Data - Long-read, whole-genome shotgun sequence data for five model organisms. ( 0,621411608442473 )
J. Comput. Biol. - Learning cellular sorting pathways using protein interactions and sequence motifs. ( 0,621074728758845 )
Comput. Biol. Med. - Prediction of methylation CpGs and their methylation degrees in human DNA sequences. ( 0,620098898045866 )
Wiley Interdiscip Rev Syst Biol Med - Using a systems biology approach to understand and study the mechanisms of metastasis. ( 0,618573528714679 )
J Integr Bioinform - Efficient online transcription factor binding site adjustment by integrating transitive graph projection with MoRAine 2.0. ( 0,617030904359116 )
J Biomed Inform - Comparative analysis of a novel disease phenotype network based on clinical manifestations. ( 0,616959214649654 )
J. Comput. Biol. - A theoretical model for whole genome alignment. ( 0,616177916628037 )
Med Biol Eng Comput - Ataxin active site determination using spectral distribution of electron ion interaction potentials of amino acids. ( 0,614959004099544 )
Comput Biol Chem - Analysis of the relationships between evolvability, thermodynamics, and the functions of intrinsically disordered proteins/regions. ( 0,614699910953102 )
Comput Biol Chem - Subtle discrepancies of SF2/ASF ESE sequence motif among human tissues: A computational approach. ( 0,614386910207957 )
J. Comput. Biol. - Reconstructing the history of large-scale genomic changes: biological questions and computational challenges. ( 0,614269505765995 )
Brief. Bioinformatics - Taxonomic binning of metagenome samples generated by next-generation sequencing technologies. ( 0,614251758597378 )
Brief. Bioinformatics - Applications of alignment-free methods in epigenomics. ( 0,614072381898386 )
J Chem Inf Model - Parallel and antiparallel ?-strands differ in amino acid composition and availability of short constituent sequences. ( 0,612921278056167 )
J. Comput. Biol. - Efficient traversal of beta-sheet protein folding pathways using ensemble models. ( 0,611922716656433 )
Comput Biol Chem - Multi-nucleation and vectorial folding pathways of large helix protein. ( 0,611576898771896 )
J. Comput. Biol. - Vavien: an algorithm for prioritizing candidate disease genes based on topological similarity of proteins in interaction networks. ( 0,611361464733616 )
J Am Med Inform Assoc - Integrated morphologic analysis for the identification and characterization of disease subtypes. ( 0,610586578804609 )
Comput Biol Chem - The frequency of poly(G) tracts in the human genome and their use as a sensor of DNA damage. ( 0,610448305565568 )
Comput Biol Chem - Conserved patterns in bacterial genomes: a conundrum physically tailored by evolutionary tinkering. ( 0,609372058615809 )
J. Comput. Biol. - ComB: SNP calling and mapping analysis for color and nucleotide space platforms. ( 0,608951088359157 )
Wiley Interdiscip Rev Syst Biol Med - Hierarchical approaches for systems modeling in cardiac development. ( 0,608747099282608 )
Comput Biol Chem - A novel feature representation method based on Chou's pseudo amino acid composition for protein structural class prediction. ( 0,607719356102326 )
J Chem Inf Model - Modules identification in protein structures: the topological and geometrical solutions. ( 0,607236275791227 )
Comput Biol Chem - Circular code motifs in transfer and 16S ribosomal RNAs: a possible translation code in genes. ( 0,607132329194873 )