bitscore colors: <40, 40-50 , 50-80, 80-200, >200
BLASTP 2.2.30+ Reference: Stephen F. Altschul, Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Reference for composition-based statistics: Alejandro A. Schaffer, L. Aravind, Thomas L. Madden, Sergei Shavirin, John L. Spouge, Yuri I. Wolf, Eugene V. Koonin, and Stephen F. Altschul (2001), "Improving the accuracy of PSI-BLAST protein database searches with composition-based statistics and other refinements", Nucleic Acids Res. 29:2994-3005. Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF excluding environmental samples from WGS projects 49,011,213 sequences; 17,563,301,199 total letters Query= Contig-18_CDS_annotation_glimmer3.pl_2_3 Length=680 Score E Sequences producing significant alignments: (Bits) Value gi|492501782|ref|WP_005867318.1| hypothetical protein 95.1 3e-17 gi|649557305|gb|KDS63784.1| capsid family protein 90.1 1e-16 gi|649569140|gb|KDS75238.1| capsid family protein 90.1 4e-16 gi|547920049|ref|WP_022322420.1| capsid protein VP1 91.7 5e-16 gi|649555287|gb|KDS61824.1| capsid family protein 90.5 1e-15 gi|494610271|ref|WP_007368517.1| capsid protein 81.3 9e-13 gi|647452987|ref|WP_025792807.1| hypothetical protein 77.0 2e-11 gi|599087863|gb|AHN52857.1| major capsid protein 60.5 5e-07 gi|599087807|gb|AHN52829.1| major capsid protein 59.7 9e-07 gi|609718276|emb|CDN73650.1| conserved hypothetical protein 61.2 2e-06 >gi|492501782|ref|WP_005867318.1| hypothetical protein [Parabacteroides distasonis] gi|409230408|gb|EKN23272.1| hypothetical protein HMPREF1059_03257 [Parabacteroides distasonis CL09T03C24] Length=538 Score = 95.1 bits (235), Expect = 3e-17, Method: Compositional matrix adjust. Identities = 77/264 (29%), Positives = 126/264 (48%), Gaps = 17/264 (6%) Query 420 VDVSDGKLTMDALILQKKIFNMLNRVAITDGTYQAWREATYGIRSTTLP-ESPIFCGGMQ 478 V+V + ++++ L + R A + Y + +G+RS+ + P F GG + Sbjct 289 VNVDELGVSINDLRTSNALQRWFERNARSGSRYIEQILSHFGVRSSDARLQRPQFLGGGR 348 Query 479 SEIAFDEIVSNAATDE-EPLGTLAGRGVATMYKSGRGLKIKCTEPCMIMALGSITPRIDY 537 + I+ E++ +ATD P +AG G++ G K E I+ + SI PR Y Sbjct 349 TPISVSEVLQTSATDSTSPQANMAGHGISA--GVNHGFKRYFEEHGYIIGIMSIRPRTGY 406 Query 538 SQG-NKWWTRLQNMDDFHKPTLDAIGFQELIaeeaaawsteatGNHELVYQSLGKQPSWI 596 QG K + + NMD F+ P +G QE+ EE T A+ N + G P + Sbjct 407 QQGVPKDFRKFDNMD-FYFPEFAHLGEQEIKNEEVYLQQTPASNN-----GTFGYTPRYA 460 Query 597 EYTTDVNETYGDFAAGMPLAFMCLNRVYEENSDHTIANASTYIDPTIYNKIFAESRLSSQ 656 EY +NE +GDF M AF LNR++ E+ + +T+++ N++FA + S Sbjct 461 EYKYSMNEVHGDFRGNM--AFWHLNRIFSESPNLN----TTFVECNPSNRVFATAETSDD 514 Query 657 NFWVQVAFDVTARRVMSAKQIPNL 680 +W+Q+ DV A R+M P L Sbjct 515 KYWIQLYQDVKALRLMPKYGTPML 538 >gi|649557305|gb|KDS63784.1| capsid family protein [Parabacteroides distasonis str. 3999B T(B) 4] gi|649559156|gb|KDS65543.1| capsid family protein [Parabacteroides distasonis str. 3999B T(B) 6] Length=245 Score = 90.1 bits (222), Expect = 1e-16, Method: Compositional matrix adjust. Identities = 68/226 (30%), Positives = 109/226 (48%), Gaps = 17/226 (8%) Query 458 ATYGIRSTTLP-ESPIFCGGMQSEIAFDEIVSNAATDE-EPLGTLAGRGVATMYKSGRGL 515 + +G+RS+ + P F GG ++ I+ E++ ++TD P +AG G++ G Sbjct 34 SHFGVRSSDARLQRPQFLGGGRTPISVSEVLQTSSTDSTSPQANMAGHGISA--GVNHGF 91 Query 516 KIKCTEPCMIMALGSITPRIDYSQG-NKWWTRLQNMDDFHKPTLDAIGFQELIaeeaaaw 574 E IM + SI PR Y QG K + + NMD F+ P +G QE+ EE Sbjct 92 TRYFEEHGYIMGIMSIRPRTGYQQGVPKDFRKFDNMD-FYFPEFAHLGEQEIKNEELYLN 150 Query 575 steatGNHELVYQSLGKQPSWIEYTTDVNETYGDFAAGMPLAFMCLNRVYEENSDHTIAN 634 ++A + G P + EY NE +GDF M AF LNR+++E + Sbjct 151 ESDAANE-----GTFGYTPRYAEYKYSQNEVHGDFRGNM--AFWHLNRIFKEKPNLN--- 200 Query 635 ASTYIDPTIYNKIFAESRLSSQNFWVQVAFDVTARRVMSAKQIPNL 680 +T+++ N++FA + S +WVQ+ D+ A R+M P L Sbjct 201 -TTFVECNPSNRVFATAETSDDKYWVQIYQDIKALRLMPKYGTPML 245 >gi|649569140|gb|KDS75238.1| capsid family protein, partial [Parabacteroides distasonis str. 3999B T(B) 6] Length=390 Score = 90.1 bits (222), Expect = 4e-16, Method: Compositional matrix adjust. Identities = 68/226 (30%), Positives = 107/226 (47%), Gaps = 17/226 (8%) Query 458 ATYGIRSTTLP-ESPIFCGGMQSEIAFDEIVSNAATDE-EPLGTLAGRGVATMYKSGRGL 515 + +G+RS+ + P F GG ++ I+ E++ ++TD P +AG G++ G Sbjct 179 SHFGVRSSDARLQRPQFLGGGRTPISVSEVLQTSSTDSTSPQANMAGHGISA--GVNHGF 236 Query 516 KIKCTEPCMIMALGSITPRIDYSQG-NKWWTRLQNMDDFHKPTLDAIGFQELIaeeaaaw 574 E IM + SI PR Y QG K + + NMD F+ P +G QE I E Sbjct 237 TRYFEEHGYIMGIMSIRPRTGYQQGVPKDFRKFDNMD-FYFPEFAHLGEQE-IKNEELYL 294 Query 575 steatGNHELVYQSLGKQPSWIEYTTDVNETYGDFAAGMPLAFMCLNRVYEENSDHTIAN 634 + N + G P + EY NE +GDF M AF LNR+++E + Sbjct 295 NESDAANE----GTFGYTPRYAEYKYSQNEVHGDFRGNM--AFWHLNRIFKEKPNLN--- 345 Query 635 ASTYIDPTIYNKIFAESRLSSQNFWVQVAFDVTARRVMSAKQIPNL 680 +T+++ N++FA + S +WVQ+ D+ A R+M P L Sbjct 346 -TTFVECNPSNRVFATAETSDDKYWVQIYQDIKALRLMPKYGTPML 390 >gi|547920049|ref|WP_022322420.1| capsid protein VP1 [Parabacteroides merdae CAG:48] gi|524592961|emb|CDD13573.1| capsid protein VP1 [Parabacteroides merdae CAG:48] Length=553 Score = 91.7 bits (226), Expect = 5e-16, Method: Compositional matrix adjust. Identities = 76/269 (28%), Positives = 123/269 (46%), Gaps = 17/269 (6%) Query 415 NAITAVDVSDGKLTMDALILQKKIFNMLNRVAITDGTYQAWREATYGIRSTTLP-ESPIF 473 N V+V + + ++ L + R A Y + +G+RS+ + P F Sbjct 299 NGTLKVNVDEMGININDLRTSNALQRWFERNARGGSRYIEQILSHFGVRSSDARLQRPQF 358 Query 474 CGGMQSEIAFDEIVSNAATDE-EPLGTLAGRGVATMYKSGRGLKIKCTEPCMIMALGSIT 532 GG + I+ E++ ++TDE P +AG G++ +G K E I+ + SIT Sbjct 359 LGGGRMPISVSEVLQTSSTDETSPQANMAGHGISAGINNG--FKHYFEEHGYIIGIMSIT 416 Query 533 PRIDYSQG-NKWWTRLQNMDDFHKPTLDAIGFQELIaeeaaawsteatGNHELVYQSLGK 591 PR Y QG + +T+ NMD F+ P + QE+ +E A N + G Sbjct 417 PRSGYQQGVPRDFTKFDNMD-FYFPEFAHLSEQEIKNQELFVSEDAAYNN-----GTFGY 470 Query 592 QPSWIEYTTDVNETYGDFAAGMPLAFMCLNRVYEENSDHTIANASTYIDPTIYNKIFAES 651 P + EY +E +GDF L+F LNR++E+ + +T+++ N++FA S Sbjct 471 TPRYAEYKYHPSEAHGDFRGN--LSFWHLNRIFEDKPNLN----TTFVECKPSNRVFATS 524 Query 652 RLSSQNFWVQVAFDVTARRVMSAKQIPNL 680 FWVQ+ DV A R+M P L Sbjct 525 ETEDDKFWVQMYQDVKALRLMPKYGTPML 553 >gi|649555287|gb|KDS61824.1| capsid family protein [Parabacteroides distasonis str. 3999B T(B) 4] gi|649560568|gb|KDS66876.1| capsid family protein [Parabacteroides distasonis str. 3999B T(B) 4] gi|649561020|gb|KDS67307.1| capsid family protein [Parabacteroides distasonis str. 3999B T(B) 4] gi|649562724|gb|KDS68908.1| capsid family protein [Parabacteroides distasonis str. 3999B T(B) 6] Length=541 Score = 90.5 bits (223), Expect = 1e-15, Method: Compositional matrix adjust. Identities = 68/234 (29%), Positives = 107/234 (46%), Gaps = 33/234 (14%) Query 458 ATYGIRSTTLP-ESPIFCGGMQSEIAFDEIVSNAATDE-EPLGTLAGRGVATMYKSGRGL 515 + +G+RS+ + P F GG ++ I+ E++ ++TD P +AG G++ G Sbjct 330 SHFGVRSSDARLQRPQFLGGGRTPISVSEVLQTSSTDSTSPQANMAGHGISA--GVNHGF 387 Query 516 KIKCTEPCMIMALGSITPRIDYSQG-NKWWTRLQNMDDFHKPTLDAIGFQELIaeeaaaw 574 E IM + SI PR Y QG K + + NMD F+ P +G QE+ Sbjct 388 TRYFEEHGYIMGIMSIRPRTGYQQGVPKDFRKFDNMD-FYFPEFAHLGEQEI-------- 438 Query 575 steatGNHELVYQ--------SLGKQPSWIEYTTDVNETYGDFAAGMPLAFMCLNRVYEE 626 N EL + G P + EY NE +GDF M AF LNR+++E Sbjct 439 -----KNEELYLNESDAANEGTFGYTPRYAEYKYSQNEVHGDFRGNM--AFWHLNRIFKE 491 Query 627 NSDHTIANASTYIDPTIYNKIFAESRLSSQNFWVQVAFDVTARRVMSAKQIPNL 680 + +T+++ N++FA + S +WVQ+ D+ A R+M P L Sbjct 492 KPNLN----TTFVECNPSNRVFATAETSDDKYWVQIYQDIKALRLMPKYGTPML 541 >gi|494610271|ref|WP_007368517.1| capsid protein [Prevotella multiformis] gi|324988543|gb|EGC20506.1| putative capsid protein (F protein) [Prevotella multiformis DSM 16608] Length=531 Score = 81.3 bits (199), Expect = 9e-13, Method: Compositional matrix adjust. Identities = 79/314 (25%), Positives = 137/314 (44%), Gaps = 55/314 (18%) Query 407 IDGTTGGINAITAVDVSD--GKLTMDALILQKKIFNMLNRVAITDGTYQAWREATYGIRS 464 + G + IN ++ + V+D +D ++ + N L+ Y + EA +G R Sbjct 233 VSGASTFINGVSVLSVNDLRAAFALDKMLEATRRANGLD--------YSSQIEAHFGFR- 283 Query 465 TTLPESPI----FCGGMQSEIAFDEIVSNA----ATDEEP-LGTLAGRGVATMYKSGRGL 515 +PES F GG + + E+V+ + DE P LG L G+GV ++ S Sbjct 284 --VPESRAGDARFIGGFDNPVVISEVVNQSEFDRGADESPCLGDLGGKGVGSLNSSSIDF 341 Query 516 KIKCTEPCMIMALGSITPRIDYSQGNKW--WTRLQNMDDFHKPTLDAIGFQELIae---- 569 +K E +IM + S+ P+ +Y G + + R +DF +P +G+Q ++ Sbjct 342 DVK--EHGIIMCIYSVVPQTEY-NGTYFDPFNRKLRREDFFQPEFADLGYQPVVTSDLIS 398 Query 570 -------------eaaawsteatGNHELVYQSLGKQPSWIEYTTDVNETYGDFAAGMPLA 616 + + + E + LG Q + EY T + +G+F +G+ L+ Sbjct 399 TYLDNPVPDGPEKQKRLAAGYPLSSIEANNRLLGWQVRYNEYKTSRDLVFGEFESGLSLS 458 Query 617 FMCLNRVYE-----ENSDHTIAN-----ASTYIDPTIYNKIFAESRLSSQNFWVQVAFDV 666 + C R Y+ + D + N A Y++P+I N IF S + + +F V FDV Sbjct 459 YWCSPR-YDFGFDGKAGDKKLVNSPWSPAHFYVNPSILNTIFLVSAVKADHFLVNSFFDV 517 Query 667 TARRVMSAKQIPNL 680 A R MS + L Sbjct 518 KAVRPMSVSGLAGL 531 >gi|647452987|ref|WP_025792807.1| hypothetical protein [Prevotella histicola] Length=584 Score = 77.0 bits (188), Expect = 2e-11, Method: Compositional matrix adjust. Identities = 73/270 (27%), Positives = 118/270 (44%), Gaps = 50/270 (19%) Query 452 YQAWREATYGIRSTTLPESPI----FCGGMQSEIAFDEIVS---NAATD--EEPLGTLAG 502 Y + EA +G + +PES F GG + I E+VS NAA+D +G L G Sbjct 324 YASQIEAHFGFK---VPESRANDARFLGGFDNSIVVSEVVSTNGNAASDGSHASIGDLGG 380 Query 503 RGVATMYKSGRGLKIKCTEPCMIMALGSITPRIDYSQG-----NKWWTRLQNMDDFHKPT 557 +G+ +M S ++ TE +IM + S+ P+ +Y+ N+ TR Q F++P Sbjct 381 KGIGSM--SSGTIEFDSTEHGIIMCIYSVAPQSEYNASYLDPFNRKLTREQ----FYQPE 434 Query 558 LDAIGFQELI------aeeaaawsteatGNHELVYQSLGKQPSWIEYTTDVNETYGDFAA 611 +G+Q LI + + EL LG Q + EY T + +GDF + Sbjct 435 FADLGYQALIGSDLICSTLGMNEKQAGFSDIELNNNLLGYQVRYNEYKTARDLVFGDFES 494 Query 612 GMPLAFMCLNRV--------------------YEENSDHTI-ANASTYIDPTIYNKIFAE 650 G L++ C R Y + + + ++ + YI+P + N IF Sbjct 495 GKSLSYWCTPRFDFGYGDTEKKIAPENKGGADYRKKGNRSHWSSRNFYINPNLVNPIFLT 554 Query 651 SRLSSQNFWVQVAFDVTARRVMSAKQIPNL 680 S + + +F V DV A R MS + +L Sbjct 555 SAVQADHFIVNSFLDVKAVRPMSVTGLSSL 584 >gi|599087863|gb|AHN52857.1| major capsid protein, partial [uncultured Gokushovirinae] Length=219 Score = 60.5 bits (145), Expect = 5e-07, Method: Compositional matrix adjust. Identities = 46/167 (28%), Positives = 80/167 (48%), Gaps = 9/167 (5%) Query 407 IDGTTGGINAITAVDVSDG-KLTMDALILQKKIFNMLNRVAITDGTYQAWREATYGIRST 465 + G T ++ + D+++ T++ L +I ML R A Y ++ +G+ S Sbjct 51 VSGDTSAVSNVMYADLTEATAATINQLRQAFQIQKMLERDARGGTRYTEIIKSHFGVTSP 110 Query 466 TLP-ESPIFCGGMQSEIAFDEIVSNAATDEE---PLGTLAGRGVATMYKSGRGLKIKCTE 521 + P + GG + + + + + TD++ P GTLA G A + G G TE Sbjct 111 DARLQRPEYLGGGSTPVIINPVAQTSGTDQQSDTPQGTLAAIGTAQV--RGHGFTKSFTE 168 Query 522 PCMIMALGSITPRIDYSQG-NKWWTRLQNMDDFHKPTLDAIGFQELI 567 C+I+ L S+ + Y QG N+ W R Q D++ P L +G QE++ Sbjct 169 HCIILGLVSVRADLTYQQGLNRMWNR-QTRYDYYFPALSHLGEQEIL 214 >gi|599087807|gb|AHN52829.1| major capsid protein, partial [uncultured Gokushovirinae] Length=224 Score = 59.7 bits (143), Expect = 9e-07, Method: Compositional matrix adjust. Identities = 48/145 (33%), Positives = 71/145 (49%), Gaps = 8/145 (6%) Query 428 TMDALILQKKIFNMLNRVAITDGTYQAWREATYGIRSTTLP-ESPIFCGGMQSEIAFDEI 486 T++AL ++ +L R A Y +A +G+ S + P + GG S + I Sbjct 78 TINALRTGFQVQRLLERDARGGTRYTEVIKAHFGVTSPDARLQRPEYLGGGSSPVNITPI 137 Query 487 VSNAATD---EEPLGTLAGRGVATMYKSGRGLKIKCTEPCMIMALGSITPRIDYSQG-NK 542 S TD EPLGTLAG V T + S G TE C+I+ L ++ + Y QG N+ Sbjct 138 GSTVPTDLDPGEPLGTLAG--VGTAHISNHGFTKSFTEHCVIIGLVNVRADLTYQQGLNR 195 Query 543 WWTRLQNMDDFHKPTLDAIGFQELI 567 W+R Q D++ P L IG Q ++ Sbjct 196 MWSR-QTRYDYYWPALSHIGEQGVL 219 >gi|609718276|emb|CDN73650.1| conserved hypothetical protein [Elizabethkingia anophelis] Length=537 Score = 61.2 bits (147), Expect = 2e-06, Method: Compositional matrix adjust. Identities = 53/206 (26%), Positives = 90/206 (44%), Gaps = 10/206 (5%) Query 469 ESPIFCGGMQSEIAFDEIVSNAATDE-EPLGTLAGRGVATMYKSGRGLKIKCTEPCMIMA 527 + P F GG +S I E++ +ATD P G +AG G+ + K G G E ++ Sbjct 332 QRPEFLGGNKSPIMISEVLQQSATDSTTPQGNMAGHGIG-IGKDG-GFSRFFEEHGYVIG 389 Query 528 LGSITPRIDYSQGNKWWTRLQNMDDFHKPTLDAIGFQELIaeeaaawsteatGNHELVYQ 587 L S+ P+ YSQG + D+ P + IG ++ + + + E V+ Sbjct 390 LMSVIPKTSYSQGIPRHFSKSDKFDYFWPQFEHIG-EQPVYNKEIFAKNIDAFDSEAVF- 447 Query 588 SLGKQPSWIEYTTDVNETYGDFAAGMPLAFMCLNRVYEENSDHTIANASTYIDPTIYNKI 647 G P + EY + +GDF L F L R+++ + + + D ++I Sbjct 448 --GYLPRYSEYKFSPSTVHGDFKDD--LYFWHLGRIFDTDKPPVLNQSFIECDKNALSRI 503 Query 648 FAESRLSSQNFWVQVAFDVTARRVMS 673 FA + F+ + +TA+R MS Sbjct 504 FAVED-DTDKFYCHLYQKITAKRKMS 528 Lambda K H a alpha 0.317 0.133 0.399 0.792 4.96 Gapped Lambda K H a alpha sigma 0.267 0.0410 0.140 1.90 42.6 43.6 Effective search space used: 5232445671600