bitscore colors: <40, 40-50 , 50-80, 80-200, >200
BLASTP 2.2.30+ Reference: Stephen F. Altschul, Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Reference for composition-based statistics: Alejandro A. Schaffer, L. Aravind, Thomas L. Madden, Sergei Shavirin, John L. Spouge, Yuri I. Wolf, Eugene V. Koonin, and Stephen F. Altschul (2001), "Improving the accuracy of PSI-BLAST protein database searches with composition-based statistics and other refinements", Nucleic Acids Res. 29:2994-3005. Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF excluding environmental samples from WGS projects 49,011,213 sequences; 17,563,301,199 total letters Query= Contig-34_CDS_annotation_glimmer3.pl_2_1 Length=299 Score E Sequences producing significant alignments: (Bits) Value gi|575094321|emb|CDL65708.1| unnamed protein product 246 1e-72 gi|547226430|ref|WP_021963493.1| putative uncharacterized protein 155 1e-39 gi|490418709|ref|WP_004291032.1| hypothetical protein 119 1e-26 gi|575094354|emb|CDL65742.1| unnamed protein product 119 1e-26 gi|496050829|ref|WP_008775336.1| hypothetical protein 116 1e-25 gi|494822885|ref|WP_007558293.1| hypothetical protein 104 1e-21 gi|647452987|ref|WP_025792807.1| hypothetical protein 93.6 6e-18 gi|565841287|ref|WP_023924568.1| hypothetical protein 88.6 3e-16 gi|649557305|gb|KDS63784.1| capsid family protein 76.6 3e-13 gi|575094297|emb|CDL65693.1| unnamed protein product 78.2 6e-13 >gi|575094321|emb|CDL65708.1| unnamed protein product [uncultured bacterium] Length=642 Score = 246 bits (628), Expect = 1e-72, Method: Compositional matrix adjust. Identities = 135/292 (46%), Positives = 179/292 (61%), Gaps = 17/292 (6%) Query 16 FSGKLTADASMK----ISALRSATALQKYKEIQNSNDPDFAAQVLAHFGIKPKVDSRVSV 71 FSG + + S+ I ALR+A A QKYKEIQ +ND DF +QV AHFGIKP + S+ Sbjct 360 FSGNVAINTSLSGNLSIIALRNALAAQKYKEIQLANDVDFQSQVEAHFGIKPDEKNENSL 419 Query 72 FIGGDDKTLSINPQVNTNFQNGGEPEIKAIGIGDLSAGCKFTSTTYGMIIGIYRAIPQLD 131 FIGG ++IN Q+N N + A G+ SA KFT+ TYG++IGIYR P LD Sbjct 420 FIGGSSSMININEQINQNLSGDNKATYGAAPQGNGSASIKFTAKTYGVVIGIYRCTPVLD 479 Query 132 YSHVGIDRNLFKTDASDFPIPELDSIGMQTQYRCELSAPLMGLNDLLVPY--GYQTSPLD 189 ++H+GIDR LFKTDASDF IPE+DSIGMQ +RCE++AP ND + G +SP D Sbjct 480 FAHLGIDRTLFKTDASDFVIPEMDSIGMQQTFRCEVAAP-APYNDEFKAFRVGDGSSP-D 537 Query 190 MSVTYGYSPRYAELKSARDYYEGGFCGTYSTWVTG--YDESFLSGWRRNRGSVSVTDYDS 247 MS TYGY+PRY+E K++ D Y G FC + +WVTG +D + W G + Sbjct 538 MSETYGYAPRYSEFKTSYDRYNGAFCHSLKSWVTGINFDAIQNNVWNTWAGI-------N 590 Query 248 IEDLFKCRASLLYPIFVNQWSGTVNDDKLLIGSVNTCVAVRPFSMYGLPYSK 299 ++F CR ++ +F+ + +DD+L +G VN C A R S YGLPYS Sbjct 591 APNMFACRPDIVKNLFLVSSTNNSDDDQLYVGMVNMCYATRNLSRYGLPYSN 642 >gi|547226430|ref|WP_021963493.1| putative uncharacterized protein [Prevotella sp. CAG:1185] gi|524103382|emb|CCY83994.1| putative uncharacterized protein [Prevotella sp. CAG:1185] Length=573 Score = 155 bits (393), Expect = 1e-39, Method: Compositional matrix adjust. Identities = 99/277 (36%), Positives = 142/277 (51%), Gaps = 15/277 (5%) Query 24 ASMKISALRSATALQKYKEIQNSNDPDFAAQVLAHFGIKPKVD-SRVSVFIGGDDKTLSI 82 A + + ALR A LQK++EI S D+ Q+ HF + P S ++GG L I Sbjct 309 AGLSVLALRQAECLQKWREIAQSGKMDYQTQMQKHFNVSPSATLSGHCKYLGGWTSNLDI 368 Query 83 NPQVNTNFQNGGEPEIKAIGIGDLSAG-CKFTSTTYGMIIGIYRAIPQLDYSHVGIDRNL 141 + VNTN + +I+ G G L+ F S+ +G+I+ IY +P LD+S I R Sbjct 369 SEVVNTNLTGDNQADIQGKGTGTLNGNKVDFESSEHGIIMCIYHCLPLLDWSINRIARQN 428 Query 142 FKTDASDFPIPELDSIGMQTQYRCELSAPLMGLNDLLVPYGYQTSPLD-MSVTYGYSPRY 200 FKT +D+ IPE DS+GMQ Y E+ + GL DL P D S+ GY PRY Sbjct 429 FKTTFTDYAIPEFDSVGMQQLYPSEM---IFGLEDL---------PSDPSSINMGYVPRY 476 Query 201 AELKSARDYYEGGFCGTYSTWVTGYDESFLSGWRRNRGSVSVTDYDSIEDLFKCRASLLY 260 A+LK++ D G F T +WV+ +S++S +R+ +D + FK ++ Sbjct 477 ADLKTSIDEIHGSFIDTLVSWVSPLTDSYISAYRQACKDAGFSDITMTYNFFKVNPHIVD 536 Query 261 PIFVNQWSGTVNDDKLLIGSVNTCVAVRPFSMYGLPY 297 IF + T+N D+LLI S AVR F GLPY Sbjct 537 NIFGVKADSTINTDQLLINSYFDIKAVRNFDYNGLPY 573 >gi|490418709|ref|WP_004291032.1| hypothetical protein [Bacteroides eggerthii] gi|217986636|gb|EEC52970.1| putative capsid protein (F protein) [Bacteroides eggerthii DSM 20697] Length=578 Score = 119 bits (298), Expect = 1e-26, Method: Compositional matrix adjust. Identities = 88/294 (30%), Positives = 126/294 (43%), Gaps = 46/294 (16%) Query 26 MKISALRSATALQKYKEIQNSNDPDFAAQVLAHFGIKPKVD-SRVSVFIGGDDKTLSINP 84 + I LR A LQK+KEI S + D+ Q+ H+G+ S + ++GG ++ IN Sbjct 309 LSILVLRQAEFLQKWKEITQSGNKDYKDQLEKHWGVSVGDGFSELCTYLGGVSSSIDINE 368 Query 85 QVNTNFQNGGEPEIKAIGIGDLSAGCKFTST-TYGMIIGIYRAIPQLDYSHVGIDRNLFK 143 +NTN +I G+G + F S YG+I+ IY +P LDY+ +D K Sbjct 369 VINTNITGSAAADIAGKGVGVANGEINFNSNGRYGLIMCIYHCLPLLDYTTDMLDPAFLK 428 Query 144 TDASDFPIPELDSIGMQTQYRCELSAPLMGLNDLLVPYGYQTSPLDMSVTYGYSPRYAEL 203 +++D+ IPE D +GMQ+ +L PL + + GY PRY + Sbjct 429 VNSTDYAIPEFDRVGMQSMPLVQLMNPLRSFANA------------SGLVLGYVPRYIDY 476 Query 204 KSARDYYEGGFCGTYSTWVTGYDESFLSGWRRNRGSVSV-------TDYDSIE------- 249 K++ D GGF T ++WV Y G++SV D IE Sbjct 477 KTSVDQSVGGFKRTLNSWVISY------------GNISVLKQVTLPNDAPPIEPSEPVPS 524 Query 250 ------DLFKCRASLLYPIFVNQWSGTVNDDKLLIGSVNTCVAVRPFSMYGLPY 297 FK L PIF Q N D+ L S AVR GLPY Sbjct 525 VAPMNFTFFKVNPDCLDPIFAVQAGDDTNTDQFLCSSFFDIKAVRNLDTDGLPY 578 >gi|575094354|emb|CDL65742.1| unnamed protein product [uncultured bacterium] Length=615 Score = 119 bits (298), Expect = 1e-26, Method: Compositional matrix adjust. Identities = 90/276 (33%), Positives = 129/276 (47%), Gaps = 15/276 (5%) Query 26 MKISALRSATALQKYKEIQNSNDPDFAAQVLAHFGIK-PKVDSRVSVFIGGDDKTLSINP 84 + I ALR A LQK+KE+ S + D+ +Q+ H+GIK S + ++GG +L IN Sbjct 351 VPILALRQAEFLQKWKEVSVSGEEDYKSQIEKHWGIKVSDFLSHQARYLGGCATSLDINE 410 Query 85 QVNTNFQNGGEPEIKAIGIGDLSAGCKFTST-TYGMIIGIYRAIPQLDYSHVGIDRNLFK 143 +N N +I G + +F S YG+I+ IY +P +DY G+D + Sbjct 411 VINNNITGDNAADIAGKGTFTGNGSIRFESKGEYGIIMCIYHVLPIVDYVGSGVDHSCTL 470 Query 144 TDASDFPIPELDSIGMQTQYRCELSAPLMGLNDLLVPYGYQTSPLDMSVTYGYSPRYAEL 203 DA+ FPIPELD IGM+ S PL+ + P +P GY+PRY + Sbjct 471 VDATSFPIPELDQIGME-------SVPLV---RAMNPVKESDTP-SADTFLGYAPRYIDW 519 Query 204 KSARDYYEGGFCGTYSTWVTGY-DESFLSGWRRNRGSVSVTDYDSI-EDLFKCRASLLYP 261 K++ D G F + TW D+ S N S + DSI FK S++ P Sbjct 520 KTSVDRSVGDFADSLRTWCLPVGDKELTSANSLNFPSNPNVEPDSIAAGFFKVNPSIVDP 579 Query 262 IFVNQWSGTVNDDKLLIGSVNTCVAVRPFSMYGLPY 297 +F TV D+ L S VR + GLPY Sbjct 580 LFAVVADSTVKTDEFLCSSFFDVKVVRNLDVNGLPY 615 >gi|496050829|ref|WP_008775336.1| hypothetical protein [Bacteroides sp. 2_2_4] gi|229448893|gb|EEO54684.1| putative capsid protein (F protein) [Bacteroides sp. 2_2_4] Length=580 Score = 116 bits (290), Expect = 1e-25, Method: Compositional matrix adjust. Identities = 93/302 (31%), Positives = 140/302 (46%), Gaps = 33/302 (11%) Query 6 LASTG--LQLVGFSGKLTADASMKISALRSATALQKYKEIQNSNDPDFAAQVLAHFGIKP 63 +STG LQ V SG T + ALR A LQK+KEI S + D+ Q+ H+ + Sbjct 302 FSSTGVNLQTVNGSGTFT------VLALRQAEFLQKWKEITQSGNKDYKDQIEKHWNVSV 355 Query 64 -KVDSRVSVFIGGDDKTLSINPQVNTNFQNGGEPEIKAIGIGDLSAGCKFTS-TTYGMII 121 + S +S+++GG +L IN VN N +I G+ + F + YG+I+ Sbjct 356 GEAYSEMSLYLGGTTASLDINEVVNNNITGSNAADIAGKGVVVGNGRISFDAGERYGLIM 415 Query 122 GIYRAIPQLDYSHVGIDRNLFKTDASDFPIPELDSIGMQTQYRCELSAPLMGL-NDLLVP 180 IY ++P LDY+ ++ K +++DF IPE D +GM+ S PL+ L N L Sbjct 416 CIYHSLPLLDYTTDLVNPAFTKINSTDFAIPEFDRVGME-------SVPLVSLMNPLQSS 468 Query 181 YGYQTSPLDMSVTYGYSPRYAELKSARDYYEGGFCGTYSTWVTGYDESFLSGWRR----- 235 Y +S L GY+PRY K+ D G F T +WV YD + Sbjct 469 YNVGSSIL------GYAPRYISYKTDVDSSVGAFKTTLKSWVMSYDNQSVINQLNYQDDP 522 Query 236 NRGSVSVTDYDSIEDLFKCRASLLYPIFVNQWSGTVNDDKLLIGSVNTCVAVRPFSMYGL 295 N ++ +Y + FK + + P+F S +++ D+ L S VR GL Sbjct 523 NNSPGTLVNYTN----FKVNPNCVDPLFAVAASNSIDTDQFLCSSFFDVKVVRNLDTDGL 578 Query 296 PY 297 PY Sbjct 579 PY 580 >gi|494822885|ref|WP_007558293.1| hypothetical protein [Bacteroides plebeius] gi|198272099|gb|EDY96368.1| putative capsid protein (F protein) [Bacteroides plebeius DSM 17135] Length=613 Score = 104 bits (260), Expect = 1e-21, Method: Compositional matrix adjust. Identities = 88/307 (29%), Positives = 141/307 (46%), Gaps = 29/307 (9%) Query 2 HLLKLASTGLQLVGFSGKLTADASMKIS--ALRSATALQKYKEIQNSNDPDFAAQVLAHF 59 +L+ +T L+ + D+S +S ALR A A QK+KE+ +++ D+ +Q+ AH+ Sbjct 325 RILRFNNTNSGLI-----VEGDSSFGVSILALRRAEAAQKWKEVALASEEDYPSQIEAHW 379 Query 60 GIK-PKVDSRVSVFIGGDDKTLSINPQVNTNFQNGGEPEIKAIGIGDLSAGCKF-TSTTY 117 G K S + ++G + LSIN VN N +I G + F Y Sbjct 380 GQSVNKAYSDMCQWLGSINIDLSINEVVNNNITGENAADIAGKGTMSGNGSINFNVGGQY 439 Query 118 GMIIGIYRAIPQLDYSHVGIDRNLFKTDASDFPIPELDSIGMQTQYRCELSAPLMGLNDL 177 G+++ ++ +PQLDY T+ DFPIPE D IGM E + GLN + Sbjct 440 GIVMCVFHVLPQLDYITSAPHFGTTLTNVLDFPIPEFDKIGM------EQVPVIRGLNPV 493 Query 178 LVPYG-YQTSPLDMSVTYGYSPRYAELKSARDYYEGGFCGTYSTWVTGYDESFLSGWRRN 236 G ++ SP ++ +GY+P+Y K+ D G F + TW+ +D+ L Sbjct 494 KPKDGDFKVSP---NLYFGYAPQYYNWKTTLDKSMGEFRRSLKTWIIPFDDEALLA---- 546 Query 237 RGSVSVTDYDSIE------DLFKCRASLLYPIFVNQWSGTVNDDKLLIGSVNTCVAVRPF 290 SV D ++E FK S+L +F + + +N D+ L ++ VR Sbjct 547 ADSVDFPDNPNVEADSVKAGFFKVSPSVLDNLFAVKANSDLNTDQFLCSTLFDVNVVRSL 606 Query 291 SMYGLPY 297 GLPY Sbjct 607 DPNGLPY 613 >gi|647452987|ref|WP_025792807.1| hypothetical protein [Prevotella histicola] Length=584 Score = 93.6 bits (231), Expect = 6e-18, Method: Compositional matrix adjust. Identities = 86/292 (29%), Positives = 137/292 (47%), Gaps = 26/292 (9%) Query 24 ASMKISALRSATALQKYKE-IQNSNDPDFAAQVLAHFGIK-PKVDSRVSVFIGGDDKTLS 81 +S ++ LR+A AL K E + +N D+A+Q+ AHFG K P+ + + F+GG D ++ Sbjct 296 SSFSVNDLRAAFALDKMLEATRRANGLDYASQIEAHFGFKVPESRANDARFLGGFDNSIV 355 Query 82 INPQVNTNFQNGGEPEIKAIG------IGDLSAGC-KFTSTTYGMIIGIYRAIPQLDYSH 134 ++ V+TN + +IG IG +S+G +F ST +G+I+ IY PQ +Y+ Sbjct 356 VSEVVSTNGNAASDGSHASIGDLGGKGIGSMSSGTIEFDSTEHGIIMCIYSVAPQSEYNA 415 Query 135 VGIDRNLFKTDASDFPIPELDSIGMQTQYRCELSAPLMGLNDLLVPYGYQTSPLDMSVTY 194 +D K F PE +G Q +L +G+N+ G+ L+ ++ Sbjct 416 SYLDPFNRKLTREQFYQPEFADLGYQALIGSDLICSTLGMNEKQA--GFSDIELNNNLL- 472 Query 195 GYSPRYAELKSARDYYEGGFCG--TYSTWVT-----GYDESFLSGWRRNRGSVSVTDYDS 247 GY RY E K+ARD G F + S W T GY ++ N+G + Sbjct 473 GYQVRYNEYKTARDLVFGDFESGKSLSYWCTPRFDFGYGDTEKKIAPENKGGADYRKKGN 532 Query 248 IEDL----FKCRASLLYPIFVNQWSGTVNDDKLLIGSVNTCVAVRPFSMYGL 295 F +L+ PIF+ + V D ++ S AVRP S+ GL Sbjct 533 RSHWSSRNFYINPNLVNPIFL---TSAVQADHFIVNSFLDVKAVRPMSVTGL 581 >gi|565841287|ref|WP_023924568.1| hypothetical protein [Prevotella nigrescens] gi|564729907|gb|ETD29851.1| hypothetical protein HMPREF1173_00033 [Prevotella nigrescens CC14M] Length=656 Score = 88.6 bits (218), Expect = 3e-16, Method: Compositional matrix adjust. Identities = 78/277 (28%), Positives = 127/277 (46%), Gaps = 37/277 (13%) Query 31 LRSATALQKYKE-IQNSNDPDFAAQVLAHFGIK-PKVDSRVSVFIGGDDKTLSINPQVNT 88 +R+ AL+K E + +N D++ Q+ AHFG K P+ + FIGG D +SI+ V T Sbjct 396 IRAMFALEKMLERTRAANGLDYSNQIAAHFGFKVPESRKNCASFIGGFDNQISISEVVTT 455 Query 89 NFQNGGEP----------EIKAIGIGDLSAG-CKFTSTTYGMIIGIYRAIPQLDYSHVGI 137 + NG ++ GIG +++G + +G+I+ IY PQ+DY + Sbjct 456 S--NGSVDGTASTGSVVGQVFGKGIGAMNSGHISYDVKEHGLIMCIYSIAPQVDYDAREL 513 Query 138 DRNLFKTDASDFPIPELDSIGMQTQYRCELSAPLMGLNDLLVPYGYQTSPLDMSVTYGYS 197 D K D+ PE +++GMQ + +L L + S + GYS Sbjct 514 DPFNRKFSREDYFQPEFENLGMQPVIQSDLC--------LCINSAKSDSSDQHNNVLGYS 565 Query 198 PRYAELKSARDYYEGGFC--GTYSTWVTGYDESFLSGWRRNRGSVSVTDYDSIEDLFKCR 255 RY E K+ARD G F G+ S W T + + G +S+ D Sbjct 566 ARYLEYKTARDIIFGEFMSGGSLSAWATPKN-----NYTFEFGKLSLPD-------LLVD 613 Query 256 ASLLYPIFVNQWSGTVNDDKLLIGSVNTCVAVRPFSM 292 +L PIF +++G+++ D+ L+ S A+RP + Sbjct 614 PKVLEPIFAVKYNGSMSTDQFLVNSYFDVKAIRPMQV 650 >gi|649557305|gb|KDS63784.1| capsid family protein [Parabacteroides distasonis str. 3999B T(B) 4] gi|649559156|gb|KDS65543.1| capsid family protein [Parabacteroides distasonis str. 3999B T(B) 6] Length=245 Score = 76.6 bits (187), Expect = 3e-13, Method: Compositional matrix adjust. Identities = 58/202 (29%), Positives = 97/202 (48%), Gaps = 23/202 (11%) Query 26 MKISALRSATALQKYKEIQNSNDPDFAAQVLAHFGIKPKVDSRVS--VFIGGDDKTLSIN 83 + I+ +R++ ALQ++ E + + Q+L+HFG++ D+R+ F+GG +S++ Sbjct 3 VNINDIRTSNALQRWFERNARSGSRYIEQILSHFGVRSS-DARLQRPQFLGGGRTPISVS 61 Query 84 PQVNTNFQNGGEPEIKAIGIGDLSAGCKFTSTTY----GMIIGIYRAIPQLDYSHVGIDR 139 + T+ + P+ G G +SAG T Y G I+GI P+ Y G+ + Sbjct 62 EVLQTSSTDSTSPQANMAGHG-ISAGVNHGFTRYFEEHGYIMGIMSIRPRTGYQQ-GVPK 119 Query 140 NLFKTDASDFPIPELDSIGMQTQYRCELSAPLMGLNDLLVPYGYQTSPLDMSVTYGYSPR 199 + K D DF PE +G Q E+ + LN+ S T+GY+PR Sbjct 120 DFRKFDNMDFYFPEFAHLGEQ-----EIKNEELYLNE---------SDAANEGTFGYTPR 165 Query 200 YAELKSARDYYEGGFCGTYSTW 221 YAE K +++ G F G + W Sbjct 166 YAEYKYSQNEVHGDFRGNMAFW 187 >gi|575094297|emb|CDL65693.1| unnamed protein product [uncultured bacterium] Length=630 Score = 78.2 bits (191), Expect = 6e-13, Method: Compositional matrix adjust. Identities = 82/292 (28%), Positives = 120/292 (41%), Gaps = 26/292 (9%) Query 25 SMKISALRSATALQKYKEIQNSNDPDFAAQVLAHFGIK-PKVDSRVSVFIGGDDKTLSIN 83 ++ +S LR+ A K I + AQ LAHFG K P+ S ++GG + L I+ Sbjct 343 TLNVSQLRALYATDKLLRITQFAGKHYDAQTLAHFGKKVPQGVSGEVYYLGGQSQRLQIS 402 Query 84 PQVNTNFQNG----------GEPEIKAIGIGDLSAGCKFTSTTYGMIIGIYRAIPQLDYS 133 P T +G GE +A + F + +G+++ IY A+P+ +YS Sbjct 403 PI--TALSSGQTSDGSDTVFGEQGARAASVTQGQKPFTFEAPCHGILMAIYSAVPEANYS 460 Query 134 HVGIDRNLFKTDASDFPIPELDSIGMQTQYRCELSAPLMGL-NDLLVPYGYQTSPLDMSV 192 IDR ++DF PELD+IGM Y E S P L + PY S D + Sbjct 461 CDAIDRINTLAYSNDFYKPELDNIGMSPLYSYEFSVPGYTLFRNPPTPY----SSDDAAQ 516 Query 193 TYGYSPRYAELKSARDYYEGGFCGTYSTWVTGYDESFLSGWRR----NRGSVSVTDYDSI 248 + G+ RY+ K+ D G T +W D L R N S+ + Sbjct 517 SLGWQFRYSWFKTKVDRTCGALNRTLRSWCPKRDYLALGLQSRPQLFNYASLYYVSPSYL 576 Query 249 EDLFKCRAS----LLYPIFVNQWSGTVNDDKLLIGSVNTCVAVRPFSMYGLP 296 + LF + YP+ + S D L+ TC S YGLP Sbjct 577 DGLFYLNFAPPIDYRYPVDSDMSSIAFETDPLIHDMQITCYKTSVMSTYGLP 628 Lambda K H a alpha 0.318 0.137 0.409 0.792 4.96 Gapped Lambda K H a alpha sigma 0.267 0.0410 0.140 1.90 42.6 43.6 Effective search space used: 1574515238976