bitscore colors: <40, 40-50 , 50-80, 80-200, >200
BLASTP 2.2.30+ Reference: Stephen F. Altschul, Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Reference for composition-based statistics: Alejandro A. Schaffer, L. Aravind, Thomas L. Madden, Sergei Shavirin, John L. Spouge, Yuri I. Wolf, Eugene V. Koonin, and Stephen F. Altschul (2001), "Improving the accuracy of PSI-BLAST protein database searches with composition-based statistics and other refinements", Nucleic Acids Res. 29:2994-3005. Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF excluding environmental samples from WGS projects 49,011,213 sequences; 17,563,301,199 total letters Query= Contig-31_CDS_annotation_glimmer3.pl_2_1 Length=174 Score E Sequences producing significant alignments: (Bits) Value gi|494822885|ref|WP_007558293.1| hypothetical protein 144 6e-37 gi|575094354|emb|CDL65742.1| unnamed protein product 141 4e-36 gi|496050829|ref|WP_008775336.1| hypothetical protein 141 5e-36 gi|547226430|ref|WP_021963493.1| putative uncharacterized protein 134 1e-33 gi|490418709|ref|WP_004291032.1| hypothetical protein 129 1e-31 gi|575094321|emb|CDL65708.1| unnamed protein product 85.1 6e-16 gi|565841287|ref|WP_023924568.1| hypothetical protein 82.4 4e-15 gi|494306153|ref|WP_007173049.1| hypothetical protein 79.0 4e-14 gi|517172762|ref|WP_018361580.1| hypothetical protein 78.2 1e-13 gi|494308783|ref|WP_007173938.1| hypothetical protein 77.8 2e-13 >gi|494822885|ref|WP_007558293.1| hypothetical protein [Bacteroides plebeius] gi|198272099|gb|EDY96368.1| putative capsid protein (F protein) [Bacteroides plebeius DSM 17135] Length=613 Score = 144 bits (362), Expect = 6e-37, Method: Compositional matrix adjust. Identities = 77/183 (42%), Positives = 103/183 (56%), Gaps = 9/183 (5%) Query 1 MTYNTGSKYCIIMCIYHAIPLLDYAISGQDGQLLVTNIEDLPIPEFDNIGMEAVPVVQLV 60 + +N G +Y I+MC++H +P LDY S +TN+ D PIPEFD IGME VPV++ + Sbjct 431 INFNVGGQYGIVMCVFHVLPQLDYITSAPHFGTTLTNVLDFPIPEFDKIGMEQVPVIRGL 490 Query 61 NSSLLSTNIGKYS---FFGYNPRYYNWKTKIDRVHGAFTTDLKDWVAPIDDSYLQQWLPL 117 N K S +FGY P+YYNWKT +D+ G F LK W+ P DD L + Sbjct 491 NPVKPKDGDFKVSPNLYFGYAPQYYNWKTTLDKSMGEFRRSLKTWIIPFDDEALLAADSV 550 Query 118 ------TSSQPSLLYPFFKVNPNTLDTIFAVKADSTWATDQLLVNCNIDCKVVRPLSQDG 171 S+ FFKV+P+ LD +FAVKA+S TDQ L + D VVR L +G Sbjct 551 DFPDNPNVEADSVKAGFFKVSPSVLDNLFAVKANSDLNTDQFLCSTLFDVNVVRSLDPNG 610 Query 172 IPY 174 +PY Sbjct 611 LPY 613 >gi|575094354|emb|CDL65742.1| unnamed protein product [uncultured bacterium] Length=615 Score = 141 bits (356), Expect = 4e-36, Method: Compositional matrix adjust. Identities = 76/180 (42%), Positives = 102/180 (57%), Gaps = 6/180 (3%) Query 1 MTYNTGSKYCIIMCIYHAIPLLDYAISGQDGQLLVTNIEDLPIPEFDNIGMEAVPVVQLV 60 + + + +Y IIMCIYH +P++DY SG D + + PIPE D IGME+VP+V+ + Sbjct 436 IRFESKGEYGIIMCIYHVLPIVDYVGSGVDHSCTLVDATSFPIPELDQIGMESVPLVRAM 495 Query 61 NSSLLSTNIGKYSFFGYNPRYYNWKTKIDRVHGAFTTDLKDWVAPIDDSYLQQWLPLT-S 119 N S +F GY PRY +WKT +DR G F L+ W P+ D L L Sbjct 496 NPVKESDTPSADTFLGYAPRYIDWKTSVDRSVGDFADSLRTWCLPVGDKELTSANSLNFP 555 Query 120 SQP-----SLLYPFFKVNPNTLDTIFAVKADSTWATDQLLVNCNIDCKVVRPLSQDGIPY 174 S P S+ FFKVNP+ +D +FAV ADST TD+ L + D KVVR L +G+PY Sbjct 556 SNPNVEPDSIAAGFFKVNPSIVDPLFAVVADSTVKTDEFLCSSFFDVKVVRNLDVNGLPY 615 >gi|496050829|ref|WP_008775336.1| hypothetical protein [Bacteroides sp. 2_2_4] gi|229448893|gb|EEO54684.1| putative capsid protein (F protein) [Bacteroides sp. 2_2_4] Length=580 Score = 141 bits (355), Expect = 5e-36, Method: Compositional matrix adjust. Identities = 78/179 (44%), Positives = 104/179 (58%), Gaps = 6/179 (3%) Query 1 MTYNTGSKYCIIMCIYHAIPLLDYAISGQDGQLLVTNIEDLPIPEFDNIGMEAVPVVQLV 60 ++++ G +Y +IMCIYH++PLLDY + N D IPEFD +GME+VP+V L+ Sbjct 403 ISFDAGERYGLIMCIYHSLPLLDYTTDLVNPAFTKINSTDFAIPEFDRVGMESVPLVSLM 462 Query 61 NSSLLSTNIGKYSFFGYNPRYYNWKTKIDRVHGAFTTDLKDWVAPIDDSYLQQWL----- 115 N S N+G S GY PRY ++KT +D GAF T LK WV D+ + L Sbjct 463 NPLQSSYNVGS-SILGYAPRYISYKTDVDSSVGAFKTTLKSWVMSYDNQSVINQLNYQDD 521 Query 116 PLTSSQPSLLYPFFKVNPNTLDTIFAVKADSTWATDQLLVNCNIDCKVVRPLSQDGIPY 174 P S + Y FKVNPN +D +FAV A ++ TDQ L + D KVVR L DG+PY Sbjct 522 PNNSPGTLVNYTNFKVNPNCVDPLFAVAASNSIDTDQFLCSSFFDVKVVRNLDTDGLPY 580 >gi|547226430|ref|WP_021963493.1| putative uncharacterized protein [Prevotella sp. CAG:1185] gi|524103382|emb|CCY83994.1| putative uncharacterized protein [Prevotella sp. CAG:1185] Length=573 Score = 134 bits (337), Expect = 1e-33, Method: Compositional matrix adjust. Identities = 74/173 (43%), Positives = 102/173 (59%), Gaps = 6/173 (3%) Query 7 SKYCIIMCIYHAIPLLDYAISGQDGQLLVTNIEDLPIPEFDNIGMEAV-PVVQLVNSSLL 65 S++ IIMCIYH +PLLD++I+ Q T D IPEFD++GM+ + P + L Sbjct 402 SEHGIIMCIYHCLPLLDWSINRIARQNFKTTFTDYAIPEFDSVGMQQLYPSEMIFGLEDL 461 Query 66 STNIGKYSFFGYNPRYYNWKTKIDRVHGAFTTDLKDWVAPIDDSYLQQWLPLTS----SQ 121 ++ + GY PRY + KT ID +HG+F L WV+P+ DSY+ + S Sbjct 462 PSDPSSINM-GYVPRYADLKTSIDEIHGSFIDTLVSWVSPLTDSYISAYRQACKDAGFSD 520 Query 122 PSLLYPFFKVNPNTLDTIFAVKADSTWATDQLLVNCNIDCKVVRPLSQDGIPY 174 ++ Y FFKVNP+ +D IF VKADST TDQLL+N D K VR +G+PY Sbjct 521 ITMTYNFFKVNPHIVDNIFGVKADSTINTDQLLINSYFDIKAVRNFDYNGLPY 573 >gi|490418709|ref|WP_004291032.1| hypothetical protein [Bacteroides eggerthii] gi|217986636|gb|EEC52970.1| putative capsid protein (F protein) [Bacteroides eggerthii DSM 20697] Length=578 Score = 129 bits (323), Expect = 1e-31, Method: Compositional matrix adjust. Identities = 76/186 (41%), Positives = 102/186 (55%), Gaps = 13/186 (7%) Query 1 MTYNTGSKYCIIMCIYHAIPLLDYAISGQDGQLLVTNIEDLPIPEFDNIGMEAVPVVQLV 60 + +N+ +Y +IMCIYH +PLLDY D L N D IPEFD +GM+++P+VQL+ Sbjct 394 INFNSNGRYGLIMCIYHCLPLLDYTTDMLDPAFLKVNSTDYAIPEFDRVGMQSMPLVQLM 453 Query 61 NSSLLSTNIGKYSFFGYNPRYYNWKTKIDRVHGAFTTDLKDWVAPIDD-SYLQQWL---- 115 N L S GY PRY ++KT +D+ G F L WV + S L+Q Sbjct 454 N-PLRSFANASGLVLGYVPRYIDYKTSVDQSVGGFKRTLNSWVISYGNISVLKQVTLPND 512 Query 116 --PLTSSQP-----SLLYPFFKVNPNTLDTIFAVKADSTWATDQLLVNCNIDCKVVRPLS 168 P+ S+P + + FFKVNP+ LD IFAV+A TDQ L + D K VR L Sbjct 513 APPIEPSEPVPSVAPMNFTFFKVNPDCLDPIFAVQAGDDTNTDQFLCSSFFDIKAVRNLD 572 Query 169 QDGIPY 174 DG+PY Sbjct 573 TDGLPY 578 >gi|575094321|emb|CDL65708.1| unnamed protein product [uncultured bacterium] Length=642 Score = 85.1 bits (209), Expect = 6e-16, Method: Compositional matrix adjust. Identities = 58/186 (31%), Positives = 86/186 (46%), Gaps = 22/186 (12%) Query 5 TGSKYCIIMCIYHAIPLLDYAISGQDGQLLVTNIEDLPIPEFDNIGMEAVPVVQLV---- 60 T Y +++ IY P+LD+A G D L T+ D IPE D+IGM+ ++ Sbjct 461 TAKTYGVVIGIYRCTPVLDFAHLGIDRTLFKTDASDFVIPEMDSIGMQQTFRCEVAAPAP 520 Query 61 -NSSLLSTNIGKYS------FFGYNPRYYNWKTKIDRVHGAFTTDLKDWVAPIDDSYLQQ 113 N + +G S +GY PRY +KT DR +GAF LK WV I+ +Q Sbjct 521 YNDEFKAFRVGDGSSPDMSETYGYAPRYSEFKTSYDRYNGAFCHSLKSWVTGINFDAIQN 580 Query 114 -----WLPLTSSQPSLLYPFFKVNPNTLDTIFAVKADSTWATDQLLVNCNIDCKVVRPLS 168 W + + P++ F P+ + +F V + + DQL V C R LS Sbjct 581 NVWNTWAGINA--PNM----FACRPDIVKNLFLVSSTNNSDDDQLYVGMVNMCYATRNLS 634 Query 169 QDGIPY 174 + G+PY Sbjct 635 RYGLPY 640 >gi|565841287|ref|WP_023924568.1| hypothetical protein [Prevotella nigrescens] gi|564729907|gb|ETD29851.1| hypothetical protein HMPREF1173_00033 [Prevotella nigrescens CC14M] Length=656 Score = 82.4 bits (202), Expect = 4e-15, Method: Compositional matrix adjust. Identities = 58/167 (35%), Positives = 87/167 (52%), Gaps = 17/167 (10%) Query 8 KYCIIMCIYHAIPLLDYAISGQDGQLLVTNIEDLPIPEFDNIGMEAVPVVQ-----LVNS 62 ++ +IMCIY P +DY D + ED PEF+N+GM+ PV+Q +NS Sbjct 492 EHGLIMCIYSIAPQVDYDARELDPFNRKFSREDYFQPEFENLGMQ--PVIQSDLCLCINS 549 Query 63 SLLSTNIGKYSFFGYNPRYYNWKTKIDRVHGAFTT--DLKDWVAPIDDSYLQQWLPLTSS 120 + ++ + GY+ RY +KT D + G F + L W P ++Y ++ L S Sbjct 550 AKSDSSDQHNNVLGYSARYLEYKTARDIIFGEFMSGGSLSAWATP-KNNYTFEFGKL--S 606 Query 121 QPSLLYPFFKVNPNTLDTIFAVKADSTWATDQLLVNCNIDCKVVRPL 167 P LL V+P L+ IFAVK + + +TDQ LVN D K +RP+ Sbjct 607 LPDLL-----VDPKVLEPIFAVKYNGSMSTDQFLVNSYFDVKAIRPM 648 >gi|494306153|ref|WP_007173049.1| hypothetical protein [Prevotella bergensis] gi|270333881|gb|EFA44667.1| putative capsid protein (F protein) [Prevotella bergensis DSM 17361] Length=519 Score = 79.0 bits (193), Expect = 4e-14, Method: Compositional matrix adjust. Identities = 54/173 (31%), Positives = 84/173 (49%), Gaps = 19/173 (11%) Query 6 GSKYCIIMCIYHAIPLLDYAISGQDGQLLVTNIEDLPIPEFDNIGMEAVPVVQLVNSSLL 65 ++ ++MCIY +P + Y + D + + D PEF+N+GM Q +NSS + Sbjct 359 AKEHGVLMCIYSLVPQIQYDCTRLDPMVDKLDRFDFFTPEFENLGM------QPLNSSYI 412 Query 66 S---TNIGKYSFFGYNPRYYNWKTKIDRVHGAFTTD--LKDWVAPIDDSYLQQWLPLTSS 120 S T K GY PRY +KT +D HG F + L W S ++W Sbjct 413 SSFCTPDPKNPVLGYQPRYSEYKTALDINHGQFAQNDALSSWSV----SRFRRWTTF--- 465 Query 121 QPSLLYPFFKVNPNTLDTIFAVKADSTWATDQLLVNCNIDCKVVRPLSQDGIP 173 P L FK++P L+++F V+ + T +TD + CN + V +S DG+P Sbjct 466 -PQLEIADFKIDPGCLNSVFPVEFNGTESTDCVFGGCNFNIVKVSDMSVDGMP 517 >gi|517172762|ref|WP_018361580.1| hypothetical protein [Prevotella nanceiensis] Length=568 Score = 78.2 bits (191), Expect = 1e-13, Method: Compositional matrix adjust. Identities = 52/169 (31%), Positives = 79/169 (47%), Gaps = 10/169 (6%) Query 8 KYCIIMCIYHAIPLLDYAISGQDGQLLVTNIEDLPIPEFDNIGME---AVPVVQLVNSSL 64 ++ I+MCIY +P + Y D + D +PEF+N+GM+ A + N++ Sbjct 405 EHGILMCIYSLVPDVQYDSKRVDPFVQKIERGDFFVPEFENLGMQPLFAKNISYKYNNNT 464 Query 65 LSTNIGKYSFFGYNPRYYNWKTKIDRVHGAFTTDLKDWVAPIDDSYLQQWLPLTSSQPSL 124 ++ I FG+ PRY +KT +D HG F P+ SY S + Sbjct 465 ANSRIKNLGAFGWQPRYSEYKTALDINHGQFVHQ-----EPL--SYWTVARARGESMSNF 517 Query 125 LYPFFKVNPNTLDTIFAVKADSTWATDQLLVNCNIDCKVVRPLSQDGIP 173 FK+NP LD +FAV + T TDQ+ C + V +S DG+P Sbjct 518 NISTFKINPKWLDDVFAVNYNGTELTDQVFGGCYFNIVKVSDMSIDGMP 566 >gi|494308783|ref|WP_007173938.1| hypothetical protein [Prevotella bergensis] gi|270333035|gb|EFA43821.1| putative capsid protein (F protein) [Prevotella bergensis DSM 17361] Length=553 Score = 77.8 bits (190), Expect = 2e-13, Method: Compositional matrix adjust. Identities = 54/169 (32%), Positives = 82/169 (49%), Gaps = 15/169 (9%) Query 8 KYCIIMCIYHAIPLLDYAISGQDGQLLVTNIEDLPIPEFDNIGMEAVPVVQLVNSSLLS- 66 ++ ++MCIY +P + Y + D + + D PEF+N+GM Q +NSS +S Sbjct 395 EHGVLMCIYSLVPQIQYDCTRLDPMVDKLDRFDYFTPEFENLGM------QPLNSSYISS 448 Query 67 --TNIGKYSFFGYNPRYYNWKTKIDRVHGAFTTDLKDWVAPIDDSYLQQWLPLTSSQPSL 124 T K GY PRY +KT +D HG F D ++ S ++W P L Sbjct 449 FCTTDPKNPVLGYQPRYSEYKTALDVNHGQFAQ--SDALSSWSVSRFRRWTTF----PQL 502 Query 125 LYPFFKVNPNTLDTIFAVKADSTWATDQLLVNCNIDCKVVRPLSQDGIP 173 FK++P L++IF V + T A D + CN + V +S DG+P Sbjct 503 EIADFKIDPGCLNSIFPVDYNGTEANDCVYGGCNFNIVKVSDMSVDGMP 551 Lambda K H a alpha 0.321 0.139 0.439 0.792 4.96 Gapped Lambda K H a alpha sigma 0.267 0.0410 0.140 1.90 42.6 43.6 Effective search space used: 439831946280