bitscore colors: <40, 40-50 , 50-80, 80-200, >200
BLASTP 2.2.30+ Reference: Stephen F. Altschul, Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Reference for composition-based statistics: Alejandro A. Schaffer, L. Aravind, Thomas L. Madden, Sergei Shavirin, John L. Spouge, Yuri I. Wolf, Eugene V. Koonin, and Stephen F. Altschul (2001), "Improving the accuracy of PSI-BLAST protein database searches with composition-based statistics and other refinements", Nucleic Acids Res. 29:2994-3005. Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF excluding environmental samples from WGS projects 49,011,213 sequences; 17,563,301,199 total letters Query= Contig-9_CDS_annotation_glimmer3.pl_2_7 Length=157 Score E Sequences producing significant alignments: (Bits) Value gi|547226430|ref|WP_021963493.1| putative uncharacterized protein 112 6e-26 gi|575094354|emb|CDL65742.1| unnamed protein product 111 2e-25 gi|496050829|ref|WP_008775336.1| hypothetical protein 110 2e-25 gi|490418709|ref|WP_004291032.1| hypothetical protein 101 4e-22 gi|494822885|ref|WP_007558293.1| hypothetical protein 94.7 1e-19 gi|575094321|emb|CDL65708.1| unnamed protein product 72.4 8e-12 gi|565841287|ref|WP_023924568.1| hypothetical protein 65.5 1e-09 gi|517172762|ref|WP_018361580.1| hypothetical protein 56.2 2e-06 gi|494610271|ref|WP_007368517.1| capsid protein 51.6 5e-05 gi|496521299|ref|WP_009229582.1| capsid protein 50.1 2e-04 >gi|547226430|ref|WP_021963493.1| putative uncharacterized protein [Prevotella sp. CAG:1185] gi|524103382|emb|CCY83994.1| putative uncharacterized protein [Prevotella sp. CAG:1185] Length=573 Score = 112 bits (280), Expect = 6e-26, Method: Compositional matrix adjust. Identities = 62/161 (39%), Positives = 95/161 (59%), Gaps = 8/161 (5%) Query 1 LDYQLTGPDLQLLNTYATDLPQPELDNLGLEAL--PYFTFVNDAVATQPNNVTVKSIIGY 58 LD+ + Q T TD PE D++G++ L F + + + P+++ +GY Sbjct 417 LDWSINRIARQNFKTTFTDYAIPEFDSVGMQQLYPSEMIFGLEDLPSDPSSIN----MGY 472 Query 59 VPRYIAYKTDIDCVDGAFLTSLTSWVTPLTIDEIVT--KISLGSGTGPFTPNYGLFKVSP 116 VPRY KT ID + G+F+ +L SWV+PLT I + +G T Y FKV+P Sbjct 473 VPRYADLKTSIDEIHGSFIDTLVSWVSPLTDSYISAYRQACKDAGFSDITMTYNFFKVNP 532 Query 117 YVLDSIFVSQCDSTVDTDQFLVESFFDVKLVQNLDYNGMPY 157 +++D+IF + DST++TDQ L+ S+FD+K V+N DYNG+PY Sbjct 533 HIVDNIFGVKADSTINTDQLLINSYFDIKAVRNFDYNGLPY 573 >gi|575094354|emb|CDL65742.1| unnamed protein product [uncultured bacterium] Length=615 Score = 111 bits (277), Expect = 2e-25, Method: Compositional matrix adjust. Identities = 65/164 (40%), Positives = 91/164 (55%), Gaps = 12/164 (7%) Query 1 LDYQLTGPDLQLLNTYATDLPQPELDNLGLEALPYFTFVNDAVATQPNNVTVKSIIGYVP 60 +DY +G D AT P PELD +G+E++P +N + + + + +GY P Sbjct 457 VDYVGSGVDHSCTLVDATSFPIPELDQIGMESVPLVRAMNPV--KESDTPSADTFLGYAP 514 Query 61 RYIAYKTDIDCVDGAFLTSLTSWVTPLTIDEIVTKISLGSGTGPFTPN-------YGLFK 113 RYI +KT +D G F SL +W P+ E+ + SL P PN G FK Sbjct 515 RYIDWKTSVDRSVGDFADSLRTWCLPVGDKELTSANSLNF---PSNPNVEPDSIAAGFFK 571 Query 114 VSPYVLDSIFVSQCDSTVDTDQFLVESFFDVKLVQNLDYNGMPY 157 V+P ++D +F DSTV TD+FL SFFDVK+V+NLD NG+PY Sbjct 572 VNPSIVDPLFAVVADSTVKTDEFLCSSFFDVKVVRNLDVNGLPY 615 >gi|496050829|ref|WP_008775336.1| hypothetical protein [Bacteroides sp. 2_2_4] gi|229448893|gb|EEO54684.1| putative capsid protein (F protein) [Bacteroides sp. 2_2_4] Length=580 Score = 110 bits (276), Expect = 2e-25, Method: Compositional matrix adjust. Identities = 66/162 (41%), Positives = 94/162 (58%), Gaps = 10/162 (6%) Query 1 LDY--QLTGPDLQLLNTYATDLPQPELDNLGLEALPYFTFVNDAVATQPNNVTVKSIIGY 58 LDY L P +N+ TD PE D +G+E++P + +N Q + SI+GY Sbjct 424 LDYTTDLVNPAFTKINS--TDFAIPEFDRVGMESVPLVSLMN---PLQSSYNVGSSILGY 478 Query 59 VPRYIAYKTDIDCVDGAFLTSLTSWVTPLTIDEIVTKISLGS--GTGPFT-PNYGLFKVS 115 PRYI+YKTD+D GAF T+L SWV ++ +++ P T NY FKV+ Sbjct 479 APRYISYKTDVDSSVGAFKTTLKSWVMSYDNQSVINQLNYQDDPNNSPGTLVNYTNFKVN 538 Query 116 PYVLDSIFVSQCDSTVDTDQFLVESFFDVKLVQNLDYNGMPY 157 P +D +F +++DTDQFL SFFDVK+V+NLD +G+PY Sbjct 539 PNCVDPLFAVAASNSIDTDQFLCSSFFDVKVVRNLDTDGLPY 580 >gi|490418709|ref|WP_004291032.1| hypothetical protein [Bacteroides eggerthii] gi|217986636|gb|EEC52970.1| putative capsid protein (F protein) [Bacteroides eggerthii DSM 20697] Length=578 Score = 101 bits (252), Expect = 4e-22, Method: Compositional matrix adjust. Identities = 61/167 (37%), Positives = 88/167 (53%), Gaps = 13/167 (8%) Query 1 LDYQLTGPDLQLLNTYATDLPQPELDNLGLEALPYFTFVNDAVATQPNNVTVKSIIGYVP 60 LDY D L +TD PE D +G++++P +N + + N + ++GYVP Sbjct 415 LDYTTDMLDPAFLKVNSTDYAIPEFDRVGMQSMPLVQLMN-PLRSFANASGL--VLGYVP 471 Query 61 RYIAYKTDIDCVDGAFLTSLTSWVTPLTIDEIVTKISLGSGTGPFTP----------NYG 110 RYI YKT +D G F +L SWV ++ +++L + P P N+ Sbjct 472 RYIDYKTSVDQSVGGFKRTLNSWVISYGNISVLKQVTLPNDAPPIEPSEPVPSVAPMNFT 531 Query 111 LFKVSPYVLDSIFVSQCDSTVDTDQFLVESFFDVKLVQNLDYNGMPY 157 FKV+P LD IF Q +TDQFL SFFD+K V+NLD +G+PY Sbjct 532 FFKVNPDCLDPIFAVQAGDDTNTDQFLCSSFFDIKAVRNLDTDGLPY 578 >gi|494822885|ref|WP_007558293.1| hypothetical protein [Bacteroides plebeius] gi|198272099|gb|EDY96368.1| putative capsid protein (F protein) [Bacteroides plebeius DSM 17135] Length=613 Score = 94.7 bits (234), Expect = 1e-19, Method: Compositional matrix adjust. Identities = 58/165 (35%), Positives = 85/165 (52%), Gaps = 11/165 (7%) Query 1 LDYQLTGPDLQLLNTYATDLPQPELDNLGLEALPYFTFVNDAVATQPN-NVTVKSIIGYV 59 LDY + P T D P PE D +G+E +P +N + V+ GY Sbjct 452 LDYITSAPHFGTTLTNVLDFPIPEFDKIGMEQVPVIRGLNPVKPKDGDFKVSPNLYFGYA 511 Query 60 PRYIAYKTDIDCVDGAFLTSLTSWVTPLTIDEIVTKISLGSGTGPFTPN-------YGLF 112 P+Y +KT +D G F SL +W+ P + ++ S+ P PN G F Sbjct 512 PQYYNWKTTLDKSMGEFRRSLKTWIIPFDDEALLAADSVDF---PDNPNVEADSVKAGFF 568 Query 113 KVSPYVLDSIFVSQCDSTVDTDQFLVESFFDVKLVQNLDYNGMPY 157 KVSP VLD++F + +S ++TDQFL + FDV +V++LD NG+PY Sbjct 569 KVSPSVLDNLFAVKANSDLNTDQFLCSTLFDVNVVRSLDPNGLPY 613 >gi|575094321|emb|CDL65708.1| unnamed protein product [uncultured bacterium] Length=642 Score = 72.4 bits (176), Expect = 8e-12, Method: Compositional matrix adjust. Identities = 52/170 (31%), Positives = 75/170 (44%), Gaps = 20/170 (12%) Query 1 LDYQLTGPDLQLLNTYATDLPQPELDNLGLEALPYFTFVNDAVATQPNNVTVKSI----- 55 LD+ G D L T A+D PE+D++G++ TF + A P N K+ Sbjct 478 LDFAHLGIDRTLFKTDASDFVIPEMDSIGMQQ----TFRCEVAAPAPYNDEFKAFRVGDG 533 Query 56 --------IGYVPRYIAYKTDIDCVDGAFLTSLTSWVTPLTIDEIVTKISLGSGTGPFTP 107 GY PRY +KT D +GAF SL SWVT + D I + + G P Sbjct 534 SSPDMSETYGYAPRYSEFKTSYDRYNGAFCHSLKSWVTGINFDAIQNNV-WNTWAGINAP 592 Query 108 NYGLFKVSPYVLDSIFVSQCDSTVDTDQFLVESFFDVKLVQNLDYNGMPY 157 N +F P ++ ++F+ + D DQ V +NL G+PY Sbjct 593 N--MFACRPDIVKNLFLVSSTNNSDDDQLYVGMVNMCYATRNLSRYGLPY 640 >gi|565841287|ref|WP_023924568.1| hypothetical protein [Prevotella nigrescens] gi|564729907|gb|ETD29851.1| hypothetical protein HMPREF1173_00033 [Prevotella nigrescens CC14M] Length=656 Score = 65.5 bits (158), Expect = 1e-09, Method: Compositional matrix adjust. Identities = 49/150 (33%), Positives = 73/150 (49%), Gaps = 33/150 (22%) Query 19 DLPQPELDNLGLEALPYFTF---VNDA---VATQPNNVTVKSIIGYVPRYIAYKTDIDCV 72 D QPE +NLG++ + +N A + Q NNV +GY RY+ YKT D + Sbjct 524 DYFQPEFENLGMQPVIQSDLCLCINSAKSDSSDQHNNV-----LGYSARYLEYKTARDII 578 Query 73 DGAFLT--SLTSWVTPLTIDEIVTKISLGSGTGPFTPNYGLFK-----VSPYVLDSIFVS 125 G F++ SL++W TP +T +G V P VL+ IF Sbjct 579 FGEFMSGGSLSAWATP---------------KNNYTFEFGKLSLPDLLVDPKVLEPIFAV 623 Query 126 QCDSTVDTDQFLVESFFDVKLVQNLDYNGM 155 + + ++ TDQFLV S+FDVK ++ + N M Sbjct 624 KYNGSMSTDQFLVNSYFDVKAIRPMQVNDM 653 >gi|517172762|ref|WP_018361580.1| hypothetical protein [Prevotella nanceiensis] Length=568 Score = 56.2 bits (134), Expect = 2e-06, Method: Compositional matrix adjust. Identities = 40/142 (28%), Positives = 66/142 (46%), Gaps = 24/142 (17%) Query 23 PELDNLGLEALPYFTFVNDAVATQPNNVTVKSII------GYVPRYIAYKTDIDCVDGAF 76 PE +NLG++ L F + ++ + NN T S I G+ PRY YKT +D G F Sbjct 441 PEFENLGMQPL----FAKN-ISYKYNNNTANSRIKNLGAFGWQPRYSEYKTALDINHGQF 495 Query 77 LTS--LTSWVTPLTIDEIVTKISLGSGTGPFTPNYGLFKVSPYVLDSIFVSQCDSTVDTD 134 + L+ W E ++ ++ + FK++P LD +F + T TD Sbjct 496 VHQEPLSYWTVARARGESMSNFNIST-----------FKINPKWLDDVFAVNYNGTELTD 544 Query 135 QFLVESFFDVKLVQNLDYNGMP 156 Q +F++ V ++ +GMP Sbjct 545 QVFGGCYFNIVKVSDMSIDGMP 566 >gi|494610271|ref|WP_007368517.1| capsid protein [Prevotella multiformis] gi|324988543|gb|EGC20506.1| putative capsid protein (F protein) [Prevotella multiformis DSM 16608] Length=531 Score = 51.6 bits (122), Expect = 5e-05, Method: Compositional matrix adjust. Identities = 38/106 (36%), Positives = 56/106 (53%), Gaps = 12/106 (11%) Query 55 IIGYVPRYIAYKTDIDCVDGAFLT--SLTSWVTP---LTIDEIVTKISLGSGTGPFTPNY 109 ++G+ RY YKT D V G F + SL+ W +P D L P++P + Sbjct 430 LLGWQVRYNEYKTSRDLVFGEFESGLSLSYWCSPRYDFGFDGKAGDKKLV--NSPWSPAH 487 Query 110 GLFKVSPYVLDSIFVSQCDSTVDTDQFLVESFFDVKLVQNLDYNGM 155 F V+P +L++IF+ S V D FLV SFFDVK V+ + +G+ Sbjct 488 --FYVNPSILNTIFLV---SAVKADHFLVNSFFDVKAVRPMSVSGL 528 >gi|496521299|ref|WP_009229582.1| capsid protein [Prevotella sp. oral taxon 317] gi|288330570|gb|EFC69154.1| putative capsid protein (F protein) [Prevotella sp. oral taxon 317 str. F0108] Length=541 Score = 50.1 bits (118), Expect = 2e-04, Method: Compositional matrix adjust. Identities = 39/138 (28%), Positives = 60/138 (43%), Gaps = 24/138 (17%) Query 23 PELDNLGLEAL-PYFTFVNDAVATQPNNVTVKSIIGYVPRYIAYKTDIDCVDGAFLTS-- 79 PE +NLG++ + P F +N A G+ PRY YKT D G F Sbjct 422 PEFENLGMQPIVPAFVSLNRAKDNS---------YGWQPRYSEYKTAFDINHGQFANGEP 472 Query 80 LTSWVTPLTIDEIVTKISLGSGTGPF-TPNYGLFKVSPYVLDSIFVSQCDSTVDTDQFLV 138 L+ W I+ G+ T N K++P+ LDS+F + T TD Sbjct 473 LSYW-----------SIARARGSDTLNTFNVAALKINPHWLDSVFAVNYNGTEVTDCMFG 521 Query 139 ESFFDVKLVQNLDYNGMP 156 + F+++ V ++ +GMP Sbjct 522 YAHFNIEKVSDMTEDGMP 539 Lambda K H a alpha 0.319 0.139 0.416 0.792 4.96 Gapped Lambda K H a alpha sigma 0.267 0.0410 0.140 1.90 42.6 43.6 Effective search space used: 432232358643