bitscore colors: <40, 40-50 , 50-80, 80-200, >200
BLASTP 2.2.30+ Reference: Stephen F. Altschul, Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Reference for composition-based statistics: Alejandro A. Schaffer, L. Aravind, Thomas L. Madden, Sergei Shavirin, John L. Spouge, Yuri I. Wolf, Eugene V. Koonin, and Stephen F. Altschul (2001), "Improving the accuracy of PSI-BLAST protein database searches with composition-based statistics and other refinements", Nucleic Acids Res. 29:2994-3005. Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF excluding environmental samples from WGS projects 49,011,213 sequences; 17,563,301,199 total letters Query= Contig-2_CDS_annotation_glimmer3.pl_2_7 Length=155 Score E Sequences producing significant alignments: (Bits) Value gi|547226430|ref|WP_021963493.1| putative uncharacterized protein 111 1e-25 gi|575094354|emb|CDL65742.1| unnamed protein product 108 3e-24 gi|494822885|ref|WP_007558293.1| hypothetical protein 102 3e-22 gi|496050829|ref|WP_008775336.1| hypothetical protein 101 5e-22 gi|490418709|ref|WP_004291032.1| hypothetical protein 87.4 4e-17 gi|575094321|emb|CDL65708.1| unnamed protein product 73.6 3e-12 gi|517172762|ref|WP_018361580.1| hypothetical protein 65.1 2e-09 gi|565841287|ref|WP_023924568.1| hypothetical protein 64.3 4e-09 gi|496521299|ref|WP_009229582.1| capsid protein 57.4 6e-07 gi|490477384|ref|WP_004347761.1| capsid protein 55.1 3e-06 >gi|547226430|ref|WP_021963493.1| putative uncharacterized protein [Prevotella sp. CAG:1185] gi|524103382|emb|CCY83994.1| putative uncharacterized protein [Prevotella sp. CAG:1185] Length=573 Score = 111 bits (278), Expect = 1e-25, Method: Compositional matrix adjust. Identities = 63/159 (40%), Positives = 95/159 (60%), Gaps = 6/159 (4%) Query 1 LDYAISGQDSQLLCTSVEDLPIPEFDNIGMEAV-PAITLFNSNAFDNDLESDFDFLGYNP 59 LD++I+ Q T+ D IPEFD++GM+ + P+ +F +D S +GY P Sbjct 417 LDWSINRIARQNFKTTFTDYAIPEFDSVGMQQLYPSEMIFGLEDLPSDPSSIN--MGYVP 474 Query 60 RYWPWKSKIDRVHGAFLTTLKDWVAPIDDFYLNRWFAS---GGSSQASISWPFFKVNPNT 116 RY K+ ID +HG+F+ TL WV+P+ D Y++ + + G S ++++ FFKVNP+ Sbjct 475 RYADLKTSIDEIHGSFIDTLVSWVSPLTDSYISAYRQACKDAGFSDITMTYNFFKVNPHI 534 Query 117 LDSIFAVAADSTWESDQLLINCDVSCKVVRPLSQDGMPY 155 +D+IF V ADST +DQLLIN K VR +G+PY Sbjct 535 VDNIFGVKADSTINTDQLLINSYFDIKAVRNFDYNGLPY 573 >gi|575094354|emb|CDL65742.1| unnamed protein product [uncultured bacterium] Length=615 Score = 108 bits (269), Expect = 3e-24, Method: Compositional matrix adjust. Identities = 65/160 (41%), Positives = 87/160 (54%), Gaps = 6/160 (4%) Query 1 LDYAISGQDSQLLCTSVEDLPIPEFDNIGMEAVPAITLFNSNAFDNDLESDFDFLGYNPR 60 +DY SG D PIPE D IGME+VP + N ++D S FLGY PR Sbjct 457 VDYVGSGVDHSCTLVDATSFPIPELDQIGMESVPLVRAMNP-VKESDTPSADTFLGYAPR 515 Query 61 YWPWKSKIDRVHGAFLTTLKDWVAPIDDFYLNRW----FASGGSSQA-SISWPFFKVNPN 115 Y WK+ +DR G F +L+ W P+ D L F S + + SI+ FFKVNP+ Sbjct 516 YIDWKTSVDRSVGDFADSLRTWCLPVGDKELTSANSLNFPSNPNVEPDSIAAGFFKVNPS 575 Query 116 TLDSIFAVAADSTWESDQLLINCDVSCKVVRPLSQDGMPY 155 +D +FAV ADST ++D+ L + KVVR L +G+PY Sbjct 576 IVDPLFAVVADSTVKTDEFLCSSFFDVKVVRNLDVNGLPY 615 >gi|494822885|ref|WP_007558293.1| hypothetical protein [Bacteroides plebeius] gi|198272099|gb|EDY96368.1| putative capsid protein (F protein) [Bacteroides plebeius DSM 17135] Length=613 Score = 102 bits (254), Expect = 3e-22, Method: Compositional matrix adjust. Identities = 64/162 (40%), Positives = 88/162 (54%), Gaps = 7/162 (4%) Query 1 LDYAISGQDSQLLCTSVEDLPIPEFDNIGMEAVPAITLFNS-NAFDNDLESDFD-FLGYN 58 LDY S T+V D PIPEFD IGME VP I N D D + + + GY Sbjct 452 LDYITSAPHFGTTLTNVLDFPIPEFDKIGMEQVPVIRGLNPVKPKDGDFKVSPNLYFGYA 511 Query 59 PRYWPWKSKIDRVHGAFLTTLKDWVAPIDDFYL----NRWFASGGSSQA-SISWPFFKVN 113 P+Y+ WK+ +D+ G F +LK W+ P DD L + F + +A S+ FFKV+ Sbjct 512 PQYYNWKTTLDKSMGEFRRSLKTWIIPFDDEALLAADSVDFPDNPNVEADSVKAGFFKVS 571 Query 114 PNTLDSIFAVAADSTWESDQLLINCDVSCKVVRPLSQDGMPY 155 P+ LD++FAV A+S +DQ L + VVR L +G+PY Sbjct 572 PSVLDNLFAVKANSDLNTDQFLCSTLFDVNVVRSLDPNGLPY 613 >gi|496050829|ref|WP_008775336.1| hypothetical protein [Bacteroides sp. 2_2_4] gi|229448893|gb|EEO54684.1| putative capsid protein (F protein) [Bacteroides sp. 2_2_4] Length=580 Score = 101 bits (252), Expect = 5e-22, Method: Compositional matrix adjust. Identities = 63/163 (39%), Positives = 87/163 (53%), Gaps = 14/163 (9%) Query 1 LDYAISGQDSQLLCTSVEDLPIPEFDNIGMEAVPAITLFNSNAFDNDLESDFD----FLG 56 LDY + + D IPEFD +GME+VP ++L N L+S ++ LG Sbjct 424 LDYTTDLVNPAFTKINSTDFAIPEFDRVGMESVPLVSLMNP------LQSSYNVGSSILG 477 Query 57 YNPRYWPWKSKIDRVHGAFLTTLKDWVAPIDDF----YLNRWFASGGSSQASISWPFFKV 112 Y PRY +K+ +D GAF TTLK WV D+ LN S +++ FKV Sbjct 478 YAPRYISYKTDVDSSVGAFKTTLKSWVMSYDNQSVINQLNYQDDPNNSPGTLVNYTNFKV 537 Query 113 NPNTLDSIFAVAADSTWESDQLLINCDVSCKVVRPLSQDGMPY 155 NPN +D +FAVAA ++ ++DQ L + KVVR L DG+PY Sbjct 538 NPNCVDPLFAVAASNSIDTDQFLCSSFFDVKVVRNLDTDGLPY 580 >gi|490418709|ref|WP_004291032.1| hypothetical protein [Bacteroides eggerthii] gi|217986636|gb|EEC52970.1| putative capsid protein (F protein) [Bacteroides eggerthii DSM 20697] Length=578 Score = 87.4 bits (215), Expect = 4e-17, Method: Compositional matrix adjust. Identities = 58/167 (35%), Positives = 81/167 (49%), Gaps = 15/167 (9%) Query 1 LDYAISGQDSQLLCTSVEDLPIPEFDNIGMEAVPAITLFNS-NAFDNDLESDFDFLGYNP 59 LDY D L + D IPEFD +GM+++P + L N +F N + LGY P Sbjct 415 LDYTTDMLDPAFLKVNSTDYAIPEFDRVGMQSMPLVQLMNPLRSFAN---ASGLVLGYVP 471 Query 60 RYWPWKSKIDRVHGAFLTTLKDWVAPIDDFYLNRWF-----------ASGGSSQASISWP 108 RY +K+ +D+ G F TL WV + + + + S A +++ Sbjct 472 RYIDYKTSVDQSVGGFKRTLNSWVISYGNISVLKQVTLPNDAPPIEPSEPVPSVAPMNFT 531 Query 109 FFKVNPNTLDSIFAVAADSTWESDQLLINCDVSCKVVRPLSQDGMPY 155 FFKVNP+ LD IFAV A +DQ L + K VR L DG+PY Sbjct 532 FFKVNPDCLDPIFAVQAGDDTNTDQFLCSSFFDIKAVRNLDTDGLPY 578 >gi|575094321|emb|CDL65708.1| unnamed protein product [uncultured bacterium] Length=642 Score = 73.6 bits (179), Expect = 3e-12, Method: Compositional matrix adjust. Identities = 58/168 (35%), Positives = 79/168 (47%), Gaps = 18/168 (11%) Query 1 LDYAISGQDSQLLCTSVEDLPIPEFDNIGME-------AVPAITLFNSNAFDNDLESDFD 53 LD+A G D L T D IPE D+IGM+ A PA AF S D Sbjct 478 LDFAHLGIDRTLFKTDASDFVIPEMDSIGMQQTFRCEVAAPAPYNDEFKAFRVGDGSSPD 537 Query 54 F---LGYNPRYWPWKSKIDRVHGAFLTTLKDWVAPI--DDFYLNRWFASGGSSQASISWP 108 GY PRY +K+ DR +GAF +LK WV I D N W ++ A I+ P Sbjct 538 MSETYGYAPRYSEFKTSYDRYNGAFCHSLKSWVTGINFDAIQNNVW-----NTWAGINAP 592 Query 109 -FFKVNPNTLDSIFAVAADSTWESDQLLINCDVSCKVVRPLSQDGMPY 155 F P+ + ++F V++ + + DQL + C R LS+ G+PY Sbjct 593 NMFACRPDIVKNLFLVSSTNNSDDDQLYVGMVNMCYATRNLSRYGLPY 640 >gi|517172762|ref|WP_018361580.1| hypothetical protein [Prevotella nanceiensis] Length=568 Score = 65.1 bits (157), Expect = 2e-09, Method: Composition-based stats. Identities = 42/139 (30%), Positives = 74/139 (53%), Gaps = 12/139 (9%) Query 19 DLPIPEFDNIGMEAVPAITL---FNSNAFDNDLESDFDFLGYNPRYWPWKSKIDRVHGAF 75 D +PEF+N+GM+ + A + +N+N ++ ++ + G+ PRY +K+ +D HG F Sbjct 437 DFFVPEFENLGMQPLFAKNISYKYNNNTANSRIK-NLGAFGWQPRYSEYKTALDINHGQF 495 Query 76 LTTLKDWVAPIDDFYLNRWFASGGSSQASISWPFFKVNPNTLDSIFAVAADSTWESDQLL 135 + P+ + + R + G S ++ + FK+NP LD +FAV + T +DQ+ Sbjct 496 VHQ-----EPLSYWTVAR---ARGESMSNFNISTFKINPKWLDDVFAVNYNGTELTDQVF 547 Query 136 INCDVSCKVVRPLSQDGMP 154 C + V +S DGMP Sbjct 548 GGCYFNIVKVSDMSIDGMP 566 >gi|565841287|ref|WP_023924568.1| hypothetical protein [Prevotella nigrescens] gi|564729907|gb|ETD29851.1| hypothetical protein HMPREF1173_00033 [Prevotella nigrescens CC14M] Length=656 Score = 64.3 bits (155), Expect = 4e-09, Method: Compositional matrix adjust. Identities = 45/143 (31%), Positives = 72/143 (50%), Gaps = 15/143 (10%) Query 16 SVEDLPIPEFDNIGMEAVPAITL---FNSNAFDNDLESDFDFLGYNPRYWPWKSKIDRVH 72 S ED PEF+N+GM+ V L NS D+ + + + LGY+ RY +K+ D + Sbjct 521 SREDYFQPEFENLGMQPVIQSDLCLCINSAKSDSSDQHN-NVLGYSARYLEYKTARDIIF 579 Query 73 GAFLT--TLKDWVAPIDDFYLNRWFASGGSSQASISWPFFKVNPNTLDSIFAVAADSTWE 130 G F++ +L W P +++ F G +S P V+P L+ IFAV + + Sbjct 580 GEFMSGGSLSAWATPKNNY----TFEFG-----KLSLPDLLVDPKVLEPIFAVKYNGSMS 630 Query 131 SDQLLINCDVSCKVVRPLSQDGM 153 +DQ L+N K +RP+ + M Sbjct 631 TDQFLVNSYFDVKAIRPMQVNDM 653 >gi|496521299|ref|WP_009229582.1| capsid protein [Prevotella sp. oral taxon 317] gi|288330570|gb|EFC69154.1| putative capsid protein (F protein) [Prevotella sp. oral taxon 317 str. F0108] Length=541 Score = 57.4 bits (137), Expect = 6e-07, Method: Compositional matrix adjust. Identities = 44/137 (32%), Positives = 67/137 (49%), Gaps = 16/137 (12%) Query 19 DLPIPEFDNIGME-AVPAITLFNSNAFDNDLESDFDFLGYNPRYWPWKSKIDRVHGAFLT 77 D IPEF+N+GM+ VPA N A DN G+ PRY +K+ D HG F Sbjct 418 DYFIPEFENLGMQPIVPAFVSLN-RAKDNSY-------GWQPRYSEYKTAFDINHGQFAN 469 Query 78 TLKDWVAPIDDFYLNRWFASGGSSQASISWPFFKVNPNTLDSIFAVAADSTWESDQLLIN 137 P+ + + R A G + + + K+NP+ LDS+FAV + T +D + Sbjct 470 G-----EPLSYWSIAR--ARGSDTLNTFNVAALKINPHWLDSVFAVNYNGTEVTDCMFGY 522 Query 138 CDVSCKVVRPLSQDGMP 154 + + V +++DGMP Sbjct 523 AHFNIEKVSDMTEDGMP 539 >gi|490477384|ref|WP_004347761.1| capsid protein [Prevotella buccalis] gi|281300712|gb|EFA93043.1| putative capsid protein (F protein) [Prevotella buccalis ATCC 35310] Length=552 Score = 55.1 bits (131), Expect = 3e-06, Method: Compositional matrix adjust. Identities = 42/140 (30%), Positives = 69/140 (49%), Gaps = 15/140 (11%) Query 16 SVEDLPIPEFDNIGMEAVPAITLFNSNAFDNDLESDFD-FLGYNPRYWPWKSKIDRVHGA 74 S D +PEF+++GM+ P T + S D+ + + F G+ PRY +K+ +D HG Sbjct 425 SRGDFFMPEFEDLGMQ--PLQTRYIS-----DIRTQTEKFKGWQPRYSEYKTSLDINHGQ 477 Query 75 FLTTLKDWVAPIDDFYLNRWFASGGSSQASISWPFFKVNPNTLDSIFAVAADSTWESDQL 134 F P+ + + R A G + + K+NP LDSIFAV + T +D + Sbjct 478 FANG-----QPLSYWTVGRGRA--GETLETFDIASLKINPKWLDSIFAVNYNGTQITDCV 530 Query 135 LINCDVSCKVVRPLSQDGMP 154 C + + V +S++G P Sbjct 531 FGGCQFNVQKVSDMSENGEP 550 Lambda K H a alpha 0.320 0.137 0.444 0.792 4.96 Gapped Lambda K H a alpha sigma 0.267 0.0410 0.140 1.90 42.6 43.6 Effective search space used: 435859188405