bitscore colors: <40, 40-50 , 50-80, 80-200, >200




           BLASTP 2.2.30+


Reference: Stephen F. Altschul, Thomas L. Madden, Alejandro A.
Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J.
Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of
protein database search programs", Nucleic Acids Res. 25:3389-3402.


Reference for composition-based statistics: Alejandro A. Schaffer,
L. Aravind, Thomas L. Madden, Sergei Shavirin, John L. Spouge, Yuri
I. Wolf, Eugene V. Koonin, and Stephen F. Altschul (2001),
"Improving the accuracy of PSI-BLAST protein database searches with
composition-based statistics and other refinements", Nucleic Acids
Res. 29:2994-3005.



Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF
excluding environmental samples from WGS projects
           49,011,213 sequences; 17,563,301,199 total letters





Query= Contig-2_CDS_annotation_glimmer3.pl_2_7

Length=155
                                                                      Score     E
Sequences producing significant alignments:                          (Bits)  Value

gi|547226430|ref|WP_021963493.1|  putative uncharacterized protein      111   1e-25
gi|575094354|emb|CDL65742.1|  unnamed protein product                   108   3e-24
gi|494822885|ref|WP_007558293.1|  hypothetical protein                  102   3e-22
gi|496050829|ref|WP_008775336.1|  hypothetical protein                  101   5e-22
gi|490418709|ref|WP_004291032.1|  hypothetical protein                87.4    4e-17
gi|575094321|emb|CDL65708.1|  unnamed protein product                 73.6    3e-12
gi|517172762|ref|WP_018361580.1|  hypothetical protein                65.1    2e-09
gi|565841287|ref|WP_023924568.1|  hypothetical protein                64.3    4e-09
gi|496521299|ref|WP_009229582.1|  capsid protein                      57.4    6e-07
gi|490477384|ref|WP_004347761.1|  capsid protein                      55.1    3e-06


>gi|547226430|ref|WP_021963493.1| putative uncharacterized protein [Prevotella sp. CAG:1185]
 gi|524103382|emb|CCY83994.1| putative uncharacterized protein [Prevotella sp. CAG:1185]
Length=573

 Score =   111 bits (278),  Expect = 1e-25, Method: Compositional matrix adjust.
 Identities = 63/159 (40%), Positives = 95/159 (60%), Gaps = 6/159 (4%)

Query  1    LDYAISGQDSQLLCTSVEDLPIPEFDNIGMEAV-PAITLFNSNAFDNDLESDFDFLGYNP  59
            LD++I+    Q   T+  D  IPEFD++GM+ + P+  +F      +D  S    +GY P
Sbjct  417  LDWSINRIARQNFKTTFTDYAIPEFDSVGMQQLYPSEMIFGLEDLPSDPSSIN--MGYVP  474

Query  60   RYWPWKSKIDRVHGAFLTTLKDWVAPIDDFYLNRWFAS---GGSSQASISWPFFKVNPNT  116
            RY   K+ ID +HG+F+ TL  WV+P+ D Y++ +  +    G S  ++++ FFKVNP+ 
Sbjct  475  RYADLKTSIDEIHGSFIDTLVSWVSPLTDSYISAYRQACKDAGFSDITMTYNFFKVNPHI  534

Query  117  LDSIFAVAADSTWESDQLLINCDVSCKVVRPLSQDGMPY  155
            +D+IF V ADST  +DQLLIN     K VR    +G+PY
Sbjct  535  VDNIFGVKADSTINTDQLLINSYFDIKAVRNFDYNGLPY  573


>gi|575094354|emb|CDL65742.1| unnamed protein product [uncultured bacterium]
Length=615

 Score =   108 bits (269),  Expect = 3e-24, Method: Compositional matrix adjust.
 Identities = 65/160 (41%), Positives = 87/160 (54%), Gaps = 6/160 (4%)

Query  1    LDYAISGQDSQLLCTSVEDLPIPEFDNIGMEAVPAITLFNSNAFDNDLESDFDFLGYNPR  60
            +DY  SG D           PIPE D IGME+VP +   N    ++D  S   FLGY PR
Sbjct  457  VDYVGSGVDHSCTLVDATSFPIPELDQIGMESVPLVRAMNP-VKESDTPSADTFLGYAPR  515

Query  61   YWPWKSKIDRVHGAFLTTLKDWVAPIDDFYLNRW----FASGGSSQA-SISWPFFKVNPN  115
            Y  WK+ +DR  G F  +L+ W  P+ D  L       F S  + +  SI+  FFKVNP+
Sbjct  516  YIDWKTSVDRSVGDFADSLRTWCLPVGDKELTSANSLNFPSNPNVEPDSIAAGFFKVNPS  575

Query  116  TLDSIFAVAADSTWESDQLLINCDVSCKVVRPLSQDGMPY  155
             +D +FAV ADST ++D+ L +     KVVR L  +G+PY
Sbjct  576  IVDPLFAVVADSTVKTDEFLCSSFFDVKVVRNLDVNGLPY  615


>gi|494822885|ref|WP_007558293.1| hypothetical protein [Bacteroides plebeius]
 gi|198272099|gb|EDY96368.1| putative capsid protein (F protein) [Bacteroides plebeius DSM 
17135]
Length=613

 Score =   102 bits (254),  Expect = 3e-22, Method: Compositional matrix adjust.
 Identities = 64/162 (40%), Positives = 88/162 (54%), Gaps = 7/162 (4%)

Query  1    LDYAISGQDSQLLCTSVEDLPIPEFDNIGMEAVPAITLFNS-NAFDNDLESDFD-FLGYN  58
            LDY  S        T+V D PIPEFD IGME VP I   N     D D +   + + GY 
Sbjct  452  LDYITSAPHFGTTLTNVLDFPIPEFDKIGMEQVPVIRGLNPVKPKDGDFKVSPNLYFGYA  511

Query  59   PRYWPWKSKIDRVHGAFLTTLKDWVAPIDDFYL----NRWFASGGSSQA-SISWPFFKVN  113
            P+Y+ WK+ +D+  G F  +LK W+ P DD  L    +  F    + +A S+   FFKV+
Sbjct  512  PQYYNWKTTLDKSMGEFRRSLKTWIIPFDDEALLAADSVDFPDNPNVEADSVKAGFFKVS  571

Query  114  PNTLDSIFAVAADSTWESDQLLINCDVSCKVVRPLSQDGMPY  155
            P+ LD++FAV A+S   +DQ L +      VVR L  +G+PY
Sbjct  572  PSVLDNLFAVKANSDLNTDQFLCSTLFDVNVVRSLDPNGLPY  613


>gi|496050829|ref|WP_008775336.1| hypothetical protein [Bacteroides sp. 2_2_4]
 gi|229448893|gb|EEO54684.1| putative capsid protein (F protein) [Bacteroides sp. 2_2_4]
Length=580

 Score =   101 bits (252),  Expect = 5e-22, Method: Compositional matrix adjust.
 Identities = 63/163 (39%), Positives = 87/163 (53%), Gaps = 14/163 (9%)

Query  1    LDYAISGQDSQLLCTSVEDLPIPEFDNIGMEAVPAITLFNSNAFDNDLESDFD----FLG  56
            LDY     +      +  D  IPEFD +GME+VP ++L N       L+S ++     LG
Sbjct  424  LDYTTDLVNPAFTKINSTDFAIPEFDRVGMESVPLVSLMNP------LQSSYNVGSSILG  477

Query  57   YNPRYWPWKSKIDRVHGAFLTTLKDWVAPIDDF----YLNRWFASGGSSQASISWPFFKV  112
            Y PRY  +K+ +D   GAF TTLK WV   D+      LN       S    +++  FKV
Sbjct  478  YAPRYISYKTDVDSSVGAFKTTLKSWVMSYDNQSVINQLNYQDDPNNSPGTLVNYTNFKV  537

Query  113  NPNTLDSIFAVAADSTWESDQLLINCDVSCKVVRPLSQDGMPY  155
            NPN +D +FAVAA ++ ++DQ L +     KVVR L  DG+PY
Sbjct  538  NPNCVDPLFAVAASNSIDTDQFLCSSFFDVKVVRNLDTDGLPY  580


>gi|490418709|ref|WP_004291032.1| hypothetical protein [Bacteroides eggerthii]
 gi|217986636|gb|EEC52970.1| putative capsid protein (F protein) [Bacteroides eggerthii DSM 
20697]
Length=578

 Score = 87.4 bits (215),  Expect = 4e-17, Method: Compositional matrix adjust.
 Identities = 58/167 (35%), Positives = 81/167 (49%), Gaps = 15/167 (9%)

Query  1    LDYAISGQDSQLLCTSVEDLPIPEFDNIGMEAVPAITLFNS-NAFDNDLESDFDFLGYNP  59
            LDY     D   L  +  D  IPEFD +GM+++P + L N   +F N   +    LGY P
Sbjct  415  LDYTTDMLDPAFLKVNSTDYAIPEFDRVGMQSMPLVQLMNPLRSFAN---ASGLVLGYVP  471

Query  60   RYWPWKSKIDRVHGAFLTTLKDWVAPIDDFYLNRWF-----------ASGGSSQASISWP  108
            RY  +K+ +D+  G F  TL  WV    +  + +             +    S A +++ 
Sbjct  472  RYIDYKTSVDQSVGGFKRTLNSWVISYGNISVLKQVTLPNDAPPIEPSEPVPSVAPMNFT  531

Query  109  FFKVNPNTLDSIFAVAADSTWESDQLLINCDVSCKVVRPLSQDGMPY  155
            FFKVNP+ LD IFAV A     +DQ L +     K VR L  DG+PY
Sbjct  532  FFKVNPDCLDPIFAVQAGDDTNTDQFLCSSFFDIKAVRNLDTDGLPY  578


>gi|575094321|emb|CDL65708.1| unnamed protein product [uncultured bacterium]
Length=642

 Score = 73.6 bits (179),  Expect = 3e-12, Method: Compositional matrix adjust.
 Identities = 58/168 (35%), Positives = 79/168 (47%), Gaps = 18/168 (11%)

Query  1    LDYAISGQDSQLLCTSVEDLPIPEFDNIGME-------AVPAITLFNSNAFDNDLESDFD  53
            LD+A  G D  L  T   D  IPE D+IGM+       A PA       AF     S  D
Sbjct  478  LDFAHLGIDRTLFKTDASDFVIPEMDSIGMQQTFRCEVAAPAPYNDEFKAFRVGDGSSPD  537

Query  54   F---LGYNPRYWPWKSKIDRVHGAFLTTLKDWVAPI--DDFYLNRWFASGGSSQASISWP  108
                 GY PRY  +K+  DR +GAF  +LK WV  I  D    N W     ++ A I+ P
Sbjct  538  MSETYGYAPRYSEFKTSYDRYNGAFCHSLKSWVTGINFDAIQNNVW-----NTWAGINAP  592

Query  109  -FFKVNPNTLDSIFAVAADSTWESDQLLINCDVSCKVVRPLSQDGMPY  155
              F   P+ + ++F V++ +  + DQL +     C   R LS+ G+PY
Sbjct  593  NMFACRPDIVKNLFLVSSTNNSDDDQLYVGMVNMCYATRNLSRYGLPY  640


>gi|517172762|ref|WP_018361580.1| hypothetical protein [Prevotella nanceiensis]
Length=568

 Score = 65.1 bits (157),  Expect = 2e-09, Method: Composition-based stats.
 Identities = 42/139 (30%), Positives = 74/139 (53%), Gaps = 12/139 (9%)

Query  19   DLPIPEFDNIGMEAVPAITL---FNSNAFDNDLESDFDFLGYNPRYWPWKSKIDRVHGAF  75
            D  +PEF+N+GM+ + A  +   +N+N  ++ ++ +    G+ PRY  +K+ +D  HG F
Sbjct  437  DFFVPEFENLGMQPLFAKNISYKYNNNTANSRIK-NLGAFGWQPRYSEYKTALDINHGQF  495

Query  76   LTTLKDWVAPIDDFYLNRWFASGGSSQASISWPFFKVNPNTLDSIFAVAADSTWESDQLL  135
            +        P+  + + R   + G S ++ +   FK+NP  LD +FAV  + T  +DQ+ 
Sbjct  496  VHQ-----EPLSYWTVAR---ARGESMSNFNISTFKINPKWLDDVFAVNYNGTELTDQVF  547

Query  136  INCDVSCKVVRPLSQDGMP  154
              C  +   V  +S DGMP
Sbjct  548  GGCYFNIVKVSDMSIDGMP  566


>gi|565841287|ref|WP_023924568.1| hypothetical protein [Prevotella nigrescens]
 gi|564729907|gb|ETD29851.1| hypothetical protein HMPREF1173_00033 [Prevotella nigrescens 
CC14M]
Length=656

 Score = 64.3 bits (155),  Expect = 4e-09, Method: Compositional matrix adjust.
 Identities = 45/143 (31%), Positives = 72/143 (50%), Gaps = 15/143 (10%)

Query  16   SVEDLPIPEFDNIGMEAVPAITL---FNSNAFDNDLESDFDFLGYNPRYWPWKSKIDRVH  72
            S ED   PEF+N+GM+ V    L    NS   D+  + + + LGY+ RY  +K+  D + 
Sbjct  521  SREDYFQPEFENLGMQPVIQSDLCLCINSAKSDSSDQHN-NVLGYSARYLEYKTARDIIF  579

Query  73   GAFLT--TLKDWVAPIDDFYLNRWFASGGSSQASISWPFFKVNPNTLDSIFAVAADSTWE  130
            G F++  +L  W  P +++     F  G      +S P   V+P  L+ IFAV  + +  
Sbjct  580  GEFMSGGSLSAWATPKNNY----TFEFG-----KLSLPDLLVDPKVLEPIFAVKYNGSMS  630

Query  131  SDQLLINCDVSCKVVRPLSQDGM  153
            +DQ L+N     K +RP+  + M
Sbjct  631  TDQFLVNSYFDVKAIRPMQVNDM  653


>gi|496521299|ref|WP_009229582.1| capsid protein [Prevotella sp. oral taxon 317]
 gi|288330570|gb|EFC69154.1| putative capsid protein (F protein) [Prevotella sp. oral taxon 
317 str. F0108]
Length=541

 Score = 57.4 bits (137),  Expect = 6e-07, Method: Compositional matrix adjust.
 Identities = 44/137 (32%), Positives = 67/137 (49%), Gaps = 16/137 (12%)

Query  19   DLPIPEFDNIGME-AVPAITLFNSNAFDNDLESDFDFLGYNPRYWPWKSKIDRVHGAFLT  77
            D  IPEF+N+GM+  VPA    N  A DN         G+ PRY  +K+  D  HG F  
Sbjct  418  DYFIPEFENLGMQPIVPAFVSLN-RAKDNSY-------GWQPRYSEYKTAFDINHGQFAN  469

Query  78   TLKDWVAPIDDFYLNRWFASGGSSQASISWPFFKVNPNTLDSIFAVAADSTWESDQLLIN  137
                   P+  + + R  A G  +  + +    K+NP+ LDS+FAV  + T  +D +   
Sbjct  470  G-----EPLSYWSIAR--ARGSDTLNTFNVAALKINPHWLDSVFAVNYNGTEVTDCMFGY  522

Query  138  CDVSCKVVRPLSQDGMP  154
               + + V  +++DGMP
Sbjct  523  AHFNIEKVSDMTEDGMP  539


>gi|490477384|ref|WP_004347761.1| capsid protein [Prevotella buccalis]
 gi|281300712|gb|EFA93043.1| putative capsid protein (F protein) [Prevotella buccalis ATCC 
35310]
Length=552

 Score = 55.1 bits (131),  Expect = 3e-06, Method: Compositional matrix adjust.
 Identities = 42/140 (30%), Positives = 69/140 (49%), Gaps = 15/140 (11%)

Query  16   SVEDLPIPEFDNIGMEAVPAITLFNSNAFDNDLESDFD-FLGYNPRYWPWKSKIDRVHGA  74
            S  D  +PEF+++GM+  P  T + S     D+ +  + F G+ PRY  +K+ +D  HG 
Sbjct  425  SRGDFFMPEFEDLGMQ--PLQTRYIS-----DIRTQTEKFKGWQPRYSEYKTSLDINHGQ  477

Query  75   FLTTLKDWVAPIDDFYLNRWFASGGSSQASISWPFFKVNPNTLDSIFAVAADSTWESDQL  134
            F         P+  + + R  A  G +  +      K+NP  LDSIFAV  + T  +D +
Sbjct  478  FANG-----QPLSYWTVGRGRA--GETLETFDIASLKINPKWLDSIFAVNYNGTQITDCV  530

Query  135  LINCDVSCKVVRPLSQDGMP  154
               C  + + V  +S++G P
Sbjct  531  FGGCQFNVQKVSDMSENGEP  550



Lambda      K        H        a         alpha
   0.320    0.137    0.444    0.792     4.96 

Gapped
Lambda      K        H        a         alpha    sigma
   0.267   0.0410    0.140     1.90     42.6     43.6 

Effective search space used: 435859188405