bitscore colors: <40, 40-50 , 50-80, 80-200, >200




           BLASTP 2.2.30+


Reference: Stephen F. Altschul, Thomas L. Madden, Alejandro A.
Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J.
Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of
protein database search programs", Nucleic Acids Res. 25:3389-3402.


Reference for composition-based statistics: Alejandro A. Schaffer,
L. Aravind, Thomas L. Madden, Sergei Shavirin, John L. Spouge, Yuri
I. Wolf, Eugene V. Koonin, and Stephen F. Altschul (2001),
"Improving the accuracy of PSI-BLAST protein database searches with
composition-based statistics and other refinements", Nucleic Acids
Res. 29:2994-3005.



Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF
excluding environmental samples from WGS projects
           49,011,213 sequences; 17,563,301,199 total letters





Query= Contig-31_CDS_annotation_glimmer3.pl_2_6

Length=236
                                                                      Score     E
Sequences producing significant alignments:                          (Bits)  Value

gi|496050828|ref|WP_008775335.1|  hypothetical protein                90.1    3e-17
gi|490418708|ref|WP_004291031.1|  hypothetical protein                79.7    6e-14
gi|547226431|ref|WP_021963494.1|  predicted protein                   79.7    8e-14
gi|517172763|ref|WP_018361581.1|  hypothetical protein                69.7    2e-10
gi|575094322|emb|CDL65709.1|  unnamed protein product                 68.6    3e-10
gi|575094355|emb|CDL65737.1|  unnamed protein product                 68.2    5e-10
gi|575094340|emb|CDL65724.1|  unnamed protein product                 67.8    6e-10
gi|565841285|ref|WP_023924566.1|  hypothetical protein                65.9    3e-09
gi|647452984|ref|WP_025792805.1|  hypothetical protein                61.2    9e-08
gi|496521300|ref|WP_009229583.1|  hypothetical protein                61.2    1e-07


>gi|496050828|ref|WP_008775335.1| hypothetical protein [Bacteroides sp. 2_2_4]
 gi|229448892|gb|EEO54683.1| hypothetical protein BSCG_01608 [Bacteroides sp. 2_2_4]
Length=497

 Score = 90.1 bits (222),  Expect = 3e-17, Method: Compositional matrix adjust.
 Identities = 52/121 (43%), Positives = 76/121 (63%), Gaps = 2/121 (2%)

Query  35   QIPVIRNRDFELFMKRLRSNLKQQGLPHEEIRYYCVSEYGPQTLRPHWHLLLFFDSPQLT  94
             +P +R  D +LF+KRLR  + +Q  P E++RY+ V EYGP   RPH+HLLLF  S +  
Sbjct  116  DVPYLRKTDLQLFLKRLRYYVTKQK-PSEKVRYFAVGEYGPVHFRPHYHLLLFLQSDEAL  174

Query  95   QAIRENVCKAWSYGNCDVSLSRGaaasyvasyvnsvasLPYLYTGHKEIRPRCFHSKGFG  154
            Q   EN+ KAW++G  D  +S+G  ++YVASYVNS  ++P ++     + P   HS+  G
Sbjct  175  QICSENISKAWTFGRVDCQVSKGQCSNYVASYVNSSCTIPKVFKA-SSVCPFSVHSQKLG  233

Query  155  Q  155
            Q
Sbjct  234  Q  234


>gi|490418708|ref|WP_004291031.1| hypothetical protein [Bacteroides eggerthii]
 gi|217986635|gb|EEC52969.1| hypothetical protein BACEGG_02720 [Bacteroides eggerthii DSM 
20697]
Length=422

 Score = 79.7 bits (195),  Expect = 6e-14, Method: Compositional matrix adjust.
 Identities = 49/121 (40%), Positives = 71/121 (59%), Gaps = 2/121 (2%)

Query  36   IPVIRNRDFELFMKRLRSNLKQQGLPHEEIRYYCVSEYGPQTLRPHWHLLLFFDSPQLTQ  95
            +P +R  D +LF KR R  + ++  P E++RY+ + EYGP   RPH+H+LLF  S +  Q
Sbjct  41   LPYLRKFDLQLFFKRFRYYVAKR-FPKEKVRYFAIGEYGPVHFRPHYHILLFLQSDEALQ  99

Query  96   AIRENVCKAWSYGNCDVSLSRGaaasyvasyvnsvasLPYLYTGHKEIRPRCFHSKGFGQ  155
               + V +AW +G  D  LS+G  +SYVA YVNS   +P + T    + P C HS+  GQ
Sbjct  100  VCSKVVSEAWPFGRVDCQLSKGKCSSYVAGYVNSSVLVPKVLT-LPTLCPFCVHSQKLGQ  158

Query  156  N  156
             
Sbjct  159  G  159


>gi|547226431|ref|WP_021963494.1| predicted protein [Prevotella sp. CAG:1185]
 gi|524103383|emb|CCY83995.1| predicted protein [Prevotella sp. CAG:1185]
Length=498

 Score = 79.7 bits (195),  Expect = 8e-14, Method: Compositional matrix adjust.
 Identities = 50/113 (44%), Positives = 68/113 (60%), Gaps = 3/113 (3%)

Query  42   RDFELFMKRLRSNLKQQGLPHEEIRYYCVSEYGPQTLRPHWHLLLFFDSPQLTQAIRENV  101
            RD +LF+KR+R NL +     E+IRYY VSEYGP+T R H+H+L F+D  +  + + + +
Sbjct  136  RDAQLFLKRVRKNLSK--YSDEKIRYYIVSEYGPKTFRAHYHVLFFYDEVKTQKVMSKVI  193

Query  102  CKAWSYGNCDVSLSRGaaasyvasyvnsvasLPYLYTGHKEIRPRCFHSKGFG  154
             +AW +G  D SLSRG   SYVA YVN    LP  + G    +P   HS  F 
Sbjct  194  RQAWQFGRVDCSLSRGKCNSYVARYVNCNYCLP-RFLGDMSTKPFSCHSIRFA  245


>gi|517172763|ref|WP_018361581.1| hypothetical protein [Prevotella nanceiensis]
Length=598

 Score = 69.7 bits (169),  Expect = 2e-10, Method: Compositional matrix adjust.
 Identities = 30/74 (41%), Positives = 46/74 (62%), Gaps = 2/74 (3%)

Query  34   LQIPVIRNRDFELFMKRLRSNLKQQGLPHEE--IRYYCVSEYGPQTLRPHWHLLLFFDSP  91
            LQ   +  +D + F+KRLR  + +  +P  E  IRY+  SEYGP+T RPH+H +LF DSP
Sbjct  146  LQFATVSKKDIQNFLKRLRKKIDKLNIPQNEKKIRYFIASEYGPKTYRPHYHGVLFIDSP  205

Query  92   QLTQAIRENVCKAW  105
             +   I+  + ++W
Sbjct  206  TVLSKIKAFIVESW  219


>gi|575094322|emb|CDL65709.1| unnamed protein product [uncultured bacterium]
Length=499

 Score = 68.6 bits (166),  Expect = 3e-10, Method: Compositional matrix adjust.
 Identities = 38/92 (41%), Positives = 52/92 (57%), Gaps = 19/92 (21%)

Query  42   RDFELFMKRLRSNL-KQQGLPHEEIRYYCVSEYGPQTLRPHWHLLLFFDSPQLTQAIREN  100
            RD +LF+KRLR ++ K  G   E+IR+Y + EYG ++LRPHWH LLFF+S  L+QA  + 
Sbjct  145  RDIQLFLKRLRKHIYKYYG---EKIRFYIIGEYGTKSLRPHWHCLLFFNSSSLSQAFEDC  201

Query  101  V--------CKA-------WSYGNCDVSLSRG  117
            V        C         W +G CD   + G
Sbjct  202  VNVGTTSRPCSCPRFLRPFWQFGICDSKRTNG  233


>gi|575094355|emb|CDL65737.1| unnamed protein product [uncultured bacterium]
Length=517

 Score = 68.2 bits (165),  Expect = 5e-10, Method: Compositional matrix adjust.
 Identities = 52/149 (35%), Positives = 71/149 (48%), Gaps = 31/149 (21%)

Query  35   QIPVIRNRDFELFMKRLRSNLKQQGLPHEEIRYYCVSEYGPQTLRPHWHLLLFFDSPQLT  94
             +P +R RD +LF+KRLR NL +      ++RY+ + EYGP   RPH+H LLFFD  + T
Sbjct  122  DVPYLRKRDLQLFIKRLRKNLSKYS--DAKVRYFAMGEYGPVHFRPHYHFLLFFDEIKFT  179

Query  95   QAI--------------RENVC--------------KAWSYGNCDVSLSRGaaasyvasy  126
                              +N C               +W +G  D   S+G AA YV+SY
Sbjct  180  APSGHTLGEFPDWAWYDSQNKCSRSDILSVVEYCIRSSWKFGRVDAQYSKGDAAQYVSSY  239

Query  127  vnsvasLPYLYTGHKEIRPRCFHSKGFGQ  155
            V+   SLP +Y      RP   HS+  GQ
Sbjct  240  VSGSGSLPKVYQV-SSARPFSLHSRFLGQ  267


>gi|575094340|emb|CDL65724.1| unnamed protein product [uncultured bacterium]
Length=486

 Score = 67.8 bits (164),  Expect = 6e-10, Method: Compositional matrix adjust.
 Identities = 57/182 (31%), Positives = 87/182 (48%), Gaps = 14/182 (8%)

Query  1    MTDDVIKDILRRANGKYSYSLRKVVYPPASEWKLQIPVIRNRDFELFMKRLRSNLKQQGL  60
            M  +V ++ L    G  + S   VV         ++ ++ ++DF  F+KRLR NL +   
Sbjct  119  MPKEVFRNYLCNTTGIVTKSRNGVVLERDDN---KVGILYDKDFVNFVKRLRINLTRNYN  175

Query  61   PHEEIRYYCVSEYGPQTLRPHWHLLLFFDSPQLT-QAIRENVCKAWSYGNCD-----VSL  114
               +I Y+  SEYGP T RPH+H + +FDS  L+  + R  V ++W   + D     V +
Sbjct  176  YEGKITYFKCSEYGPTTNRPHFHGIFWFDSRALSFDSFRSAVVESWKMCDKDKQYENVEI  235

Query  115  SRGaaasyvasyvnsvasLP-YLYTGHKEIRPRCFHSKGFG-QNKSFVNRPVFQKFEKYP  172
            +R  A    +      +  P +L+ G   +RP+  HSKGFG  N  F    VF  F    
Sbjct  236  AREPATYVASYVNCLTSVPPLFLFKG---LRPKHSHSKGFGFANNLFSFSAVFTNFMAQR  292

Query  173  LT  174
            LT
Sbjct  293  LT  294


>gi|565841285|ref|WP_023924566.1| hypothetical protein [Prevotella nigrescens]
 gi|564729906|gb|ETD29850.1| hypothetical protein HMPREF1173_00032 [Prevotella nigrescens 
CC14M]
Length=484

 Score = 65.9 bits (159),  Expect = 3e-09, Method: Compositional matrix adjust.
 Identities = 32/74 (43%), Positives = 47/74 (64%), Gaps = 5/74 (7%)

Query  43   DFELFMKRLRSNL-----KQQGLPHEEIRYYCVSEYGPQTLRPHWHLLLFFDSPQLTQAI  97
            D   F KRLRS L     K   + +E+IRY+  SEYGP+TLRPH+H +++FDS ++ + I
Sbjct  119  DIVKFFKRLRSKLSYYFKKHHIITNEKIRYFVCSEYGPKTLRPHYHAIIWFDSEEVARVI  178

Query  98   RENVCKAWSYGNCD  111
             + +  +WS G  D
Sbjct  179  EKMLSSSWSNGFTD  192


>gi|647452984|ref|WP_025792805.1| hypothetical protein [Prevotella histicola]
Length=480

 Score = 61.2 bits (147),  Expect = 9e-08, Method: Compositional matrix adjust.
 Identities = 37/105 (35%), Positives = 54/105 (51%), Gaps = 4/105 (4%)

Query  14   NGKY-SYSLRKVVYPPASEWKLQIPVIRNRDFELFMKRLRSNLKQQGLPHEE---IRYYC  69
            NGK+ S  + + + P   E  +       +D + F KRLRS +  +  P      IRY+ 
Sbjct  96   NGKFVSSDIARPIPPVGMEDTVCFAYPCKKDVQDFFKRLRSKIDYKLKPRGNEYRIRYFI  155

Query  70   VSEYGPQTLRPHWHLLLFFDSPQLTQAIRENVCKAWSYGNCDVSL  114
             SEYGP T RPH+H +L++DS  L   +   + + W  GN D SL
Sbjct  156  CSEYGPNTFRPHYHAILWYDSEILHNELNVLIRETWKNGNTDFSL  200


>gi|496521300|ref|WP_009229583.1| hypothetical protein [Prevotella sp. oral taxon 317]
 gi|288330571|gb|EFC69155.1| hypothetical protein HMPREF0670_00478 [Prevotella sp. oral taxon 
317 str. F0108]
Length=569

 Score = 61.2 bits (147),  Expect = 1e-07, Method: Compositional matrix adjust.
 Identities = 29/68 (43%), Positives = 42/68 (62%), Gaps = 2/68 (3%)

Query  42   RDFELFMKRLRSNLKQQGLPHE--EIRYYCVSEYGPQTLRPHWHLLLFFDSPQLTQAIRE  99
            +D + F+KRLR N+ +     E  +IRYY  SEYGP TLRPH+H ++FFD   L   I  
Sbjct  136  KDIQNFLKRLRFNISKLYGKAESRKIRYYVASEYGPTTLRPHYHGIIFFDDASLLSEISS  195

Query  100  NVCKAWSY  107
             + ++W +
Sbjct  196  LIVRSWGF  203



Lambda      K        H        a         alpha
   0.326    0.141    0.441    0.792     4.96 

Gapped
Lambda      K        H        a         alpha    sigma
   0.267   0.0410    0.140     1.90     42.6     43.6 

Effective search space used: 981586889820