bitscore colors: <40, 40-50 , 50-80, 80-200, >200




           BLASTP 2.2.30+


Reference: Stephen F. Altschul, Thomas L. Madden, Alejandro A.
Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J.
Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of
protein database search programs", Nucleic Acids Res. 25:3389-3402.


Reference for composition-based statistics: Alejandro A. Schaffer,
L. Aravind, Thomas L. Madden, Sergei Shavirin, John L. Spouge, Yuri
I. Wolf, Eugene V. Koonin, and Stephen F. Altschul (2001),
"Improving the accuracy of PSI-BLAST protein database searches with
composition-based statistics and other refinements", Nucleic Acids
Res. 29:2994-3005.



Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF
excluding environmental samples from WGS projects
           49,011,213 sequences; 17,563,301,199 total letters





Query= Contig-13_CDS_annotation_glimmer3.pl_2_4

Length=438
                                                                      Score     E
Sequences producing significant alignments:                          (Bits)  Value

gi|547226431|ref|WP_021963494.1|  predicted protein                     102   2e-20
gi|496050828|ref|WP_008775335.1|  hypothetical protein                  102   2e-20
gi|490418708|ref|WP_004291031.1|  hypothetical protein                98.6    3e-19
gi|575094340|emb|CDL65724.1|  unnamed protein product                 79.7    8e-13
gi|494822887|ref|WP_007558295.1|  hypothetical protein                78.2    3e-12
gi|575094322|emb|CDL65709.1|  unnamed protein product                 77.0    7e-12
gi|647452984|ref|WP_025792805.1|  hypothetical protein                75.1    3e-11
gi|565841285|ref|WP_023924566.1|  hypothetical protein                72.8    1e-10
gi|494610270|ref|WP_007368516.1|  hypothetical protein                71.2    4e-10
gi|546189465|ref|WP_021825245.1|  hypothetical protein                65.1    4e-08


>gi|547226431|ref|WP_021963494.1| predicted protein [Prevotella sp. CAG:1185]
 gi|524103383|emb|CCY83995.1| predicted protein [Prevotella sp. CAG:1185]
Length=498

 Score =   102 bits (255),  Expect = 2e-20, Method: Compositional matrix adjust.
 Identities = 61/141 (43%), Positives = 79/141 (56%), Gaps = 10/141 (7%)

Query  74   AFPQFKGLLKYVNIRDYQLFAKRLRKYLSKKIGKYEKIHSYVVSEYSPKTFRPHFHILFF  133
            A     G L Y + RD QLF KR+RK LSK     EKI  Y+VSEY PKTFR H+H+LFF
Sbjct  122  AKCNLDGYLSYTSKRDAQLFLKRVRKNLSKYSD--EKIRYYIVSEYGPKTFRAHYHVLFF  179

Query  134  FDSDEIAKNFRQAVYQSWRLGRVDTQLAREQANSYVANYLNSVVSIPFVYKAKKSIRPRS  193
            +D  +  K   + + Q+W+ GRVD  L+R + NSYVA Y+N    +P  +    S +P S
Sbjct  180  YDEVKTQKVMSKVIRQAWQFGRVDCSLSRGKCNSYVARYVNCNYCLP-RFLGDMSTKPFS  238

Query  194  RFSNLFGY-------EEVKKG  207
              S  F         EE+ KG
Sbjct  239  CHSIRFALGIHQSQKEEIYKG  259


>gi|496050828|ref|WP_008775335.1| hypothetical protein [Bacteroides sp. 2_2_4]
 gi|229448892|gb|EEO54683.1| hypothetical protein BSCG_01608 [Bacteroides sp. 2_2_4]
Length=497

 Score =   102 bits (255),  Expect = 2e-20, Method: Compositional matrix adjust.
 Identities = 105/395 (27%), Positives = 167/395 (42%), Gaps = 51/395 (13%)

Query  80   GLLKYVNIRDYQLFAKRLRKYLSKKIGKYEKIHSYVVSEYSPKTFRPHFHILFFFDSDEI  139
            G + Y+   D QLF KRLR Y++K+    EK+  + V EY P  FRPH+H+L F  SDE 
Sbjct  115  GDVPYLRKTDLQLFLKRLRYYVTKQKPS-EKVRYFAVGEYGPVHFRPHYHLLLFLQSDEA  173

Query  140  AKNFRQAVYQSWRLGRVDTQLAREQANSYVANYLNSVVSIPFVYKAKKSIRPRSRFSNLF  199
             +   + + ++W  GRVD Q+++ Q ++YVA+Y+NS  +IP V+KA  S+ P       F
Sbjct  174  LQICSENISKAWTFGRVDCQVSKGQCSNYVASYVNSSCTIPKVFKA-SSVCP-------F  225

Query  200  GYEEVKKGIQHASDKRSALFDGVP-------YISNQKFVRYVPSGSHIDRLFPRFTHYDG  252
                 K G      +R  ++   P        + N K+  +    S     +PR   +  
Sbjct  226  SVHSQKLGQGFLDCQREKIYSLTPENFIRSSIVLNGKYKEFDVWRSCYSFFYPRCKGFVT  285

Query  253  SFLRRSSQIYEVVQRVLRLFARNEPFKKATPRNVSEFVCWWCEYNFRRGCQIKDFPDYMQ  312
               R  +  Y +      LF    P  K T     E   +   ++  +   + D   Y  
Sbjct  286  KSSRERAYSYSIYDTARLLF----PDAKTTFSLAKEIAIYIYYFHNPKETYLLDLYGYCS  341

Query  313  EFLHIVRLDGYSFLNWDVPI-----GKISRFFYR----------FNRFEAMKGSL---RS  354
            +   +  L  Y F + DV +     G+ SR+ +R          F  F     +L   +S
Sbjct  342  DQSKLYELSQY-FYDSDVLLHSFNSGEFSRYVHRIYTELLISKHFLYFVCTHNTLAERKS  400

Query  355  KLKAVSLFYDYRDYQSLKNQLSLQELVFAE--LGYSDELFDSFYVKPHIKVLKNAYID--  410
            K + +  FY   DY  L      Q+L +    +G  D   D++    +     N Y D  
Sbjct  401  KQRLIEEFYSRLDYMHLTKFFEAQQLFYESDLIGDDDLCTDNWDNSYYPYFYNNVYTDTN  460

Query  411  --------KWKDVNYKEVHYFRVKHKVLNDENNIF  437
                    +    + K++   R+KHK LND N +F
Sbjct  461  LFEKTPVYRLYSSDVKKLFNDRIKHKKLNDANKVF  495


>gi|490418708|ref|WP_004291031.1| hypothetical protein [Bacteroides eggerthii]
 gi|217986635|gb|EEC52969.1| hypothetical protein BACEGG_02720 [Bacteroides eggerthii DSM 
20697]
Length=422

 Score = 98.6 bits (244),  Expect = 3e-19, Method: Compositional matrix adjust.
 Identities = 104/395 (26%), Positives = 171/395 (43%), Gaps = 49/395 (12%)

Query  80   GLLKYVNIRDYQLFAKRLRKYLSKKIGKYEKIHSYVVSEYSPKTFRPHFHILFFFDSDEI  139
            G L Y+   D QLF KR R Y++K+  K EK+  + + EY P  FRPH+HIL F  SDE 
Sbjct  39   GYLPYLRKFDLQLFFKRFRYYVAKRFPK-EKVRYFAIGEYGPVHFRPHYHILLFLQSDEA  97

Query  140  AKNFRQAVYQSWRLGRVDTQLAREQANSYVANYLNSVVSIPFVYKAKKSIRPRSRFSNLF  199
             +   + V ++W  GRVD QL++ + +SYVA Y+NS V +P V     ++ P       F
Sbjct  98   LQVCSKVVSEAWPFGRVDCQLSKGKCSSYVAGYVNSSVLVPKVLTL-PTLCP-------F  149

Query  200  GYEEVKKGIQHASDKRSALFDGVP-------YISNQKFVRYVPSGSHIDRLFPRFTHYDG  252
                 K G      +R+ ++   P        + N ++  +    S     FP+   +  
Sbjct  150  CVHSQKLGQGFLQSERAKVYSLTPEQFVKRSIVINGRYKEFDVWRSAYAYFFPKCKGFAD  209

Query  253  SFLRRSSQIYEVVQRVLRLFARNEPFKKATPRNVSEFVCWWCEYNFRRGCQIKDFPDYMQ  312
               R  +  Y +     RLF    P  + T     E V +   ++ ++     D    + 
Sbjct  210  KSSRERAYSYGLYDTARRLF----PSAETTFALAKEIVGYIYYFHNKKDTYCLDIFGEVS  265

Query  313  EFLHIVRLDGYSF----LNWDVPIGKISRFFYR----------FNRFEAMKGSL---RSK  355
            +   + +   Y F    +N+ +   ++ R+ +R          F  F   + +L   + K
Sbjct  266  DQSDLYQFSQYFFEPEIVNYSLDSIEMCRYVHRVYTELLLSKHFLYFVCDRPTLSEQKRK  325

Query  356  LKAVSLFYDYRDYQSLKNQLSLQELVFAE--LGYSDELFDS--------FYVKPHI--KV  403
            LK +  FY   DY  LK     Q+L +    +G  D + D+        FY   +   +V
Sbjct  326  LKLIEEFYSRLDYMHLKTFFENQQLFYESDLVGDLDLMSDAWENSYYPFFYDNVYFSSEV  385

Query  404  LKNAYIDKWKDVNYKEVHYFRVKHKVLNDENNIFL  438
             K   + +  D+   ++   R+KHK LND N IF+
Sbjct  386  YKKTPVYRLYDMQISKLFSDRIKHKKLNDLNKIFV  420


>gi|575094340|emb|CDL65724.1| unnamed protein product [uncultured bacterium]
Length=486

 Score = 79.7 bits (195),  Expect = 8e-13, Method: Compositional matrix adjust.
 Identities = 49/122 (40%), Positives = 73/122 (60%), Gaps = 12/122 (10%)

Query  88   RDYQLFAKRLRKYLSKKIGKYEKIHSYVVSEYSPKTFRPHFHILFFFDSDEIA-KNFRQA  146
            +D+  F KRLR  L++      KI  +  SEY P T RPHFH +F+FDS  ++  +FR A
Sbjct  157  KDFVNFVKRLRINLTRNYNYEGKITYFKCSEYGPTTNRPHFHGIFWFDSRALSFDSFRSA  216

Query  147  VYQSWRLGRVDTQ-----LAREQANSYVANYLNSVVSIP--FVYKAKKSIRPRSRFSNLF  199
            V +SW++   D Q     +ARE A +YVA+Y+N + S+P  F++K    +RP+   S  F
Sbjct  217  VVESWKMCDKDKQYENVEIAREPA-TYVASYVNCLTSVPPLFLFKG---LRPKHSHSKGF  272

Query  200  GY  201
            G+
Sbjct  273  GF  274


>gi|494822887|ref|WP_007558295.1| hypothetical protein [Bacteroides plebeius]
 gi|198272100|gb|EDY96369.1| hypothetical protein BACPLE_00805 [Bacteroides plebeius DSM 17135]
Length=545

 Score = 78.2 bits (191),  Expect = 3e-12, Method: Compositional matrix adjust.
 Identities = 59/184 (32%), Positives = 82/184 (45%), Gaps = 41/184 (22%)

Query  55   MLPEIAESLKKKNNTDASGAFPQFKGLLKYVNIRDYQLFAKRLRKYLSKKIGKYEKIHSY  114
            M P++    +K+ N   +     +KG   Y++ R+ QLF KRLRKYL K  G  +KI  +
Sbjct  96   MTPQLMNEYQKRVNYRIN-----YKGRFPYLSKRELQLFMKRLRKYLDKYEG--QKIRFF  148

Query  115  VVSEYSPKTFRPHFHILFFFDS-----------------------------DEIAKNFRQ  145
               EY P +FRPHFHIL F D                                +      
Sbjct  149  ATGEYGPLSFRPHFHILLFVDDPSLFLPSVHTLGEYPYPYWSKYQKAHCGKGTLLSKLEY  208

Query  146  AVYQSWRLGRVDTQ-LAREQANSYVANYLNSVVSIPFVYK--AKKSIRPRSRF--SNLFG  200
             + +SW  G +D Q + +   +SYVA Y+NS V +P   K  A KS    SRF    +FG
Sbjct  209  YIRESWPFGGIDAQSVEQGSCSSYVAGYVNSSVPLPSCLKVDAVKSFSQHSRFLGRKIFG  268

Query  201  YEEV  204
             E +
Sbjct  269  TELI  272


>gi|575094322|emb|CDL65709.1| unnamed protein product [uncultured bacterium]
Length=499

 Score = 77.0 bits (188),  Expect = 7e-12, Method: Compositional matrix adjust.
 Identities = 80/318 (25%), Positives = 130/318 (41%), Gaps = 59/318 (19%)

Query  18   LRYPNFISKFRPFILRSIPRVSKLQNFKDEYFEELVWMLPEIAESLKKKNNTDASGAFPQ  77
            LR  +FIS F           S L NF +++ +++ +    +     K + +   G    
Sbjct  90   LRNDSFISDF----------CSDLHNFDNDFVDKMDYYSDYVINYESKYHKSCVYG----  135

Query  78   FKGLLKYVNIRDYQLFAKRLRKYLSKKIGKYEKIHSYVVSEYSPKTFRPHFHILFFFDSD  137
              GL   +  RD QLF KRLRK++ K  G  EKI  Y++ EY  K+ RPH+H L FF+S 
Sbjct  136  -HGLYALLYYRDIQLFLKRLRKHIYKYYG--EKIRFYIIGEYGTKSLRPHWHCLLFFNSS  192

Query  138  EIAKNFRQAVYQS---------------WRLGRVDTQLAREQANSYVANYLNSVVSIP--  180
             +++ F   V                  W+ G  D++    +A +YV++Y+N   + P  
Sbjct  193  SLSQAFEDCVNVGTTSRPCSCPRFLRPFWQFGICDSKRTNGEAYNYVSSYVNQSANFPKL  252

Query  181  FVYKAKKSIRPRSRFSNLFGYEEVKKGIQHAS---DKRSALFDGVPYISNQKFVRYVPSG  237
             V  + +      +   +   + +   IQ       +R    D     ++    R     
Sbjct  253  LVLLSNQKAYHSIQLGQILSEQSIVSAIQKGDFSFFERQFYLDTFGAANSYSVWR-----  307

Query  238  SHIDRLFPRFTHYDGSFLRRSSQI-YEVVQRVLRLFARNEPFKKATPRNVSEFVCWWCEY  296
            S+  R FP+FT         SSQ+ YE   RVL  +   E  +     +    +C    Y
Sbjct  308  SYYSRFFPKFTC--------SSQLTYEQTYRVLTCY---ETLRDLFDTDSVGVICRRLFY  356

Query  297  NFRRGCQIKDFPDYMQEF  314
            ++  G     +PDY   F
Sbjct  357  HYHFG-----YPDYHDIF  369


>gi|647452984|ref|WP_025792805.1| hypothetical protein [Prevotella histicola]
Length=480

 Score = 75.1 bits (183),  Expect = 3e-11, Method: Compositional matrix adjust.
 Identities = 41/138 (30%), Positives = 66/138 (48%), Gaps = 11/138 (8%)

Query  88   RDYQLFAKRLRKYLSKKI---GKYEKIHSYVVSEYSPKTFRPHFHILFFFDSDEIAKNFR  144
            +D Q F KRLR  +  K+   G   +I  ++ SEY P TFRPH+H + ++DS+ +     
Sbjct  125  KDVQDFFKRLRSKIDYKLKPRGNEYRIRYFICSEYGPNTFRPHYHAILWYDSEILHNELN  184

Query  145  QAVYQSWRLGRVDTQLAREQANSYVANYLNSVVSIPFVYKAKKSIRPRSRFSNLFGYEEV  204
              + ++W+ G  D  L    A+ YVA Y+N    +P           R+ F++ F     
Sbjct  185  VLIRETWKNGNTDFSLVNSSASQYVAKYVNGDCDLPSFL--------RTEFTSTFHLASK  236

Query  205  KKGIQHASDKRSALFDGV  222
               I +  D   AL++ V
Sbjct  237  HPCIGYGKDDEEALYENV  254


>gi|565841285|ref|WP_023924566.1| hypothetical protein [Prevotella nigrescens]
 gi|564729906|gb|ETD29850.1| hypothetical protein HMPREF1173_00032 [Prevotella nigrescens 
CC14M]
Length=484

 Score = 72.8 bits (177),  Expect = 1e-10, Method: Compositional matrix adjust.
 Identities = 64/237 (27%), Positives = 103/237 (43%), Gaps = 12/237 (5%)

Query  51   ELVWMLPEIAESLKKKNNTDASGAFPQFKGLLKYVNIRDYQLFAKRLRKYLSKKIGKY--  108
            E+VW    + +      N D           + Y    D   F KRLR  LS    K+  
Sbjct  81   EMVWTSNRLCDEKVIVGNYDFIKVSNSDVQAVAYCCKSDIVKFFKRLRSKLSYYFKKHHI  140

Query  109  ---EKIHSYVVSEYSPKTFRPHFHILFFFDSDEIAKNFRQAVYQSWRLGRVDTQLAREQA  165
               EKI  +V SEY PKT RPH+H + +FDS+E+A+   + +  SW  G  D +     A
Sbjct  141  ITNEKIRYFVCSEYGPKTLRPHYHAIIWFDSEEVARVIEKMLSSSWSNGFTDFEYVNSTA  200

Query  166  NSYVANYL--NSVVSIPFVYKAKKSIRPRSRFSNLFGY--EEVKKGIQHASDKRSALFDG  221
              YVA Y+  NSV+     + A ++   +S+  ++ GY  ++ +K  +   D     F+ 
Sbjct  201  PQYVAKYVSGNSVLPEILQHDACRTFHLQSQAPSV-GYRSDDYEKFEKEVIDGCYGHFEY  259

Query  222  VPYISNQKFVRYVPSGSHIDRLFPRFTHYDGSFLRRSSQIYEVVQRVLRLFARNEPF  278
                 +  FV+  P G+   R FP+   Y         +IY   + +  ++  + P 
Sbjct  260  DSSSQSSVFVQ--PPGTLETRCFPKCREYRSLSRIEKLRIYAYKRDICSIYGIDTPI  314


>gi|494610270|ref|WP_007368516.1| hypothetical protein [Prevotella multiformis]
 gi|324988542|gb|EGC20505.1| hypothetical protein HMPREF9141_0984 [Prevotella multiformis 
DSM 16608]
Length=479

 Score = 71.2 bits (173),  Expect = 4e-10, Method: Compositional matrix adjust.
 Identities = 40/135 (30%), Positives = 71/135 (53%), Gaps = 7/135 (5%)

Query  53   VWMLPEIAESLKKKNNTDASGAFPQ---FKGLLKYVNIRDYQLFAKRLRKYLSKKIGKYE  109
            VW    ++ES K  +++      PQ    +    Y   +D Q + KRLR  +  ++ K +
Sbjct  84   VWFSNRLSESGKFLSDSVCRSLPPQKMEDEVCFAYPCKKDVQDWFKRLRSAVDYQLNKNK  143

Query  110  ----KIHSYVVSEYSPKTFRPHFHILFFFDSDEIAKNFRQAVYQSWRLGRVDTQLAREQA  165
                +I  ++ SEY P+TFRPH+H + ++DS+E+ +N  + + ++W+ G     L    A
Sbjct  144  SNEFRIRYFICSEYGPRTFRPHYHAILWYDSEELQRNIGRLIRETWKNGNSVFSLVNNSA  203

Query  166  NSYVANYLNSVVSIP  180
            + YVA Y+N    +P
Sbjct  204  SQYVAKYVNGDTRLP  218


>gi|546189465|ref|WP_021825245.1| hypothetical protein [Prevotella salivae]
 gi|544001993|gb|ERK01417.1| hypothetical protein HMPREF9145_2741 [Prevotella salivae F0493]
Length=586

 Score = 65.1 bits (157),  Expect = 4e-08, Method: Compositional matrix adjust.
 Identities = 54/190 (28%), Positives = 89/190 (47%), Gaps = 15/190 (8%)

Query  73   GAFPQFKGLLKYVNIRDYQLFAKRLRKYLS----KKIGKYEKIHSYVVSEYSPKTFRPHF  128
            G+ P FK  L  ++   Y L+    + YL+    KK    + +  ++ SEY+P TFRPHF
Sbjct  177  GSIP-FKEWLDDLDTETYDLYYSVYQYYLTDYEKKKESCKQSVRYFICSEYTPTTFRPHF  235

Query  129  HILFFFDSDEIAKNFRQAVYQSWRLG---RVDTQLAREQANSYVANYLNSVVSIPFVYKA  185
            H LF+FD ++      + ++++W++     ++ Q     A++YV+ Y+    ++P V +A
Sbjct  236  HGLFWFDDEKAFSYAPRCIFKAWKMCAEININVQPVSGDASAYVSKYVTGNSNLPPVLQA  295

Query  186  KKSIRPRSRFSN--LFGYEEVKKGIQHASDKRSALFDGVPYISNQ---KFVRYVPSGSHI  240
             KS R     S     GY+            R  +F     IS +     V  VPS S +
Sbjct  296  -KSTRTFCLASKGPAIGYKSFSDKEVLEMFTRRCIFRSYETISKKGKLSGVSAVPS-SAV  353

Query  241  DRLFPRFTHY  250
             R FP+   Y
Sbjct  354  GRYFPKCYQY  363



Lambda      K        H        a         alpha
   0.325    0.140    0.426    0.792     4.96 

Gapped
Lambda      K        H        a         alpha    sigma
   0.267   0.0410    0.140     1.90     42.6     43.6 

Effective search space used: 2916668506332