bitscore colors: <40, 40-50 , 50-80, 80-200, >200




           BLASTP 2.2.30+


Reference: Stephen F. Altschul, Thomas L. Madden, Alejandro A.
Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J.
Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of
protein database search programs", Nucleic Acids Res. 25:3389-3402.


Reference for composition-based statistics: Alejandro A. Schaffer,
L. Aravind, Thomas L. Madden, Sergei Shavirin, John L. Spouge, Yuri
I. Wolf, Eugene V. Koonin, and Stephen F. Altschul (2001),
"Improving the accuracy of PSI-BLAST protein database searches with
composition-based statistics and other refinements", Nucleic Acids
Res. 29:2994-3005.



Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF
excluding environmental samples from WGS projects
           49,011,213 sequences; 17,563,301,199 total letters



Query= Contig-8_CDS_annotation_glimmer3.pl_2_1

Length=461
                                                                      Score     E
Sequences producing significant alignments:                          (Bits)  Value

gi|547226431|ref|WP_021963494.1|  predicted protein                     103   1e-20
gi|496050828|ref|WP_008775335.1|  hypothetical protein                  103   2e-20
gi|490418708|ref|WP_004291031.1|  hypothetical protein                99.0    3e-19
gi|575094340|emb|CDL65724.1|  unnamed protein product                 81.3    3e-13
gi|494822887|ref|WP_007558295.1|  hypothetical protein                78.2    4e-12
gi|575094322|emb|CDL65709.1|  unnamed protein product                 77.4    6e-12
gi|647452984|ref|WP_025792805.1|  hypothetical protein                75.9    2e-11
gi|565841285|ref|WP_023924566.1|  hypothetical protein                72.0    3e-10
gi|494610270|ref|WP_007368516.1|  hypothetical protein                71.2    4e-10
gi|546189465|ref|WP_021825245.1|  hypothetical protein                65.1    5e-08


>gi|547226431|ref|WP_021963494.1| predicted protein [Prevotella sp. CAG:1185]
 gi|524103383|emb|CCY83995.1| predicted protein [Prevotella sp. CAG:1185]
Length=498

 Score =   103 bits (258),  Expect = 1e-20, Method: Compositional matrix adjust.
 Identities = 60/135 (44%), Positives = 78/135 (58%), Gaps = 10/135 (7%)

Query  103  GLLKYVNIRDYQLFAKRLRKYLSKKIGKYEKIHSYVVSEYSPKTFRPHFHILFFFDSDEI  162
            G L Y + RD QLF KR+RK LSK     EKI  Y+VSEY PKTFR H+H+LFF+D  + 
Sbjct  128  GYLSYTSKRDAQLFLKRVRKNLSKYSD--EKIRYYIVSEYGPKTFRAHYHVLFFYDEVKT  185

Query  163  AKNFRQAVYQSWRLGRVDTQLAREQANSYVANYLNSVVSIPFVYKAKKSIRPRSRFSNLF  222
             K   + + Q+W+ GRVD  L+R + NSYVA Y+N    +P  +    S +P S  S  F
Sbjct  186  QKVMSKVIRQAWQFGRVDCSLSRGKCNSYVARYVNCNYCLP-RFLGDMSTKPFSCHSIRF  244

Query  223  GF-------EEVKKG  230
                     EE+ KG
Sbjct  245  ALGIHQSQKEEIYKG  259


>gi|496050828|ref|WP_008775335.1| hypothetical protein [Bacteroides sp. 2_2_4]
 gi|229448892|gb|EEO54683.1| hypothetical protein BSCG_01608 [Bacteroides sp. 2_2_4]
Length=497

 Score =   103 bits (257),  Expect = 2e-20, Method: Compositional matrix adjust.
 Identities = 105/395 (27%), Positives = 167/395 (42%), Gaps = 51/395 (13%)

Query  103  GLLKYVNIRDYQLFAKRLRKYLSKKIGKYEKIHSYVVSEYSPKTFRPHFHILFFFDSDEI  162
            G + Y+   D QLF KRLR Y++K+    EK+  + V EY P  FRPH+H+L F  SDE 
Sbjct  115  GDVPYLRKTDLQLFLKRLRYYVTKQKPS-EKVRYFAVGEYGPVHFRPHYHLLLFLQSDEA  173

Query  163  AKNFRQAVYQSWRLGRVDTQLAREQANSYVANYLNSVVSIPFVYKAKKSIRPRSRFSNLF  222
             +   + + ++W  GRVD Q+++ Q ++YVA+Y+NS  +IP V+KA  S+ P       F
Sbjct  174  LQICSENISKAWTFGRVDCQVSKGQCSNYVASYVNSSCTIPKVFKA-SSVCP-------F  225

Query  223  GFEEVKKGIQHASDKRSALFDGVP-------YISNQKFVRYVPSGSHIDRLFPRFTHYDG  275
                 K G      +R  ++   P        + N K+  +    S     +PR   +  
Sbjct  226  SVHSQKLGQGFLDCQREKIYSLTPENFIRSSIVLNGKYKEFDVWRSCYSFFYPRCKGFVT  285

Query  276  SFLRRSSQIYEVVQRVLRLFARNEPFKKATPRNVSEFVCWWCEYNFRRGCQIKDFPDYMQ  335
               R  +  Y +      LF    P  K T     E   +   ++  +   + D   Y  
Sbjct  286  KSSRERAYSYSIYDTARLLF----PDAKTTFSLAKEIAIYIYYFHNPKETYLLDLYGYCS  341

Query  336  EFLHIVRLDGYSFLNWDVPI-----GKISRFFYR----------FNRFEAMKGSL---RS  377
            +   +  L  Y F + DV +     G+ SR+ +R          F  F     +L   +S
Sbjct  342  DQSKLYELSQY-FYDSDVLLHSFNSGEFSRYVHRIYTELLISKHFLYFVCTHNTLAERKS  400

Query  378  KLKAVSLFYDYRDYQSLKNQLSLQELVFAE--LGYSDELFDSFYVKPHIKVLKNAYID--  433
            K + +  FY   DY  L      Q+L +    +G  D   D++    +     N Y D  
Sbjct  401  KQRLIEEFYSRLDYMHLTKFFEAQQLFYESDLIGDDDLCTDNWDNSYYPYFYNNVYTDTN  460

Query  434  --------KWKDVNYKEVHYFRVKHKVLNDENNIF  460
                    +    + K++   R+KHK LND N +F
Sbjct  461  LFEKTPVYRLYSSDVKKLFNDRIKHKKLNDANKVF  495


>gi|490418708|ref|WP_004291031.1| hypothetical protein [Bacteroides eggerthii]
 gi|217986635|gb|EEC52969.1| hypothetical protein BACEGG_02720 [Bacteroides eggerthii DSM 
20697]
Length=422

 Score = 99.0 bits (245),  Expect = 3e-19, Method: Compositional matrix adjust.
 Identities = 104/395 (26%), Positives = 171/395 (43%), Gaps = 49/395 (12%)

Query  103  GLLKYVNIRDYQLFAKRLRKYLSKKIGKYEKIHSYVVSEYSPKTFRPHFHILFFFDSDEI  162
            G L Y+   D QLF KR R Y++K+  K EK+  + + EY P  FRPH+HIL F  SDE 
Sbjct  39   GYLPYLRKFDLQLFFKRFRYYVAKRFPK-EKVRYFAIGEYGPVHFRPHYHILLFLQSDEA  97

Query  163  AKNFRQAVYQSWRLGRVDTQLAREQANSYVANYLNSVVSIPFVYKAKKSIRPRSRFSNLF  222
             +   + V ++W  GRVD QL++ + +SYVA Y+NS V +P V     ++ P       F
Sbjct  98   LQVCSKVVSEAWPFGRVDCQLSKGKCSSYVAGYVNSSVLVPKVLTL-PTLCP-------F  149

Query  223  GFEEVKKGIQHASDKRSALFDGVP-------YISNQKFVRYVPSGSHIDRLFPRFTHYDG  275
                 K G      +R+ ++   P        + N ++  +    S     FP+   +  
Sbjct  150  CVHSQKLGQGFLQSERAKVYSLTPEQFVKRSIVINGRYKEFDVWRSAYAYFFPKCKGFAD  209

Query  276  SFLRRSSQIYEVVQRVLRLFARNEPFKKATPRNVSEFVCWWCEYNFRRGCQIKDFPDYMQ  335
               R  +  Y +     RLF    P  + T     E V +   ++ ++     D    + 
Sbjct  210  KSSRERAYSYGLYDTARRLF----PSAETTFALAKEIVGYIYYFHNKKDTYCLDIFGEVS  265

Query  336  EFLHIVRLDGYSF----LNWDVPIGKISRFFYR----------FNRFEAMKGSL---RSK  378
            +   + +   Y F    +N+ +   ++ R+ +R          F  F   + +L   + K
Sbjct  266  DQSDLYQFSQYFFEPEIVNYSLDSIEMCRYVHRVYTELLLSKHFLYFVCDRPTLSEQKRK  325

Query  379  LKAVSLFYDYRDYQSLKNQLSLQELVFAE--LGYSDELFDS--------FYVKPHI--KV  426
            LK +  FY   DY  LK     Q+L +    +G  D + D+        FY   +   +V
Sbjct  326  LKLIEEFYSRLDYMHLKTFFENQQLFYESDLVGDLDLMSDAWENSYYPFFYDNVYFSSEV  385

Query  427  LKNAYIDKWKDVNYKEVHYFRVKHKVLNDENNIFL  461
             K   + +  D+   ++   R+KHK LND N IF+
Sbjct  386  YKKTPVYRLYDMQISKLFSDRIKHKKLNDLNKIFV  420


>gi|575094340|emb|CDL65724.1| unnamed protein product [uncultured bacterium]
Length=486

 Score = 81.3 bits (199),  Expect = 3e-13, Method: Compositional matrix adjust.
 Identities = 50/122 (41%), Positives = 73/122 (60%), Gaps = 12/122 (10%)

Query  111  RDYQLFAKRLRKYLSKKIGKYEKIHSYVVSEYSPKTFRPHFHILFFFDSDEIA-KNFRQA  169
            +D+  F KRLR  L++      KI  +  SEY P T RPHFH +F+FDS  ++  +FR A
Sbjct  157  KDFVNFVKRLRINLTRNYNYEGKITYFKCSEYGPTTNRPHFHGIFWFDSRALSFDSFRSA  216

Query  170  VYQSWRLGRVDTQ-----LAREQANSYVANYLNSVVSIP--FVYKAKKSIRPRSRFSNLF  222
            V +SW++   D Q     +ARE A +YVA+Y+N + S+P  F++K    +RP+   S  F
Sbjct  217  VVESWKMCDKDKQYENVEIAREPA-TYVASYVNCLTSVPPLFLFKG---LRPKHSHSKGF  272

Query  223  GF  224
            GF
Sbjct  273  GF  274


>gi|494822887|ref|WP_007558295.1| hypothetical protein [Bacteroides plebeius]
 gi|198272100|gb|EDY96369.1| hypothetical protein BACPLE_00805 [Bacteroides plebeius DSM 17135]
Length=545

 Score = 78.2 bits (191),  Expect = 4e-12, Method: Compositional matrix adjust.
 Identities = 59/184 (32%), Positives = 82/184 (45%), Gaps = 41/184 (22%)

Query  78   MLPEIAESLKKKNNTDASGAFPQFKGLLKYVNIRDYQLFAKRLRKYLSKKIGKYEKIHSY  137
            M P++    +K+ N   +     +KG   Y++ R+ QLF KRLRKYL K  G  +KI  +
Sbjct  96   MTPQLMNEYQKRVNYRIN-----YKGRFPYLSKRELQLFMKRLRKYLDKYEG--QKIRFF  148

Query  138  VVSEYSPKTFRPHFHILFFFDS-----------------------------DEIAKNFRQ  168
               EY P +FRPHFHIL F D                                +      
Sbjct  149  ATGEYGPLSFRPHFHILLFVDDPSLFLPSVHTLGEYPYPYWSKYQKAHCGKGTLLSKLEY  208

Query  169  AVYQSWRLGRVDTQ-LAREQANSYVANYLNSVVSIPFVYK--AKKSIRPRSRF--SNLFG  223
             + +SW  G +D Q + +   +SYVA Y+NS V +P   K  A KS    SRF    +FG
Sbjct  209  YIRESWPFGGIDAQSVEQGSCSSYVAGYVNSSVPLPSCLKVDAVKSFSQHSRFLGRKIFG  268

Query  224  FEEV  227
             E +
Sbjct  269  TELI  272


>gi|575094322|emb|CDL65709.1| unnamed protein product [uncultured bacterium]
Length=499

 Score = 77.4 bits (189),  Expect = 6e-12, Method: Compositional matrix adjust.
 Identities = 84/319 (26%), Positives = 133/319 (42%), Gaps = 61/319 (19%)

Query  41   LRYPNFISKFRPFILRSIPRVSKLQNFKDEYFEELVWMLPEIAESLKKKNNTDASGAFPQ  100
            LR  +FIS F           S L NF +++ +++ +    +     K + +   G    
Sbjct  90   LRNDSFISDF----------CSDLHNFDNDFVDKMDYYSDYVINYESKYHKSCVYG----  135

Query  101  FKGLLKYVNIRDYQLFAKRLRKYLSKKIGKYEKIHSYVVSEYSPKTFRPHFHILFFFDSD  160
              GL   +  RD QLF KRLRK++ K  G  EKI  Y++ EY  K+ RPH+H L FF+S 
Sbjct  136  -HGLYALLYYRDIQLFLKRLRKHIYKYYG--EKIRFYIIGEYGTKSLRPHWHCLLFFNSS  192

Query  161  EIAKNFRQAVYQS---------------WRLGRVDTQLAREQANSYVANYLNSVVSIPFV  205
             +++ F   V                  W+ G  D++    +A +YV++Y+N   + P +
Sbjct  193  SLSQAFEDCVNVGTTSRPCSCPRFLRPFWQFGICDSKRTNGEAYNYVSSYVNQSANFPKL  252

Query  206  Y------KAKKSIRPRSRFSNLFGFEEVKKGIQHASDKRSALFDGVPYISNQKFVRYVPS  259
                   KA  SI+     S       ++KG   +  +R    D     ++    R    
Sbjct  253  LVLLSNQKAYHSIQLGQILSEQSIVSAIQKG-DFSFFERQFYLDTFGAANSYSVWR----  307

Query  260  GSHIDRLFPRFTHYDGSFLRRSSQI-YEVVQRVLRLFARNEPFKKATPRNVSEFVCWWCE  318
             S+  R FP+FT         SSQ+ YE   RVL  +   E  +     +    +C    
Sbjct  308  -SYYSRFFPKFTC--------SSQLTYEQTYRVLTCY---ETLRDLFDTDSVGVICRRLF  355

Query  319  YNFRRGCQIKDFPDYMQEF  337
            Y++  G     +PDY   F
Sbjct  356  YHYHFG-----YPDYHDIF  369


>gi|647452984|ref|WP_025792805.1| hypothetical protein [Prevotella histicola]
Length=480

 Score = 75.9 bits (185),  Expect = 2e-11, Method: Compositional matrix adjust.
 Identities = 41/138 (30%), Positives = 66/138 (48%), Gaps = 11/138 (8%)

Query  111  RDYQLFAKRLRKYLSKKI---GKYEKIHSYVVSEYSPKTFRPHFHILFFFDSDEIAKNFR  167
            +D Q F KRLR  +  K+   G   +I  ++ SEY P TFRPH+H + ++DS+ +     
Sbjct  125  KDVQDFFKRLRSKIDYKLKPRGNEYRIRYFICSEYGPNTFRPHYHAILWYDSEILHNELN  184

Query  168  QAVYQSWRLGRVDTQLAREQANSYVANYLNSVVSIPFVYKAKKSIRPRSRFSNLFGFEEV  227
              + ++W+ G  D  L    A+ YVA Y+N    +P           R+ F++ F     
Sbjct  185  VLIRETWKNGNTDFSLVNSSASQYVAKYVNGDCDLPSFL--------RTEFTSTFHLASK  236

Query  228  KKGIQHASDKRSALFDGV  245
               I +  D   AL++ V
Sbjct  237  HPCIGYGKDDEEALYENV  254


>gi|565841285|ref|WP_023924566.1| hypothetical protein [Prevotella nigrescens]
 gi|564729906|gb|ETD29850.1| hypothetical protein HMPREF1173_00032 [Prevotella nigrescens 
CC14M]
Length=484

 Score = 72.0 bits (175),  Expect = 3e-10, Method: Compositional matrix adjust.
 Identities = 62/236 (26%), Positives = 102/236 (43%), Gaps = 10/236 (4%)

Query  74   ELVWMLPEIAESLKKKNNTDASGAFPQFKGLLKYVNIRDYQLFAKRLRKYLSKKIGKY--  131
            E+VW    + +      N D           + Y    D   F KRLR  LS    K+  
Sbjct  81   EMVWTSNRLCDEKVIVGNYDFIKVSNSDVQAVAYCCKSDIVKFFKRLRSKLSYYFKKHHI  140

Query  132  ---EKIHSYVVSEYSPKTFRPHFHILFFFDSDEIAKNFRQAVYQSWRLGRVDTQLAREQA  188
               EKI  +V SEY PKT RPH+H + +FDS+E+A+   + +  SW  G  D +     A
Sbjct  141  ITNEKIRYFVCSEYGPKTLRPHYHAIIWFDSEEVARVIEKMLSSSWSNGFTDFEYVNSTA  200

Query  189  NSYVANYL--NSVVSIPFVYKAKKSIRPRSRFSNL-FGFEEVKKGIQHASDKRSALFDGV  245
              YVA Y+  NSV+     + A ++   +S+  ++ +  ++ +K  +   D     F+  
Sbjct  201  PQYVAKYVSGNSVLPEILQHDACRTFHLQSQAPSVGYRSDDYEKFEKEVIDGCYGHFEYD  260

Query  246  PYISNQKFVRYVPSGSHIDRLFPRFTHYDGSFLRRSSQIYEVVQRVLRLFARNEPF  301
                +  FV+  P G+   R FP+   Y         +IY   + +  ++  + P 
Sbjct  261  SSSQSSVFVQ--PPGTLETRCFPKCREYRSLSRIEKLRIYAYKRDICSIYGIDTPI  314


>gi|494610270|ref|WP_007368516.1| hypothetical protein [Prevotella multiformis]
 gi|324988542|gb|EGC20505.1| hypothetical protein HMPREF9141_0984 [Prevotella multiformis 
DSM 16608]
Length=479

 Score = 71.2 bits (173),  Expect = 4e-10, Method: Compositional matrix adjust.
 Identities = 40/135 (30%), Positives = 71/135 (53%), Gaps = 7/135 (5%)

Query  76   VWMLPEIAESLKKKNNTDASGAFPQ---FKGLLKYVNIRDYQLFAKRLRKYLSKKIGKYE  132
            VW    ++ES K  +++      PQ    +    Y   +D Q + KRLR  +  ++ K +
Sbjct  84   VWFSNRLSESGKFLSDSVCRSLPPQKMEDEVCFAYPCKKDVQDWFKRLRSAVDYQLNKNK  143

Query  133  ----KIHSYVVSEYSPKTFRPHFHILFFFDSDEIAKNFRQAVYQSWRLGRVDTQLAREQA  188
                +I  ++ SEY P+TFRPH+H + ++DS+E+ +N  + + ++W+ G     L    A
Sbjct  144  SNEFRIRYFICSEYGPRTFRPHYHAILWYDSEELQRNIGRLIRETWKNGNSVFSLVNNSA  203

Query  189  NSYVANYLNSVVSIP  203
            + YVA Y+N    +P
Sbjct  204  SQYVAKYVNGDTRLP  218


>gi|546189465|ref|WP_021825245.1| hypothetical protein [Prevotella salivae]
 gi|544001993|gb|ERK01417.1| hypothetical protein HMPREF9145_2741 [Prevotella salivae F0493]
Length=586

 Score = 65.1 bits (157),  Expect = 5e-08, Method: Compositional matrix adjust.
 Identities = 36/123 (29%), Positives = 67/123 (54%), Gaps = 8/123 (7%)

Query  96   GAFPQFKGLLKYVNIRDYQLFAKRLRKYLS----KKIGKYEKIHSYVVSEYSPKTFRPHF  151
            G+ P FK  L  ++   Y L+    + YL+    KK    + +  ++ SEY+P TFRPHF
Sbjct  177  GSIP-FKEWLDDLDTETYDLYYSVYQYYLTDYEKKKESCKQSVRYFICSEYTPTTFRPHF  235

Query  152  HILFFFDSDEIAKNFRQAVYQSWRLG---RVDTQLAREQANSYVANYLNSVVSIPFVYKA  208
            H LF+FD ++      + ++++W++     ++ Q     A++YV+ Y+    ++P V +A
Sbjct  236  HGLFWFDDEKAFSYAPRCIFKAWKMCAEININVQPVSGDASAYVSKYVTGNSNLPPVLQA  295

Query  209  KKS  211
            K +
Sbjct  296  KST  298



Lambda      K        H        a         alpha
   0.325    0.140    0.424    0.792     4.96 

Gapped
Lambda      K        H        a         alpha    sigma
   0.267   0.0410    0.140     1.90     42.6     43.6 

Effective search space used: 3125101418307