bitscore colors: <40, 40-50 , 50-80, 80-200, >200




           BLASTP 2.2.30+


Reference: Stephen F. Altschul, Thomas L. Madden, Alejandro A.
Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J.
Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of
protein database search programs", Nucleic Acids Res. 25:3389-3402.


Reference for composition-based statistics: Alejandro A. Schaffer,
L. Aravind, Thomas L. Madden, Sergei Shavirin, John L. Spouge, Yuri
I. Wolf, Eugene V. Koonin, and Stephen F. Altschul (2001),
"Improving the accuracy of PSI-BLAST protein database searches with
composition-based statistics and other refinements", Nucleic Acids
Res. 29:2994-3005.



Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF
excluding environmental samples from WGS projects
           49,011,213 sequences; 17,563,301,199 total letters





Query= Contig-13_CDS_annotation_glimmer3.pl_2_4

Length=559
                                                                      Score     E
Sequences producing significant alignments:                          (Bits)  Value

gi|496050828|ref|WP_008775335.1|  hypothetical protein                  105   7e-21
gi|575094298|emb|CDL65688.1|  unnamed protein product                 89.7    1e-15
gi|547226431|ref|WP_021963494.1|  predicted protein                   89.0    2e-15
gi|575094340|emb|CDL65724.1|  unnamed protein product                 85.9    2e-14
gi|575094355|emb|CDL65737.1|  unnamed protein product                 84.0    8e-14
gi|490418708|ref|WP_004291031.1|  hypothetical protein                81.6    3e-13
gi|575094322|emb|CDL65709.1|  unnamed protein product                 79.7    2e-12
gi|575095229|emb|CDL66433.1|  unnamed protein product                 73.6    1e-10
gi|490477382|ref|WP_004347759.1|  hypothetical protein                72.8    3e-10
gi|517172763|ref|WP_018361581.1|  hypothetical protein                70.9    1e-09


>gi|496050828|ref|WP_008775335.1| hypothetical protein [Bacteroides sp. 2_2_4]
 gi|229448892|gb|EEO54683.1| hypothetical protein BSCG_01608 [Bacteroides sp. 2_2_4]
Length=497

 Score =   105 bits (262),  Expect = 7e-21, Method: Compositional matrix adjust.
 Identities = 94/328 (29%), Positives = 141/328 (43%), Gaps = 73/328 (22%)

Query  14   CQHRSFITNRYTGARIAVDCGQCDYCIHKRAQKASMRVKTAGSAFKYSYFVTLTYDNEHI  73
            C H   I N YT   + V CG C  C   +  + + +        K++ F+TLTY N  I
Sbjct  9    CLHPKRIMNPYTKESMVVPCGHCQACTLAKNSRYAFQCDLESYTAKHTLFITLTYANRFI  68

Query  74   PLMRCKVLHSEYEDVVGISGDIHFGDEYHHYIPVSEYQCDDSSALRHIFFEQVQ---GTV  130
            P                                            R +F + ++   G  
Sbjct  69   P--------------------------------------------RAMFVDSIERPYGCD  84

Query  131  PYDREIKEYVPVKDNWFLSIDAIRSFIHKTQALGKTDYPVAEQYGRDNLIPFLNYVDVQN  190
              D+E  E +   D   L+ D   + ++K    G               +P+L   D+Q 
Sbjct  85   LIDKETGEILGPAD---LTEDERTNLLNKFYLFGD--------------VPYLRKTDLQL  127

Query  191  YIKRLRKYLFKVLGSYESLHFYAVGEYGPVHFRPHFHLLLFTNSDEVAEVLRQCHDKSWK  250
            ++KRLR Y+ K   S E + ++AVGEYGPVHFRPH+HLLLF  SDE  ++  +   K+W 
Sbjct  128  FLKRLRYYVTKQKPS-EKVRYFAVGEYGPVHFRPHYHLLLFLQSDEALQICSENISKAWT  186

Query  251  FGRSDFQRsaggsasyvssyvNSLCSAPLLYRS---CRAFRPKSRASVGFFEKGCDFVED  307
            FGR D Q S G  ++YV+SYVNS C+ P ++++   C       +   GF +   + +  
Sbjct  187  FGRVDCQVSKGQCSNYVASYVNSSCTIPKVFKASSVCPFSVHSQKLGQGFLDCQREKIYS  246

Query  308  EDPYAQIEKKIDSVVNGRCYNFNGVSVW  335
              P   I   I  V+NG+   F+   VW
Sbjct  247  LTPENFIRSSI--VLNGKYKEFD---VW  269


>gi|575094298|emb|CDL65688.1| unnamed protein product [uncultured bacterium]
Length=478

 Score = 89.7 bits (221),  Expect = 1e-15, Method: Compositional matrix adjust.
 Identities = 65/242 (27%), Positives = 111/242 (46%), Gaps = 29/242 (12%)

Query  14   CQHRSFITNRYTGARIAVDCGQCDYCIHKRAQKASMRVKTAGSAFKYSYFVTLTYDNEHI  73
            C ++  I N+YTG ++ V CG+C  C+ ++A  ++ +++   S+    +FVTL YDN HI
Sbjct  2    CINKREIRNKYTGQKLYVSCGKCPACLQEKANASAYKIRNNQSSELSCFFVTLNYDNNHI  61

Query  74   PLMRCKVLHSEYEDVVGISGDIHFGDEYHHY-IPVSEYQCDDSSALRHIFFEQVQGTVP-  131
            P++     H  Y      S   HF +E     +PV  Y                +G  P 
Sbjct  62   PVI---FKHDVYN--YNSSDVYHFDEERKELCLPVDLY----------------RGVCPA  100

Query  132  YDREIKEY-VPVKDNWFLSIDAIRSFIHKTQALGKTDYPVAEQYGRDNLIPFLNYVDVQN  190
            +  +I  +  P+     LS D + S  +    + KT       +  + +       D+Q 
Sbjct  101  FSNKIDTFNFPLNR---LSTDVVSSLDNHCGVVVKTKNHKPVLFNEE-IFSVCYTKDIQL  156

Query  191  YIKRLRKYLFKVLGSYESLHFYAVGEYGPVHFRPHFHLLLFTNSDEVA-EVLRQCHDKSW  249
            + KRLR+ L++  G    + ++   EYGP  +R HFHL +F    E++ +  R+   K+W
Sbjct  157  FFKRLRQSLYRKFGFRPFIQYFQTSEYGPTTYRAHFHLCIFVKRSEISFDSFRKACVKAW  216

Query  250  KF  251
             F
Sbjct  217  PF  218


>gi|547226431|ref|WP_021963494.1| predicted protein [Prevotella sp. CAG:1185]
 gi|524103383|emb|CCY83995.1| predicted protein [Prevotella sp. CAG:1185]
Length=498

 Score = 89.0 bits (219),  Expect = 2e-15, Method: Compositional matrix adjust.
 Identities = 67/242 (28%), Positives = 103/242 (43%), Gaps = 49/242 (20%)

Query  14   CQHRSFITNRYTGARIAVDCGQCDYCIHKRAQKASMRVKTAGSAFKYSYFVTLTYDNEHI  73
            C H   + N+YTG  I V CG C  C+ +RA K S        + KY  F TLTY N+++
Sbjct  11   CYHPRHVQNKYTGEVIQVGCGVCKACLKRRADKMSFLCAIEEQSHKYCMFATLTYSNDYV  70

Query  74   PLMRCKVLHSEYEDVVGISGDIHFGDEYHHYIPVSEYQCDDSSALRHIFFEQVQGTVPYD  133
            P M        Y +V     ++     Y +        CD  +    +       TV YD
Sbjct  71   PRM--------YPEV---DNELRLVRWYSY--------CDRLNEKGKLM------TVDYD  105

Query  134  REIKEYVPVKDNWFLSIDAIRSFIHKTQALGKTDYPVAEQYGRDNLIPFLNYVDVQNYIK  193
                                  + HK  +L      +  +   D  + + +  D Q ++K
Sbjct  106  ----------------------YWHKCPSLDTYVLMLTAKCNLDGYLSYTSKRDAQLFLK  143

Query  194  RLRKYLFKVLGSYESLHFYAVGEYGPVHFRPHFHLLLFTNSDEVAEVLRQCHDKSWKFGR  253
            R+RK L K   S E + +Y V EYGP  FR H+H+L F +  +  +V+ +   ++W+FGR
Sbjct  144  RVRKNLSKY--SDEKIRYYIVSEYGPKTFRAHYHVLFFYDEVKTQKVMSKVIRQAWQFGR  201

Query  254  SD  255
             D
Sbjct  202  VD  203


>gi|575094340|emb|CDL65724.1| unnamed protein product [uncultured bacterium]
Length=486

 Score = 85.9 bits (211),  Expect = 2e-14, Method: Compositional matrix adjust.
 Identities = 82/290 (28%), Positives = 125/290 (43%), Gaps = 29/290 (10%)

Query  14   CQHRSFITNRYTGARIAVDCGQCDYCIHKRAQKASMR-VKTAGSAFKYSYFVTLTYDNEH  72
            C +R  +TN+Y G    VDCG C  C+ ++A K+  + +   G  + +  FVTLTYDNEH
Sbjct  6    CTNRIKVTNKYVGRSFYVDCGHCPSCLQRKANKSCCKIINEYGRPYSFMCFVTLTYDNEH  65

Query  73   IPLMRCKVLHSEYEDVVGISGDIHFGDEYHHYIPVSEYQCDDSSALRHIFFEQVQGTVPY  132
            IP                    IH   +Y H      Y    S        E +   V  
Sbjct  66   IPY-------------------IHPDTDYSHLYVGKSYYVRHSRIFDKDGVENLPLGVYR  106

Query  133  DREIKEYVPVKDNWFLSIDAIRSFIHKTQALGKTDYPVAEQYGRDNLIPFLNYVDVQNYI  192
            + ++ + V + +   +  +  R+++  T  +             DN +  L   D  N++
Sbjct  107  NGKLIDTVFLPE---MPKEVFRNYLCNTTGIVTKSRNGVVLERDDNKVGILYDKDFVNFV  163

Query  193  KRLRKYLFKVLGSYESLHFYAVGEYGPVHFRPHFHLLLFTNSDEVA-EVLRQCHDKSWKF  251
            KRLR  L +       + ++   EYGP   RPHFH + + +S  ++ +  R    +SWK 
Sbjct  164  KRLRINLTRNYNYEGKITYFKCSEYGPTTNRPHFHGIFWFDSRALSFDSFRSAVVESWKM  223

Query  252  GRSDFQ----RsaggsasyvssyvNSLCSAPLLYRSCRAFRPKSRASVGF  297
               D Q      A   A+YV+SYVN L S P L+   +  RPK   S GF
Sbjct  224  CDKDKQYENVEIAREPATYVASYVNCLTSVPPLFLF-KGLRPKHSHSKGF  272


>gi|575094355|emb|CDL65737.1| unnamed protein product [uncultured bacterium]
Length=517

 Score = 84.0 bits (206),  Expect = 8e-14, Method: Compositional matrix adjust.
 Identities = 97/348 (28%), Positives = 143/348 (41%), Gaps = 89/348 (26%)

Query  14   CQHRSFITNRYTGARIAVDCGQCDYCIHKRAQKASMRVKTAGSAFKYSYFVTLTYDNEHI  73
            C     + N Y    + V CG+C  C   +A +  ++++   S  K+  F TLTY N +I
Sbjct  11   CLEPKRVFNPYLNDWLLVPCGKCRACQCSKASRYKLQIQLEASQHKFCIFGTLTYANTYI  70

Query  74   PLMRCKVLHSEYEDVVGISGDIHFGDEYHHYIPVSEYQCDDSSALRHIFFEQVQGTVPYD  133
            P +                            +P ++             F  V G    D
Sbjct  71   PRLSL--------------------------VPYNDKT-----------FGVVNGYEMCD  93

Query  134  REIKEYVPVKDNWFLSIDAIRSFIHKTQALGKTDYPVAEQYGRDNLIPFLNYVDVQNYIK  193
            +E  EY+   D+   S D + S + K    G               +P+L   D+Q +IK
Sbjct  94   KETGEYLGYLDS--PSYD-VESLLDKLHLFGD--------------VPYLRKRDLQLFIK  136

Query  194  RLRKYLFKVLGSYESLHFYAVGEYGPVHFRPHFHLLLFTNS-------------------  234
            RLRK L K   S   + ++A+GEYGPVHFRPH+H LLF +                    
Sbjct  137  RLRKNLSKY--SDAKVRYFAMGEYGPVHFRPHYHFLLFFDEIKFTAPSGHTLGEFPDWAW  194

Query  235  ---------DEVAEVLRQCHDKSWKFGRSDFQRsaggsasyvssyvNSLCSAPLLYR--S  283
                      ++  V+  C   SWKFGR D Q S G +A YVSSYV+   S P +Y+  S
Sbjct  195  YDSQNKCSRSDILSVVEYCIRSSWKFGRVDAQYSKGDAAQYVSSYVSGSGSLPKVYQVSS  254

Query  284  CRAFRPKSR-ASVGFFEKGCDFVEDEDPYAQIEKKIDSVVNGRCYNFN  330
             R F   SR    GF    C+ V +      +++ ++  +NG   +FN
Sbjct  255  ARPFSLHSRFLGQGFLAHECEKVYETPVRDFVKRSVE--LNGSNKDFN  300


>gi|490418708|ref|WP_004291031.1| hypothetical protein [Bacteroides eggerthii]
 gi|217986635|gb|EEC52969.1| hypothetical protein BACEGG_02720 [Bacteroides eggerthii DSM 
20697]
Length=422

 Score = 81.6 bits (200),  Expect = 3e-13, Method: Compositional matrix adjust.
 Identities = 64/190 (34%), Positives = 95/190 (50%), Gaps = 15/190 (8%)

Query  157  IHKTQALGKTDYPVAE------QYGRDNLIPFLNYVDVQNYIKRLRKYLFKVLGSYESLH  210
            +   + LG+ D  + E      ++     +P+L   D+Q + KR R Y+ K     E + 
Sbjct  12   VETGEYLGEADLSIKEIERLQEKFHLFGYLPYLRKFDLQLFFKRFRYYVAKRFPK-EKVR  70

Query  211  FYAVGEYGPVHFRPHFHLLLFTNSDEVAEVLRQCHDKSWKFGRSDFQRsaggsasyvssy  270
            ++A+GEYGPVHFRPH+H+LLF  SDE  +V  +   ++W FGR D Q S G  +SYV+ Y
Sbjct  71   YFAIGEYGPVHFRPHYHILLFLQSDEALQVCSKVVSEAWPFGRVDCQLSKGKCSSYVAGY  130

Query  271  vNSLCSAP---LLYRSCRAFRPKSRASVGFFEKGCDFVEDEDPYAQIEKKIDSVVNGRCY  327
            VNS    P    L   C       +   GF +     V    P   +++ I  V+NGR  
Sbjct  131  VNSSVLVPKVLTLPTLCPFCVHSQKLGQGFLQSERAKVYSLTPEQFVKRSI--VINGRYK  188

Query  328  NFNGVSVWST  337
             F+   VW +
Sbjct  189  EFD---VWRS  195


>gi|575094322|emb|CDL65709.1| unnamed protein product [uncultured bacterium]
Length=499

 Score = 79.7 bits (195),  Expect = 2e-12, Method: Compositional matrix adjust.
 Identities = 65/248 (26%), Positives = 106/248 (43%), Gaps = 51/248 (21%)

Query  26   GARIAVDCGQCDYCIHKRAQKASMRVKTAGSAFKYSYFVTLTYDNEHIPLMRCKVLHSEY  85
            G    V CG+C  C + +    S++++      KY YF+TLTYD++++PL          
Sbjct  19   GYPYQVPCGKCIACHNNKRSSLSLKLRLEEYTSKYCYFLTLTYDDDNLPLFS--------  70

Query  86   EDVVGISGDIHFGDEYHHYIPVSEYQCDDSSALRHIFFEQVQGTVPYDREIKEYVPVKDN  145
               VG+        E+    P SE   +DS        +       +D +  + +    +
Sbjct  71   ---VGLDT---CATEFVRIYPYSERLRNDS-----FISDFCSDLHNFDNDFVDKMDYYSD  119

Query  146  WFLSIDAIRSFIHKTQALGKTDYPVAEQYGRDNLIPFLNYVDVQNYIKRLRKYLFKVLGS  205
            + ++ +   S  HK+   G              L   L Y D+Q ++KRLRK+++K  G 
Sbjct  120  YVINYE---SKYHKSCVYGH------------GLYALLYYRDIQLFLKRLRKHIYKYYG-  163

Query  206  YESLHFYAVGEYGPVHFRPHFHLLLFTNSDEVAEVLRQCHDKS---------------WK  250
             E + FY +GEYG    RPH+H LLF NS  +++    C +                 W+
Sbjct  164  -EKIRFYIIGEYGTKSLRPHWHCLLFFNSSSLSQAFEDCVNVGTTSRPCSCPRFLRPFWQ  222

Query  251  FGRSDFQR  258
            FG  D +R
Sbjct  223  FGICDSKR  230


>gi|575095229|emb|CDL66433.1| unnamed protein product [uncultured bacterium]
Length=510

 Score = 73.6 bits (179),  Expect = 1e-10, Method: Compositional matrix adjust.
 Identities = 69/278 (25%), Positives = 124/278 (45%), Gaps = 35/278 (13%)

Query  20   ITNRYTGARIAVDCGQCDYCIHKRAQKASMRVKTAGS-AFKYSYFVTLTYDNEHIPLMRC  78
            I NRY        CG+C  C+  ++ K    +    S A     F+ LTYD EH+PL+R 
Sbjct  20   IGNRYFA------CGRCSACLLAKSNKNRYNLTLELSNATTKCCFIMLTYDKEHLPLVR-  72

Query  79   KVLHSEYEDVVGISGDIHFGDEYHHYIPVSEYQCDDSSALRHIFFEQVQGTVPYDREIKE  138
                        IS   H  D  ++  P+++ + +  +     FF Q+     Y++++ +
Sbjct  73   ------------ISK--HDFDAMYYKKPINKPEYEKRN-----FFCQLS----YEKQLSK  109

Query  139  YVPVKDNWFLSIDAIRSFIHKTQALGKTDYPVAEQYGRDNLIPFLNYVDVQNYIKRLRKY  198
               + +             +    L ++ Y  +       ++P L YVDV  ++KRLR  
Sbjct  110  ITSLSNRKVFKSAYSSQSGYSMSTLFESGYNNSVHTDCYYMLPTLRYVDVSGFLKRLRTR  169

Query  199  LFKVLGSYESLHFYAVGEYGPVHFRPHFHLLLFTNSDEVAEVLRQCHDKSWKFGRSDFQR  258
            + + +G   ++ F A GEYGP  FRPH+H+++   S+   + + + +   W +G S   +
Sbjct  170  VQREIGE-SNIRFAACGEYGPRGFRPHYHIIVICQSEAARQSVMRNYRTCWLYGLSS-AK  227

Query  259  saggsasyvssyvNSLCSAPLLYR--SCRAFRPKSRAS  294
                S +      N +  +PLL +  + + FRP  R+S
Sbjct  228  LYIKSKNSADYVSNYVTCSPLLPKLYTYKPFRPFFRSS  265


>gi|490477382|ref|WP_004347759.1| hypothetical protein [Prevotella buccalis]
 gi|281300711|gb|EFA93042.1| hypothetical protein HMPREF0650_1078 [Prevotella buccalis ATCC 
35310]
Length=582

 Score = 72.8 bits (177),  Expect = 3e-10, Method: Compositional matrix adjust.
 Identities = 68/258 (26%), Positives = 111/258 (43%), Gaps = 45/258 (17%)

Query  5    PDLLKAADHCQHRSFITNRYTGARIAVDCGQCDYCIHKRAQKASMRVKTAGSAFKYSYFV  64
            P  LK   +C +   + N      +   C +C  C++++A   S R +      KYS F 
Sbjct  4    PSHLKIVGNCLNPRKVYNPSLHGWMYCSCDKCTACLNQKATTLSNRARAEIEQHKYSVFF  63

Query  65   TLTYDNEHIPLMRCKVLHSEYEDVVGISGDIHFGDEYHHYIPVSEYQCDDSSALRHIFFE  124
            TLTYDNEH+P         +YE    +  D    +E   Y P+     DDSS+      +
Sbjct  64   TLTYDNEHLP---------KYE----VFQD---SNEVIQYRPIGRL-VDDSSS------D  100

Query  125  QVQGTVPYDREIKEYVPVKDNWFLSIDAIRSFIHKTQALGKTDYPVAEQYGRDNLIPFLN  184
             +  + P                  I+   ++ +  Q    T  P  E Y        + 
Sbjct  101  MLSNSCP------------------INKYNNYENLYQFDESTFIPPIENYEDIYHFGVVC  142

Query  185  YVDVQNYIKRLRKYLFKV--LGSYES-LHFYAVGEYGPVHFRPHFHLLLFTNSDEVAEVL  241
              D+QN++KRLR  + K+  +   ES + +Y   EYGP  +RPH+H +LF +S ++ + +
Sbjct  143  KKDIQNFLKRLRWRISKIPNITKDESKIRYYISSEYGPTTYRPHYHGILFFDSKKILDKI  202

Query  242  RQCHDKSW-KFGRSDFQR  258
            +     SW K+ R   +R
Sbjct  203  KSLIVMSWGKYERQQGER  220


>gi|517172763|ref|WP_018361581.1| hypothetical protein [Prevotella nanceiensis]
Length=598

 Score = 70.9 bits (172),  Expect = 1e-09, Method: Compositional matrix adjust.
 Identities = 59/240 (25%), Positives = 98/240 (41%), Gaps = 50/240 (21%)

Query  14   CQHRSFITNRYTGARIAVDCGQCDYCIHKRAQKASMRVKTAGSAFKYSYFVTLTYDNEHI  73
            C   S+I N+ TG    V C  C YC++  A K S RV+       +S   TLTYDN +I
Sbjct  26   CLSPSYIYNKNTGHYETVPCHNCTYCVNVEASKQSRRVREEIKQHLFSVMFTLTYDNVYI  85

Query  74   PLMRCKV-LHSEYE-DVVGISGDIHFGDEYHHYIPVSEYQCDDSSALRHIFFEQVQGTVP  131
            P M      H E +   +G + D+H    ++      +Y+ +D + +             
Sbjct  86   PRMEAFAGKHGEMQLKPIGRTADLHDSCPFNSKNYNGDYRFNDDTRI-------------  132

Query  132  YDREIKEYVPVKDNWFLSIDAIRSFIHKTQALGKTDYPVAEQYGRDNLIPFLNYVDVQNY  191
                                    +I   +   K +   A    +D          +QN+
Sbjct  133  -----------------------PWIENNKIYCKNNLQFATVSKKD----------IQNF  159

Query  192  IKRLRKYLFK--VLGSYESLHFYAVGEYGPVHFRPHFHLLLFTNSDEVAEVLRQCHDKSW  249
            +KRLRK + K  +  + + + ++   EYGP  +RPH+H +LF +S  V   ++    +SW
Sbjct  160  LKRLRKKIDKLNIPQNEKKIRYFIASEYGPKTYRPHYHGVLFIDSPTVLSKIKAFIVESW  219



Lambda      K        H        a         alpha
   0.325    0.139    0.427    0.792     4.96 

Gapped
Lambda      K        H        a         alpha    sigma
   0.267   0.0410    0.140     1.90     42.6     43.6 

Effective search space used: 4086221757660