bitscore colors: <40, 40-50 , 50-80, 80-200, >200




           BLASTP 2.2.30+


Reference: Stephen F. Altschul, Thomas L. Madden, Alejandro A.
Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J.
Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of
protein database search programs", Nucleic Acids Res. 25:3389-3402.


Reference for composition-based statistics: Alejandro A. Schaffer,
L. Aravind, Thomas L. Madden, Sergei Shavirin, John L. Spouge, Yuri
I. Wolf, Eugene V. Koonin, and Stephen F. Altschul (2001),
"Improving the accuracy of PSI-BLAST protein database searches with
composition-based statistics and other refinements", Nucleic Acids
Res. 29:2994-3005.



Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF
excluding environmental samples from WGS projects
           49,011,213 sequences; 17,563,301,199 total letters





Query= Contig-12_CDS_annotation_glimmer3.pl_2_3

Length=613
                                                                      Score     E
Sequences producing significant alignments:                          (Bits)  Value

gi|547226430|ref|WP_021963493.1|  putative uncharacterized protein      426   6e-138
gi|490418709|ref|WP_004291032.1|  hypothetical protein                  416   3e-134
gi|575094354|emb|CDL65742.1|  unnamed protein product                   415   4e-133
gi|496050829|ref|WP_008775336.1|  hypothetical protein                  409   2e-131
gi|494822885|ref|WP_007558293.1|  hypothetical protein                  362   7e-113
gi|575094321|emb|CDL65708.1|  unnamed protein product                   259   1e-73
gi|517172762|ref|WP_018361580.1|  hypothetical protein                  204   4e-54
gi|647452987|ref|WP_025792807.1|  hypothetical protein                  194   1e-50
gi|494308783|ref|WP_007173938.1|  hypothetical protein                  193   3e-50
gi|575094339|emb|CDL65730.1|  unnamed protein product                   192   6e-50


>gi|547226430|ref|WP_021963493.1| putative uncharacterized protein [Prevotella sp. CAG:1185]
 gi|524103382|emb|CCY83994.1| putative uncharacterized protein [Prevotella sp. CAG:1185]
Length=573

 Score =   426 bits (1095),  Expect = 6e-138, Method: Compositional matrix adjust.
 Identities = 268/624 (43%), Positives = 359/624 (58%), Gaps = 62/624 (10%)

Query  1    MASYTGMSNLQNHPHRSGFDIGRKNAFTAKVGELLPVYWDISMPGDKYRFNIEYFTRTQP  60
            M+S   ++ L+N   R+GFD+  KNAFTAKVGELLP+      PGDK+    + FTRTQP
Sbjct  1    MSSVMSLTALKNSVKRNGFDLSFKNAFTAKVGELLPIMCKEVYPGDKFNIRGQAFTRTQP  60

Query  61   VETSAYTRLREYFDFYAVPLRLLWKSAPSVLTQMQDVNKIQALSLTQNLSLGTFLPSIPL  120
            V ++AY+RLREY+DFY VP RLLW  AP+  T M D +    L  + NLS        P 
Sbjct  61   VNSAAYSRLREYYDFYFVPYRLLWNMAPTFFTNMPDPHHAADLVSSVNLS-----QRHPW  115

Query  121  SVLSDAM-YLLNGRSWTPGNSSSLKNMFGFDRSDLCYKLLSYLGYGNLISSESTGR-WWS  178
                D M YL N  S +       KN FGF R +L  KLL+YL YG        G+ + S
Sbjct  116  FTFFDIMEYLGNLNSLSGAYEKYQKNFFGFSRVELSVKLLNYLNYG-------FGKDYES  168

Query  179  TSVSSKNDASYTQRYIQNNYVNLFPLLAYQKIYQDFFRWSQWEASNPSSYNVDYFTGvsp  238
              V S +D         +  ++ FPLLAYQKI +D+FR  QW+++ P  YN+DY  G S 
Sbjct  169  VKVPSDSD---------DIVLSPFPLLAYQKICEDYFRDDQWQSAAPYRYNLDYLYGKSS  219

Query  239  slissllsvspDYWKSGTMFDLKYCNWNKDMLMGVLPNSQFGDVAVLDIPDSGDSNVVLG  298
                 + S + D +K+ TMFDL YCN+ KD   G+LP +Q+GDV+V   P  GD +    
Sbjct  220  GFHIPMSSFTNDAFKNPTMFDLNYCNFQKDYFTGMLPRAQYGDVSVAS-PIFGDLD----  274

Query  299  TDSHKSSVGIASAITSKTAPFPLFALDASPENPIPINsklrldlsslksQFTVLALRQAE  358
                   +G +S++T  +AP           N I     +  + S+  +  +VLALRQAE
Sbjct  275  -------IGDSSSLTFASAP-------QQGANTIQSGVLVVNNNSNTTAGLSVLALRQAE  320

Query  359  ALQRWKEISQSGDSDYREQIRKHFGVKLPQALSNMCTYIGGVSRNLDISEVVNNNLATEG  418
             LQ+W+EI+QSG  DY+ Q++KHF V     LS  C Y+GG + NLDISEVVN NL T  
Sbjct  321  CLQKWREIAQSGKMDYQTQMQKHFNVSPSATLSGHCKYLGGWTSNLDISEVVNTNL-TGD  379

Query  419  DTAVIAGKGVGAGNGS-FEYTTTEHCVVMCIYHAVPLLDYTLTGQDGQLLVTDAESLPIP  477
            + A I GKG G  NG+  ++ ++EH ++MCIYH +PLLD+++     Q   T      IP
Sbjct  380  NQADIQGKGTGTLNGNKVDFESSEHGIIMCIYHCLPLLDWSINRIARQNFKTTFTDYAIP  439

Query  478  EFDNIGMEVL-PMTQVFN-----SPKASIVNLFNAGYNPRYFNWKTKLDVINGAFTTTLK  531
            EFD++GM+ L P   +F      S  +SI    N GY PRY + KT +D I+G+F  TL 
Sbjct  440  EFDSVGMQQLYPSEMIFGLEDLPSDPSSI----NMGYVPRYADLKTSIDEIHGSFIDTLV  495

Query  532  SWVSPVTESLLSGW--FCFGYNKDDAAPDTKVIMNYKFFKVNPSVLDPIFGVNADSTWDT  589
            SWVSP+T+S +S +   C    KD    D  + M Y FFKVNP ++D IFGV ADST +T
Sbjct  496  SWVSPLTDSYISAYRQAC----KDAGFSD--ITMTYNFFKVNPHIVDNIFGVKADSTINT  549

Query  590  DQLLVNSYIGCYVVRNLSRDGVPY  613
            DQLL+NSY     VRN   +G+PY
Sbjct  550  DQLLINSYFDIKAVRNFDYNGLPY  573


>gi|490418709|ref|WP_004291032.1| hypothetical protein [Bacteroides eggerthii]
 gi|217986636|gb|EEC52970.1| putative capsid protein (F protein) [Bacteroides eggerthii DSM 
20697]
Length=578

 Score =   416 bits (1070),  Expect = 3e-134, Method: Compositional matrix adjust.
 Identities = 251/622 (40%), Positives = 344/622 (55%), Gaps = 53/622 (9%)

Query  1    MASYTGMSNLQNHPHRSGFDIGRKNAFTAKVGELLPVYWDISMPGDKYRFNIEYFTRTQP  60
            MA+   + +++N P R+GFD+  K  FTAK GELLPV     +PGD ++ N++ FTRTQP
Sbjct  1    MANIMSLKSIRNKPSRNGFDLSFKKNFTAKAGELLPVMVKEVLPGDTFKINLKAFTRTQP  60

Query  61   VETSAYTRLREYFDFYAVPLRLLWKSAPSVLTQMQDVNKIQALSL--TQNLSLGTFLPSI  118
            V T+A+ R+REY+DF+ VP  LLW  A +VLTQM D N   A+S+  T+N  L   +P +
Sbjct  61   VNTAAFARIREYYDFFFVPYDLLWNKANTVLTQMYD-NPQHAVSIDPTRNFVLSGEMPYM  119

Query  119  PLSVLSDAMYLLNGRSWTPGNSSSLKNMFGFDRSDLCYKLLSYLGYGNLISSESTGRWWS  178
                ++     +N  S     +    N FG++RS    KLL YLGYGN   S  T  W  
Sbjct  120  TSEAIAS---YINALSTASALADYKSNYFGYNRSKSSVKLLEYLGYGNY-ESFLTDDW--  173

Query  179  TSVSSKNDASYTQRYIQNNYVNLFPLLAYQKIYQDFFRWSQWEASNPSSYNVDYFTGvsp  238
                       T   + N   N+F LLAYQKIY DF+R SQWE  +PS++NVDY  G   
Sbjct  174  ----------NTAPLMANLNHNIFGLLAYQKIYSDFYRDSQWERVSPSTFNVDYLDG---  220

Query  239  slissllsvspDYWKSGTMFDLKYCNWNKDMLMGVLPNSQFGDVAVLDIPDSGDSNVVLG  298
            S ++   + S +++++   FDL+YCNW KD+  GVLP+ Q+G+ AV  I       + L 
Sbjct  221  SSMNLDNAYSTEFYQNYNFFDLRYCNWQKDLFHGVLPHQQYGETAVASITPDVTGKLTL-  279

Query  299  TDSHKSSVGIASAITSKTAPFPLFALDASPENPIPINsklrldlsslksQFTVLALRQAE  358
              S+ S+VG +    S TA   L A D   +                    ++L LRQAE
Sbjct  280  --SNFSTVGTSPTTASGTATKNLPAFDTVGD-------------------LSILVLRQAE  318

Query  359  ALQRWKEISQSGDSDYREQIRKHFGVKLPQALSNMCTYIGGVSRNLDISEVVNNNLATEG  418
             LQ+WKEI+QSG+ DY++Q+ KH+GV +    S +CTY+GGVS ++DI+EV+N N+ T  
Sbjct  319  FLQKWKEITQSGNKDYKDQLEKHWGVSVGDGFSELCTYLGGVSSSIDINEVINTNI-TGS  377

Query  419  DTAVIAGKGVGAGNGSFEYTTT-EHCVVMCIYHAVPLLDYTLTGQDGQLLVTDAESLPIP  477
              A IAGKGVG  NG   + +   + ++MCIYH +PLLDYT    D   L  ++    IP
Sbjct  378  AAADIAGKGVGVANGEINFNSNGRYGLIMCIYHCLPLLDYTTDMLDPAFLKVNSTDYAIP  437

Query  478  EFDNIGMEVLPMTQVFNSPKASIVNL--FNAGYNPRYFNWKTKLDVINGAFTTTLKSWVS  535
            EFD +GM+ +P+ Q+ N P  S  N      GY PRY ++KT +D   G F  TL SWV 
Sbjct  438  EFDRVGMQSMPLVQLMN-PLRSFANASGLVLGYVPRYIDYKTSVDQSVGGFKRTLNSWVI  496

Query  536  PVTESLLSGWFCFGYNKDDAAPDTKV----IMNYKFFKVNPSVLDPIFGVNADSTWDTDQ  591
                  +        +     P   V     MN+ FFKVNP  LDPIF V A    +TDQ
Sbjct  497  SYGNISVLKQVTLPNDAPPIEPSEPVPSVAPMNFTFFKVNPDCLDPIFAVQAGDDTNTDQ  556

Query  592  LLVNSYIGCYVVRNLSRDGVPY  613
             L +S+     VRNL  DG+PY
Sbjct  557  FLCSSFFDIKAVRNLDTDGLPY  578


>gi|575094354|emb|CDL65742.1| unnamed protein product [uncultured bacterium]
Length=615

 Score =   415 bits (1066),  Expect = 4e-133, Method: Compositional matrix adjust.
 Identities = 251/646 (39%), Positives = 359/646 (56%), Gaps = 72/646 (11%)

Query  7    MSNLQNHPHRSGFDIGRKNAFTAKVGELLPVYWDISMPGDKYRFNIEYFTRTQPVETSAY  66
            M++++N P R+GFD+  K  FTAK GELLPV   + +PGD +  N+  FTRTQP+ TSA+
Sbjct  3    MADIKNRPSRNGFDLSFKKNFTAKAGELLPVMTKVVLPGDSFNINLRSFTRTQPLNTSAF  62

Query  67   TRLREYFDFYAVPLRLLWKSAPSVLTQMQ-DVNKIQALSLTQNLSLGTFLPSIPLSVLSD  125
             R+REY+DFY VP   +W    S +TQM  +V      +L  N  L   +P      ++D
Sbjct  63   ARMREYYDFYFVPFEQMWNKFDSCITQMNANVQHASGPTLDDNTPLSGRMPYFTSEQIAD  122

Query  126  AMYLLNGRSWTPGNSSSLKNMFGFDRSDLCYKLLSYLGYGNLISSESTGRWWSTSVSSKN  185
                LN ++     +++ KN FGF+RS L  KLL YLGYG+  S +S    WS       
Sbjct  123  ---YLNDQA-----TAARKNPFGFNRSTLTCKLLQYLGYGDYNSFDSETNTWSA------  168

Query  186  DASYTQRYIQNNYVNLFPLLAYQKIYQDFFRWSQWEASNPSSYNVDYFTGvspslissll  245
                 +  + N  ++ FPLLAYQKIY DF+R++QWE +NPS++N+DY  G S   +    
Sbjct  169  -----KPLLYNLELSPFPLLAYQKIYSDFYRYTQWEKTNPSTFNLDYIKGTSDLQMDLTG  223

Query  246  svspDYWKSGTMFDLKYCNWNKDMLMGVLPNSQFGDVAV------LDIPDSGDSNVV---  296
              S D       FD++YCN+ KDM  GVLP +Q+G  +V      L++  +GDS  +   
Sbjct  224  LPSDD----NNFFDIRYCNYQKDMFHGVLPVAQYGSASVVPINGQLNVISNGDSGPIFKT  279

Query  297  ------------------LGTDSHKSSVGIASAITSKTAP-----FPLFALDASP--ENP  331
                              +G D+    V  ++    K+A      FP  A   S   ENP
Sbjct  280  STPDPGTPGTSYVTVGGNIGVDNRSFGVSGSTLNVGKSADPSGYGFPSNASTRSLLWENP  339

Query  332  IPINsklrldlsslksQFTVLALRQAEALQRWKEISQSGDSDYREQIRKHFGVKLPQALS  391
              I        ++      +LALRQAE LQ+WKE+S SG+ DY+ QI KH+G+K+   LS
Sbjct  340  NLI------IENNQGFYVPILALRQAEFLQKWKEVSVSGEEDYKSQIEKHWGIKVSDFLS  393

Query  392  NMCTYIGGVSRNLDISEVVNNNLATEGDTAVIAGKGVGAGNGSFEYTTT-EHCVVMCIYH  450
            +   Y+GG + +LDI+EV+NNN+ T  + A IAGKG   GNGS  + +  E+ ++MCIYH
Sbjct  394  HQARYLGGCATSLDINEVINNNI-TGDNAADIAGKGTFTGNGSIRFESKGEYGIIMCIYH  452

Query  451  AVPLLDYTLTGQDGQLLVTDAESLPIPEFDNIGMEVLPMTQVFNSPKASIVNLFNA--GY  508
             +P++DY  +G D    + DA S PIPE D IGME +P+ +  N  K S     +   GY
Sbjct  453  VLPIVDYVGSGVDHSCTLVDATSFPIPELDQIGMESVPLVRAMNPVKESDTPSADTFLGY  512

Query  509  NPRYFNWKTKLDVINGAFTTTLKSWVSPVTESLLSGWFCFGYNKD-DAAPDTKVIMNYKF  567
             PRY +WKT +D   G F  +L++W  PV +  L+      +  + +  PD+   +   F
Sbjct  513  APRYIDWKTSVDRSVGDFADSLRTWCLPVGDKELTSANSLNFPSNPNVEPDS---IAAGF  569

Query  568  FKVNPSVLDPIFGVNADSTWDTDQLLVNSYIGCYVVRNLSRDGVPY  613
            FKVNPS++DP+F V ADST  TD+ L +S+    VVRNL  +G+PY
Sbjct  570  FKVNPSIVDPLFAVVADSTVKTDEFLCSSFFDVKVVRNLDVNGLPY  615


>gi|496050829|ref|WP_008775336.1| hypothetical protein [Bacteroides sp. 2_2_4]
 gi|229448893|gb|EEO54684.1| putative capsid protein (F protein) [Bacteroides sp. 2_2_4]
Length=580

 Score =   409 bits (1051),  Expect = 2e-131, Method: Compositional matrix adjust.
 Identities = 254/621 (41%), Positives = 356/621 (57%), Gaps = 49/621 (8%)

Query  1    MASYTGMSNLQNHPHRSGFDIGRKNAFTAKVGELLPVY-WDISMPGDKYRFNIEYFTRTQ  59
            MA+   + +L+N   R+GFD+  K  FTAK GELLPV  W++ +PGDK+  +++ FTRTQ
Sbjct  1    MANIMSLKSLRNKTSRNGFDLSSKRNFTAKPGELLPVKCWEV-LPGDKWSIDLKSFTRTQ  59

Query  60   PVETSAYTRLREYFDFYAVPLRLLWKSAPSVLTQMQDVNKIQALSL--TQNLSLGTFLPS  117
            P+ T+A+ R+REY+DFY VP  LLW  A +VLTQM D N   A S   + N +L   +P+
Sbjct  60   PLNTAAFARMREYYDFYFVPYNLLWNKANTVLTQMYD-NPQHATSYIPSANQALAGVMPN  118

Query  118  IPLSVLSDAMYLLNGRSWTPGNSSSLKNMFGFDRSDLCYKLLSYLGYGNLISSESTGRWW  177
            +    ++D + L+     T   +S  KN FG+ RS    KLL YLGYGN           
Sbjct  119  VTCKGIADYLNLVAPDVTT--TNSYEKNYFGYSRSLGTAKLLEYLGYGNFY---------  167

Query  178  STSVSSKNDASYTQRYIQNNYVNLFPLLAYQKIYQDFFRWSQWEASNPSSYNVDYFTGvs  237
             T  +SKN+         N  +N++ +LAYQKIY D  R SQWE  +PS +NVDY +G  
Sbjct  168  -TYATSKNNTWTKSPLSSNLQLNIYGVLAYQKIYADHIRDSQWEKVSPSCFNVDYLSGTV  226

Query  238  pslissllsvspD-YWKSGTMFDLKYCNWNKDMLMGVLPNSQFGDVAVLDIPDSGDSNVV  296
             S ++    ++   +     MFDL+YCNW KD+  GVLP  Q+GD A +++     SNV+
Sbjct  227  DSAMTIDSMITGQGFAPFYNMFDLRYCNWQKDLFHGVLPRQQYGDTAAVNV---NLSNVL  283

Query  297  LGTDSHKSSVGIASAITSKTAPFPLFALDASPENPIPINsklrldlsslks-QFTVLALR  355
                               +A + +   D  P    P +S      +   S  FTVLALR
Sbjct  284  -------------------SAQYMVQTPDGDPVGGSPFSSTGVNLQTVNGSGTFTVLALR  324

Query  356  QAEALQRWKEISQSGDSDYREQIRKHFGVKLPQALSNMCTYIGGVSRNLDISEVVNNNLA  415
            QAE LQ+WKEI+QSG+ DY++QI KH+ V + +A S M  Y+GG + +LDI+EVVNNN+ 
Sbjct  325  QAEFLQKWKEITQSGNKDYKDQIEKHWNVSVGEAYSEMSLYLGGTTASLDINEVVNNNI-  383

Query  416  TEGDTAVIAGKGVGAGNGSFEYTTTE-HCVVMCIYHAVPLLDYTLTGQDGQLLVTDAESL  474
            T  + A IAGKGV  GNG   +   E + ++MCIYH++PLLDYT    +      ++   
Sbjct  384  TGSNAADIAGKGVVVGNGRISFDAGERYGLIMCIYHSLPLLDYTTDLVNPAFTKINSTDF  443

Query  475  PIPEFDNIGMEVLPMTQVFNSPKASIVNLFNA--GYNPRYFNWKTKLDVINGAFTTTLKS  532
             IPEFD +GME +P+  + N P  S  N+ ++  GY PRY ++KT +D   GAF TTLKS
Sbjct  444  AIPEFDRVGMESVPLVSLMN-PLQSSYNVGSSILGYAPRYISYKTDVDSSVGAFKTTLKS  502

Query  533  WVSPVTESLLSGWFCFGYNKDDAAPDTKVIMNYKFFKVNPSVLDPIFGVNADSTWDTDQL  592
            WV       +     +   +DD       ++NY  FKVNP+ +DP+F V A ++ DTDQ 
Sbjct  503  WVMSYDNQSVINQLNY---QDDPNNSPGTLVNYTNFKVNPNCVDPLFAVAASNSIDTDQF  559

Query  593  LVNSYIGCYVVRNLSRDGVPY  613
            L +S+    VVRNL  DG+PY
Sbjct  560  LCSSFFDVKVVRNLDTDGLPY  580


>gi|494822885|ref|WP_007558293.1| hypothetical protein [Bacteroides plebeius]
 gi|198272099|gb|EDY96368.1| putative capsid protein (F protein) [Bacteroides plebeius DSM 
17135]
Length=613

 Score =   362 bits (929),  Expect = 7e-113, Method: Compositional matrix adjust.
 Identities = 230/634 (36%), Positives = 347/634 (55%), Gaps = 49/634 (8%)

Query  1    MASYTGMSNLQNHPHRSGFDIGRKNAFTAKVGELLPVYWDISMPGDKYRFNIEYFTRTQP  60
            MA+   M +++N P R+G+D+ +K  FTAK G L+PV+W   +P D     ++ F RTQP
Sbjct  8    MANIMSMKSVRNKPTRAGYDLTQKINFTAKAGSLIPVWWTPVLPFDDLNATVKSFVRTQP  67

Query  61   VETSAYTRLREYFDFYAVPLRLLWKSAPSVLTQMQDVNKIQALS--LTQNLSLGTFLPSI  118
            + T+A+ R+R YFDFY VP R +W   P+ +TQM+  N + A    L  N+ L   LP  
Sbjct  68   LNTAAFARMRGYFDFYFVPFRQMWNKFPTAITQMR-TNLLHASGPVLADNVPLSDELPYF  126

Query  119  PLSVLSDAMYLLNGRSWTPGNSSSLKNMFGFDRSDLCYKLLSYLGYGNLISSESTGRWWS  178
                ++D +  L          +  KN FG+ R+ L   +L YLGYG+          + 
Sbjct  127  TAEQVADYIVSL----------ADSKNQFGYYRAWLVCIILEYLGYGDFYP-------YI  169

Query  179  TSVSSKNDASYTQRYIQNNY-VNLFPLLAYQKIYQDFFRWSQWEASNPSSYNVDYFTGvs  237
               +    A++  R + NN   + FPL AYQKIY DF R++QWE SNPS++N+DY +G  
Sbjct  170  VEAAGGEGATWATRPMLNNLKFSPFPLFAYQKIYADFNRYTQWERSNPSTFNIDYISG--  227

Query  238  pslissllsvspDYWKSGTMFDLKYCNWNKDMLMGVLPNSQFGDVAVLDIPDSGDSNVVL  297
             +    L      +  S  +FD++Y NW +D+L G +P +Q+G+ +   +P SG   VV 
Sbjct  228  SADSLQLDFTVEGFKDSFNLFDMRYSNWQRDLLHGTIPQAQYGEASA--VPVSGSMQVVE  285

Query  298  GTDSHKSSVG------IASAITSKTAPFPLFALDASPENPI-PINsklrldlsslksQF-  349
            G      + G      +   +T + +   L A  +  E+ I   N+     +    S F 
Sbjct  286  GPTPPAFTTGQDGVAFLNGNVTIQGSSGYLQAQTSVGESRILRFNNTNSGLIVEGDSSFG  345

Query  350  -TVLALRQAEALQRWKEISQSGDSDYREQIRKHFGVKLPQALSNMCTYIGGVSRNLDISE  408
             ++LALR+AEA Q+WKE++ + + DY  QI  H+G  + +A S+MC ++G ++ +L I+E
Sbjct  346  VSILALRRAEAAQKWKEVALASEEDYPSQIEAHWGQSVNKAYSDMCQWLGSINIDLSINE  405

Query  409  VVNNNLATEGDTAVIAGKGVGAGNGSFEYTT-TEHCVVMCIYHAVPLLDYTLTGQDGQLL  467
            VVNNN+  E + A IAGKG  +GNGS  +    ++ +VMC++H +P LDY  +       
Sbjct  406  VVNNNITGE-NAADIAGKGTMSGNGSINFNVGGQYGIVMCVFHVLPQLDYITSAPHFGTT  464

Query  468  VTDAESLPIPEFDNIGMEVLPMTQVFNSPKAS------IVNLFNAGYNPRYFNWKTKLDV  521
            +T+    PIPEFD IGME +P+ +  N  K          NL+  GY P+Y+NWKT LD 
Sbjct  465  LTNVLDFPIPEFDKIGMEQVPVIRGLNPVKPKDGDFKVSPNLY-FGYAPQYYNWKTTLDK  523

Query  522  INGAFTTTLKSWVSPV-TESLLSG-WFCFGYNKDDAAPDTKVIMNYKFFKVNPSVLDPIF  579
              G F  +LK+W+ P   E+LL+     F  N +  A   K      FFKV+PSVLD +F
Sbjct  524  SMGEFRRSLKTWIIPFDDEALLAADSVDFPDNPNVEADSVKA----GFFKVSPSVLDNLF  579

Query  580  GVNADSTWDTDQLLVNSYIGCYVVRNLSRDGVPY  613
             V A+S  +TDQ L ++     VVR+L  +G+PY
Sbjct  580  AVKANSDLNTDQFLCSTLFDVNVVRSLDPNGLPY  613


>gi|575094321|emb|CDL65708.1| unnamed protein product [uncultured bacterium]
Length=642

 Score =   259 bits (663),  Expect = 1e-73, Method: Compositional matrix adjust.
 Identities = 214/658 (33%), Positives = 317/658 (48%), Gaps = 76/658 (12%)

Query  6    GMSNLQNHPHRSGFDIGRKNAFTAKVGELLPVYWDISMPGDKYRFNIEYFTRTQPVETSA  65
            G+  L+N P R+ FD+  +N FTAKVGELLP +     PGD  + +  YFTRT P++++A
Sbjct  9    GLHGLKNKPSRNSFDLSHRNMFTAKVGELLPCFVQELNPGDSVKVSSSYFTRTAPLQSNA  68

Query  66   YTRLREYFDFYAVPLRLLWKSAPSVLTQMQ------DVNKIQALSLTQNLSLGTFLPSIP  119
            +TRLRE   ++ VP   LWK   S +  M       D+++I A SL  N  + T +P + 
Sbjct  69   FTRLRENVQYFFVPYSALWKYFDSQVLNMTKNANGGDISRI-ASSLVGNQKVTTQMPCVN  127

Query  120  LSVLSDAMYLLNGRSWTPGNSSSLKNMF--GFDRSDLCYKLLSYLGYGNLISSESTGRWW  177
               L   +     RS T G+  S+   F  G  R     KLL  LGYGN     +  +  
Sbjct  128  YKTLHAYLLKFINRS-TVGSDGSVGPEFNRGCYRHAESAKLLQLLGYGNFPEQFANFKVN  186

Query  178  STSVSSKNDASYTQRYIQNNYVNLFPLLAYQKIYQDFFRWSQWEASNPSSYNVDYFTGvs  237
            +   +          Y  + Y+++F LLAY KI  D + + QW+  N S  NVDY T  S
Sbjct  187  NDKHNQSGQNFKDVTYNNSPYLSIFRLLAYHKICNDHYLYRQWQPYNASLCNVDYLTPNS  246

Query  238  pslissllsvsp---DYWKSG--TMFDLKYCNWNKDMLMGVLPNSQFGDVAV--LDIPDS  290
             SL+S   ++     D  K+    + D+++ N   D   GVLP SQFG  +V  L++ ++
Sbjct  247  SSLLSIDDALLSIPDDSIKAEKLNLLDMRFSNLPLDYFTGVLPTSQFGSESVVNLNLGNA  306

Query  291  GDSNVVLGTDSH-----KSSVG-------IASAITSK----TAPFPLFALDASPENPIPI  334
              S V+ GT S      +++ G       +AS+         +     + D +    + I
Sbjct  307  SGSAVLNGTTSKDSGRWRTTTGEWEMEQRVASSANGNLKLDNSNGTFISHDHTFSGNVAI  366

Query  335  NsklrldlsslksQFTVLALRQAEALQRWKEISQSGDSDYREQIRKHFGVKLPQALSNMC  394
            N       +SL    +++ALR A A Q++KEI  + D D++ Q+  HFG+K P   +   
Sbjct  367  N-------TSLSGNLSIIALRNALAAQKYKEIQLANDVDFQSQVEAHFGIK-PDEKNENS  418

Query  395  TYIGGVSRNLDISEVVNNNLATEGDTAVIAGKG-VGAGNGSFEYTTTEHCVVMCIYHAVP  453
             +IGG S  ++I+E +N NL+  GD     G    G G+ S ++T   + VV+ IY   P
Sbjct  419  LFIGGSSSMININEQINQNLS--GDNKATYGAAPQGNGSASIKFTAKTYGVVIGIYRCTP  476

Query  454  LLDYTLTGQDGQLLVTDAESLPIPEFDNIGMEVLPMTQVFNSPKASIVNLFNA-------  506
            +LD+   G D  L  TDA    IPE D+IGM+     +V  +  A   + F A       
Sbjct  477  VLDFAHLGIDRTLFKTDASDFVIPEMDSIGMQQTFRCEV--AAPAPYNDEFKAFRVGDGS  534

Query  507  --------GYNPRYFNWKTKLDVINGAFTTTLKSWVSPVTESLLSG--WFCF-GYNKDDA  555
                    GY PRY  +KT  D  NGAF  +LKSWV+ +    +    W  + G N    
Sbjct  535  SPDMSETYGYAPRYSEFKTSYDRYNGAFCHSLKSWVTGINFDAIQNNVWNTWAGIN----  590

Query  556  APDTKVIMNYKFFKVNPSVLDPIFGVNADSTWDTDQLLVNSYIGCYVVRNLSRDGVPY  613
            AP+         F   P ++  +F V++ +  D DQL V     CY  RNLSR G+PY
Sbjct  591  APN--------MFACRPDIVKNLFLVSSTNNSDDDQLYVGMVNMCYATRNLSRYGLPY  640


>gi|517172762|ref|WP_018361580.1| hypothetical protein [Prevotella nanceiensis]
Length=568

 Score =   204 bits (520),  Expect = 4e-54, Method: Compositional matrix adjust.
 Identities = 178/621 (29%), Positives = 264/621 (43%), Gaps = 96/621 (15%)

Query  16   RSGFDIGRKNAFTAKVGELLPVYWDISMPGDKYRFNIEYFTRTQPVETSAYTRLREYFDF  75
            R+ FDI +++ FTA  G LLPV     +P D    N   F RT P+ ++A+  +R  ++F
Sbjct  18   RNAFDISQRHLFTAPAGALLPVLSLDLLPHDHVEINASDFMRTLPMNSAAFMSMRGVYEF  77

Query  76   YAVPLRLLWKSAPSVLTQMQDVNKIQALSLTQNLSLGTFLPSIPLSVLS-DAMYLLNGRS  134
            Y VP + LW      +T M D          ++  +  F    P S +S D   L++   
Sbjct  78   YFVPYKQLWSGFDQFITGMSDY---------KSSFMYAFKGKTPPSCVSFDVQKLVD---  125

Query  135  WTPGNSSSLKNMFGFDRSDLCYKLLSYLGYGNLISSESTGRWWSTSVSSKNDASYTQRYI  194
            W   N++  K++ GFD++   Y++L  LGYG   +S          V   N  S T    
Sbjct  126  WCKTNTA--KDIHGFDKNKGVYRILDLLGYGKYANS--------AGVPYTNPTSTTM---  172

Query  195  QNNYVNLFPLLAYQKIYQDFFRWSQWEASNPSSYNVDYFTGvspslissllsvspDYWKS  254
                   F  LAYQKIY DF+R + +E     S+NVD F G      +         W  
Sbjct  173  --GKCTPFRGLAYQKIYNDFYRNTTYEEYQLESFNVDMFYGSGKVKETIPNEPWDYDW--  228

Query  255  GTMFDLKYCNWNKDMLMGVLP---------NSQFGDVAVLDIPDSGDSNVVLGTDSHKSS  305
               F L+Y N  KD+L  V P         N QF      DI      NV  GT  ++ S
Sbjct  229  ---FTLRYRNAQKDLLTNVRPTPLFSIDDFNPQFF-TGGSDIVMEKGPNVTGGTHEYRDS  284

Query  306  VGIASAITSKTAPFPLFALDASPENPIPINsklrldlsslksQFTVLALRQAEALQRWKE  365
            V I                    EN +           S ++  +V  +R A AL++   
Sbjct  285  VVIVGKNLK--------------ENGV----------DSKRTMISVADIRNAFALEKLAS  320

Query  366  ISQSGDSDYREQIRKHFGVKLPQALSNMCTYIGGVSRNLDISEVVNNNLAT---------  416
            ++      Y+EQ+  HFG+ + +     CTYIGG   N+ + +V  ++  T         
Sbjct  321  VTMRAGKTYKEQMEAHFGISVEEGRDGRCTYIGGFDSNIQVGDVTQSSGTTVTGTKDTSF  380

Query  417  EGDTAVIAGKGVGAGNGSFEYTTTEHCVVMCIYHAVPLLDYTLTGQDGQLLVTDAESLPI  476
             G      GK  G+G+G   +   EH ++MCIY  VP + Y     D  +   +     +
Sbjct  381  GGYLGRTTGKATGSGSGHIRFDAKEHGILMCIYSLVPDVQYDSKRVDPFVQKIERGDFFV  440

Query  477  PEFDNIGMEVLPMTQVF-----NSPKASIVNLFNAGYNPRYFNWKTKLDVINGAFTTTLK  531
            PEF+N+GM+ L    +      N+  + I NL   G+ PRY  +KT LD+ +G F     
Sbjct  441  PEFENLGMQPLFAKNISYKYNNNTANSRIKNLGAFGWQPRYSEYKTALDINHGQF-----  495

Query  532  SWVSPVTESLLSGWFCFGYNKDDAAPDTKVIMNYKFFKVNPSVLDPIFGVNADSTWDTDQ  591
                 V +  LS W         A  ++    N   FK+NP  LD +F VN + T  TDQ
Sbjct  496  -----VHQEPLSYWTV-----ARARGESMSNFNISTFKINPKWLDDVFAVNYNGTELTDQ  545

Query  592  LLVNSYIGCYVVRNLSRDGVP  612
            +    Y     V ++S DG+P
Sbjct  546  VFGGCYFNIVKVSDMSIDGMP  566


>gi|647452987|ref|WP_025792807.1| hypothetical protein [Prevotella histicola]
Length=584

 Score =   194 bits (494),  Expect = 1e-50, Method: Compositional matrix adjust.
 Identities = 177/644 (27%), Positives = 300/644 (47%), Gaps = 120/644 (19%)

Query  16   RSGFDIGRKNAFTAKVGELLPV-YWDISMPGDKYRFNIEYFTRTQPVETSAYTRLREYFD  74
            R+GFD+  +  F+AK G+LLP+  W+++ P + ++F+++   RT  + T++Y R++EY+ 
Sbjct  10   RNGFDLSSRRIFSAKAGQLLPIGCWEVN-PSEHFKFSVQDLVRTTTLNTASYARMKEYYH  68

Query  75   FYAVPLRLLWKSAPSVLTQMQD----VNKIQALSLTQNLSLGTFLPSIPLSVLSDAMYLL  130
            F+ V  R LW+     +    +    +N ++    T    + + +P+  L          
Sbjct  69   FFFVSYRSLWQWFDQFIVGTNNPHSALNGVKKNGTTNYNQICSSVPTFDL----------  118

Query  131  NGRSWTPGNSSSLKNMFGFDRSDLCYKLLSYLGYG-----------NLISSESTGRWWST  179
             G+  T   +S + +  GF+ S+   KLL+ L YG           NLI+S S       
Sbjct  119  -GKLITRLKTSDMDSQ-GFNYSEGAAKLLNMLNYGVTNKGKFMNLENLITSTSY------  170

Query  180  SVSSKNDASYTQRYIQNNYVNLFPLLAYQKIYQDFFRWSQWEASNPSSYNVDYFTGvsps  239
             + SK+D   +  Y     V+ F LLAYQKI+ DF+R   W  S+  S+NVD +   S  
Sbjct  171  -LPSKDDKEPSSIYACK--VSPFRLLAYQKIFNDFYRNQDWTPSDVRSFNVDDYADDSNL  227

Query  240  lissllsvspDYWKSGTMFDLKYCNWNKDMLMGVLPNSQFGDVAVLDIPD--SGDSNVVL  297
             I   +++            ++Y  + KD L  + P   + D  + ++P+   G+ NV+L
Sbjct  228  TIEPDVAL--------KFCQMRYRPYAKDWLTSMKPTPNYSD-GIFNLPEYVRGNGNVIL  278

Query  298  GTDSHKSSVGIASAITSKTAPFPLFALDASPENPIPINsklrldlsslksQFTVLALRQA  357
             T++   SV + S   S ++                               F+V  LR A
Sbjct  279  -TNNKSGSVSLDSGTVSPSS-------------------------------FSVNDLRAA  306

Query  358  EALQRWKEISQSGDS-DYREQIRKHFGVKLPQALSNMCTYIGGVSRNLDISEVV--NNNL  414
             AL +  E ++  +  DY  QI  HFG K+P++ +N   ++GG   ++ +SEVV  N N 
Sbjct  307  FALDKMLEATRRANGLDYASQIEAHFGFKVPESRANDARFLGGFDNSIVVSEVVSTNGNA  366

Query  415  ATEGDTAVI---AGKGVGA-GNGSFEYTTTEHCVVMCIYHAVPLLDYTLTGQDGQLLVTD  470
            A++G  A I    GKG+G+  +G+ E+ +TEH ++MCIY   P  +Y  +  D       
Sbjct  367  ASDGSHASIGDLGGKGIGSMSSGTIEFDSTEHGIIMCIYSVAPQSEYNASYLDPFNRKLT  426

Query  471  AESLPIPEFDNIGMEVLPMTQV------FNSPKA--SIVNLFN--AGYNPRYFNWKTKLD  520
             E    PEF ++G + L  + +       N  +A  S + L N   GY  RY  +KT  D
Sbjct  427  REQFYQPEFADLGYQALIGSDLICSTLGMNEKQAGFSDIELNNNLLGYQVRYNEYKTARD  486

Query  521  VINGAFTT--TLKSWVSPVTESLLSGWFCFGYNKDDAAPDTKVIMNYKF-----------  567
            ++ G F +  +L  W +P  +      F +G  +   AP+ K   +Y+            
Sbjct  487  LVFGDFESGKSLSYWCTPRFD------FGYGDTEKKIAPENKGGADYRKKGNRSHWSSRN  540

Query  568  FKVNPSVLDPIFGVNADSTWDTDQLLVNSYIGCYVVRNLSRDGV  611
            F +NP++++PIF     S    D  +VNS++    VR +S  G+
Sbjct  541  FYINPNLVNPIF---LTSAVQADHFIVNSFLDVKAVRPMSVTGL  581


>gi|494308783|ref|WP_007173938.1| hypothetical protein [Prevotella bergensis]
 gi|270333035|gb|EFA43821.1| putative capsid protein (F protein) [Prevotella bergensis DSM 
17361]
Length=553

 Score =   193 bits (490),  Expect = 3e-50, Method: Compositional matrix adjust.
 Identities = 167/613 (27%), Positives = 266/613 (43%), Gaps = 92/613 (15%)

Query  15   HRSGFDIGRKNAFTAKVGELLPVYWDISMPGDKYRFNIEYFTRTQPVETSAYTRLREYFD  74
            +R+ FD+ +++ FTA  G LLPV     +P D    N + F RT P+ T+A+  +R  ++
Sbjct  16   NRNAFDLSQRHLFTAHAGMLLPVLNLDLIPHDHVEINAQDFMRTLPMNTAAFASMRGVYE  75

Query  75   FYAVPLRLLWKSAPSVLTQMQDVNKIQALSLTQNLSLGTFLPSIPLSVLSDAMYLLN-GR  133
            F+ VP   LW      +T M D +     S  +++  GT    +P   +      LN G+
Sbjct  76   FFFVPYHQLWAQFDQFITGMNDFHS----SANKSIQGGTSPLQVPYFNVDSVFNSLNTGK  131

Query  134  SWTPGNSSSLKNMFGFDRSDLCYKLLSYLGYGNLISSESTGRWWSTSVSS-KNDASYTQR  192
                G++  L+  F +      ++LL  LGYG     +S G  +  +VS  KN+  Y   
Sbjct  132  ESGSGSTDDLQYKFKYG----AFRLLDLLGYGRKF--DSFGTAYPDNVSGLKNNLDYN--  183

Query  193  YIQNNYVNLFPLLAYQKIYQDFFRWSQWEASNPSSYNVDYFTGvspslissllsvspDYW  252
                   ++F +LAY KIYQD++R S +E  +  S+N D F G              D  
Sbjct  184  ------CSVFRILAYNKIYQDYYRNSNYENFDTDSFNFDKFKGGLV-----------DAK  226

Query  253  KSGTMFDLKYCNWNKDMLMGVLPNSQFGDVAVLDIPDSGDSNVVLGTDSHKSSVGIASAI  312
                +F L+Y N   D    +  +  F      +  D    N+ +    +  S G  S  
Sbjct  227  VVADLFKLRYRNAQTDYFTNLRQSQLFSFTTAFEDVD----NINIAPRDYVKSDG--SNF  280

Query  313  TSKTAPFPLFALDASPENPIPINsklrldlsslksQFTVLALRQAEALQRWKEISQSGDS  372
            T        F +D                       F+V +LR A A+ +   ++     
Sbjct  281  TRVN-----FGVDTDSSE----------------GDFSVSSLRAAFAVDKLLSVTMRAGK  319

Query  373  DYREQIRKHFGVKLPQALSNMCTYIGGVSRNLDISEVVNNN--LATE-----GDTAVIAG  425
             +++Q+R H+GV++P +      Y+GG   ++ +S+V   +   ATE     G    +AG
Sbjct  320  TFQDQMRAHYGVEIPDSRDGRVNYLGGFDSDMQVSDVTQTSGTTATEYKPEAGYLGRVAG  379

Query  426  KGVGAGNGSFEYTTTEHCVVMCIYHAVPLLDYTLTGQDGQLLVTDAESLPIPEFDNIGME  485
            KG G+G G   +   EH V+MCIY  VP + Y  T  D  +   D      PEF+N+GM+
Sbjct  380  KGTGSGRGRIVFDAKEHGVLMCIYSLVPQIQYDCTRLDPMVDKLDRFDYFTPEFENLGMQ  439

Query  486  VLPMTQVFN----SPKASIVNLFNAGYNPRYFNWKTKLDVINGAFTTT--LKSWVSPVTE  539
             L  + + +     PK  ++     GY PRY  +KT LDV +G F  +  L SW    + 
Sbjct  440  PLNSSYISSFCTTDPKNPVL-----GYQPRYSEYKTALDVNHGQFAQSDALSSW----SV  490

Query  540  SLLSGWFCFGYNKDDAAPDTKVIMNYKFFKVNPSVLDPIFGVNADSTWDTDQLLVNSYIG  599
            S    W  F        P  ++      FK++P  L+ IF V+ + T   D +       
Sbjct  491  SRFRRWTTF--------PQLEIAD----FKIDPGCLNSIFPVDYNGTEANDCVYGGCNFN  538

Query  600  CYVVRNLSRDGVP  612
               V ++S DG+P
Sbjct  539  IVKVSDMSVDGMP  551


>gi|575094339|emb|CDL65730.1| unnamed protein product [uncultured bacterium]
Length=588

 Score =   192 bits (489),  Expect = 6e-50, Method: Compositional matrix adjust.
 Identities = 187/658 (28%), Positives = 277/658 (42%), Gaps = 124/658 (19%)

Query  7    MSNLQNHP-HRS-----GFDIGRKNAFTAKVGELLPVYWDISMPGDKYRFNIEYFTRTQP  60
            M+N+   P HR+     GFD+ +++ FT+ VG+LLPV++D   PGDK R +   FTRTQP
Sbjct  1    MANINQKPSHRANLSKNGFDMSQRHPFTSSVGQLLPVFYDYLNPGDKIRISANLFTRTQP  60

Query  61   VETSAYTRLREYFDFYAVPLRLLWKSAPSVLTQMQDVNKIQALSLTQNLSLGTFLPSIPL  120
            ++++A  RL E+ +++ VP   ++    SV   + D N   +L    NL++  F  S  +
Sbjct  61   MKSTAMARLTEHIEYFFVPFEQMFSLFGSVFYGIDDYNS-SSLVKHNNLTM-PFFKSDAV  118

Query  121  SVLSDAMYL-----LNGRSWTPGNSSSLKNMFGFDRSDLCYKLLSYLGYGNLISSESTGR  175
            S   +A Y      +N +  TP       +M G  R     +L   LGYG+L+ S     
Sbjct  119  SAALEAAYTSFSSSINRKVLTP-------DMMGQPRVYGILRLSEMLGYGSLLLSNDNNL  171

Query  176  WWSTSVSSKNDASYTQRYIQNNYVNLFPLLAYQKIYQDFFRWSQWEASNPSSYNVDYFTG  235
                 +S                  +F   AYQKI+ DF+R   + +    SYNVDY  G
Sbjct  172  LPHADMS------------------VFLFTAYQKIFNDFYRLDDYTSVQHKSYNVDYAQG  213

Query  236  vspslissllsvspDYWKSGTMFDLKYCNWNKDMLMGVLPN---------SQFGDVAVLD  286
               +                +MF+L Y  W KD    V+PN         S FG   + D
Sbjct  214  QPIT--------------DNSMFELHYRPWKKDYFTNVIPNPYFSSVDNKSSFGGAGLFD  259

Query  287  IPDSGDSNVVLGTDSHKSSVGIASAITSKTAPFPLFALDASPENPIPINsklrldlsslk  346
             P  G S      D     +   S +++     P+F         +P+N           
Sbjct  260  RP-VGLSITSFNFDG-SDFLQAPSDLSTMENNQPIF-------QELPVNLTSASSAG---  307

Query  347  sQFTVLALRQAEALQRWKEISQSGDSDYREQIRKHFGVKLPQALSNMCTYIGGVSRNLDI  406
               +V  LR   A  +   I+Q     Y  Q   HFG ++PQ +S    YIGG S+ L I
Sbjct  308  --LSVSDLRYLYATDKLLRITQFAGKHYDAQTLAHFGKRVPQGVSGEVYYIGGQSQPLQI  365

Query  407  SEVVNNNLATEGDTAVIAGKGVG--AGNG--------SFEYTTTEHCVVMCIYHAVPLLD  456
            S V   + AT  D+  + G  +G  AG G         F +    H V+M IY AVP  D
Sbjct  366  SSV--ESTATTFDSGDVVGSVLGELAGKGYSQTGNQKDFSFEAPCHGVLMAIYSAVPEAD  423

Query  457  YTLTGQDGQLLVTDAESLPIPEFDNIGMEVLPMTQVFNSPKASIVNLFNAGYNPRYFNWK  516
            Y     D    +  +     PEFD++GME  P  ++       + N    G+  RY   K
Sbjct  424  YLDERIDYLNTLIQSNDFYKPEFDSLGMEPFPNYEL--DQYRMVGNNSRLGWRYRYSGLK  481

Query  517  TKLDVINGAFTTTLKSWVSPVTESLLSGWFCFGYNKDDAAPDTKVIMNYKFFKVNPSVLD  576
            +K D+I+GAF  TL+ WV+   +S               A D     +  F  ++P+ LD
Sbjct  482  SKPDLISGAFKYTLRDWVAVRNDSRY-------------AEDESWWQSAAFMYIDPAYLD  528

Query  577  PIFGV-----------NADSTWD-----------TDQLLVNSYIGCYVVRNLSRDGVP  612
             IF +           +A+ T+D            D LL + YI CY    +S  G+P
Sbjct  529  NIFELSFTPRLYQQQDSANVTYDGTFIDRSLVYQRDPLLHDLYIKCYKSSAMSTYGLP  586



Lambda      K        H        a         alpha
   0.318    0.134    0.414    0.792     4.96 

Gapped
Lambda      K        H        a         alpha    sigma
   0.267   0.0410    0.140     1.90     42.6     43.6 

Effective search space used: 4597148648223