bitscore colors: <40, 40-50 , 50-80, 80-200, >200




           BLASTP 2.2.30+


Reference: Stephen F. Altschul, Thomas L. Madden, Alejandro A.
Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J.
Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of
protein database search programs", Nucleic Acids Res. 25:3389-3402.


Reference for composition-based statistics: Alejandro A. Schaffer,
L. Aravind, Thomas L. Madden, Sergei Shavirin, John L. Spouge, Yuri
I. Wolf, Eugene V. Koonin, and Stephen F. Altschul (2001),
"Improving the accuracy of PSI-BLAST protein database searches with
composition-based statistics and other refinements", Nucleic Acids
Res. 29:2994-3005.



Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF
excluding environmental samples from WGS projects
           49,011,213 sequences; 17,563,301,199 total letters





Query= Contig-16_CDS_annotation_glimmer3.pl_2_2

Length=579
                                                                      Score     E
Sequences producing significant alignments:                          (Bits)  Value

gi|547226430|ref|WP_021963493.1|  putative uncharacterized protein      294   1e-87
gi|496050829|ref|WP_008775336.1|  hypothetical protein                  279   6e-82
gi|490418709|ref|WP_004291032.1|  hypothetical protein                  271   8e-79
gi|494822885|ref|WP_007558293.1|  hypothetical protein                  228   8e-63
gi|494308783|ref|WP_007173938.1|  hypothetical protein                  189   6e-49
gi|575094321|emb|CDL65708.1|  unnamed protein product                   181   1e-45
gi|490477384|ref|WP_004347761.1|  capsid protein                        173   2e-43
gi|647452987|ref|WP_025792807.1|  hypothetical protein                  172   3e-43
gi|494306153|ref|WP_007173049.1|  hypothetical protein                  166   3e-41
gi|496521299|ref|WP_009229582.1|  capsid protein                        164   1e-40


>gi|547226430|ref|WP_021963493.1| putative uncharacterized protein [Prevotella sp. CAG:1185]
 gi|524103382|emb|CCY83994.1| putative uncharacterized protein [Prevotella sp. CAG:1185]
Length=573

 Score =   294 bits (752),  Expect = 1e-87, Method: Compositional matrix adjust.
 Identities = 211/614 (34%), Positives = 296/614 (48%), Gaps = 76/614 (12%)

Query  1    MSDFNPLNRARISTHRSSFDLSSKKLFTAKVGEILPCYWQIAIPGNKYRISSDWFTRTVP  60
            MS    L   + S  R+ FDLS K  FTAKVGE+LP   +   PG+K+ I    FTRT P
Sbjct  1    MSSVMSLTALKNSVKRNGFDLSFKNAFTAKVGELLPIMCKEVYPGDKFNIRGQAFTRTQP  60

Query  61   VNTAAYTRIKEYYDFYAVPLRLISRALPQAFTQMTD--YItsaasstansstltsVPFVS  118
            VN+AAY+R++EYYDFY VP RL+    P  FT M D  +     SS   S       F  
Sbjct  61   VNSAAYSRLREYYDFYFVPYRLLWNMAPTFFTNMPDPHHAADLVSSVNLSQRHPWFTFFD  120

Query  119  QTIFNAFFQTANAGDQPNTRDDAGLPIVYGSCKLLDMLGYGSMIASNNPSKAAITKKYLG  178
               +     + +   +   ++  G   V  S KLL+ L YG              K Y  
Sbjct  121  IMEYLGNLNSLSGAYEKYQKNFFGFSRVELSVKLLNYLNYG------------FGKDYES  168

Query  179  VDSLSDADNPLVYQTSQTVNALPFLAYQKIYYDFFSNSQWEKHKAYAYNVDYWSGAGN--  236
            V   SD+D+ ++       +  P LAYQKI  D+F + QW+    Y YN+DY  G  +  
Sbjct  169  VKVPSDSDDIVL-------SPFPLLAYQKICEDYFRDDQWQSAAPYRYNLDYLYGKSSGF  221

Query  237  ----IGLVTD------MVQLRYANYPKDYFMGMLPSSQYGSVAVLPsissssdsrsLLYY  286
                     D      M  L Y N+ KDYF GMLP +QYG V+V                
Sbjct  222  HIPMSSFTNDAFKNPTMFDLNYCNFQKDYFTGMLPRAQYGDVSVA---------------  266

Query  287  EPsssataalqsaggssssvrlsqtvsssQGIRL-------NSDLSALSIRATEYLQRWK  339
             P         S+  + +S       +   G+ +        + LS L++R  E LQ+W+
Sbjct  267  SPIFGDLDIGDSSSLTFASAPQQGANTIQSGVLVVNNNSNTTAGLSVLALRQAECLQKWR  326

Query  340  EIVQFSSKDYSDQMAAQFGIKAPEYMGNHSHYIGGWSSVININEVVNTNLDADSSQASIA  399
            EI Q    DY  QM   F +     +  H  Y+GGW+S ++I+EVVNTNL  D +QA I 
Sbjct  327  EIAQSGKMDYQTQMQKHFNVSPSATLSGHCKYLGGWTSNLDISEVVNTNLTGD-NQADIQ  385

Query  400  GKGISSNSGHTLTYDCGAEHQIIMCVYHAVPMLDWNLTGQAPQLTVTAISDFPQPAFDQL  459
            GKG  + +G+ + ++  +EH IIMC+YH +P+LDW++   A Q   T  +D+  P FD +
Sbjct  386  GKGTGTLNGNKVDFE-SSEHGIIMCIYHCLPLLDWSINRIARQNFKTTFTDYAIPEFDSV  444

Query  460  GMQSV-PS---LNLQNNPGRNVSGALGYNLRYWQWKSNIDTVHAGFRAGAAYQSWVAPL-  514
            GMQ + PS     L++ P    S  +GY  RY   K++ID +H  F       SWV+PL 
Sbjct  445  GMQQLYPSEMIFGLEDLPSDPSSINMGYVPRYADLKTSIDEIHGSFIDTLV--SWVSPLT  502

Query  515  DGW---------NVLTSSGAWSYQSMKVRPQQLNSIFVPQVDSANCSVAFDQLLCNVNFQ  565
            D +         +   S    +Y   KV P  +++IF  + DS   ++  DQLL N  F 
Sbjct  503  DSYISAYRQACKDAGFSDITMTYNFFKVNPHIVDNIFGVKADS---TINTDQLLINSYFD  559

Query  566  VYAVQNLDRNGLPY  579
            + AV+N D NGLPY
Sbjct  560  IKAVRNFDYNGLPY  573


>gi|496050829|ref|WP_008775336.1| hypothetical protein [Bacteroides sp. 2_2_4]
 gi|229448893|gb|EEO54684.1| putative capsid protein (F protein) [Bacteroides sp. 2_2_4]
Length=580

 Score =   279 bits (714),  Expect = 6e-82, Method: Compositional matrix adjust.
 Identities = 202/620 (33%), Positives = 295/620 (48%), Gaps = 81/620 (13%)

Query  1    MSDFNPLNRARISTHRSSFDLSSKKLFTAKVGEILPCYWQIAIPGNKYRISSDWFTRTVP  60
            M++   L   R  T R+ FDLSSK+ FTAK GE+LP      +PG+K+ I    FTRT P
Sbjct  1    MANIMSLKSLRNKTSRNGFDLSSKRNFTAKPGELLPVKCWEVLPGDKWSIDLKSFTRTQP  60

Query  61   VNTAAYTRIKEYYDFYAVPLRLISRALPQAFTQMTDYItsaasstansstltsVPFVSQT  120
            +NTAA+ R++EYYDFY VP  L+        TQM D               + +P  +Q 
Sbjct  61   LNTAAFARMREYYDFYFVPYNLLWNKANTVLTQMYD---------NPQHATSYIPSANQA  111

Query  121  IFNAFFQTANAG----------DQPNT----RDDAGLPIVYGSCKLLDMLGYGSMIASNN  166
            +          G          D   T    ++  G     G+ KLL+ LGYG+      
Sbjct  112  LAGVMPNVTCKGIADYLNLVAPDVTTTNSYEKNYFGYSRSLGTAKLLEYLGYGNFYTYAT  171

Query  167  PSKAAITKKYLGVDSLSDADNPLVYQTSQTVNALPFLAYQKIYYDFFSNSQWEKHKAYAY  226
                  TK            +PL   ++  +N    LAYQKIY D   +SQWEK     +
Sbjct  172  SKNNTWTK------------SPL--SSNLQLNIYGVLAYQKIYADHIRDSQWEKVSPSCF  217

Query  227  NVDYWSGAGNIGLVTD-------------MVQLRYANYPKDYFMGMLPSSQYGSVAVLPs  273
            NVDY SG  +  +  D             M  LRY N+ KD F G+LP  QYG  A +  
Sbjct  218  NVDYLSGTVDSAMTIDSMITGQGFAPFYNMFDLRYCNWQKDLFHGVLPRQQYGDTAAVNV  277

Query  274  issssdsrsLLYYEPsssataalqsaggssssvrlsqtvsssQGIRLNSDLSALSIRATE  333
              S+  S   +   P          +    +           Q +  +   + L++R  E
Sbjct  278  NLSNVLSAQYMVQTPDGDPVGGSPFSSTGVNL----------QTVNGSGTFTVLALRQAE  327

Query  334  YLQRWKEIVQFSSKDYSDQMAAQFGIKAPEYMGNHSHYIGGWSSVININEVVNTNLDADS  393
            +LQ+WKEI Q  +KDY DQ+   + +   E     S Y+GG ++ ++INEVVN N+   S
Sbjct  328  FLQKWKEITQSGNKDYKDQIEKHWNVSVGEAYSEMSLYLGGTTASLDINEVVNNNITG-S  386

Query  394  SQASIAGKGISSNSGHTLTYDCGAEHQIIMCVYHAVPMLDWNLTGQAPQLTVTAISDFPQ  453
            + A IAGKG+   +G  +++D G  + +IMC+YH++P+LD+      P  T    +DF  
Sbjct  387  NAADIAGKGVVVGNGR-ISFDAGERYGLIMCIYHSLPLLDYTTDLVNPAFTKINSTDFAI  445

Query  454  PAFDQLGMQSVPSLNLQN--NPGRNV-SGALGYNLRYWQWKSNIDTVHAGFRAGAAYQSW  510
            P FD++GM+SVP ++L N      NV S  LGY  RY  +K+++D+    F+     +SW
Sbjct  446  PEFDRVGMESVPLVSLMNPLQSSYNVGSSILGYAPRYISYKTDVDSSVGAFK--TTLKSW  503

Query  511  VAPLDGWNVL----------TSSGAW-SYQSMKVRPQQLNSIFVPQVDSANCSVAFDQLL  559
            V   D  +V+           S G   +Y + KV P  ++ +F     +A+ S+  DQ L
Sbjct  504  VMSYDNQSVINQLNYQDDPNNSPGTLVNYTNFKVNPNCVDPLFAV---AASNSIDTDQFL  560

Query  560  CNVNFQVYAVQNLDRNGLPY  579
            C+  F V  V+NLD +GLPY
Sbjct  561  CSSFFDVKVVRNLDTDGLPY  580


>gi|490418709|ref|WP_004291032.1| hypothetical protein [Bacteroides eggerthii]
 gi|217986636|gb|EEC52970.1| putative capsid protein (F protein) [Bacteroides eggerthii DSM 
20697]
Length=578

 Score =   271 bits (692),  Expect = 8e-79, Method: Compositional matrix adjust.
 Identities = 206/617 (33%), Positives = 292/617 (47%), Gaps = 77/617 (12%)

Query  1    MSDFNPLNRARISTHRSSFDLSSKKLFTAKVGEILPCYWQIAIPGNKYRISSDWFTRTVP  60
            M++   L   R    R+ FDLS KK FTAK GE+LP   +  +PG+ ++I+   FTRT P
Sbjct  1    MANIMSLKSIRNKPSRNGFDLSFKKNFTAKAGELLPVMVKEVLPGDTFKINLKAFTRTQP  60

Query  61   VNTAAYTRIKEYYDFYAVPLRLISRALPQAFTQMTDYItsaass--tansstltsVPFVS  118
            VNTAA+ RI+EYYDF+ VP  L+        TQM D    A S   T N      +P+++
Sbjct  61   VNTAAFARIREYYDFFFVPYDLLWNKANTVLTQMYDNPQHAVSIDPTRNFVLSGEMPYMT  120

Query  119  Q----TIFNAFFQTANAGDQPNTRDDAGLPIVYGSCKLLDMLGYGSMIASNNPSKAAITK  174
                 +  NA    +   D  +     G      S KLL+ LGYG+  +           
Sbjct  121  SEAIASYINALSTASALADYKSNY--FGYNRSKSSVKLLEYLGYGNYESF----------  168

Query  175  KYLGVDSLSDADNPLVYQTSQTVNALPFLAYQKIYYDFFSNSQWEKHKAYAYNVDYWSGA  234
                   L+D  N      +   N    LAYQKIY DF+ +SQWE+     +NVDY  G+
Sbjct  169  -------LTDDWNTAPLMANLNHNIFGLLAYQKIYSDFYRDSQWERVSPSTFNVDYLDGS  221

Query  235  G---NIGLVTDMVQ------LRYANYPKDYFMGMLPSSQYGSVAVLPsissssdsrsLLY  285
                +    T+  Q      LRY N+ KD F G+LP  QYG  AV       +   +L  
Sbjct  222  SMNLDNAYSTEFYQNYNFFDLRYCNWQKDLFHGVLPHQQYGETAVASITPDVTGKLTLSN  281

Query  286  YEPsssataalqsaggssssvrlsqtvsssQGIRLNSDLSALSIRATEYLQRWKEIVQFS  345
            +              G+S +        +        DLS L +R  E+LQ+WKEI Q  
Sbjct  282  FS-----------TVGTSPTTASGTATKNLPAFDTVGDLSILVLRQAEFLQKWKEITQSG  330

Query  346  SKDYSDQMAAQFGIKAPEYMGNHSHYIGGWSSVININEVVNTNLDADSSQASIAGKGISS  405
            +KDY DQ+   +G+   +       Y+GG SS I+INEV+NTN+   S+ A IAGKG+  
Sbjct  331  NKDYKDQLEKHWGVSVGDGFSELCTYLGGVSSSIDINEVINTNITG-SAAADIAGKGVGV  389

Query  406  NSGHTLTYDCGAEHQIIMCVYHAVPMLDWNLTGQAPQLTVTAISDFPQPAFDQLGMQSVP  465
             +G  + ++    + +IMC+YH +P+LD+      P       +D+  P FD++GMQS+P
Sbjct  390  ANGE-INFNSNGRYGLIMCIYHCLPLLDYTTDMLDPAFLKVNSTDYAIPEFDRVGMQSMP  448

Query  466  SLNLQNNPGR---NVSG-ALGYNLRYWQWKSNIDTVHAGFRAGAAYQSWV----------  511
             + L  NP R   N SG  LGY  RY  +K+++D    GF+      SWV          
Sbjct  449  LVQLM-NPLRSFANASGLVLGYVPRYIDYKTSVDQSVGGFK--RTLNSWVISYGNISVLK  505

Query  512  --------APLDGWNVLTSSGAWSYQSMKVRPQQLNSIFVPQV-DSANCSVAFDQLLCNV  562
                     P++    + S    ++   KV P  L+ IF  Q  D  N     DQ LC+ 
Sbjct  506  QVTLPNDAPPIEPSEPVPSVAPMNFTFFKVNPDCLDPIFAVQAGDDTNT----DQFLCSS  561

Query  563  NFQVYAVQNLDRNGLPY  579
             F + AV+NLD +GLPY
Sbjct  562  FFDIKAVRNLDTDGLPY  578


>gi|494822885|ref|WP_007558293.1| hypothetical protein [Bacteroides plebeius]
 gi|198272099|gb|EDY96368.1| putative capsid protein (F protein) [Bacteroides plebeius DSM 
17135]
Length=613

 Score =   228 bits (582),  Expect = 8e-63, Method: Compositional matrix adjust.
 Identities = 175/629 (28%), Positives = 300/629 (48%), Gaps = 73/629 (12%)

Query  1    MSDFNPLNRARISTHRSSFDLSSKKLFTAKVGEILPCYWQIAIPGNKYRISSDWFTRTVP  60
            M++   +   R    R+ +DL+ K  FTAK G ++P +W   +P +    +   F RT P
Sbjct  8    MANIMSMKSVRNKPTRAGYDLTQKINFTAKAGSLIPVWWTPVLPFDDLNATVKSFVRTQP  67

Query  61   VNTAAYTRIKEYYDFYAVPLRLISRALPQAFTQMTDYItsaasstansstltsVPFVSQT  120
            +NTAA+ R++ Y+DFY VP R +    P A TQM   +  A+      +    VP   + 
Sbjct  68   LNTAAFARMRGYFDFYFVPFRQMWNKFPTAITQMRTNLLHASGPVLADN----VPLSDEL  123

Query  121  IFNAFFQTAN-AGDQPNTRDDAGLPIVYGSCKLLDMLGYGSMIASNNPSKAAITKKYLGV  179
             +    Q A+      ++++  G    +  C +L+ LGYG               +  G 
Sbjct  124  PYFTAEQVADYIVSLADSKNQFGYYRAWLVCIILEYLGYGDFYP--------YIVEAAGG  175

Query  180  DSLSDADNPLVYQTSQTVNALPFLAYQKIYYDFFSNSQWEKHKAYAYNVDYWSGAGNIGL  239
            +  + A  P++   +   +  P  AYQKIY DF   +QWE+     +N+DY SG+ +  L
Sbjct  176  EGATWATRPML--NNLKFSPFPLFAYQKIYADFNRYTQWERSNPSTFNIDYISGSAD-SL  232

Query  240  VTD-----------MVQLRYANYPKDYFMGMLPSSQYGSVAVL--------------Psi  274
              D           +  +RY+N+ +D   G +P +QYG  + +              P+ 
Sbjct  233  QLDFTVEGFKDSFNLFDMRYSNWQRDLLHGTIPQAQYGEASAVPVSGSMQVVEGPTPPAF  292

Query  275  ssssdsrsLLYYEPsssataalqsaggssssvrlsqtvsssQGIRLNSD----LSALSIR  330
            ++  D  + L    +   ++    A  S    R+ +  +++ G+ +  D    +S L++R
Sbjct  293  TTGQDGVAFLNGNVTIQGSSGYLQAQTSVGESRILRFNNTNSGLIVEGDSSFGVSILALR  352

Query  331  ATEYLQRWKEIVQFSSKDYSDQMAAQFGIKAPEYMGNHSHYIGGWSSVININEVVNTNLD  390
              E  Q+WKE+   S +DY  Q+ A +G    +   +   ++G  +  ++INEVVN N+ 
Sbjct  353  RAEAAQKWKEVALASEEDYPSQIEAHWGQSVNKAYSDMCQWLGSINIDLSINEVVNNNIT  412

Query  391  ADSSQASIAGKGISSNSGHTLTYDCGAEHQIIMCVYHAVPMLDWNLTGQAPQLTVTAISD  450
             +++ A IAGKG  S +G ++ ++ G ++ I+MCV+H +P LD+  +      T+T + D
Sbjct  413  GENA-ADIAGKGTMSGNG-SINFNVGGQYGIVMCVFHVLPQLDYITSAPHFGTTLTNVLD  470

Query  451  FPQPAFDQLGMQSVPSLNLQNNPGRNVSGA--------LGYNLRYWQWKSNIDTVHAGFR  502
            FP P FD++GM+ VP +    NP +   G          GY  +Y+ WK+ +D     FR
Sbjct  471  FPIPEFDKIGMEQVPVIR-GLNPVKPKDGDFKVSPNLYFGYAPQYYNWKTTLDKSMGEFR  529

Query  503  AGAAYQSWVAPLDGWNVLTSSG----------AWSYQS--MKVRPQQLNSIFVPQVDSAN  550
               + ++W+ P D   +L +            A S ++   KV P  L+++F  +   AN
Sbjct  530  --RSLKTWIIPFDDEALLAADSVDFPDNPNVEADSVKAGFFKVSPSVLDNLFAVK---AN  584

Query  551  CSVAFDQLLCNVNFQVYAVQNLDRNGLPY  579
              +  DQ LC+  F V  V++LD NGLPY
Sbjct  585  SDLNTDQFLCSTLFDVNVVRSLDPNGLPY  613


>gi|494308783|ref|WP_007173938.1| hypothetical protein [Prevotella bergensis]
 gi|270333035|gb|EFA43821.1| putative capsid protein (F protein) [Prevotella bergensis DSM 
17361]
Length=553

 Score =   189 bits (479),  Expect = 6e-49, Method: Compositional matrix adjust.
 Identities = 162/586 (28%), Positives = 262/586 (45%), Gaps = 62/586 (11%)

Query  10   ARISTHRSSFDLSSKKLFTAKVGEILPCYWQIAIPGNKYRISSDWFTRTVPVNTAAYTRI  69
             R + +R++FDLS + LFTA  G +LP      IP +   I++  F RT+P+NTAA+  +
Sbjct  11   TRPNRNRNAFDLSQRHLFTAHAGMLLPVLNLDLIPHDHVEINAQDFMRTLPMNTAAFASM  70

Query  70   KEYYDFYAVPLRLISRALPQAFTQMTDYItsaasstansstltsVPFVS-QTIFNAFFQT  128
            +  Y+F+ VP   +     Q  T M D+ +SA  S    ++   VP+ +  ++FN+    
Sbjct  71   RGVYEFFFVPYHQLWAQFDQFITGMNDFHSSANKSIQGGTSPLQVPYFNVDSVFNSLNTG  130

Query  129  ANAGDQPNTRDDAGLPIVYGSCKLLDMLGYGSMIASNNPSKAAITKKYLGVDSLSDADNP  188
              +G    + DD      YG+ +LLD+LGYG    S   +           D++S   N 
Sbjct  131  KESG--SGSTDDLQYKFKYGAFRLLDLLGYGRKFDSFGTAYP---------DNVSGLKNN  179

Query  189  LVYQTSQTVNALPFLAYQKIYYDFFSNSQWEKHKAYAYNVDYWSGA-GNIGLVTDMVQLR  247
            L Y  S        LAY KIY D++ NS +E     ++N D + G   +  +V D+ +LR
Sbjct  180  LDYNCS----VFRILAYNKIYQDYYRNSNYENFDTDSFNFDKFKGGLVDAKVVADLFKLR  235

Query  248  YANYPKDYFMGMLPSSQYGSVAVLPsissssdsrsLLYYEPsssataalqsaggssssvr  307
            Y N   DYF  +  S                    L  +  +      +  A        
Sbjct  236  YRNAQTDYFTNLRQSQ-------------------LFSFTTAFEDVDNINIAPRDYVKSD  276

Query  308  lsqtvsssQGIRLNS---DLSALSIRATEYLQRWKEIVQFSSKDYSDQMAAQFGIKAPEY  364
             S     + G+  +S   D S  S+RA   + +   +   + K + DQM A +G++ P+ 
Sbjct  277  GSNFTRVNFGVDTDSSEGDFSVSSLRAAFAVDKLLSVTMRAGKTFQDQMRAHYGVEIPDS  336

Query  365  MGNHSHYIGGWSSVININEVVNTNLDADSSQ-------ASIAGKGISSNSGHTLTYDCGA  417
                 +Y+GG+ S + +++V  T+    +           +AGKG  S  G  + +D   
Sbjct  337  RDGRVNYLGGFDSDMQVSDVTQTSGTTATEYKPEAGYLGRVAGKGTGSGRGR-IVFDA-K  394

Query  418  EHQIIMCVYHAVPMLDWNLTGQAPQLTVTAISDFPQPAFDQLGMQSVPSLNLQN----NP  473
            EH ++MC+Y  VP + ++ T   P +      D+  P F+ LGMQ + S  + +    +P
Sbjct  395  EHGVLMCIYSLVPQIQYDCTRLDPMVDKLDRFDYFTPEFENLGMQPLNSSYISSFCTTDP  454

Query  474  GRNVSGALGYNLRYWQWKSNIDTVHAGFRAGAAYQSW-VAPLDGWNVLTSSGAWSYQSMK  532
               V   LGY  RY ++K+ +D  H  F    A  SW V+    W   T+         K
Sbjct  455  KNPV---LGYQPRYSEYKTALDVNHGQFAQSDALSSWSVSRFRRW---TTFPQLEIADFK  508

Query  533  VRPQQLNSIFVPQVDSANCSVAFDQLLCNVNFQVYAVQNLDRNGLP  578
            + P  LNSIF   VD  N + A D +    NF +  V ++  +G+P
Sbjct  509  IDPGCLNSIF--PVD-YNGTEANDCVYGGCNFNIVKVSDMSVDGMP  551


>gi|575094321|emb|CDL65708.1| unnamed protein product [uncultured bacterium]
Length=642

 Score =   181 bits (458),  Expect = 1e-45, Method: Compositional matrix adjust.
 Identities = 178/640 (28%), Positives = 283/640 (44%), Gaps = 94/640 (15%)

Query  16   RSSFDLSSKKLFTAKVGEILPCYWQIAIPGNKYRISSDWFTRTVPVNTAAYTRIKEYYDF  75
            R+SFDLS + +FTAKVGE+LPC+ Q   PG+  ++SS +FTRT P+ + A+TR++E   +
Sbjct  19   RNSFDLSHRNMFTAKVGELLPCFVQELNPGDSVKVSSSYFTRTAPLQSNAFTRLRENVQY  78

Query  76   YAVPLRLISRALPQAFTQMT------DYItsaasstansstltsVPFVSQTIFNAFF---  126
            + VP   + +        MT      D    A+S   N    T +P V+    +A+    
Sbjct  79   FFVPYSALWKYFDSQVLNMTKNANGGDISRIASSLVGNQKVTTQMPCVNYKTLHAYLLKF  138

Query  127  ---QTANAGDQPNTRDDAGLPIVYGSCKLLDMLGYGSMIASNNPSKAAITKKYLGVDSLS  183
                T  +        + G      S KLL +LGYG+        K    K      +  
Sbjct  139  INRSTVGSDGSVGPEFNRGCYRHAESAKLLQLLGYGNFPEQFANFKVNNDKHNQSGQNFK  198

Query  184  DADNPLVYQTSQTVNALPFLAYQKIYYDFFSNSQWEKHKAYAYNVDYWSGAGNIGLVTD-  242
            D    + Y  S  ++    LAY KI  D +   QW+ + A   NVDY +   +  L  D 
Sbjct  199  D----VTYNNSPYLSIFRLLAYHKICNDHYLYRQWQPYNASLCNVDYLTPNSSSLLSIDD  254

Query  243  ----------------MVQLRYANYPKDYFMGMLPSSQYGSVAVLPsissssdsrsLL--  284
                            ++ +R++N P DYF G+LP+SQ+GS +V+     ++   ++L  
Sbjct  255  ALLSIPDDSIKAEKLNLLDMRFSNLPLDYFTGVLPTSQFGSESVVNLNLGNASGSAVLNG  314

Query  285  ----------------YYEPsssataalqsaggssssvrlsqtvsssQGIRLNSDLSA--  326
                              E   +++A       +S+   +S   + S  + +N+ LS   
Sbjct  315  TTSKDSGRWRTTTGEWEMEQRVASSANGNLKLDNSNGTFISHDHTFSGNVAINTSLSGNL  374

Query  327  --LSIRATEYLQRWKEIVQFSSKDYSDQMAAQFGIKAPEYMGNHSHYIGGWSSVININEV  384
              +++R     Q++KEI   +  D+  Q+ A FGIK P+    +S +IGG SS+ININE 
Sbjct  375  SIIALRNALAAQKYKEIQLANDVDFQSQVEAHFGIK-PDEKNENSLFIGGSSSMININEQ  433

Query  385  VNTNLDADSSQ---ASIAGKGISSNSGHTLTYDCGAEHQIIMCVYHAVPMLDWNLTGQAP  441
            +N NL  D+     A+  G G +S      TY       +++ +Y   P+LD+   G   
Sbjct  434  INQNLSGDNKATYGAAPQGNGSASIKFTAKTYG------VVIGIYRCTPVLDFAHLGIDR  487

Query  442  QLTVTAISDFPQPAFDQLGMQSV---------------PSLNLQNNPGRNVSGALGYNLR  486
             L  T  SDF  P  D +GMQ                  +  + +    ++S   GY  R
Sbjct  488  TLFKTDASDFVIPEMDSIGMQQTFRCEVAAPAPYNDEFKAFRVGDGSSPDMSETYGYAPR  547

Query  487  YWQWKSNIDTVHAGFRAGAAYQSWVAPL-------DGWNVLTSSGAWSYQSMKVRPQQLN  539
            Y ++K++ D  +  F    + +SWV  +       + WN  T +G  +      RP  + 
Sbjct  548  YSEFKTSYDRYNGAF--CHSLKSWVTGINFDAIQNNVWN--TWAGINAPNMFACRPDIVK  603

Query  540  SIFVPQVDSANCSVAFDQLLCNVNFQVYAVQNLDRNGLPY  579
            ++F+  V S N S   DQL   +    YA +NL R GLPY
Sbjct  604  NLFL--VSSTNNSDD-DQLYVGMVNMCYATRNLSRYGLPY  640


>gi|490477384|ref|WP_004347761.1| capsid protein [Prevotella buccalis]
 gi|281300712|gb|EFA93043.1| putative capsid protein (F protein) [Prevotella buccalis ATCC 
35310]
Length=552

 Score =   173 bits (438),  Expect = 2e-43, Method: Compositional matrix adjust.
 Identities = 160/601 (27%), Positives = 263/601 (44%), Gaps = 74/601 (12%)

Query  1    MSDFNPLNRA-RISTHRSSFDLSSKKLFTAKVGEILPCYWQIAIPGNKYRISSDWFTRTV  59
            MS   PL +A R +  R++FDLS K LFTA  G +LP      IP +   I +  F R +
Sbjct  1    MSKKIPLIKASRANRPRNAFDLSQKHLFTAHAGMLLPVMTLDLIPHDHVSIQATDFMRCL  60

Query  60   PVNTAAYTRIKEYYDFYAVPLRLISRALPQAFTQMTDYItsaasstansstltsVP-FVS  118
            P+N+AA+  ++  Y+F+ VP   +     Q  T M DY +   S    S +   +P F  
Sbjct  61   PMNSAAFMSMRSVYEFFFVPYSQLWHPFDQFITGMNDYRSVLQSDLYKSKSPLVIPSFKR  120

Query  119  QTIFNAFFQTANAG---DQPNTRDDAGLPIVYGSCKLLDMLGYGSMIASNNPSKAAITKK  175
            + ++  F   A  G    Q N  D  G    +   +LLD+LGYG  + ++  S+      
Sbjct  121  KELYELF--NAPGGFLNQQSNQPDIFGFKSRFNFLRLLDLLGYGVYVNADGSSR------  172

Query  176  YLGVDSLSDADNPLVYQTSQTVNALPFLAYQKIYYDFFSNSQWEKHKAYAYNVDYWSGA-  234
               +D+ S      +   ++ ++     AYQKIY DF+ N+ +E     ++++D  + + 
Sbjct  173  ---IDAFSK-----LLDDTEKLSIFRLAAYQKIYSDFYRNTTYEAVDVSSFSLDNITDSI  224

Query  235  GNIGLVTDMVQLRYANYPKDYFMGMLPSSQYGSVAVLPsissssdsrsLLYYEPsssata  294
              I        LRY N   DYF  + P+         P     + S +  Y  P ++ + 
Sbjct  225  SAINAFKRFGTLRYRNAQLDYFTNLRPT---------PLFDLDNPSLNSFYNTPGNADSV  275

Query  295  alqsaggssssvrlsqtvsssQGIRLNSDL-SALSIRATEYLQRWKEIVQFSSKDYSDQM  353
            ++ S   +                +L+SDL +  SIR    L +   I Q + K Y++Q+
Sbjct  276  SIDSDSNAV-------------NFQLDSDLLTVQSIRNAFALDKLMRITQRAGKTYAEQI  322

Query  354  AAQFGIKAPEYMGNHSHYIGGWSSVININEVVNTNLDADSSQ-----------ASIAGKG  402
             A FG +  E      +YIGG+ S I + +V   +    S +             + GK 
Sbjct  323  KAHFGFEVSEGRDGRVNYIGGFDSNIQVGDVTQMSGTTASPEQGVSIKHGGYLGRVTGKA  382

Query  403  ISSNSGHTLTYDCGAEHQIIMCVYHAVPMLDWNLTGQAPQLTVTAISDFPQPAFDQLGMQ  462
              S SGH + +D   EH I+MC+Y  VP + ++ T   P +T  +  DF  P F+ LGMQ
Sbjct  383  QGSGSGH-IEFDA-HEHGILMCIYSLVPDMQYDATRIDPFVTKLSRGDFFMPEFEDLGMQ  440

Query  463  SVPSLNLQNNPGRNVSGALGYNLRYWQWKSNIDTVHAGFRAGAAYQSWVAPLDGWNVLTS  522
             + +  + ++         G+  RY ++K+++D  H  F  G        PL  W V   
Sbjct  441  PLQTRYI-SDIRTQTEKFKGWQPRYSEYKTSLDINHGQFANG-------QPLSYWTVGRG  492

Query  523  SGAWSYQ-----SMKVRPQQLNSIFVPQVDSANCSVAFDQLLCNVNFQVYAVQNLDRNGL  577
                + +     S+K+ P+ L+SIF    +    +   D +     F V  V ++  NG 
Sbjct  493  RAGETLETFDIASLKINPKWLDSIFAVNYNGTQIT---DCVFGGCQFNVQKVSDMSENGE  549

Query  578  P  578
            P
Sbjct  550  P  550


>gi|647452987|ref|WP_025792807.1| hypothetical protein [Prevotella histicola]
Length=584

 Score =   172 bits (437),  Expect = 3e-43, Method: Compositional matrix adjust.
 Identities = 175/626 (28%), Positives = 265/626 (42%), Gaps = 100/626 (16%)

Query  6    PLNRARISTHRSSFDLSSKKLFTAKVGEILP--CYWQIAIPGNKYRISSDWFTRTVPVNT  63
            P  + R++  R+ FDLSS+++F+AK G++LP  C W++  P   ++ S     RT  +NT
Sbjct  2    PAPKPRLA--RNGFDLSSRRIFSAKAGQLLPIGC-WEVN-PSEHFKFSVQDLVRTTTLNT  57

Query  64   AAYTRIKEYYDFYAVPLRLISRALPQAFTQMTDYItsaasstansstltsVPFVSQTIFN  123
            A+Y R+KEYY F+ V  R    +L Q F Q          +    S L  V     T +N
Sbjct  58   ASYARMKEYYHFFFVSYR----SLWQWFDQFI------VGTNNPHSALNGVKKNGTTNYN  107

Query  124  AFFQTANAGD--------QPNTRDDAGLPIVYGSCKLLDMLGYGSMIASN--NPSKAAIT  173
                +    D        + +  D  G     G+ KLL+ML YG        N      +
Sbjct  108  QICSSVPTFDLGKLITRLKTSDMDSQGFNYSEGAAKLLNMLNYGVTNKGKFMNLENLITS  167

Query  174  KKYLGVDSLSDADNPLVYQTSQTVNALPFLAYQKIYYDFFSNSQWEKHKAYAYNVDYWSG  233
              YL   S  D +   +Y     V+    LAYQKI+ DF+ N  W      ++NVD ++ 
Sbjct  168  TSYL--PSKDDKEPSSIYACK--VSPFRLLAYQKIFNDFYRNQDWTPSDVRSFNVDDYAD  223

Query  234  AGNIGLVTDM----VQLRYANYPKDYFMGMLPSSQYG-SVAVLPsissssdsrsLLYYEP  288
              N+ +  D+     Q+RY  Y KD+   M P+  Y   +  LP     + +  L     
Sbjct  224  DSNLTIEPDVALKFCQMRYRPYAKDWLTSMKPTPNYSDGIFNLPEYVRGNGNVIL-----  278

Query  289  sssataalqsaggssssvrlsqtvsssQGIRLNSDLSALSIRATEYLQRWKEIVQFSSK-  347
                            +   S +VS   G    S  S   +RA   L +  E  + ++  
Sbjct  279  ----------------TNNKSGSVSLDSGTVSPSSFSVNDLRAAFALDKMLEATRRANGL  322

Query  348  DYSDQMAAQFGIKAPEYMGNHSHYIGGWSSVININEVVNTNLDA--DSSQASI---AGKG  402
            DY+ Q+ A FG K PE   N + ++GG+ + I ++EVV+TN +A  D S ASI    GKG
Sbjct  323  DYASQIEAHFGFKVPESRANDARFLGGFDNSIVVSEVVSTNGNAASDGSHASIGDLGGKG  382

Query  403  ISSNSGHTLTYDCGAEHQIIMCVYHAVPMLDWNLTGQAPQLTVTAISDFPQPAFDQLGMQ  462
            I S S  T+ +D   EH IIMC+Y   P  ++N +   P         F QP F  LG Q
Sbjct  383  IGSMSSGTIEFDS-TEHGIIMCIYSVAPQSEYNASYLDPFNRKLTREQFYQPEFADLGYQ  441

Query  463  SVPSLNL-QNNPGRNVSGA-----------LGYNLRYWQWKSNIDTVHAGFRAGAAYQSW  510
            ++   +L  +  G N   A           LGY +RY ++K+  D V   F +G +   W
Sbjct  442  ALIGSDLICSTLGMNEKQAGFSDIELNNNLLGYQVRYNEYKTARDLVFGDFESGKSLSYW  501

Query  511  VAPL-------------------DGWNVLTSSGAWSYQSMKVRPQQLNSIFVPQVDSANC  551
              P                      +    +   WS ++  + P  +N IF+        
Sbjct  502  CTPRFDFGYGDTEKKIAPENKGGADYRKKGNRSHWSSRNFYINPNLVNPIFL------TS  555

Query  552  SVAFDQLLCNVNFQVYAVQNLDRNGL  577
            +V  D  + N    V AV+ +   GL
Sbjct  556  AVQADHFIVNSFLDVKAVRPMSVTGL  581


>gi|494306153|ref|WP_007173049.1| hypothetical protein [Prevotella bergensis]
 gi|270333881|gb|EFA44667.1| putative capsid protein (F protein) [Prevotella bergensis DSM 
17361]
Length=519

 Score =   166 bits (420),  Expect = 3e-41, Method: Compositional matrix adjust.
 Identities = 147/562 (26%), Positives = 248/562 (44%), Gaps = 63/562 (11%)

Query  34   ILPCYWQIAIPGNKYRISSDWFTRTVPVNTAAYTRIKEYYDFYAVPLRLISRALPQAFTQ  93
            +LP      IP +   I++  F RT+P+NTAA+  ++  Y+F+ VP   +     Q  T 
Sbjct  2    LLPVLNLDLIPHDHVEINAQDFMRTLPMNTAAFASMRGVYEFFFVPYHQLWAQFDQFITG  61

Query  94   MTDYItsaasstansstltsVPFVS-QTIFNAFFQTANAGDQPNTRDDAGLPIVYGSCKL  152
            M D+ +SA  S    ++   VP+ + +++F    +  +    P+ +DD      YG+ +L
Sbjct  62   MNDFHSSANKSIQGGTSPLQVPYFNLESVFKNIIERDST---PSFQDDLQYRFKYGAFRL  118

Query  153  LDMLGYGSMIASNNPSKAAITKKYLGVDSLSDADNPLVYQTSQTVNALPFLAYQKIYYDF  212
            LD+LGYG    S   +           D++S   N L Y  S        LAY KIY D+
Sbjct  119  LDLLGYGRKFDSFGTAYP---------DNVSGLKNNLDYNCS----VFRVLAYNKIYQDY  165

Query  213  FSNSQWEKHKAYAYNVDYWSGA-GNIGLVTDMVQLRYANYPKDYFMGMLPSSQYGSVAVL  271
            + NS +E     ++N D + G   +  +V D+ +LRY N   DYF  +  S         
Sbjct  166  YRNSNYENFDTDSFNFDKFKGGLVDAKVVADLFKLRYRNAQTDYFTNLRQSQ--------  217

Query  272  PsissssdsrsLLYYEPsssataalqsaggssssvrlsqtvsssQGIRLNSDL---SALS  328
                       L  + P  S    L       +    S     +  + ++++L   S  S
Sbjct  218  -----------LFTFIPEFSDDEHLNFDRDQYADQSKSNFTQLNFPVDVDNNLGYFSVSS  266

Query  329  IRATEYLQRWKEIVQFSSKDYSDQMAAQFGIKAPEYMGNHSHYIGGWSSVININEVVNTN  388
            +R+   + +   +   + K + DQM A +G++ P+      +Y+GG+ S + +++V  T+
Sbjct  267  LRSAFAVDKLLSVTMRAGKTFQDQMRAHYGVEIPDSRDGRVNYLGGFDSDLQVSDVTQTS  326

Query  389  LDADSSQ-------ASIAGKGISSNSGHTLTYDCGAEHQIIMCVYHAVPMLDWNLTGQAP  441
                +           IAGKG  S  G  + +D   EH ++MC+Y  VP + ++ T   P
Sbjct  327  GTTATEYKPEAGYLGRIAGKGTGSGRGR-IVFD-AKEHGVLMCIYSLVPQIQYDCTRLDP  384

Query  442  QLTVTAISDFPQPAFDQLGMQSVPSLNLQN----NPGRNVSGALGYNLRYWQWKSNIDTV  497
             +      DF  P F+ LGMQ + S  + +    +P   V   LGY  RY ++K+ +D  
Sbjct  385  MVDKLDRFDFFTPEFENLGMQPLNSSYISSFCTPDPKNPV---LGYQPRYSEYKTALDIN  441

Query  498  HAGFRAGAAYQSW-VAPLDGWNVLTSSGAWSYQSMKVRPQQLNSIFVPQVDSANCSVAFD  556
            H  F    A  SW V+    W   T+         K+ P  LNS+F  +    N + + D
Sbjct  442  HGQFAQNDALSSWSVSRFRRW---TTFPQLEIADFKIDPGCLNSVFPVEF---NGTESTD  495

Query  557  QLLCNVNFQVYAVQNLDRNGLP  578
             +    NF +  V ++  +G+P
Sbjct  496  CVFGGCNFNIVKVSDMSVDGMP  517


>gi|496521299|ref|WP_009229582.1| capsid protein [Prevotella sp. oral taxon 317]
 gi|288330570|gb|EFC69154.1| putative capsid protein (F protein) [Prevotella sp. oral taxon 
317 str. F0108]
Length=541

 Score =   164 bits (415),  Expect = 1e-40, Method: Compositional matrix adjust.
 Identities = 155/585 (26%), Positives = 252/585 (43%), Gaps = 73/585 (12%)

Query  10   ARISTHRSSFDLSSKKLFTAKVGEILPCYWQIAIPGNKYRISSDWFTRTVPVNTAAYTRI  69
            +R +  RS+FDLS K L+TA  G +LP      +  +  RI +  F RT+P+N+AA+  +
Sbjct  12   SRANRPRSAFDLSQKHLYTAPAGALLPVLSVDLMFHDHIRIQAQDFMRTMPMNSAAFISM  71

Query  70   KEYYDFYAVPLRLISRALPQAFTQMTDYItsaasstansstltsVPFVSQTIFNAFFQTA  129
            +  Y+F+ VP   +     Q  T M DY +S  SS A    L SVP V       F +  
Sbjct  72   RGVYEFFFVPYSQLWHPYDQFITSMNDYRSSVVSSAAGDKALDSVPNVKLADMYKFVR--  129

Query  130  NAGDQPNTRDDAGLPIVYGSCKLLDMLGYGSMIASNNPSKAAITKKYLGVDSLSDADNPL  189
                +   +D  G P    SC+L+D+LGYG  I S   SK  +               PL
Sbjct  130  ----ERTDKDIFGYPHSNNSCRLMDLLGYGKPITS---SKTPV---------------PL  167

Query  190  VYQTSQTVNALPFLAYQKIYYDFFSNSQWEKHKAYAYNVDYWSGA--GNIGLVTDMVQLR  247
            +Y  +  VN    LAY KIY D++ N+ +E    Y++N+D+  G            + L 
Sbjct  168  LY--TGNVNLFRLLAYNKIYSDYYRNTTYEGVDVYSFNIDHKKGTFVPTADEFKKYLNLH  225

Query  248  YANYPKDYFMGMLPSSQYGSVAVLPsissssdsrsLLYYEPsssataalqsaggssssvr  307
            Y N P D++  + P+  +       +I S S S  L   +P+ SA  +        +   
Sbjct  226  YRNAPLDFYTNLRPTPLF-------TIGSDSFSSVLQLSDPTGSAGFSADGNSAKLNMAS  278

Query  308  lsqtvsssQGIRLNSDLSALSIRATEYLQRWKEIVQFSSKDYSDQMAAQFGIKAPEYMGN  367
                            L+  +IR+   L +   I   + K Y++Q+ A FG+   E    
Sbjct  279  PDV-------------LNVSAIRSAFALDKLLSISMRAGKTYAEQIEAHFGVTVSEGRDG  325

Query  368  HSHYIGGWSSVININEVVNT------------NLDADSSQASIAGKGISSNSGHTLTYDC  415
              +Y+GG+ S + + +V  T            N         I GKG  S  G  + +D 
Sbjct  326  QVYYLGGFDSNVQVGDVTQTSGTTNPNVSEVGNAKLAGYLGKITGKGTGSGYGE-IQFDA  384

Query  416  GAEHQIIMCVYHAVPMLDWNLTGQAPQLTVTAISDFPQPAFDQLGMQS-VPSLNLQNNPG  474
              E  ++MC+Y  VP + ++     P +      D+  P F+ LGMQ  VP+    N   
Sbjct  385  -KEPGVLMCIYSVVPAMQYDCMRLDPFVAKQTRGDYFIPEFENLGMQPIVPAFVSLN---  440

Query  475  RNVSGALGYNLRYWQWKSNIDTVHAGFRAGAAYQSW-VAPLDGWNVLTSSGAWSYQSMKV  533
            R    + G+  RY ++K+  D  H  F  G     W +A   G + L +   ++  ++K+
Sbjct  441  RAKDNSYGWQPRYSEYKTAFDINHGQFANGEPLSYWSIARARGSDTLNT---FNVAALKI  497

Query  534  RPQQLNSIFVPQVDSANCSVAFDQLLCNVNFQVYAVQNLDRNGLP  578
             P  L+S+F    +    +   D +    +F +  V ++  +G+P
Sbjct  498  NPHWLDSVFAVNYNGTEVT---DCMFGYAHFNIEKVSDMTEDGMP  539



Lambda      K        H        a         alpha
   0.319    0.133    0.409    0.792     4.96 

Gapped
Lambda      K        H        a         alpha    sigma
   0.267   0.0410    0.140     1.90     42.6     43.6 

Effective search space used: 4256619118725