bitscore colors: <40, 40-50 , 50-80, 80-200, >200




           BLASTP 2.2.30+


Reference: Stephen F. Altschul, Thomas L. Madden, Alejandro A.
Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J.
Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of
protein database search programs", Nucleic Acids Res. 25:3389-3402.


Reference for composition-based statistics: Alejandro A. Schaffer,
L. Aravind, Thomas L. Madden, Sergei Shavirin, John L. Spouge, Yuri
I. Wolf, Eugene V. Koonin, and Stephen F. Altschul (2001),
"Improving the accuracy of PSI-BLAST protein database searches with
composition-based statistics and other refinements", Nucleic Acids
Res. 29:2994-3005.



Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF
excluding environmental samples from WGS projects
           49,011,213 sequences; 17,563,301,199 total letters





Query= Contig-5_CDS_annotation_glimmer3.pl_2_3

Length=811
                                                                      Score     E
Sequences producing significant alignments:                          (Bits)  Value

gi|575094326|emb|CDL65712.1|  unnamed protein product                   451   6e-143
gi|547920049|ref|WP_022322420.1|  capsid protein VP1                    144   6e-33
gi|649569140|gb|KDS75238.1|  capsid family protein                      137   2e-31
gi|649555287|gb|KDS61824.1|  capsid family protein                      137   6e-31
gi|492501782|ref|WP_005867318.1|  hypothetical protein                  130   1e-28
gi|639237429|ref|WP_024568106.1|  hypothetical protein                  125   9e-27
gi|649557305|gb|KDS63784.1|  capsid family protein                      117   1e-25
gi|609718276|emb|CDN73650.1|  conserved hypothetical protein            120   3e-25
gi|12085136|ref|NP_073538.1|  major capsid protein                      111   2e-22
gi|530695351|gb|AGT39907.1|  major capsid protein                       105   2e-20


>gi|575094326|emb|CDL65712.1| unnamed protein product [uncultured bacterium]
Length=758

 Score =   451 bits (1161),  Expect = 6e-143, Method: Compositional matrix adjust.
 Identities = 244/446 (55%), Positives = 314/446 (70%), Gaps = 22/446 (5%)

Query  371  LTSDKPVDLTLGSS---PYYNSGSAN-KDKQIKISAYSFRAYEGIYNAYIRDNRNNPYYV  426
            +T DK +D+ +GSS   PYY  GSAN  DK IK+SAY FRAYE IYNAYIR+ RNNP+ +
Sbjct  330  ITFDK-LDVFIGSSGKYPYY--GSANMSDKAIKLSAYPFRAYEAIYNAYIRNTRNNPFVL  386

Query  427  NGQVQYNKWIPTYDGGADQ-NIYELRYANWEKDFLTTAVQSPQQGTAPLVGIttytetve  485
            NG+  YN+WI T  GG+D     +LR+ANW+ D  TTA+ +PQQG APLVG+TTY     
Sbjct  387  NGKKTYNRWITTDAGGSDTLTPRDLRFANWQSDAYTTALTAPQQGVAPLVGLTTYEIRSV  446

Query  486  ttSDDGTPVTRELSRIALVDEDGKKYQVSFDSDSEGLKGVSYVELDNEVKLRQPRNLIDV  545
              +D G  VT      A+VDE+G  Y+V F+S+ E LKGV+Y  L     +   ++L+  
Sbjct  447  --NDAGHEVT--TVNTAIVDEEGNAYKVDFESNGEALKGVNYTPLKAGEAVNM-QSLVSP  501

Query  546  VTSGISINDLRNVNAYQKFLELNMRKGYSYRDIIEGRFNVKVRYDELLMPEFFGGFSRDI  605
            VTSGISIND RNVNAYQ++LELN  +G+SY++IIEGRF+V VRYD L MPE+ GG +RDI
Sbjct  502  VTSGISINDFRNVNAYQRYLELNQFRGFSYKEIIEGRFDVNVRYDALNMPEYLGGITRDI  561

Query  606  EMHSISQTVDQDLDGSQTYAKALGSQSGIAGVRGDSGRALECFCDEESIVMGILIVTPLP  665
             ++ I+QTV+    GS +Y  +LGSQSG+A   G++  ++  FCDEESIVMGI+ V P+P
Sbjct  562  VVNPITQTVETT--GSGSYVGSLGSQSGLATCFGNTDGSISVFCDEESIVMGIMYVMPMP  619

Query  666  VYTQLLPKHFTYRGLLDHYQPEFNHIGFQPILYKEVCPLQAYSDGPDTLSDVFGYNRPWY  725
            VY  LLPK  TYR  LD + PEF+HIG+QPI  KE+ P+Q   D  D  + VFGY RPWY
Sbjct  620  VYDSLLPKWLTYRERLDSFNPEFDHIGYQPIYAKELGPMQCVQDDIDP-NTVFGYQRPWY  678

Query  726  EYVQKYDQAHGLFRTNLSNFLMHRVFNQKPQLAQSFLVIDPAQVTDVFAVTKADDGTELA  785
            EYV K D+AHGLF ++L NF+M R F+  P+L QSF V+ P  V +VF+V      TE++
Sbjct  679  EYVAKPDRAHGLFLSSLRNFIMFRSFDNVPELGQSFTVMQPGSVNNVFSV------TEVS  732

Query  786  DKIYGQIWFDCTAKLPISRVAIPRLD  811
            DKI GQI FDCTA+LPISRV +PRL+
Sbjct  733  DKILGQIHFDCTAQLPISRVVVPRLE  758


 Score =   139 bits (349),  Expect = 1e-30, Method: Compositional matrix adjust.
 Identities = 64/129 (50%), Positives = 90/129 (70%), Gaps = 4/129 (3%)

Query  5    AFDATLDVNNEIKVNNFDWSHANNLTTQIGRVTPVFCELVPAHGSVRINPRFGLQFMPMV  64
             F+   D+ N++K N+FDWSH NN TT +GR+TPVF ELVP + S+RI P FGL+FMPM+
Sbjct  4    VFNKIGDIKNDVKRNSFDWSHDNNFTTDLGRITPVFTELVPPNSSIRIKPEFGLRFMPMM  63

Query  65   FPIQTRLRARMMFFKYPLRALWDGYRDFI-GNFREDLEEPYLDLNTV--TRLDAMAKTGS  121
            FPIQT+++A + F+K PLR LW  Y DFI  +  E+ + PY+  ++   +    +A +G 
Sbjct  64   FPIQTKMKAYLSFYKVPLRTLWADYMDFISSDNTEEFQPPYMSFDSTDYSEGGTLAPSG-  122

Query  122  LGDYLGLPT  130
            LGDY G+PT
Sbjct  123  LGDYFGIPT  131


>gi|547920049|ref|WP_022322420.1| capsid protein VP1 [Parabacteroides merdae CAG:48]
 gi|524592961|emb|CDD13573.1| capsid protein VP1 [Parabacteroides merdae CAG:48]
Length=553

 Score =   144 bits (362),  Expect = 6e-33, Method: Compositional matrix adjust.
 Identities = 131/430 (30%), Positives = 192/430 (45%), Gaps = 53/430 (12%)

Query  399  KISAYSFRAYEGIYNAYIRD-NRNNPYYVNGQVQYNKWIPTYDGGADQ--NIYELRYANW  455
            ++SA  FRAY+ IYN Y RD N   P      + +     T  GG DQ   +  LR   W
Sbjct  159  QVSALPFRAYQLIYNEYYRDQNLTEP------IDFTLGSGTTVGG-DQLMALMSLRRRAW  211

Query  456  EKDFLTTAVQSPQQG---TAPLVGIttytetvettSDDGTPVTRELSRIALVDEDGKKYQ  512
            EKD+ T+A+   Q+G   T P+ G     + V     D         R     E+G  Y 
Sbjct  212  EKDYFTSALPWLQRGPEVTVPVQGAGGSMDVVYERQSDSQKWVDSSGREF---ENGHAYD  268

Query  513  VSF----DSDSEGLKGVS------YVELDNEVKLRQPRNLIDVVTSGISINDLRNVNAYQ  562
            ++     D +S  +  V+        ELD    L+     ++V   GI+INDLR  NA Q
Sbjct  269  ITMARANDPNSALMVAVNGGTNNRAPELDPNGTLK-----VNVDEMGININDLRTSNALQ  323

Query  563  KFLELNMRKGYSYRDIIEGRFNVKVRYDELLMPEFFGGFSRDIEMHSISQTVDQDLDGSQ  622
            ++ E N R G  Y + I   F V+     L  P+F GG    I +  + QT   D    Q
Sbjct  324  RWFERNARGGSRYIEQILSHFGVRSSDARLQRPQFLGGGRMPISVSEVLQTSSTDETSPQ  383

Query  623  TYAKALGSQSGIAGVRGDSGRALECFCDEESIVMGILIVTPLPVYTQLLPKHFTYRGLLD  682
                  G  +GI           + + +E   ++GI+ +TP   Y Q +P+ FT    +D
Sbjct  384  ANMAGHGISAGI-------NNGFKHYFEEHGYIIGIMSITPRSGYQQGVPRDFTKFDNMD  436

Query  683  HYQPEFNHIGFQPILYKE--VCPLQAYSDGPDTLSDVFGYNRPWYEYVQKYDQAHGLFRT  740
             Y PEF H+  Q I  +E  V    AY++G       FGY   + EY     +AHG FR 
Sbjct  437  FYFPEFAHLSEQEIKNQELFVSEDAAYNNG------TFGYTPRYAEYKYHPSEAHGDFRG  490

Query  741  NLSNFLMHRVFNQKPQLAQSFLVIDPAQVTDVFAVTKADDGTELADKIYGQIWFDCTAKL  800
            NLS + ++R+F  KP L  +F+   P+    VFA ++ +D     DK + Q++ D  A  
Sbjct  491  NLSFWHLNRIFEDKPNLNTTFVECKPS--NRVFATSETED-----DKFWVQMYQDVKALR  543

Query  801  PISRVAIPRL  810
             + +   P L
Sbjct  544  LMPKYGTPML  553


 Score = 55.8 bits (133),  Expect = 1e-04, Method: Compositional matrix adjust.
 Identities = 37/124 (30%), Positives = 60/124 (48%), Gaps = 10/124 (8%)

Query  17   KVNNFDWSHANNLTTQIGRVTPVFCELVPAHGSVRINPRFGLQFMPMVFPIQTRLRARMM  76
            + N F+ S+ + LT  +G + P+ C  V +    R+     ++  P+V P+  R+     
Sbjct  14   RRNAFNLSYESKLTLNMGELVPIMCMPVVSGDKFRVKTESLVRLAPLVAPMMHRVNVFTH  73

Query  77   FFKYPLRALWDGYRDFI--GNFREDLEE-PYLDLNTVTRLDAMAKT-------GSLGDYL  126
            +F  P R +W+ + DFI  G   ED+   P + +N  + L + A          SL DYL
Sbjct  74   YFFVPNRLVWNEWEDFITKGVDGEDMPMFPKIQINQDSHLVSSASLIKEYFGDSSLWDYL  133

Query  127  GLPT  130
            GLPT
Sbjct  134  GLPT  137


>gi|649569140|gb|KDS75238.1| capsid family protein, partial [Parabacteroides distasonis str. 
3999B T(B) 6]
Length=390

 Score =   137 bits (345),  Expect = 2e-31, Method: Compositional matrix adjust.
 Identities = 115/417 (28%), Positives = 183/417 (44%), Gaps = 37/417 (9%)

Query  399  KISAYSFRAYEGIYNAYIRDNRNNPYYVNGQVQYNKWIPTYDGGADQNIYELRYANWEKD  458
            K+SA  FRAY  IYN Y RD       +  +++       Y    + ++++L    WEKD
Sbjct  6    KVSALPFRAYHLIYNEYYRDQN-----LTSELEITLDSGNYQLPVNSSLWQLHRRAWEKD  60

Query  459  FLTTAVQSPQQGTAPLVGIttytetvettSDD--GTPVTRELSRIALVDEDGKKYQVSFD  516
            + T+A+   Q+G    V I    E      +      +T    R  +   +      S  
Sbjct  61   YFTSALPWVQRGPEVTVPINGGGEIPVEMKEGFAAQKITTFPDRKPISGSEVLYSAPSVL  120

Query  517  SDSE--GLKGVSYVELDNEVKLRQPRNLIDVVTSGISINDLRNVNAYQKFLELNMRKGYS  574
            S  +   +KG + +E DN V        ++    G++IND+R  NA Q++ E N R G  
Sbjct  121  SYGQIGSIKGQALIEPDNFV--------VNTDQMGVNINDIRTSNALQRWFERNARSGSR  172

Query  575  YRDIIEGRFNVKVRYDELLMPEFFGGFSRDIEMHSISQTVDQDLDGSQTYAKALGSQSGI  634
            Y + I   F V+     L  P+F GG    I +  + QT   D    Q      G  +G+
Sbjct  173  YIEQILSHFGVRSSDARLQRPQFLGGGRTPISVSEVLQTSSTDSTSPQANMAGHGISAGV  232

Query  635  AGVRGDSGRALECFCDEESIVMGILIVTPLPVYTQLLPKHFTYRGLLDHYQPEFNHIGFQ  694
                         + +E   +MGI+ + P   Y Q +PK F     +D Y PEF H+G Q
Sbjct  233  -------NHGFTRYFEEHGYIMGIMSIRPRTGYQQGVPKDFRKFDNMDFYFPEFAHLGEQ  285

Query  695  PILYKEVCPLQAYSDGPDTLSD-VFGYNRPWYEYVQKYDQAHGLFRTNLSNFLMHRVFNQ  753
             I  +E+     Y +  D  ++  FGY   + EY    ++ HG FR N++ + ++R+F +
Sbjct  286  EIKNEEL-----YLNESDAANEGTFGYTPRYAEYKYSQNEVHGDFRGNMAFWHLNRIFKE  340

Query  754  KPQLAQSFLVIDPAQVTDVFAVTKADDGTELADKIYGQIWFDCTAKLPISRVAIPRL  810
            KP L  +F+  +P+    VFA  +  D     DK + QI+ D  A   + +   P L
Sbjct  341  KPNLNTTFVECNPS--NRVFATAETSD-----DKYWVQIYQDIKALRLMPKYGTPML  390


>gi|649555287|gb|KDS61824.1| capsid family protein [Parabacteroides distasonis str. 3999B 
T(B) 4]
 gi|649560568|gb|KDS66876.1| capsid family protein [Parabacteroides distasonis str. 3999B 
T(B) 4]
 gi|649561020|gb|KDS67307.1| capsid family protein [Parabacteroides distasonis str. 3999B 
T(B) 4]
 gi|649562724|gb|KDS68908.1| capsid family protein [Parabacteroides distasonis str. 3999B 
T(B) 6]
Length=541

 Score =   137 bits (346),  Expect = 6e-31, Method: Compositional matrix adjust.
 Identities = 115/417 (28%), Positives = 183/417 (44%), Gaps = 37/417 (9%)

Query  399  KISAYSFRAYEGIYNAYIRDNRNNPYYVNGQVQYNKWIPTYDGGADQNIYELRYANWEKD  458
            K+SA  FRAY  IYN Y RD       +  +++       Y    + ++++L    WEKD
Sbjct  157  KVSALPFRAYHLIYNEYYRDQN-----LTSELEITLDSGNYQLPVNSSLWQLHRRAWEKD  211

Query  459  FLTTAVQSPQQGTAPLVGIttytetvettSDD--GTPVTRELSRIALVDEDGKKYQVSFD  516
            + T+A+   Q+G    V I    E      +      +T    R  +   +      S  
Sbjct  212  YFTSALPWVQRGPEVTVPINGGGEIPVEMKEGFAAQKITTFPDRKPISGSEVLYSAPSVL  271

Query  517  SDSE--GLKGVSYVELDNEVKLRQPRNLIDVVTSGISINDLRNVNAYQKFLELNMRKGYS  574
            S  +   +KG + +E DN V        ++    G++IND+R  NA Q++ E N R G  
Sbjct  272  SYGQIGSIKGQALIEPDNFV--------VNTDQMGVNINDIRTSNALQRWFERNARSGSR  323

Query  575  YRDIIEGRFNVKVRYDELLMPEFFGGFSRDIEMHSISQTVDQDLDGSQTYAKALGSQSGI  634
            Y + I   F V+     L  P+F GG    I +  + QT   D    Q      G  +G+
Sbjct  324  YIEQILSHFGVRSSDARLQRPQFLGGGRTPISVSEVLQTSSTDSTSPQANMAGHGISAGV  383

Query  635  AGVRGDSGRALECFCDEESIVMGILIVTPLPVYTQLLPKHFTYRGLLDHYQPEFNHIGFQ  694
                         + +E   +MGI+ + P   Y Q +PK F     +D Y PEF H+G Q
Sbjct  384  -------NHGFTRYFEEHGYIMGIMSIRPRTGYQQGVPKDFRKFDNMDFYFPEFAHLGEQ  436

Query  695  PILYKEVCPLQAYSDGPDTLSD-VFGYNRPWYEYVQKYDQAHGLFRTNLSNFLMHRVFNQ  753
             I  +E+     Y +  D  ++  FGY   + EY    ++ HG FR N++ + ++R+F +
Sbjct  437  EIKNEEL-----YLNESDAANEGTFGYTPRYAEYKYSQNEVHGDFRGNMAFWHLNRIFKE  491

Query  754  KPQLAQSFLVIDPAQVTDVFAVTKADDGTELADKIYGQIWFDCTAKLPISRVAIPRL  810
            KP L  +F+  +P+    VFA  +  D     DK + QI+ D  A   + +   P L
Sbjct  492  KPNLNTTFVECNPS--NRVFATAETSD-----DKYWVQIYQDIKALRLMPKYGTPML  541


 Score = 60.1 bits (144),  Expect = 5e-06, Method: Compositional matrix adjust.
 Identities = 37/120 (31%), Positives = 58/120 (48%), Gaps = 6/120 (5%)

Query  17   KVNNFDWSHANNLTTQIGRVTPVFCELVPAHGSVRINPRFGLQFMPMVFPIQTRLRARMM  76
            + N F+ S+ N LT   G + P+ C+ V      R+N    ++  P+V P+  R+     
Sbjct  14   RRNVFNLSYENKLTVNAGELIPIMCKPVVPGDKFRVNTEMLVRLAPLVAPMMHRVDVFTH  73

Query  77   FFKYPLRALWDGYRDFIGNFREDLEEP----YLDLNTVTRLDAMAK--TGSLGDYLGLPT  130
            +F  P R +W+ + DFI    +  + P    Y   +TV   +A      GSL DYLGLP+
Sbjct  74   YFFVPNRLIWNKWEDFITKGVDGTDSPVFPTYSFPSTVDTANAHNSFGDGSLWDYLGLPS  133


>gi|492501782|ref|WP_005867318.1| hypothetical protein [Parabacteroides distasonis]
 gi|409230408|gb|EKN23272.1| hypothetical protein HMPREF1059_03257 [Parabacteroides distasonis 
CL09T03C24]
Length=538

 Score =   130 bits (328),  Expect = 1e-28, Method: Compositional matrix adjust.
 Identities = 117/417 (28%), Positives = 181/417 (43%), Gaps = 41/417 (10%)

Query  399  KISAYSFRAYEGIYNAYIRD-NRNNPYYVNGQVQYNKWIPTYDGGADQNIYELRYANWEK  457
            ++SA  FRAY+ IYN Y RD N   P     +   N  I          +  LR   WEK
Sbjct  158  QVSALPFRAYQLIYNEYYRDQNLTKPI----EFSLNSGI-VLSADEVTRLLTLRRRTWEK  212

Query  458  DFLTTAVQSPQQG---TAPLVGIttytetvettSDDGTPVTRELSRIALVDEDGKKYQVS  514
            D+ T+A+   Q+G   T P+ G              G  +   L   A  D        +
Sbjct  213  DYFTSALPWVQRGPEVTVPIQG-------------SGGNLDVTLKNDAHADTYRMPGTSN  259

Query  515  FDSDSEGLKGVSYVELDNEVKLRQPRNL-IDVVTSGISINDLRNVNAYQKFLELNMRKGY  573
              + +  L G + +    +    +P N  ++V   G+SINDLR  NA Q++ E N R G 
Sbjct  260  RPAGAMQLVGGALIAGGTDGAYLEPDNFQVNVDELGVSINDLRTSNALQRWFERNARSGS  319

Query  574  SYRDIIEGRFNVKVRYDELLMPEFFGGFSRDIEMHSISQTVDQDLDGSQTYAKALGSQSG  633
             Y + I   F V+     L  P+F GG    I +  + QT   D    Q      G  +G
Sbjct  320  RYIEQILSHFGVRSSDARLQRPQFLGGGRTPISVSEVLQTSATDSTSPQANMAGHGISAG  379

Query  634  IAGVRGDSGRALECFCDEESIVMGILIVTPLPVYTQLLPKHFTYRGLLDHYQPEFNHIGF  693
            +           + + +E   ++GI+ + P   Y Q +PK F     +D Y PEF H+G 
Sbjct  380  V-------NHGFKRYFEEHGYIIGIMSIRPRTGYQQGVPKDFRKFDNMDFYFPEFAHLGE  432

Query  694  QPILYKEVCPLQAYSDGPDTLSDVFGYNRPWYEYVQKYDQAHGLFRTNLSNFLMHRVFNQ  753
            Q I  +EV   Q     P + +  FGY   + EY    ++ HG FR N++ + ++R+F++
Sbjct  433  QEIKNEEVYLQQT----PASNNGTFGYTPRYAEYKYSMNEVHGDFRGNMAFWHLNRIFSE  488

Query  754  KPQLAQSFLVIDPAQVTDVFAVTKADDGTELADKIYGQIWFDCTAKLPISRVAIPRL  810
             P L  +F+  +P+    VFA  +  D     DK + Q++ D  A   + +   P L
Sbjct  489  SPNLNTTFVECNPS--NRVFATAETSD-----DKYWIQLYQDVKALRLMPKYGTPML  538


 Score = 63.2 bits (152),  Expect = 5e-07, Method: Compositional matrix adjust.
 Identities = 37/121 (31%), Positives = 55/121 (45%), Gaps = 7/121 (6%)

Query  17   KVNNFDWSHANNLTTQIGRVTPVFCELVPAHGSVRINPRFGLQFMPMVFPIQTRLRARMM  76
            + N F+ S+ N LT   G + P+ C+ V      R+N    ++  P+V P+  R+     
Sbjct  14   RRNVFNLSYENKLTANAGELVPIMCKPVVPGDKFRVNTEMLVRLAPLVAPMMHRVDVFTH  73

Query  77   FFKYPLRALWDGYRDFIGNFREDLEEPYL-------DLNTVTRLDAMAKTGSLGDYLGLP  129
            +F  P R LW+ + DFI    +  + P         D    T    +   GSL DYLGLP
Sbjct  74   YFFVPNRLLWNQWEDFITKGVDGTDTPVFPKIALRPDWVNPTSAAVLLDDGSLWDYLGLP  133

Query  130  T  130
            T
Sbjct  134  T  134


>gi|639237429|ref|WP_024568106.1| hypothetical protein [Elizabethkingia anophelis]
Length=546

 Score =   125 bits (313),  Expect = 9e-27, Method: Compositional matrix adjust.
 Identities = 119/427 (28%), Positives = 185/427 (43%), Gaps = 39/427 (9%)

Query  399  KISAYSFRAYEGIYNAYIRD-NRNNPYYV--NGQVQ------YNKWIPTYDGGADQNIYE  449
            ++S   F AY+ I++ Y RD N  +  +V  NG  +       N W P+      Q +++
Sbjct  138  RVSMLPFLAYQKIWDEYYRDENLIDSVFVDKNGDKRELFIDGINYWNPSLPYEFRQ-LFD  196

Query  450  LRYANWEKDFLTTAVQSPQQGTAPLVGIttytetvettSDDGTPVTRE----LSRIALVD  505
            ++   W  D+ T+A+   Q+G A  V +          +  G    ++    LS      
Sbjct  197  IKKRAWHHDYFTSALPFAQKGAA--VKMPLQMTADLFYNPGGNTFVKKPDGSLSHTGFRL  254

Query  506  EDGKKYQVSFDSDSEGLKGVSYVELDNEVKLRQPRNL-IDVVT-SGISINDLRNVNAYQK  563
            EDG    V  D     +   S     N V +    NL +D+ T SG +INDLR     Q+
Sbjct  255  EDG---SVPADGIGHLMVETSSTGNSNPVNIDNSSNLGVDLKTASGSTINDLRRAFKLQE  311

Query  564  FLELNMRKGYSYRDIIEGRFNVKVRYDELLMPEFFGGFSRDIEMHSISQTVDQDLDGSQT  623
            +LE N R G  Y + I   F VK     L  PEF GG    I +  + Q    D    Q 
Sbjct  312  WLEKNARAGSRYAESILSFFGVKTSDGRLQRPEFLGGNKTPILISEVLQQSSTDSTTPQG  371

Query  624  YAKALGSQSGIAGVRGDSGRALECFCDEESIVMGILIVTPLPVYTQLLPKHFTYRGLLDH  683
                 G   G  G           F +E   V+G++ V P   Y+Q +P+HF+     D+
Sbjct  372  NMAGHGISVGKEG-------GFSKFFEEHGYVIGLMSVIPKTSYSQGIPRHFSKFDKFDY  424

Query  684  YQPEFNHIGFQPILYKEVCPLQAYSDGPDTLSDVFGYNRPWYEYVQKYDQAHGLFRTNLS  743
            + P+F HIG QP+  KE+    A + G      VFGY   + EY       HG F+  L 
Sbjct  425  FWPQFEHIGEQPVYNKEIF---AKNVGDYDSGGVFGYVPRYSEYKYSPSTIHGDFKDTLY  481

Query  744  NFLMHRVFNQK--PQLAQSFLVIDPAQVTDVFAVTKADDGTELADKIYGQIWFDCTAKLP  801
             + + R+F+    P+L + F+ ++ + ++ +FAV       + +DK Y  ++   TAK  
Sbjct  482  FWHLGRIFDSSAPPKLNRDFIEVNKSGLSRIFAV------EDNSDKFYCHLYQKITAKRK  535

Query  802  ISRVAIP  808
            +S    P
Sbjct  536  MSYFGDP  542


 Score = 57.0 bits (136),  Expect = 4e-05, Method: Compositional matrix adjust.
 Identities = 34/120 (28%), Positives = 55/120 (46%), Gaps = 4/120 (3%)

Query  12   VNNEIKVNNFDWSHANNLTTQIGRVTPVFCELVPAHGSVRINPRFGLQFMPMVFPIQTRL  71
            V+   K + F+ S+    +   G + P+ C+ +     + INP+   +  PM+ P+   +
Sbjct  8    VSKAPKSSTFNMSYDRKFSMNFGDLVPIHCQEIVPGDKISINPQHMTRLAPMLAPVMHEV  67

Query  72   RARMMFFKYPLRALWDGYRDFIGNFREDLEEPYLDLNTVTRLDAMAKTGSLGDYLGLPTT  131
               + +F  P R LW  +  FI   +  L+   L +  V  L       SLGDYLGLP T
Sbjct  68   NVFIHYFFVPNRILWKNWEAFITGGQSGLDAHMLPV--VQNLP--VPKSSLGDYLGLPLT  123


>gi|649557305|gb|KDS63784.1| capsid family protein [Parabacteroides distasonis str. 3999B 
T(B) 4]
 gi|649559156|gb|KDS65543.1| capsid family protein [Parabacteroides distasonis str. 3999B 
T(B) 6]
Length=245

 Score =   117 bits (292),  Expect = 1e-25, Method: Compositional matrix adjust.
 Identities = 79/263 (30%), Positives = 123/263 (47%), Gaps = 20/263 (8%)

Query  549  GISINDLRNVNAYQKFLELNMRKGYSYRDIIEGRFNVKVRYDELLMPEFFGGFSRDIEMH  608
            G++IND+R  NA Q++ E N R G  Y + I   F V+     L  P+F GG    I + 
Sbjct  2    GVNINDIRTSNALQRWFERNARSGSRYIEQILSHFGVRSSDARLQRPQFLGGGRTPISVS  61

Query  609  SISQTVDQDLDGSQTYAKALGSQSGIAGVRGDSGRALECFCDEESIVMGILIVTPLPVYT  668
             + QT   D    Q      G  +G+             + +E   +MGI+ + P   Y 
Sbjct  62   EVLQTSSTDSTSPQANMAGHGISAGV-------NHGFTRYFEEHGYIMGIMSIRPRTGYQ  114

Query  669  QLLPKHFTYRGLLDHYQPEFNHIGFQPILYKEVCPLQAYSDGPDTLSD-VFGYNRPWYEY  727
            Q +PK F     +D Y PEF H+G Q I  +E+     Y +  D  ++  FGY   + EY
Sbjct  115  QGVPKDFRKFDNMDFYFPEFAHLGEQEIKNEEL-----YLNESDAANEGTFGYTPRYAEY  169

Query  728  VQKYDQAHGLFRTNLSNFLMHRVFNQKPQLAQSFLVIDPAQVTDVFAVTKADDGTELADK  787
                ++ HG FR N++ + ++R+F +KP L  +F+  +P+    VFA  +  D     DK
Sbjct  170  KYSQNEVHGDFRGNMAFWHLNRIFKEKPNLNTTFVECNPS--NRVFATAETSD-----DK  222

Query  788  IYGQIWFDCTAKLPISRVAIPRL  810
             + QI+ D  A   + +   P L
Sbjct  223  YWVQIYQDIKALRLMPKYGTPML  245


>gi|609718276|emb|CDN73650.1| conserved hypothetical protein [Elizabethkingia anophelis]
Length=537

 Score =   120 bits (301),  Expect = 3e-25, Method: Compositional matrix adjust.
 Identities = 119/426 (28%), Positives = 190/426 (45%), Gaps = 48/426 (11%)

Query  400  ISAYSFRAYEGIYNAY----------IRDNRNNPYYVNGQVQYNKWIPTYDGGADQNIYE  449
            ++   F AY+ I++ +           RD+  NP  +      +  +P Y    +  +++
Sbjct  139  VNLLPFLAYQKIWDEFYRDENLIQPLFRDSNGNPVKMFNDGINDHNLPPYSKFTE--LFK  196

Query  450  LRYANWEKDFLTTAVQSPQQGTAPLVGIttytetvettSDDGTPVTRELSRIALVDEDGK  509
            +R   W  D+ T+A+   Q+G A  + I               P+T E+     + +   
Sbjct  197  MRKRAWHHDYFTSALPFAQKGNAVKIPIF---------PQGNVPLTYEMGSQTFIKDMAG  247

Query  510  KYQVSFD--SDSEG-LKGVSYVELDNEVKLRQPRNL-IDVVTSGIS-INDLRNVNAYQKF  564
                + D  SD  G L+ VS       + L   +NL +++ +  +S +NDLR     Q++
Sbjct  248  NPAPNKDLRSDVNGNLQDVS----GQPLSLDPSKNLKLNMASENVSTVNDLRRAFKLQEW  303

Query  565  LELNMRKGYSYRDIIEGRFNVKVRYDELLMPEFFGGFSRDIEMHSISQTVDQDLDGSQTY  624
            LE N R G  Y + I   F VK     L  PEF GG    I    IS+ + Q    S T 
Sbjct  304  LEKNARAGSRYAESILSFFGVKTSDGRLQRPEFLGGNKSPI---MISEVLQQSATDSTTP  360

Query  625  AKALGSQSGIAGVRGDSGRALECFCDEESIVMGILIVTPLPVYTQLLPKHFTYRGLLDHY  684
               +    GI G+  D G     F +E   V+G++ V P   Y+Q +P+HF+     D++
Sbjct  361  QGNMAGH-GI-GIGKDGG--FSRFFEEHGYVIGLMSVIPKTSYSQGIPRHFSKSDKFDYF  416

Query  685  QPEFNHIGFQPILYKEVCPLQAYSDGPDTLSDVFGYNRPWYEYVQKYDQAHGLFRTNLSN  744
             P+F HIG QP+  KE+       D  D+ + VFGY   + EY       HG F+ +L  
Sbjct  417  WPQFEHIGEQPVYNKEI--FAKNIDAFDSEA-VFGYLPRYSEYKFSPSTVHGDFKDDLYF  473

Query  745  FLMHRVF--NQKPQLAQSFLVIDPAQVTDVFAVTKADDGTELADKIYGQIWFDCTAKLPI  802
            + + R+F  ++ P L QSF+  D   ++ +FAV   DD     DK Y  ++   TAK  +
Sbjct  474  WHLGRIFDTDKPPVLNQSFIECDKNALSRIFAV--EDD----TDKFYCHLYQKITAKRKM  527

Query  803  SRVAIP  808
            S    P
Sbjct  528  SYFGDP  533


 Score = 58.2 bits (139),  Expect = 2e-05, Method: Compositional matrix adjust.
 Identities = 37/136 (27%), Positives = 63/136 (46%), Gaps = 11/136 (8%)

Query  17   KVNNFDWSHANNLTTQIGRVTPVFCELVPAHGSVRINPRFGLQFMPMVFPIQTRLRARMM  76
            K + F+ S+    +   G + P+ C+ V     + INP+   +  PM+ P+   +   + 
Sbjct  13   KSSTFNMSYDRKFSMNFGDLVPIHCQEVIPGDKISINPQHMTRLAPMIAPVMHEVNVFIH  72

Query  77   FFKYPLRALWDGYRDFIGNFREDLEEPYLDLNTVTRLDAM-AKTGSLGDYLGLPTTLFGK  135
            +F  P R +W  +  FI        E  LD + + R+  +    GSL D+LGLP T  G+
Sbjct  73   YFFVPNRIIWSNWEQFITG-----GESGLDQHLMPRVGNLPVSKGSLADHLGLPLTT-GR  126

Query  136  FGTTLSVATTGHIFGL  151
            F    +V   G ++ L
Sbjct  127  F----AVGNAGVLYNL  138


>gi|12085136|ref|NP_073538.1| major capsid protein [Bdellovibrio phage phiMH2K]
 gi|75089173|sp|Q9G059.1|F_BPPHM RecName: Full=Capsid protein VP1; Short=VP1 [Bdellovibrio phage 
phiMH2K]
 gi|12017984|gb|AAG45340.1|AF306496_1 Vp1 [Bdellovibrio phage phiMH2K]
Length=533

 Score =   111 bits (277),  Expect = 2e-22, Method: Compositional matrix adjust.
 Identities = 114/432 (26%), Positives = 180/432 (42%), Gaps = 36/432 (8%)

Query  384  SPYYNSGSANKDKQIKISAYSFRAYEGIYNAYIRDNRNNPYYVNGQVQYNKWIPTYDGGA  443
            S Y + G   +   ++I+A  FRAY  IYN + RD       + G++     +P  DG  
Sbjct  125  SIYDHFGIPTQVANLEINALPFRAYNLIYNDWFRDQN-----LIGKIA----VPKGDGPD  175

Query  444  DQNIYELRYANWEKDFLTTAVQSPQQGTAPLVGIttytetvettSDDGTPVTR--ELSRI  501
            +   Y+L  A    D+ T+A+  PQ+G A  + I          +    P      +   
Sbjct  176  NHADYQLLKAAKPHDYFTSALPWPQKGMAVEMPIGNSAPITYVPNAGNGPYPHFNWVQTP  235

Query  502  ALVDEDGKKYQVSFDSDSEGLKGVSYVELDNEVKLRQPRNLIDVVT-SGISINDLRNVNA  560
                 +G   QV+F     G K +S    D      Q   + D+ + +  +IN LR    
Sbjct  236  GGPGNNGALSQVTFG----GQKAISAAGNDPIGYDPQGTLIADLSSATAATINQLRQAMM  291

Query  561  YQKFLELNMRKGYSYRDIIEGRFNVKVRYDELLMPEFFGGFSRDIEMHSISQTVDQDLDG  620
             Q  LEL+ R G  Y +I++  FNV      L  PE+  G + D++ + + QT     D 
Sbjct  292  MQSLLELDARGGTRYVEILKSHFNVISLDFRLQRPEYLSGGTIDLQQNPVPQTSSSTTDS  351

Query  621  SQTYAKALGSQSGIAGVRGDSGRALECFCDEESIVMGILIVTPLPVYTQLLPKHFTYRGL  680
             Q    A  + S      G S    + F  E   V+G +       Y Q L K ++ +  
Sbjct  352  PQGNLAAFSTASEFGNKIGFS----KSFV-EHGYVLGFIRARGQVTYQQGLHKMWSRQTR  406

Query  681  LDHYQPEFNHIGFQPILYKEVCPLQAYSDGPDTLSDVFGYNRPWYEYVQKYDQAHGLFRT  740
             D + P+F  +G Q IL KE+     Y+ G  T S++FGY   + EY  +  +  G FR+
Sbjct  407  WDFFWPKFQELGEQAILNKEI-----YAQGNATDSEIFGYQERYGEYRFRPSEIKGQFRS  461

Query  741  NLSNFL----MHRVFNQKPQLAQSFLVIDPAQVTDVFAVTKADDGTELADKIYGQIWFDC  796
            N +  L    +   F  KP L ++F+  +   +     VT+ D        + G  WFD 
Sbjct  462  NFAESLDVWHLAEYFTVKPSLNKTFIESN-TPIERSLVVTRPD-----YPDLIGDFWFDY  515

Query  797  TAKLPISRVAIP  808
            T   P+    +P
Sbjct  516  THVRPMVTYGVP  527


 Score = 41.2 bits (95),  Expect = 3.0, Method: Compositional matrix adjust.
 Identities = 25/114 (22%), Positives = 50/114 (44%), Gaps = 0/114 (0%)

Query  19   NNFDWSHANNLTTQIGRVTPVFCELVPAHGSVRINPRFGLQFMPMVFPIQTRLRARMMFF  78
            + F+ S     T +   +TP+F + +    ++ +N +  ++    V P+  R+     FF
Sbjct  23   SKFNRSFGTKDTFKFDDLTPIFIDEILPGDTINMNTKTFIRLATQVVPVMDRMMLDFYFF  82

Query  79   KYPLRALWDGYRDFIGNFREDLEEPYLDLNTVTRLDAMAKTGSLGDYLGLPTTL  132
              P R +WD +  F G      +     + T+T      +  S+ D+ G+PT +
Sbjct  83   FVPCRLVWDNWEKFNGAQDNPSDSTDYLIPTITAPAGGFENMSIYDHFGIPTQV  136


>gi|530695351|gb|AGT39907.1| major capsid protein [Marine gokushovirus]
Length=539

 Score =   105 bits (263),  Expect = 2e-20, Method: Compositional matrix adjust.
 Identities = 109/428 (25%), Positives = 172/428 (40%), Gaps = 35/428 (8%)

Query  389  SGSANKDKQIKISAYSFRAYEGIYNAYIRDNRNNPYYVNGQVQYNKWIPTYDGGADQNIY  448
            +G  +    I  ++   RAY  I+N + RD           +Q +  +   DG      Y
Sbjct  137  AGQVDAGSSISHNSLFTRAYNLIWNEWFRDE---------NLQDSVVVDKGDGPDTYTDY  187

Query  449  ELRYANWEKDFLTTAVQSPQQGTAPLVGIttytetvettSDDGTPV-TRELSRIALVDED  507
             L       D+ T+A+  PQ+G A  V +          +D G P   RE+S   +    
Sbjct  188  TLLRRGKRHDYFTSALPWPQKGDA--VTLPLGGSANVVYNDTGDPAYIREVSTGNVWTTP  245

Query  508  GKKYQVSFDSDSEGLKGVSYVELDNEVKLRQPRNLIDVVTSGISINDLRNVNAYQKFLEL  567
             ++   S   ++ G   V    ++ +              +  +IN +R     Q+ LE 
Sbjct  246  SRE---SVSKEANGNMSVPTGSVNAQYDPNGSLVADLSTATAATINAIRQSFQIQRLLER  302

Query  568  NMRKGYSYRDIIEGRFNVKVRYDELLMPEFFGGFSRDIEMHSISQTVDQDLDGSQTYAKA  627
            + R G  Y +I+   F V      +  PE+ GG S  I ++ ++Q       G+ T    
Sbjct  303  DARGGTRYTEIVRSHFGVISPDARMQRPEYLGGGSAPIIVNPVAQQSASGASGTDTPLGT  362

Query  628  LGS-QSGIAGVRGDSGRALECFCDEESIVMGILIVTPLPVYTQLLPKHFTYRGLLDHYQP  686
            LG+  +G+A     SG        E  +V+G+  V     Y Q L + F+     D + P
Sbjct  363  LGAVGTGLA-----SGHGFASSFTEHGVVVGLCSVRADLTYQQGLHRMFSRSTRYDFFFP  417

Query  687  EFNHIGFQPILYKEVCPLQAYSDGPDTLSDVFGYNRPWYEYVQKYDQAHGLFRTNLSNFL  746
             F+H+G QPIL KE+     Y+ G  T  DVFGY   W EY  K  Q  GL R+  +  L
Sbjct  418  VFSHLGEQPILNKEL-----YATGTSTDDDVFGYQEAWAEYRYKPSQVTGLMRSTAAGTL  472

Query  747  ----MHRVFNQKPQLAQSFLVIDPAQVTDVFAVTKADDGTELADKIYGQIWFDCTAKLPI  802
                + + F   P L  +F + D   V  V AV    +G +     +    FD     P+
Sbjct  473  DAWHLAQNFGSLPTLNSTF-IEDTPPVDRVVAVGSEANGQQFIFDAF----FDINMARPM  527

Query  803  SRVAIPRL  810
               ++P L
Sbjct  528  PMYSVPGL  535



Lambda      K        H        a         alpha
   0.320    0.137    0.411    0.792     4.96 

Gapped
Lambda      K        H        a         alpha    sigma
   0.267   0.0410    0.140     1.90     42.6     43.6 

Effective search space used: 6454025655732