bitscore colors: <40, 40-50 , 50-80, 80-200, >200




           BLASTP 2.2.30+


Reference: Stephen F. Altschul, Thomas L. Madden, Alejandro A.
Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J.
Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of
protein database search programs", Nucleic Acids Res. 25:3389-3402.


Reference for composition-based statistics: Alejandro A. Schaffer,
L. Aravind, Thomas L. Madden, Sergei Shavirin, John L. Spouge, Yuri
I. Wolf, Eugene V. Koonin, and Stephen F. Altschul (2001),
"Improving the accuracy of PSI-BLAST protein database searches with
composition-based statistics and other refinements", Nucleic Acids
Res. 29:2994-3005.



Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF
excluding environmental samples from WGS projects
           49,011,213 sequences; 17,563,301,199 total letters



Query= Contig-30_CDS_annotation_glimmer3.pl_2_1

Length=544
                                                                      Score     E
Sequences producing significant alignments:                          (Bits)  Value

gi|547920049|ref|WP_022322420.1|  capsid protein VP1                   1128   0.0
gi|492501782|ref|WP_005867318.1|  hypothetical protein                  779   0.0
gi|649555287|gb|KDS61824.1|  capsid family protein                      759   0.0
gi|649569140|gb|KDS75238.1|  capsid family protein                      559   0.0
gi|649557305|gb|KDS63784.1|  capsid family protein                      451   1e-153
gi|609718276|emb|CDN73650.1|  conserved hypothetical protein            457   2e-151
gi|639237429|ref|WP_024568106.1|  hypothetical protein                  456   4e-151
gi|9629155|ref|NP_044312.1|  VP1                                        336   6e-104
gi|575094603|emb|CDL65960.1|  unnamed protein product                   319   2e-98
gi|47566141|ref|YP_022479.1|  structural protein                        320   3e-98


>gi|547920049|ref|WP_022322420.1| capsid protein VP1 [Parabacteroides merdae CAG:48]
 gi|524592961|emb|CDD13573.1| capsid protein VP1 [Parabacteroides merdae CAG:48]
Length=553

 Score =  1128 bits (2918),  Expect = 0.0, Method: Compositional matrix adjust.
 Identities = 539/544 (99%), Positives = 540/544 (99%), Gaps = 0/544 (0%)

Query  1    MKRPRRNAFNLSYESKLTLNMGELVPIMCMPVVPGDKFRVKTESLVRLAPLVAPMMHRVN  60
            MKRPRRNAFNLSYESKLTLNMGELVPIMCMPVV GDKFRVKTESLVRLAPLVAPMMHRVN
Sbjct  10   MKRPRRNAFNLSYESKLTLNMGELVPIMCMPVVSGDKFRVKTESLVRLAPLVAPMMHRVN  69

Query  61   VFTHYFFVPNRLVWNEWEDFITKGVDGEDMPMFPKIQINQDSHLVSSASLIKEYFGDSSL  120
            VFTHYFFVPNRLVWNEWEDFITKGVDGEDMPMFPKIQINQDSHLVSSASLIKEYFGDSSL
Sbjct  70   VFTHYFFVPNRLVWNEWEDFITKGVDGEDMPMFPKIQINQDSHLVSSASLIKEYFGDSSL  129

Query  121  WDYLGLPTLSACGNKSYDVVNGVKVPSGFQVSALPFRAYQLIYNEYYRDQNLTEPIDFTL  180
            WDYLGLPTLSACGNKSYDVVNGVKVPSGFQVSALPFRAYQLIYNEYYRDQNLTEPIDFTL
Sbjct  130  WDYLGLPTLSACGNKSYDVVNGVKVPSGFQVSALPFRAYQLIYNEYYRDQNLTEPIDFTL  189

Query  181  GSGTTVGGDQLMALMSLRRRAWEKDYFTSALPWLQRGPEVTVPVQGAGGSMDVVYERQSD  240
            GSGTTVGGDQLMALMSLRRRAWEKDYFTSALPWLQRGPEVTVPVQGAGGSMDVVYERQSD
Sbjct  190  GSGTTVGGDQLMALMSLRRRAWEKDYFTSALPWLQRGPEVTVPVQGAGGSMDVVYERQSD  249

Query  241  SQKWVDSSGREFENGRAYDITMTRANDPNSALMVAVNGGTNNRAPELDPNGTLKVNVDEM  300
            SQKWVDSSGREFENG AYDITM RANDPNSALMVAVNGGTNNRAPELDPNGTLKVNVDEM
Sbjct  250  SQKWVDSSGREFENGHAYDITMARANDPNSALMVAVNGGTNNRAPELDPNGTLKVNVDEM  309

Query  301  GININDLRTSNALQRWFERNARGGSRYIEQILSHFGVRSSDARLQRPQFLGGGRMPISVS  360
            GININDLRTSNALQRWFERNARGGSRYIEQILSHFGVRSSDARLQRPQFLGGGRMPISVS
Sbjct  310  GININDLRTSNALQRWFERNARGGSRYIEQILSHFGVRSSDARLQRPQFLGGGRMPISVS  369

Query  361  EVLQTSSTDETSPQANMAGHGISAGINNGFKHYFEEHGYIIGIMSITPRSGYQQGVPRDF  420
            EVLQTSSTDETSPQANMAGHGISAGINNGFKHYFEEHGYIIGIMSITPRSGYQQGVPRDF
Sbjct  370  EVLQTSSTDETSPQANMAGHGISAGINNGFKHYFEEHGYIIGIMSITPRSGYQQGVPRDF  429

Query  421  TKFDNMDFYFPEFAHLSEQEIKNQELFVSEDAAYNNGTFGYTPRYAEYKYHPSEAHGDFR  480
            TKFDNMDFYFPEFAHLSEQEIKNQELFVSEDAAYNNGTFGYTPRYAEYKYHPSEAHGDFR
Sbjct  430  TKFDNMDFYFPEFAHLSEQEIKNQELFVSEDAAYNNGTFGYTPRYAEYKYHPSEAHGDFR  489

Query  481  SNLSFWHLNRIFEDKPNLNTTFVECRPSNRVFATSETEDDKFWVQMYQDVKALRLMPKYG  540
             NLSFWHLNRIFEDKPNLNTTFVEC+PSNRVFATSETEDDKFWVQMYQDVKALRLMPKYG
Sbjct  490  GNLSFWHLNRIFEDKPNLNTTFVECKPSNRVFATSETEDDKFWVQMYQDVKALRLMPKYG  549

Query  541  TPML  544
            TPML
Sbjct  550  TPML  553


>gi|492501782|ref|WP_005867318.1| hypothetical protein [Parabacteroides distasonis]
 gi|409230408|gb|EKN23272.1| hypothetical protein HMPREF1059_03257 [Parabacteroides distasonis 
CL09T03C24]
Length=538

 Score =   779 bits (2011),  Expect = 0.0, Method: Compositional matrix adjust.
 Identities = 384/550 (70%), Positives = 444/550 (81%), Gaps = 27/550 (5%)

Query  1    MKRPRRNAFNLSYESKLTLNMGELVPIMCMPVVPGDKFRVKTESLVRLAPLVAPMMHRVN  60
            +KRPRRN FNLSYE+KLT N GELVPIMC PVVPGDKFRV TE LVRLAPLVAPMMHRV+
Sbjct  10   LKRPRRNVFNLSYENKLTANAGELVPIMCKPVVPGDKFRVNTEMLVRLAPLVAPMMHRVD  69

Query  61   VFTHYFFVPNRLVWNEWEDFITKGVDGEDMPMFPKIQINQDSHLVSSASLIKEYFGDSSL  120
            VFTHYFFVPNRL+WN+WEDFITKGVDG D P+FPKI +  D    +SA+++ +   D SL
Sbjct  70   VFTHYFFVPNRLLWNQWEDFITKGVDGTDTPVFPKIALRPDWVNPTSAAVLLD---DGSL  126

Query  121  WDYLGLPTLSACGNKSYD--VVNGVKVPSGFQVSALPFRAYQLIYNEYYRDQNLTEPIDF  178
            WDYLGLPT+    N ++     N V  P G+QVSALPFRAYQLIYNEYYRDQNLT+PI+F
Sbjct  127  WDYLGLPTIGGFNNVAFPNRSPNSVMPPVGYQVSALPFRAYQLIYNEYYRDQNLTKPIEF  186

Query  179  TLGSGTTVGGDQLMALMSLRRRAWEKDYFTSALPWLQRGPEVTVPVQGAGGSMDVVYERQ  238
            +L SG  +  D++  L++LRRR WEKDYFTSALPW+QRGPEVTVP+QG+GG++DV  +  
Sbjct  187  SLNSGIVLSADEVTRLLTLRRRTWEKDYFTSALPWVQRGPEVTVPIQGSGGNLDVTLKND  246

Query  239  SDSQKWVDSSGREFENGRAYDITMTRANDPNSALMVA----VNGGTNNRAPELDPNGTLK  294
            + +                Y +  T +N P  A+ +     + GGT+    E D     +
Sbjct  247  AHAD--------------TYRMPGT-SNRPAGAMQLVGGALIAGGTDGAYLEPD---NFQ  288

Query  295  VNVDEMGININDLRTSNALQRWFERNARGGSRYIEQILSHFGVRSSDARLQRPQFLGGGR  354
            VNVDE+G++INDLRTSNALQRWFERNAR GSRYIEQILSHFGVRSSDARLQRPQFLGGGR
Sbjct  289  VNVDELGVSINDLRTSNALQRWFERNARSGSRYIEQILSHFGVRSSDARLQRPQFLGGGR  348

Query  355  MPISVSEVLQTSSTDETSPQANMAGHGISAGINNGFKHYFEEHGYIIGIMSITPRSGYQQ  414
             PISVSEVLQTS+TD TSPQANMAGHGISAG+N+GFK YFEEHGYIIGIMSI PR+GYQQ
Sbjct  349  TPISVSEVLQTSATDSTSPQANMAGHGISAGVNHGFKRYFEEHGYIIGIMSIRPRTGYQQ  408

Query  415  GVPRDFTKFDNMDFYFPEFAHLSEQEIKNQELFVSEDAAYNNGTFGYTPRYAEYKYHPSE  474
            GVP+DF KFDNMDFYFPEFAHL EQEIKN+E+++ +  A NNGTFGYTPRYAEYKY  +E
Sbjct  409  GVPKDFRKFDNMDFYFPEFAHLGEQEIKNEEVYLQQTPASNNGTFGYTPRYAEYKYSMNE  468

Query  475  AHGDFRSNLSFWHLNRIFEDKPNLNTTFVECRPSNRVFATSETEDDKFWVQMYQDVKALR  534
             HGDFR N++FWHLNRIF + PNLNTTFVEC PSNRVFAT+ET DDK+W+Q+YQDVKALR
Sbjct  469  VHGDFRGNMAFWHLNRIFSESPNLNTTFVECNPSNRVFATAETSDDKYWIQLYQDVKALR  528

Query  535  LMPKYGTPML  544
            LMPKYGTPML
Sbjct  529  LMPKYGTPML  538


>gi|649555287|gb|KDS61824.1| capsid family protein [Parabacteroides distasonis str. 3999B 
T(B) 4]
 gi|649560568|gb|KDS66876.1| capsid family protein [Parabacteroides distasonis str. 3999B 
T(B) 4]
 gi|649561020|gb|KDS67307.1| capsid family protein [Parabacteroides distasonis str. 3999B 
T(B) 4]
 gi|649562724|gb|KDS68908.1| capsid family protein [Parabacteroides distasonis str. 3999B 
T(B) 6]
Length=541

 Score =   759 bits (1961),  Expect = 0.0, Method: Compositional matrix adjust.
 Identities = 379/549 (69%), Positives = 432/549 (79%), Gaps = 22/549 (4%)

Query  1    MKRPRRNAFNLSYESKLTLNMGELVPIMCMPVVPGDKFRVKTESLVRLAPLVAPMMHRVN  60
            +KRPRRN FNLSYE+KLT+N GEL+PIMC PVVPGDKFRV TE LVRLAPLVAPMMHRV+
Sbjct  10   LKRPRRNVFNLSYENKLTVNAGELIPIMCKPVVPGDKFRVNTEMLVRLAPLVAPMMHRVD  69

Query  61   VFTHYFFVPNRLVWNEWEDFITKGVDGEDMPMFPKIQINQDSHLVSSASLIKEYFGDSSL  120
            VFTHYFFVPNRL+WN+WEDFITKGVDG D P+FP          V +A+     FGD SL
Sbjct  70   VFTHYFFVPNRLIWNKWEDFITKGVDGTDSPVFPTYSF---PSTVDTAN-AHNSFGDGSL  125

Query  121  WDYLGLPTLSACGNKSYDVV--NGVKVPSGFQVSALPFRAYQLIYNEYYRDQNLTEPIDF  178
            WDYLGLP+++  G   + V   NGVK P+GF+VSALPFRAY LIYNEYYRDQNLT  ++ 
Sbjct  126  WDYLGLPSINQIGEAVFQVQSPNGVKAPAGFKVSALPFRAYHLIYNEYYRDQNLTSELEI  185

Query  179  TLGSGTTVGGDQL---MALMSLRRRAWEKDYFTSALPWLQRGPEVTVPVQGAGGSMDVVY  235
            TL SG      QL    +L  L RRAWEKDYFTSALPW+QRGPEVTVP+ G GG + V  
Sbjct  186  TLDSGNY----QLPVNSSLWQLHRRAWEKDYFTSALPWVQRGPEVTVPING-GGEIPVEM  240

Query  236  ERQSDSQKWVDSSGREFENGRAYDITMTRANDPNSALMVAVNGGTNNRAPELDPNGTLKV  295
            +    +QK      R+  +G     +        S L     G    +A  ++P+  + V
Sbjct  241  KEGFAAQKITTFPDRKPISGSEVLYSAP------SVLSYGQIGSIKGQA-LIEPDNFV-V  292

Query  296  NVDEMGININDLRTSNALQRWFERNARGGSRYIEQILSHFGVRSSDARLQRPQFLGGGRM  355
            N D+MG+NIND+RTSNALQRWFERNAR GSRYIEQILSHFGVRSSDARLQRPQFLGGGR 
Sbjct  293  NTDQMGVNINDIRTSNALQRWFERNARSGSRYIEQILSHFGVRSSDARLQRPQFLGGGRT  352

Query  356  PISVSEVLQTSSTDETSPQANMAGHGISAGINNGFKHYFEEHGYIIGIMSITPRSGYQQG  415
            PISVSEVLQTSSTD TSPQANMAGHGISAG+N+GF  YFEEHGYI+GIMSI PR+GYQQG
Sbjct  353  PISVSEVLQTSSTDSTSPQANMAGHGISAGVNHGFTRYFEEHGYIMGIMSIRPRTGYQQG  412

Query  416  VPRDFTKFDNMDFYFPEFAHLSEQEIKNQELFVSEDAAYNNGTFGYTPRYAEYKYHPSEA  475
            VP+DF KFDNMDFYFPEFAHL EQEIKN+EL+++E  A N GTFGYTPRYAEYKY  +E 
Sbjct  413  VPKDFRKFDNMDFYFPEFAHLGEQEIKNEELYLNESDAANEGTFGYTPRYAEYKYSQNEV  472

Query  476  HGDFRSNLSFWHLNRIFEDKPNLNTTFVECRPSNRVFATSETEDDKFWVQMYQDVKALRL  535
            HGDFR N++FWHLNRIF++KPNLNTTFVEC PSNRVFAT+ET DDK+WVQ+YQD+KALRL
Sbjct  473  HGDFRGNMAFWHLNRIFKEKPNLNTTFVECNPSNRVFATAETSDDKYWVQIYQDIKALRL  532

Query  536  MPKYGTPML  544
            MPKYGTPML
Sbjct  533  MPKYGTPML  541


>gi|649569140|gb|KDS75238.1| capsid family protein, partial [Parabacteroides distasonis str. 
3999B T(B) 6]
Length=390

 Score =   559 bits (1441),  Expect = 0.0, Method: Compositional matrix adjust.
 Identities = 280/403 (69%), Positives = 319/403 (79%), Gaps = 16/403 (4%)

Query  145  VPSGFQVSALPFRAYQLIYNEYYRDQNLTEPIDFTLGSGTTVGGDQL---MALMSLRRRA  201
             P+GF+VSALPFRAY LIYNEYYRDQNLT  ++ TL SG      QL    +L  L RRA
Sbjct  1    APAGFKVSALPFRAYHLIYNEYYRDQNLTSELEITLDSGNY----QLPVNSSLWQLHRRA  56

Query  202  WEKDYFTSALPWLQRGPEVTVPVQGAGGSMDVVYERQSDSQKWVDSSGREFENGRAYDIT  261
            WEKDYFTSALPW+QRGPEVTVP+ G GG + V  +    +QK      R+  +G     +
Sbjct  57   WEKDYFTSALPWVQRGPEVTVPING-GGEIPVEMKEGFAAQKITTFPDRKPISGSEVLYS  115

Query  262  MTRANDPNSALMVAVNGGTNNRAPELDPNGTLKVNVDEMGININDLRTSNALQRWFERNA  321
                    S L     G    +A  ++P+  + VN D+MG+NIND+RTSNALQRWFERNA
Sbjct  116  AP------SVLSYGQIGSIKGQA-LIEPDNFV-VNTDQMGVNINDIRTSNALQRWFERNA  167

Query  322  RGGSRYIEQILSHFGVRSSDARLQRPQFLGGGRMPISVSEVLQTSSTDETSPQANMAGHG  381
            R GSRYIEQILSHFGVRSSDARLQRPQFLGGGR PISVSEVLQTSSTD TSPQANMAGHG
Sbjct  168  RSGSRYIEQILSHFGVRSSDARLQRPQFLGGGRTPISVSEVLQTSSTDSTSPQANMAGHG  227

Query  382  ISAGINNGFKHYFEEHGYIIGIMSITPRSGYQQGVPRDFTKFDNMDFYFPEFAHLSEQEI  441
            ISAG+N+GF  YFEEHGYI+GIMSI PR+GYQQGVP+DF KFDNMDFYFPEFAHL EQEI
Sbjct  228  ISAGVNHGFTRYFEEHGYIMGIMSIRPRTGYQQGVPKDFRKFDNMDFYFPEFAHLGEQEI  287

Query  442  KNQELFVSEDAAYNNGTFGYTPRYAEYKYHPSEAHGDFRSNLSFWHLNRIFEDKPNLNTT  501
            KN+EL+++E  A N GTFGYTPRYAEYKY  +E HGDFR N++FWHLNRIF++KPNLNTT
Sbjct  288  KNEELYLNESDAANEGTFGYTPRYAEYKYSQNEVHGDFRGNMAFWHLNRIFKEKPNLNTT  347

Query  502  FVECRPSNRVFATSETEDDKFWVQMYQDVKALRLMPKYGTPML  544
            FVEC PSNRVFAT+ET DDK+WVQ+YQD+KALRLMPKYGTPML
Sbjct  348  FVECNPSNRVFATAETSDDKYWVQIYQDIKALRLMPKYGTPML  390


>gi|649557305|gb|KDS63784.1| capsid family protein [Parabacteroides distasonis str. 3999B 
T(B) 4]
 gi|649559156|gb|KDS65543.1| capsid family protein [Parabacteroides distasonis str. 3999B 
T(B) 6]
Length=245

 Score =   451 bits (1161),  Expect = 1e-153, Method: Compositional matrix adjust.
 Identities = 207/245 (84%), Positives = 227/245 (93%), Gaps = 0/245 (0%)

Query  300  MGININDLRTSNALQRWFERNARGGSRYIEQILSHFGVRSSDARLQRPQFLGGGRMPISV  359
            MG+NIND+RTSNALQRWFERNAR GSRYIEQILSHFGVRSSDARLQRPQFLGGGR PISV
Sbjct  1    MGVNINDIRTSNALQRWFERNARSGSRYIEQILSHFGVRSSDARLQRPQFLGGGRTPISV  60

Query  360  SEVLQTSSTDETSPQANMAGHGISAGINNGFKHYFEEHGYIIGIMSITPRSGYQQGVPRD  419
            SEVLQTSSTD TSPQANMAGHGISAG+N+GF  YFEEHGYI+GIMSI PR+GYQQGVP+D
Sbjct  61   SEVLQTSSTDSTSPQANMAGHGISAGVNHGFTRYFEEHGYIMGIMSIRPRTGYQQGVPKD  120

Query  420  FTKFDNMDFYFPEFAHLSEQEIKNQELFVSEDAAYNNGTFGYTPRYAEYKYHPSEAHGDF  479
            F KFDNMDFYFPEFAHL EQEIKN+EL+++E  A N GTFGYTPRYAEYKY  +E HGDF
Sbjct  121  FRKFDNMDFYFPEFAHLGEQEIKNEELYLNESDAANEGTFGYTPRYAEYKYSQNEVHGDF  180

Query  480  RSNLSFWHLNRIFEDKPNLNTTFVECRPSNRVFATSETEDDKFWVQMYQDVKALRLMPKY  539
            R N++FWHLNRIF++KPNLNTTFVEC PSNRVFAT+ET DDK+WVQ+YQD+KALRLMPKY
Sbjct  181  RGNMAFWHLNRIFKEKPNLNTTFVECNPSNRVFATAETSDDKYWVQIYQDIKALRLMPKY  240

Query  540  GTPML  544
            GTPML
Sbjct  241  GTPML  245


>gi|609718276|emb|CDN73650.1| conserved hypothetical protein [Elizabethkingia anophelis]
Length=537

 Score =   457 bits (1175),  Expect = 2e-151, Method: Compositional matrix adjust.
 Identities = 246/568 (43%), Positives = 341/568 (60%), Gaps = 71/568 (13%)

Query  2    KRPRRNAFNLSYESKLTLNMGELVPIMCMPVVPGDKFRVKTESLVRLAPLVAPMMHRVNV  61
            K P+ + FN+SY+ K ++N G+LVPI C  V+PGDK  +  + + RLAP++AP+MH VNV
Sbjct  10   KAPKSSTFNMSYDRKFSMNFGDLVPIHCQEVIPGDKISINPQHMTRLAPMIAPVMHEVNV  69

Query  62   FTHYFFVPNRLVWNEWEDFITKGVDGEDMPMFPKIQINQDSHLVSSASLIKEYFGDSSLW  121
            F HYFFVPNR++W+ WE FIT G  G D  + P++      +L  S           SL 
Sbjct  70   FIHYFFVPNRIIWSNWEQFITGGESGLDQHLMPRV-----GNLPVSKG---------SLA  115

Query  122  DYLGLPTLS---ACGNKS--YDVVNGVKVPSGFQVSALPFRAYQLIYNEYYRDQNLTEPI  176
            D+LGLP  +   A GN    Y++VN            LPF AYQ I++E+YRD+NL +P+
Sbjct  116  DHLGLPLTTGRFAVGNAGVLYNLVN-----------LLPFLAYQKIWDEFYRDENLIQPL  164

Query  177  DFTLGSGTTVGG-------------DQLMALMSLRRRAWEKDYFTSALPWLQRGPEVTVP  223
             F   +G  V                +   L  +R+RAW  DYFTSALP+ Q+G  V +P
Sbjct  165  -FRDSNGNPVKMFNDGINDHNLPPYSKFTELFKMRKRAWHHDYFTSALPFAQKGNAVKIP  223

Query  224  VQGAGGSMDVVYERQSDSQKWVDSSGREFENGRAYDITMTRANDPNSALMVAVNGGTNNR  283
            +    G++ + YE  S                + +   M     PN  L   VNG   + 
Sbjct  224  I-FPQGNVPLTYEMGS----------------QTFIKDMAGNPAPNKDLRSDVNGNLQDV  266

Query  284  APE---LDPNGTLKVNVDEMGIN-INDLRTSNALQRWFERNARGGSRYIEQILSHFGVRS  339
            + +   LDP+  LK+N+    ++ +NDLR +  LQ W E+NAR GSRY E ILS FGV++
Sbjct  267  SGQPLSLDPSKNLKLNMASENVSTVNDLRRAFKLQEWLEKNARAGSRYAESILSFFGVKT  326

Query  340  SDARLQRPQFLGGGRMPISVSEVLQTSSTDETSPQANMAGHGISAGINNGFKHYFEEHGY  399
            SD RLQRP+FLGG + PI +SEVLQ S+TD T+PQ NMAGHGI  G + GF  +FEEHGY
Sbjct  327  SDGRLQRPEFLGGNKSPIMISEVLQQSATDSTTPQGNMAGHGIGIGKDGGFSRFFEEHGY  386

Query  400  IIGIMSITPRSGYQQGVPRDFTKFDNMDFYFPEFAHLSEQEIKNQELFVSE-DAAYNNGT  458
            +IG+MS+ P++ Y QG+PR F+K D  D+++P+F H+ EQ + N+E+F    DA  +   
Sbjct  387  VIGLMSVIPKTSYSQGIPRHFSKSDKFDYFWPQFEHIGEQPVYNKEIFAKNIDAFDSEAV  446

Query  459  FGYTPRYAEYKYHPSEAHGDFRSNLSFWHLNRIFE-DKPN-LNTTFVECRPS--NRVFAT  514
            FGY PRY+EYK+ PS  HGDF+ +L FWHL RIF+ DKP  LN +F+EC  +  +R+FA 
Sbjct  447  FGYLPRYSEYKFSPSTVHGDFKDDLYFWHLGRIFDTDKPPVLNQSFIECDKNALSRIFAV  506

Query  515  SETEDDKFWVQMYQDVKALRLMPKYGTP  542
             E + DKF+  +YQ + A R M  +G P
Sbjct  507  -EDDTDKFYCHLYQKITAKRKMSYFGDP  533


>gi|639237429|ref|WP_024568106.1| hypothetical protein [Elizabethkingia anophelis]
Length=546

 Score =   456 bits (1174),  Expect = 4e-151, Method: Compositional matrix adjust.
 Identities = 249/567 (44%), Positives = 335/567 (59%), Gaps = 60/567 (11%)

Query  2    KRPRRNAFNLSYESKLTLNMGELVPIMCMPVVPGDKFRVKTESLVRLAPLVAPMMHRVNV  61
            K P+ + FN+SY+ K ++N G+LVPI C  +VPGDK  +  + + RLAP++AP+MH VNV
Sbjct  10   KAPKSSTFNMSYDRKFSMNFGDLVPIHCQEIVPGDKISINPQHMTRLAPMLAPVMHEVNV  69

Query  62   FTHYFFVPNRLVWNEWEDFITKGVDGEDMPMFPKIQINQDSHLVSSASLIKEYFGDSSLW  121
            F HYFFVPNR++W  WE FIT G  G D  M P +Q         +  + K     SSL 
Sbjct  70   FIHYFFVPNRILWKNWEAFITGGQSGLDAHMLPVVQ---------NLPVPK-----SSLG  115

Query  122  DYLGLPTLSACGNKSYDVVNGVKVPSGFQVSALPFRAYQLIYNEYYRDQNLTEPIDFTLG  181
            DYLGLP         + V N   +P   +VS LPF AYQ I++EYYRD+NL + + F   
Sbjct  116  DYLGLPLTEG----RFAVGNDGVLPD--RVSMLPFLAYQKIWDEYYRDENLIDSV-FVDK  168

Query  182  SGTT----VGGD---------QLMALMSLRRRAWEKDYFTSALPWLQRGPEVTVPVQ---  225
            +G      + G          +   L  +++RAW  DYFTSALP+ Q+G  V +P+Q   
Sbjct  169  NGDKRELFIDGINYWNPSLPYEFRQLFDIKKRAWHHDYFTSALPFAQKGAAVKMPLQMTA  228

Query  226  ----GAGGSMDVVYERQSDSQKWVDSSGREFENGRAYDITMTRANDPNSALMVAVNGGTN  281
                  GG+    + ++ D    +  +G   E+G           D    LMV  +   N
Sbjct  229  DLFYNPGGN---TFVKKPDGS--LSHTGFRLEDGSV-------PADGIGHLMVETSSTGN  276

Query  282  NRAPELDPNGTLKVNVDEM-GININDLRTSNALQRWFERNARGGSRYIEQILSHFGVRSS  340
            +    +D +  L V++    G  INDLR +  LQ W E+NAR GSRY E ILS FGV++S
Sbjct  277  SNPVNIDNSSNLGVDLKTASGSTINDLRRAFKLQEWLEKNARAGSRYAESILSFFGVKTS  336

Query  341  DARLQRPQFLGGGRMPISVSEVLQTSSTDETSPQANMAGHGISAGINNGFKHYFEEHGYI  400
            D RLQRP+FLGG + PI +SEVLQ SSTD T+PQ NMAGHGIS G   GF  +FEEHGY+
Sbjct  337  DGRLQRPEFLGGNKTPILISEVLQQSSTDSTTPQGNMAGHGISVGKEGGFSKFFEEHGYV  396

Query  401  IGIMSITPRSGYQQGVPRDFTKFDNMDFYFPEFAHLSEQEIKNQELFVSEDAAYNN-GTF  459
            IG+MS+ P++ Y QG+PR F+KFD  D+++P+F H+ EQ + N+E+F      Y++ G F
Sbjct  397  IGLMSVIPKTSYSQGIPRHFSKFDKFDYFWPQFEHIGEQPVYNKEIFAKNVGDYDSGGVF  456

Query  460  GYTPRYAEYKYHPSEAHGDFRSNLSFWHLNRIFEDK--PNLNTTFVECRPS--NRVFATS  515
            GY PRY+EYKY PS  HGDF+  L FWHL RIF+    P LN  F+E   S  +R+FA  
Sbjct  457  GYVPRYSEYKYSPSTIHGDFKDTLYFWHLGRIFDSSAPPKLNRDFIEVNKSGLSRIFAV-  515

Query  516  ETEDDKFWVQMYQDVKALRLMPKYGTP  542
            E   DKF+  +YQ + A R M  +G P
Sbjct  516  EDNSDKFYCHLYQKITAKRKMSYFGDP  542


>gi|9629155|ref|NP_044312.1| VP1 [Chlamydia phage 1]
 gi|139180|sp|P19192.2|F_BPCHP RecName: Full=Capsid protein VP1; AltName: Full=Protein VP1; 
Short=VP1 [Chlamydia phage 1]
 gi|93817|pir||JU0345 major capsid protein VP1 - Chlamydophila psittaci phage Chp1
 gi|217762|dbj|BAA00515.1| VP1 [Chlamydia phage 1]
Length=596

 Score =   336 bits (862),  Expect = 6e-104, Method: Compositional matrix adjust.
 Identities = 213/593 (36%), Positives = 300/593 (51%), Gaps = 76/593 (13%)

Query  5    RRNAFNLSYESKLTLNMGELVPIMCMPVVPGDKFRVKTESLVRLAPLVAPMMHRVNVFTH  64
            RR++F+ S+  K T +M  LVP     V+PGD F +    L RL  LV P+M  + + T 
Sbjct  24   RRSSFDRSHGYKTTFDMDYLVPFFVDEVLPGDTFSLSETHLCRLTTLVQPIMDNIQLTTQ  83

Query  65   YFFVPNRLVWNEWEDFITKGVDGEDMPMFPKIQINQDSHLVSSASLIKEYFGDSSLWDYL  124
            +FFVPNRL+W+ WE FIT G    D P+        + + V   +     + ++S++DY 
Sbjct  84   FFFVPNRLLWDNWESFITGG----DEPVAWTSTNPANEYFVPQVTSPDGGYAENSIYDYF  139

Query  125  GLPTLSACGNKSYDVVNGVKVPSGFQVSALPFRAYQLIYNEYYRDQNLTEPIDFTLGSGT  184
            GLPT  A                 ++   LP RAY LI+NEYYRD+NL E +    G   
Sbjct  140  GLPTKVA----------------NYRHQVLPLRAYNLIFNEYYRDENLQESLPVWTGDAD  183

Query  185  -----TVG-----GDQLMALMSLRRRAWEKDYFTSALPWLQRGPEVTVPVQGA-------  227
                 T G      D +  +  L RR    DYFTSALP LQ+GP V + + G        
Sbjct  184  PKVDPTTGEESQEDDAVPYVYKLMRRNKRYDYFTSALPGLQKGPSVGIGITGGDSGRLPV  243

Query  228  -GGSMDVVYERQSD------------SQKWVDSSGREFENGRAYDITMTRANDPNSALM-  273
             G ++    +  SD            SQKW  + GR   +G       T  N P   ++ 
Sbjct  244  HGLAIRSYLDDSSDDQFSFGVSYVNASQKWFTADGR-LTSGMGSVPVGTTGNFPIDNVVY  302

Query  274  -------VAVNGGTNNRAPELDPNGTLKVNVD---EMGININDLRTSNALQRWFERNARG  323
                   VA  G  ++ +      G   V VD      + IN LR +  LQ+WFE++AR 
Sbjct  303  PSYFGTTVAQTGSPSSSSTPPFVKGDFPVYVDLAASSSVTINSLRNAITLQQWFEKSARY  362

Query  324  GSRYIEQILSHFGVRSSDARLQRPQFLGGGRMPISVSEVLQTSSTDETSPQANMAGHGIS  383
            GSRY+E +  HFGV   D R QRP +LGG +  +SV+ V+Q SSTD  SPQ N++ + +S
Sbjct  363  GSRYVESVQGHFGVHLGDYRAQRPIYLGGSKSYVSVNPVVQNSSTDSVSPQGNLSAYALS  422

Query  384  AGINNGFKHYFEEHGYIIGIMSITPRSGYQQGVPRDFTKFDNMDFYFPEFAHLSEQEIKN  443
                + F   F EHG++IG++S T    YQQG+ R +++F   D+Y+P FAHL EQ + N
Sbjct  423  TDTKHLFTKSFVEHGFVIGLLSATADLTYQQGLERQWSRFSRYDYYWPTFAHLGEQPVYN  482

Query  444  QELFVSED-------AAYNNGTFGYTPRYAEYKYHPSEAHGDFRSN----LSFWHLNRIF  492
            +E++   D       +A N+  FGY  RYAEY+Y PS+  G FRSN    L  WHL++ F
Sbjct  483  KEIYCQSDTVMDPSGSAVNDVPFGYQERYAEYRYKPSKVTGLFRSNATGTLDSWHLSQNF  542

Query  493  EDKPNLNTTFVECR-PSNRVFATSETEDDKFWVQMYQDVKALRLMPKYGTPML  544
             + P LN TF++   P +R  A  +  D  F    Y + + +R MP Y  P L
Sbjct  543  ANLPTLNETFIQSNTPIDRALAVPDQPD--FICDFYFNYRCIRPMPVYSVPGL  593


>gi|575094603|emb|CDL65960.1| unnamed protein product [uncultured bacterium]
Length=507

 Score =   319 bits (817),  Expect = 2e-98, Method: Compositional matrix adjust.
 Identities = 209/558 (37%), Positives = 298/558 (53%), Gaps = 72/558 (13%)

Query  5    RRNAFNLSYESKLTLNMGELVPIMCMPVVPGDKFRVKTESLVRLAPLVAPMMHRVNVFTH  64
            +R+  +LS+ + LT  MG + PI C+ V+PGD F       +R  PLVAP+MH V+V   
Sbjct  3    KRSIHSLSHFNLLTTQMGLITPIECVDVLPGDSFMQDNSVFLRTMPLVAPVMHPVHVEIR  62

Query  65   YFFVPNRLVWNEWEDFITKGVDGEDMPMFPKIQINQDSHLVSSASLIKEYFGDSSLWDYL  124
             FFVP R++W+E+EDFIT G  G +  + P        H V+     K   G+  L DYL
Sbjct  63   SFFVPLRIIWDEFEDFITGGPKGLNNSVHP--------HFVADEQNCK--LGE--LNDYL  110

Query  125  GLPTLSACGNKSYDVVNGVKVPSGFQVSALPFRAYQLIYNEYYRDQNLTE--PIDFTLGS  182
            G+P  +  G K               V+ L  RAY LIYNEY+RDQ++ E  PI F  G 
Sbjct  111  GIPPDTLLGTK---------------VNCLYARAYALIYNEYFRDQDIDEELPISFESGQ  155

Query  183  GTTVGGDQLMALMSLRRRAWEKDYFTSALPWLQRGPEVTVPVQGAGGSMDVVYERQ----  238
             TT           L+R  W KDYFT A P+ Q+GP   VP+  +GG++  +        
Sbjct  156  DTTTPS-------VLQRADWRKDYFTLARPFEQKGPAAVVPITSSGGAITGLTGTTTTTV  208

Query  239  ---SDSQKWVDS----SGREFENGRAYDIT----MTRANDPNSALMVAVNGGTNNRAPEL  287
               SDS +W  +    SG    N R+         T++N  ++A  V+    + N + +L
Sbjct  209  TIGSDSGRWNPATGLKSGAGVNNSRSLIFGDGDGFTQSNHTHAATAVSNTVVSGNASFDL  268

Query  288  DPNGTLKVNVDEMGININDLRTSNALQRWFERNARGGSRYIEQILSHFGVRSSDARLQRP  347
               GT+         +I+D R +  LQR+ E  +  GSRY E   +  GVRSSD RLQ P
Sbjct  269  ---GTM---------SISDFREAMQLQRFEEIRSLYGSRY-EDYQAVLGVRSSDGRLQNP  315

Query  348  QFLGGGRMPISVSEVLQTSSTDETSPQANMAGHGISAGINNGFKHYFEEHGYIIGIMSIT  407
            ++LGGG+  I  SEVLQTS  + +SP   + GHGI A     F  +F EHG ++ ++ + 
Sbjct  316  EYLGGGQRTIQFSEVLQTS--EGSSPVGTLRGHGIGALKTKRFLRFFNEHGILLTLLVVR  373

Query  408  PRSGYQQGVPRDFTKFDNMDFYFPEFAHLSEQEIKNQELFVSEDAAYNNG--TFGYTPRY  465
            P S Y QG+ R F +    D++ P+F H+ +Q + N+EL+    AA  NG  TFG++ RY
Sbjct  374  PISIYTQGLNRMFLRETRFDYWQPQFEHIGQQSVLNKELY----AASPNGDDTFGFSNRY  429

Query  466  AEYKYHPSEAHGDFRSNLSFWHLNRIFEDKPNLNTTFVECRPSNRVFATSETEDDKFWVQ  525
             EY+YHPS  HG+FR+    WHL R FE  P LN+ F++C P+ R+FA +    D+  V 
Sbjct  430  NEYRYHPSNVHGEFRTYYEDWHLGRKFESTPTLNSDFLKCHPTTRIFAETSGNYDQLLVM  489

Query  526  MYQDVKALRLMPKYGTPM  543
                ++A RL+ K G P+
Sbjct  490  CQNHIRARRLISKNGDPI  507


>gi|47566141|ref|YP_022479.1| structural protein [Chlamydia phage 3]
 gi|47522476|emb|CAD79477.1| structural protein [Chlamydia phage 3]
Length=565

 Score =   320 bits (821),  Expect = 3e-98, Method: Compositional matrix adjust.
 Identities = 208/579 (36%), Positives = 297/579 (51%), Gaps = 80/579 (14%)

Query  3    RPRRNAFNLSYESKLTLNMGELVPIMCMPVVPGDKFRVKTESLVRLAPLVAPMMHRVNVF  62
            R +R++F+ S   K T N G L+PI C  V+PGD F +K   L R+A  + P+M  + + 
Sbjct  22   RIQRSSFDRSCGLKTTFNAGYLIPIFCDEVLPGDTFSLKEAFLARMATPIFPLMDNLRLD  81

Query  63   THYFFVPNRLVWNEWEDFITKGVDGEDMPMFPKIQINQDSHLVSSASLIKEYFGDSSLWD  122
            T YFFVP RL+W+ ++ F  +  + +D   F          L    +     F + S+ D
Sbjct  82   TQYFFVPLRLIWSNFQKFCGEQDNPDDSTDF----------LTPVLTAPTGGFTEGSIHD  131

Query  123  YLGLPTLSACGNKSYDVVNGVKVPSGFQVSALPFRAYQLIYNEYYRDQNLTEPIDFTLGS  182
            YLGLPT  A                G Q +A   RAY LI+N+YYRD+N+ E ++  +G 
Sbjct  132  YLGLPTKVA----------------GVQCAAFWHRAYNLIWNQYYRDENIQESVEVQMGD  175

Query  183  GTTVGGDQLMALMSLRRRAWEKDYFTSALPWLQRGPEVTVPVQGA----GGSMDV-----  233
             TT   D++     L +R    DYFTS LPW Q+GP VT+ V G     G  M+V     
Sbjct  176  TTT---DEVKN-YELLKRGKRYDYFTSCLPWPQKGPAVTIGVGGKAPIEGLYMNVNSNNP  231

Query  234  ----VYERQSDSQKWVDSSGREFENGRAYDITMTRANDPNSALMVAVNGGTNNRAPELDP  289
                V + QS  +   D  G +     AY+ T            V VN       P+ +P
Sbjct  232  VGKFVLDSQSTPRVLQDLQGNKLSGIAAYNQT---------GKHVYVNSAWYTVTPQSEP  282

Query  290  NGTLKVN----------VDEMG----ININDLRTSNALQRWFERNARGGSRYIEQILSHF  335
              TL+              ++G    + IN LR +  LQ+ +ER+ARGG+RYIE I SHF
Sbjct  283  GATLENGNYYTTQKPQIYADLGATSPVTINSLREAFQLQKLYERDARGGTRYIEIIRSHF  342

Query  336  GVRSSDARLQRPQFLGGGRMPISVSEVLQTSSTDETSPQANMAGHGISAGINNGFKHYFE  395
             V+S DARLQR ++LGG   P+++S + QTSSTD TSPQ N+A +G + G    F   F 
Sbjct  343  NVQSPDARLQRAEYLGGSSTPVNISPIPQTSSTDSTSPQGNLAAYGTAIGSKRVFTKSFT  402

Query  396  EHGYIIGIMSITPRSGYQQGVPRDFTKFDNMDFYFPEFAHLSEQEIKNQELFVSEDAAYN  455
            EHG I+G+ S+     YQQG+ R +++    DFY+P  +HL EQ + N+E++    +  N
Sbjct  403  EHGVILGLASVRADLNYQQGLDRMWSRRTRWDFYWPALSHLGEQAVLNKEIYCQGPSVKN  462

Query  456  NG-------TFGYTPRYAEYKYHPSEAHGDFRSN----LSFWHLNRIFEDKPNLNTTFVE  504
            +G        FGY  R+AEY+Y  S+  G FRSN    L  WHL + FE+ P L+  F+E
Sbjct  463  SGGEIVDEQVFGYQERFAEYRYKTSKITGKFRSNATSSLDSWHLAQEFENLPTLSPEFIE  522

Query  505  CRPS-NRVFATSETEDDKFWVQMYQDVKALRLMPKYGTP  542
              P  +RV A S   +  F +  +  ++  R MP Y  P
Sbjct  523  ENPPMDRVLAVS--NEPHFLLDGWFSLRCARPMPVYSVP  559



Lambda      K        H        a         alpha
   0.319    0.135    0.412    0.792     4.96 

Gapped
Lambda      K        H        a         alpha    sigma
   0.267   0.0410    0.140     1.90     42.6     43.6 

Effective search space used: 3935252973510