bitscore colors: <40, 40-50 , 50-80, 80-200, >200




           BLASTP 2.2.30+


Reference: Stephen F. Altschul, Thomas L. Madden, Alejandro A.
Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J.
Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of
protein database search programs", Nucleic Acids Res. 25:3389-3402.


Reference for composition-based statistics: Alejandro A. Schaffer,
L. Aravind, Thomas L. Madden, Sergei Shavirin, John L. Spouge, Yuri
I. Wolf, Eugene V. Koonin, and Stephen F. Altschul (2001),
"Improving the accuracy of PSI-BLAST protein database searches with
composition-based statistics and other refinements", Nucleic Acids
Res. 29:2994-3005.



Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF
excluding environmental samples from WGS projects
           49,011,213 sequences; 17,563,301,199 total letters





Query= Contig-22_CDS_annotation_glimmer3.pl_2_6

Length=561
                                                                      Score     E
Sequences producing significant alignments:                          (Bits)  Value

gi|575094431|emb|CDL65804.1|  unnamed protein product                   480   7e-160
gi|575094544|emb|CDL65904.1|  unnamed protein product                   465   4e-154
gi|575096056|emb|CDL66947.1|  unnamed protein product                   457   4e-151
gi|575094572|emb|CDL65928.1|  unnamed protein product                   450   2e-148
gi|575094492|emb|CDL65859.1|  unnamed protein product                   437   1e-143
gi|575094496|emb|CDL65862.1|  unnamed protein product                   435   4e-142
gi|575094415|emb|CDL65790.1|  unnamed protein product                   423   1e-137
gi|557745632|ref|YP_008798242.1|  major capsid protein                  402   6e-130
gi|313766927|gb|ADR80653.1|  putative major coat protein                382   4e-122
gi|530695351|gb|AGT39907.1|  major capsid protein                       382   5e-122


>gi|575094431|emb|CDL65804.1| unnamed protein product [uncultured bacterium]
Length=560

 Score =   480 bits (1235),  Expect = 7e-160, Method: Compositional matrix adjust.
 Identities = 268/578 (46%), Positives = 360/578 (62%), Gaps = 42/578 (7%)

Query  1    VNRNNERHFNQIPEMKASRTRFNRDQTILTTFDSGKLIPFYVDEVLPGDTFSVDTSAIIR  60
            +NRN+  +F + P +  SR+RFNR    L TFD+G+++P YVDEVLPGDTF +D +AIIR
Sbjct  1    MNRNSNFNFARNPGVSLSRSRFNRTSDRLDTFDTGEIVPIYVDEVLPGDTFELDMTAIIR  60

Query  61   MTTPKYPVMDDAFIDFYYFYCPNRILWDNFKQFMGEVEETPWMPKQEYSVPKIVI--NGK  118
             +TP +PVMD++F+D Y+F+ PNR+ W+++++ MGE   T W    +YSVP++     G 
Sbjct  61   GSTPIFPVMDNSFLDVYFFFVPNRLTWEHWRELMGENRTTAWTQPVDYSVPQVTAPAGGW  120

Query  119  EKSPEPYEDSILDYMGIPTKVKKVFKVNALPIRAYVKIWNEFFRDENVDNAATIKTDDAS  178
            E      E S+ D+MGIPTKV  +  VNALP RAY  I+NEFFR++N+ N   ++  DA+
Sbjct  121  E------ELSLADHMGIPTKVDNI-SVNALPFRAYGLIYNEFFRNQNLTNPTQVEVTDAN  173

Query  179  IDYQDGNESEDNTDDILKKAIGGGRCLPVNKFHDYFTSCLPYPQRGPEVTLPMQGN-APV  237
            I  ++ N+ +++ D     AI G +CL   KF DYFT  LP PQ+G  V + +  +  PV
Sbjct  174  IAGKNPNDVKNSND----WAITGAKCLKSAKFFDYFTGALPQPQKGEPVEINLASSWLPV  229

Query  238  GM----------YKNDSLT-EFGTINGNSE--IFLNQALNGSALAPKISNSFKEGARRAL  284
            G+            +D+LT E  +  GN++    L        + P    +F+  A    
Sbjct  230  GIGDYHGPLDKVSNSDTLTWESPSSEGNTKRTYALGMVQQEGEVNPNGLKNFETKA----  285

Query  285  VTGSTNPTTQVSDAAYLAANLGE---TTATTINDLRKAVAVQQYYEALARGGSRYREQVQ  341
              GS + +  V  AAY   NL     T A T+N LR+A  VQ+  E  ARGG+RYRE ++
Sbjct  286  -GGSFSESGAV--AAY-PTNLWASPVTAAATVNQLRQAFQVQKLLEKDARGGTRYREILK  341

Query  342  ALWDVVISDKTVQIPEYLGGGRYHVNMNQIVQTSGQQSSTD-TPIGETGAMSVTPVNESS  400
              + V  SD  +QIPEYLGG +  +N++Q+VQTS   +STD +P G T A+SVTP ++S 
Sbjct  342  NHFGVTTSDARMQIPEYLGGCKVPINVSQVVQTS---ASTDASPQGNTAAISVTPFSKSM  398

Query  401  FTKSFEEHGFIIGVCCVRHNHSYQQGLERLWSRTDRLDYYVPQFANLGEQPVKKKEIMLT  460
            FTKSF+EHGFIIGV   R   SYQQG+ER+WSR DRLDYY P  AN+GEQ +  KEI   
Sbjct  399  FTKSFDEHGFIIGVATARTAQSYQQGIERMWSRKDRLDYYFPVLANIGEQAILNKEIYAQ  458

Query  461  GEASDEETFGYQEAWADYRMKPNRVSSKMRSNATGTLDFWHYADNYKSVPTLSQGWMAER  520
            G A D+E FGYQEAWADYR KPN +  + RSNA  +LD WHY  +Y  +PTLS  WM + 
Sbjct  459  GNAKDDEAFGYQEAWADYRYKPNTICGRFRSNAQQSLDAWHYGQDYDKLPTLSTDWMEQS  518

Query  521  KTEIARTLIVQDEPQFFGAIRVANKTTRRMPLYSVPGL  558
              E+ RTL VQ EP F    R   KT R MPLYS+PGL
Sbjct  519  DIEMKRTLAVQTEPDFIANFRFNCKTVRVMPLYSIPGL  556


>gi|575094544|emb|CDL65904.1| unnamed protein product [uncultured bacterium]
Length=551

 Score =   465 bits (1196),  Expect = 4e-154, Method: Compositional matrix adjust.
 Identities = 247/564 (44%), Positives = 347/564 (62%), Gaps = 23/564 (4%)

Query  1    VNRNNERHFNQIPEMKASRTRFNRDQTILTTFDSGKLIPFYVDEVLPGDTFSVDTSAIIR  60
            +NRN E HF+++P +  SR++F+R  ++ TTF+ G LIPFY+DEVLPGDTF+V +S +IR
Sbjct  1    MNRNVESHFSRLPSVDISRSQFDRSSSLKTTFNVGDLIPFYIDEVLPGDTFNVKSSKVIR  60

Query  61   MTTPKYPVMDDAFIDFYYFYCPNRILWDNFKQFMGEVEETPWMPKQEYSVPKIVINGKEK  120
            M +   P+MD+ ++D YYF+ PNR++W +++QF GE  E+ W+P  EY VP++       
Sbjct  61   MQSLVTPIMDNIYLDTYYFFVPNRLVWSHWQQFNGENTESAWLPTTEYQVPQVTAPANGW  120

Query  121  SPEPYEDSILDYMGIPTKVKKVFKVNALPIRAYVKIWNEFFRDENVDNAATIKTDDASID  180
            S      +I DY GIPT V     VNALP RAY  I NE+FRDEN+ +   I   DA++ 
Sbjct  121  S----IGTIADYFGIPTGV--ACSVNALPFRAYALICNEWFRDENLSDPLNIPISDATVV  174

Query  181  YQDGNESEDNTDDILKKAIGGGRCLPVNKFHDYFTSCLPYPQRGPEVTLPMQGNAPVGMY  240
              +G       D+ +   + GG      K+HDYFTSCLP PQ+GP+V LP+  ++PV + 
Sbjct  175  GSNG-------DNYITDIVKGGMPFKACKYHDYFTSCLPAPQKGPDVLLPL-SSSPVPVT  226

Query  241  KNDSLTEFGTINGNSEIFLNQALNGSALAPKISNSFK--EGARRAL--VTGSTNPTTQVS  296
             +D++ +    +      ++       L   I   F+  EGA   +   TG   PT    
Sbjct  227  TSDTMVDPLQYSKYPMAGVDSWNLSPTLMRNIIRPFEGVEGANYQVHQFTGDI-PTIDAF  285

Query  297  DAAYLAANLGETTATTINDLRKAVAVQQYYEALARGGSRYREQVQALWDVVISDKTVQIP  356
                L ANL   TA +IN LR A  +Q+ YE  ARGG+RY E +++ + V   D  +Q P
Sbjct  286  RPLNLVANLQNATAASINQLRLAFQIQRLYERDARGGTRYIEILKSHFGVTSPDARLQRP  345

Query  357  EYLGGGRYHVNMNQIVQTSGQQSSTDTPIGETGAMSVTPVNESSFTKSFEEHGFIIGVCC  416
            EYLGG R  +N+NQ++Q S  ++++ +P G     S+T    + F KSF EHGF+IG+  
Sbjct  346  EYLGGNRIPININQVLQQS--ETTSTSPQGNPVGQSLTTDTNADFVKSFVEHGFVIGLMV  403

Query  417  VRHNHSYQQGLERLWSRTDRLDYYVPQFANLGEQPVKKKEIMLTGEASDEETFGYQEAWA  476
             R++H+YQQGLER WSR DR DYY P FA++GEQ V  KEI  +G A D+E FGYQEA+A
Sbjct  404  ARYDHTYQQGLERFWSRKDRFDYYWPVFAHIGEQAVLNKEIYTSGTAVDDEVFGYQEAYA  463

Query  477  DYRMKPNRVSSKMRSNATGTLDFWHYADNYKSVPTLSQGWMAERKTEIARTLIVQD--EP  534
            DYR KP+RV+ +MRS A  +LD WH AD+Y S+P+LS  W+ E  + + R L V      
Sbjct  464  DYRYKPSRVTGEMRSAAPQSLDVWHLADDYASLPSLSDSWIRESASTVDRVLAVSSNVSA  523

Query  535  QFFGAIRVANKTTRRMPLYSVPGL  558
            Q F  I + N++TR MP+YSVPGL
Sbjct  524  QLFCDIYIQNRSTRPMPMYSVPGL  547


>gi|575096056|emb|CDL66947.1| unnamed protein product [uncultured bacterium]
Length=570

 Score =   457 bits (1177),  Expect = 4e-151, Method: Compositional matrix adjust.
 Identities = 253/586 (43%), Positives = 355/586 (61%), Gaps = 49/586 (8%)

Query  1    VNRNNERHFNQIPEMKASRTRFNRDQTILTTFDSGKLIPFYVDEVLPGDTFSVDTSAIIR  60
            +NRN E HF+ +P +  SR+RF+R  +I TTF++G ++PF+++EVLPGDTFSVD+S ++R
Sbjct  2    MNRNTESHFSLLPHVDISRSRFDRSSSIKTTFNAGDVVPFFLEEVLPGDTFSVDSSKVVR  61

Query  61   MTTPKYPVMDDAFIDFYYFYCPNRILWDNFKQFMGEVEETPWMPKQEYSVPKIVINGKEK  120
            M T   P+MD+ ++D YYF+ PNR++W ++K+F GE  E+ W+P+ EY++P++      K
Sbjct  62   MQTLLTPMMDNVYLDTYYFFVPNRLVWQHWKEFCGENNESAWIPQTEYAIPQL------K  115

Query  121  SP-EPYE-DSILDYMGIPTKVKKVFKVNALPIRAYVKIWNEFFRDENVDNAATIKTDDAS  178
            SP   +E  +I DY G+PT V  +  V+ALP RAY  I NE+FRDEN+ +   + TDDA+
Sbjct  116  SPVGGFEVGTIADYFGLPTGVANL-SVSALPFRAYALIMNEWFRDENLMDPLVVPTDDAT  174

Query  179  IDYQDGNESEDNTDDILKKAIGGGRCLPVNKFHDYFTSCLPYPQRGPEVTLPMQ--GNAP  236
            +       +  NT   +     GG+     K+HDYFTS LP PQ+GP+V +P+   GN  
Sbjct  175  V-------TGVNTGIFVTDVAKGGKPFVAAKYHDYFTSALPAPQKGPDVVIPVASAGNYN  227

Query  237  V-----GMYKNDSLTEFGTING-------NSEIFLNQALNGSALAPKISNSFKEGARRAL  284
            V     G+  +D        NG        +E+F +  L     +     S        +
Sbjct  228  VVGNGKGLALSDGSKMSIICNGLSGSNGQGTELFASGILGSQVGSSGGFGSGSSLRGDGI  287

Query  285  VTGSTNPTTQVSDAAYLAANLGET----------TATTINDLRKAVAVQQYYEALARGGS  334
            + G       V  AA L  NL  +           A TIN LR A  +Q++YE  ARGGS
Sbjct  288  ILG-------VPTAAQLGNNLENSGLIAIASGNAAAATINQLRMAFQIQKFYEKQARGGS  340

Query  335  RYREQVQALWDVVISDKTVQIPEYLGGGRYHVNMNQIVQTSGQQSSTDTPIGETGAMSVT  394
            RY E +++ + V   D  +Q  EYLGG R  +N+NQ++Q SG  S++ TP G    MS T
Sbjct  341  RYTEVIRSFFGVTSPDARLQRSEYLGGNRIPININQVIQQSGTGSASTTPQGTVVGMSQT  400

Query  395  PVNESSFTKSFEEHGFIIGVCCVRHNHSYQQGLERLWSRTDRLDYYVPQFANLGEQPVKK  454
                S FTKSF EHGFIIGV C R++H+YQQG++R+WSR D+ DYY P F+N+GEQ +K 
Sbjct  401  TDTHSDFTKSFTEHGFIIGVMCARYDHTYQQGIDRMWSRKDKFDYYWPVFSNIGEQAIKN  460

Query  455  KEIMLTGEASDEETFGYQEAWADYRMKPNRVSSKMRSNATGTLDFWHYADNYKSVPTLSQ  514
            KEI   G A+D+E FGYQEAWA+YR KP+RV+ +MRS+   +LD WH AD+Y  +P+LS 
Sbjct  461  KEIYAQGNATDDEVFGYQEAWAEYRYKPSRVTGEMRSSYAQSLDVWHLADDYSKLPSLSD  520

Query  515  GWMAERKTEIARTLIVQDE--PQFFGAIRVANKTTRRMPLYSVPGL  558
             W+ E    + R L V D+   QFF  I V N  TR MP+YS+PGL
Sbjct  521  EWIREDAKTLNRVLAVSDQNSNQFFADIYVKNLCTRPMPMYSIPGL  566


>gi|575094572|emb|CDL65928.1| unnamed protein product [uncultured bacterium]
Length=556

 Score =   450 bits (1158),  Expect = 2e-148, Method: Compositional matrix adjust.
 Identities = 246/575 (43%), Positives = 348/575 (61%), Gaps = 40/575 (7%)

Query  1    VNRNNERHFNQIP-EMKASRTRFNRDQTILTTFDSGKLIPFYVDEVLPGDTFSVDTSAII  59
            +NRN E HF + P  +  SR+ F+R  ++  TF++G++IPF+++EVLPGDTF V TS +I
Sbjct  1    MNRNVESHFAKNPTNIDISRSTFDRSSSVKLTFNTGEIIPFFIEEVLPGDTFKVKTSKVI  60

Query  60   RMTTPKYPVMDDAFIDFYYFYCPNRILWDNFKQFMGEVEETPWMPKQEYSVPKIVINGKE  119
            R+ T   P+MD+ ++D YYF+ PNR++W+++K+F GE  ++ W+P+ EY +P++      
Sbjct  61   RLQTLLTPMMDNIYLDTYYFFVPNRLVWEHWKEFNGENTQSAWIPEVEYQIPQLT-----  115

Query  120  KSPEPYED--SILDYMGIPTKVKKVFKVNALPIRAYVKIWNEFFRDENVDNAATIKTDDA  177
             +PE   +  ++ DY GIPT V  +  VNALP RAY  + NE+FRD+N+ +   I   DA
Sbjct  116  -APEGGWNIGTLADYFGIPTGVSGI-SVNALPFRAYALVCNEWFRDQNLSDPLNIPVGDA  173

Query  178  SIDYQDGNESEDNTDDILKKAIGGGRCLPVNKFHDYFTSCLPYPQRGPEVTLPMQG--NA  235
            ++       +  NT   +   + GG      K+HDYFTSCLP PQ+GP+VT+P+    N 
Sbjct  174  TV-------TGVNTGTFITDVVKGGLPYTAAKYHDYFTSCLPAPQKGPDVTIPVTSGHNL  226

Query  236  PVGMYKNDS-----LTEFGTINGNSEI---FLNQALNGSALAPKISNSFKEGARRALVTG  287
            PV M+ N++        FG    NSE+   +   + +  A +   ++S  E        G
Sbjct  227  PV-MFLNETHDAGPYKPFGVGIQNSELRNFYGFGSGSSGATSTSDTSSTVEVGSDGTGIG  285

Query  288  ST--NPTTQVSDAAYLAANLGETTATTINDLRKAVAVQQYYEALARGGSRYREQVQALWD  345
                 PT         A   G+    TIN LR A  +Q+ YE  ARGG+RY E +++ + 
Sbjct  286  QNFWTPTNM------WAVESGDVGMATINQLRLAFQLQKLYEKDARGGTRYTEIIRSHFG  339

Query  346  VVISDKTVQIPEYLGGGRYHVNMNQIVQTSGQQSSTDTPIGETGAMSVTPVNESSFTKSF  405
            VV  D  +Q PEYLGG R  +N+NQI+Q S  QS+  +P+G    MSVT    S F KSF
Sbjct  340  VVSPDSRLQRPEYLGGNRIPINVNQIIQQS--QSTEQSPLGALAGMSVTTDKNSDFIKSF  397

Query  406  EEHGFIIGVCCVRHNHSYQQGLERLWSRTDRLDYYVPQFANLGEQPVKKKEIMLTGEASD  465
             EHG+IIG+   R++H+YQQGL+R+WSR DR D+Y P  AN+GEQ V  KEI + G  +D
Sbjct  398  VEHGYIIGLVVARYDHTYQQGLDRMWSRKDRFDFYWPVLANIGEQAVLNKEIYIDGSDTD  457

Query  466  EETFGYQEAWADYRMKPNRVSSKMRSNATGTLDFWHYADNYKSVPTLSQGWMAERKTEIA  525
            +E FGYQEAWA+YR KPNRV  +MRS+A  +LD WH  D+Y S+P LS  W+ E KT + 
Sbjct  458  DEVFGYQEAWAEYRYKPNRVCGEMRSSAPQSLDVWHLGDDYSSLPYLSDSWIREDKTNVD  517

Query  526  RTLIVQD--EPQFFGAIRVANKTTRRMPLYSVPGL  558
            R L V      Q F  I + NK TR MP+YS+PGL
Sbjct  518  RVLAVTSSVSDQLFADIYICNKATRPMPMYSIPGL  552


>gi|575094492|emb|CDL65859.1| unnamed protein product [uncultured bacterium]
Length=551

 Score =   437 bits (1125),  Expect = 1e-143, Method: Compositional matrix adjust.
 Identities = 241/547 (44%), Positives = 325/547 (59%), Gaps = 48/547 (9%)

Query  30   TTFDSGKLIPFYVDEVLPGDTFSVDTSAIIRMTTPKYPVMDDAFIDFYYFYCPNRILWDN  89
            TTF+ G LIPFYVDE+LPGDTFS+DTS ++RM +   PVMD+ ++D Y+F+ PNR+ W +
Sbjct  31   TTFNVGDLIPFYVDEILPGDTFSIDTSKVVRMQSLLTPVMDNIYLDTYFFFVPNRLTWSH  90

Query  90   FKQFMGEVEETPWMPKQEYSVPKIVINGKEKSPEPYED--SILDYMGIPTKVKKVFKVNA  147
            +++ MGE  ++ W P+ EYSVP+I       +PE   +  +I DYMGIPT V  +  VNA
Sbjct  91   WRELMGENTQSAWTPQVEYSVPQIT------APEGGWNVGTIADYMGIPTGVSGL-SVNA  143

Query  148  LPIRAYVKIWNEFFRDENVDNAATIKTDDASIDYQDGNESEDNTDDILKKAIGGGRCLPV  207
            +P RAY  I NE+FRDEN+ +   I   DA++       +  NT   +     GG     
Sbjct  144  MPFRAYALICNEWFRDENLTDPLNIPVGDATV-------AGVNTGTYVTDVAKGGLPFKA  196

Query  208  NKFHDYFTSCLPYPQRGPEVTLPMQGNAPVGMYKNDSLTEFGTIN-------GNSEIFLN  260
             K+HDYFTSCLP PQ+GP+V +   G+  V +   D+  +   +N       GNS   +N
Sbjct  197  AKYHDYFTSCLPAPQKGPDVLISAVGSGIVPVTATDNDNDSLNVNSPGMRFVGNSSTSVN  256

Query  261  QALNGSALAPKISNSFKEGARRALVTGSTNPTTQVSDAAYLAANLGE--TTAT-----TI  313
                G             G    +VT +  P+T +   + +  NL    +TAT     TI
Sbjct  257  YLAFG-------------GGDGYVVTDTPKPSTPIHGISMIPTNLWADLSTATDLPVATI  303

Query  314  NDLRKAVAVQQYYEALARGGSRYREQVQALWDVVISDKTVQIPEYLGGGRYHVNMNQIVQ  373
            N LR A  +Q+ YE  ARGG+RY E +++ + V   D  +Q PEYLGG R  +N+NQ++Q
Sbjct  304  NQLRTAFQIQKLYERDARGGTRYIEILKSHFGVTSPDARLQRPEYLGGSRVPININQVIQ  363

Query  374  TSGQQSSTDTPIGETGAMSVTPVNESSFTKSFEEHGFIIGVCCVRHNHSYQQGLERLWSR  433
            +S    +  TP G   A S+T  + S FTKSF EHGFIIG+   R++HSYQQGL+R WSR
Sbjct  364  SS---ETGATPQGNAAAYSLTTDSHSEFTKSFVEHGFIIGLMVARYDHSYQQGLQRFWSR  420

Query  434  TDRLDYYVPQFANLGEQPVKKKEIMLTGEASDEETFGYQEAWADYRMKPNRVSSKMRSNA  493
             DR DYY P FANLGE  VK KEI   G   D+E FGYQEAWADYR KP+ V+ +MRS  
Sbjct  421  KDRFDYYWPVFANLGEMAVKNKEIFAQGTDVDDEVFGYQEAWADYRYKPSVVTGEMRSQY  480

Query  494  TGTLDFWHYADNYKSVPTLSQGWMAERKTEIARTLIVQD--EPQFFGAIRVANKTTRRMP  551
              +LD WH AD+Y+++P+LS  W+ E  + + R L V D    Q F  I +    TR MP
Sbjct  481  AQSLDIWHLADDYENLPSLSDSWIREDSSTVNRVLAVSDSVSAQLFCDIYIRCLATRPMP  540

Query  552  LYSVPGL  558
            LYS+PGL
Sbjct  541  LYSIPGL  547


>gi|575094496|emb|CDL65862.1| unnamed protein product [uncultured bacterium]
Length=568

 Score =   435 bits (1118),  Expect = 4e-142, Method: Compositional matrix adjust.
 Identities = 250/590 (42%), Positives = 343/590 (58%), Gaps = 61/590 (10%)

Query  3    RNNERHFNQIP-EMKASRTRFNRDQTILTTFDSGKLIPFYVDEVLPGDTFSVDTSAIIRM  61
            RN    F++ P  +   R+ FNR  T  T+ + G+LIPFY DEVLPGDTF V T+ ++R+
Sbjct  2    RNENSRFSENPVTLDIQRSTFNRSSTYKTSANIGELIPFYYDEVLPGDTFQVKTNKVVRL  61

Query  62   TTPKYPVMDDAFIDFYYFYCPNRILWDNFKQFMGEVEETPWMPKQEYSVPKIVINGKEKS  121
                   MD+ + D YYF+ PNR++W+++++FMGE ++  W+P+ EY++P+I       +
Sbjct  62   QPLVSAPMDNLYFDTYYFFVPNRLVWEHWEEFMGENKQGAWIPQTEYTIPQIT----SPA  117

Query  122  PEPYE-DSILDYMGIPTKVKKVFKVNALPIRAYVKIWNEFFRDENVDNAATIKTDDASID  180
               +E  +I DY GIPT V  +  V+ALP RAY  I +E+FRD+N+     I  DD ++ 
Sbjct  118  STGFEIGTIADYFGIPTGVPNL-SVSALPFRAYALIVDEWFRDQNLQLPLNIPLDDTTLQ  176

Query  181  YQDGNESEDNTDDILKKAIGGGRCLPVNKFHDYFTSCLPYPQRGPEVTLPMQGNAPVGMY  240
                     NT D +   + GG+     K+HDYFTSCLP PQ+GP+VT+   G+ PV  Y
Sbjct  177  -------GVNTGDYVTDTVKGGKPFVAAKYHDYFTSCLPSPQKGPDVTIAAVGDFPV--Y  227

Query  241  KNDSLTEFGTINGNSEIFLNQALN-GSALAPKISNSFKEG---ARRALVTGST-------  289
              D     G+         N+AL+ G +     S SF +G       L TGST       
Sbjct  228  TGDPHNNNGS---------NKALHYGISNISSGSVSFSQGNYIIPSVLTTGSTQSVPAQG  278

Query  290  -----NPTTQVS----DAAY----------LAANLGETTATTINDLRKAVAVQQYYEALA  330
                 N T   S    D+++          L A+ G  TATTIN LR A  +Q+ YE  A
Sbjct  279  KLNASNITMTTSPGSPDSSFGSKLSVYPDNLYASSG--TATTINQLRMAFQIQKLYEKDA  336

Query  331  RGGSRYREQVQALWDVVISDKTVQIPEYLGGGRYHVNMNQIVQTSGQQSSTDTPIGETGA  390
            R GSRYRE +++ + V   D  +Q+PEYLGG R  +N+NQ+VQTS  Q+S  +P G    
Sbjct  337  RAGSRYRELIRSHFSVTPLDARMQVPEYLGGNRIPININQVVQTS--QTSDVSPQGNVAG  394

Query  391  MSVTPVNESSFTKSFEEHGFIIGVCCVRHNHSYQQGLERLWSRTDRLDYYVPQFANLGEQ  450
             S+T  +   F KSF EHG +IGV   R++H+YQQG+ +LWSR  R DYY P  AN+GEQ
Sbjct  395  QSLTSDSHGDFIKSFTEHGMLIGVAVARYDHTYQQGVSKLWSRKTRFDYYWPVLANIGEQ  454

Query  451  PVKKKEIMLTGEASDEETFGYQEAWADYRMKPNRVSSKMRSNATGTLDFWHYADNYKSVP  510
             V  KEI   G A DEE FGYQEAWA+YR KP+ V+ +MRS+A  +LD WH+AD+Y S+P
Sbjct  455  AVLNKEIYAQGTAQDEEVFGYQEAWAEYRYKPSIVTGEMRSSARTSLDSWHFADDYNSLP  514

Query  511  TLSQGWMAERKTEIARTLIVQDEP--QFFGAIRVANKTTRRMPLYSVPGL  558
             LS  W+ E KT I R L V      Q+F    + N+TTR +P YS+PGL
Sbjct  515  KLSADWIKEDKTNIDRVLAVSSSVSNQYFADFYIENETTRALPFYSIPGL  564


>gi|575094415|emb|CDL65790.1| unnamed protein product [uncultured bacterium]
Length=569

 Score =   423 bits (1087),  Expect = 1e-137, Method: Compositional matrix adjust.
 Identities = 238/596 (40%), Positives = 326/596 (55%), Gaps = 68/596 (11%)

Query  1    VNRNNERHFNQIPEMKASRTRFNRDQTILTTFDSGKLIPFYVDEVLPGDTFSVDTSAIIR  60
            +NRN E H++QIP     R +F RD + LTT + G L+P YVDEVLPGDT  +   +++R
Sbjct  1    MNRNAEAHYSQIPHANIQRAKFKRDFSYLTTINEGDLVPIYVDEVLPGDTIKIKQRSLVR  60

Query  61   MTTPKYPVMDDAFIDFYYFYCPNRILWDNFKQFMGEVEETPWMPKQEYSVPKIVINGKEK  120
            M+TP YPVMD+ ++D +YF+ P R++WD+++  MGE  ++ W P  +Y+ P         
Sbjct  61   MSTPLYPVMDNCYLDIWYFFVPCRLVWDHWQNLMGENTKSYWAPDVQYTTPLT----SAP  116

Query  121  SPEPYEDSILDYMGIPTKVKKVFKVNALPIRAYVKIWNEFFRDENVDNAATIKTDDASID  180
            S      +I DYMGIPT V  + KVN++P+RAY +IWNE+FRDEN+    T  +DDA+  
Sbjct  117  SGGWQVGTIADYMGIPTGVSGI-KVNSMPMRAYARIWNEWFRDENLQQPVTQHSDDATT-  174

Query  181  YQDGNESEDNTDDILKKAIGGGRCLPVNKFHDYFTSCLPYPQRG----------PEV---  227
                  +  NT   L  A  GG  L V KF DYFTSCLP PQ+G          P+V   
Sbjct  175  ------TGSNTGTELTDAESGGLPLKVAKFKDYFTSCLPAPQKGEAIGFDFNQTPKVKGI  228

Query  228  --TLPMQGNAP---------------VGMYKNDSLTEFG------TINGNSEIFLNQALN  264
                P++ N                 VG   N S   F       T+NG    F N    
Sbjct  229  GLVFPLETNTGHTATDILWRQPDAQLVGENYNTSYNNFNSITTQTTVNGKKAFFFNNG-K  287

Query  265  GSALAPKISNSFKEGARRALVTGSTNPTTQVSDAAYLAANLGETTATTINDLRKAVAVQQ  324
            G  L+ +  + +  G  +  +T              +A N   T   +INDLR+A+A+Q 
Sbjct  288  GPMLSARFEDDYNGGVEQVELTA-------------VAEN--STNFLSINDLRQAIALQH  332

Query  325  YYEALARGGSRYREQVQALWDVVISDKTVQIPEYLGGGRYHVNMNQIVQTSGQQSSTDTP  384
              EA ARGG+RY E ++  + V   D  +Q  EY+GG R  +N++Q++Q+S   S T +P
Sbjct  333  ILEADARGGTRYVEILKNEFGVSSPDARLQRSEYIGGERIPINVSQVIQSSA--SDTTSP  390

Query  385  IGETGAMSVTPVNESSFTKSFEEHGFIIGVCCVRHNHSYQQGLERLWSRTDRLDYYVPQF  444
             G   A S+T    +    S  EHG+I+G+  +R +HSYQQGL R+W+R+DR  YY P  
Sbjct  391  QGNAAAYSLTTSANTIRAYSAVEHGYILGLAAIRVDHSYQQGLSRMWTRSDRFSYYHPML  450

Query  445  ANLGEQPVKKKEIMLTGEASDEETFGYQEAWADYRMKPNRVSSKMRSNATGTLDFWHYAD  504
            ANLGEQ V  +EI   G  +D E FGYQEAWADYR + N ++ +MRS    +LD WHY D
Sbjct  451  ANLGEQAVLNQEIYAQGTTADTEVFGYQEAWADYRYRTNMITGEMRSTYAQSLDAWHYGD  510

Query  505  NYKSVPTLSQGWMAERKTEIARTLIVQDE--PQFFGAIRVANKTTRRMPLYSVPGL  558
             Y  +P LS  W+ E +  I RTL VQ E   QF   +       R MP+YSVPGL
Sbjct  511  KYTDLPRLSNDWIKEGQENIDRTLAVQSENSHQFICNLYFDQTWVRPMPIYSVPGL  566


>gi|557745632|ref|YP_008798242.1| major capsid protein [Marine gokushovirus]
 gi|530695345|gb|AGT39902.1| major capsid protein [Marine gokushovirus]
Length=538

 Score =   402 bits (1033),  Expect = 6e-130, Method: Compositional matrix adjust.
 Identities = 232/570 (41%), Positives = 323/570 (57%), Gaps = 61/570 (11%)

Query  1    VNRNNERHFNQIPEMKASRTRFNRDQTILTTFDSGKLIPFYVDEVLPGDTFSVDTSAIIR  60
            +    +  F+++P     R+ F+R   + TTF++G+L+P YVDE LPGDTFS + +A  R
Sbjct  13   IGSAKQHQFSEVPHADIQRSTFDRSHGLKTTFNAGQLVPIYVDEALPGDTFSCNLTAFSR  72

Query  61   MTTPKYPVMDDAFIDFYYFYCPNRILWDNFKQFMGE-----------VEETPWMPKQEYS  109
            + TP +P MD+AF+D ++F  P R++WD+F++FMGE           ++ TP        
Sbjct  73   LATPIHPTMDNAFMDTHFFAVPVRLVWDDFEEFMGETKTYKAAGSDRLDGTPDFSVAAPV  132

Query  110  VPKIVINGKEKSPEPYEDSILDYMGIPTKVKKVFKVNALPIRAYVKIWNEFFRDENVDNA  169
             P I  +G  ++    E S+ DY GIPTKV  + + +AL  RAY  +WN++FRDEN+   
Sbjct  133  PPTITASGSGEA----EASLSDYFGIPTKVGGL-EFSALWHRAYTLVWNDWFRDENLQAP  187

Query  170  ATIKTDDASIDYQDGNESEDNTDDILKKAIGGGRCLPVNKFHDYFTSCLPYPQRGPEVTL  229
             TI       D   GN++   T  +L +           K HDYFTS LP+PQ+G +VT+
Sbjct  188  KTI-------DTTSGNDT--TTYALLNRG----------KKHDYFTSALPWPQKGADVTI  228

Query  230  PMQGNAPV--GMYKNDSLTEFGTINGNSEIFLNQALNGSALAPKISNSFKEGARRALVTG  287
            P+  +APV      N  +T F    GN+  FLN A   + + P   N+  + ARR     
Sbjct  229  PLGTSAPVTTANSSNQDVTIFTPNIGNTHRFLNSA--STNVYPGDENT--DEARR-----  279

Query  288  STNPTTQVSDAAYLAANLGETTATTINDLRKAVAVQQYYEALARGGSRYREQVQALWDVV  347
                         L A+L E T+ TIN LR A A Q++ E  ARGGSRY E ++  ++V 
Sbjct  280  -------------LYADLSEATSATINQLRLAFATQKFLEIQARGGSRYIEVIKNHFNVT  326

Query  348  ISDKTVQIPEYLGGGRYHVNMNQIVQTSGQQSSTDTPIGETGAMSVTPVNESSFTKSFEE  407
              D  +Q PEYLGGG   VN++ + QTS   ++T  P G   A+  T ++  SFTKSF E
Sbjct  327  SPDARLQRPEYLGGGSSPVNISPVAQTSSTDATT--PQGNLSAIGTTVLSGHSFTKSFTE  384

Query  408  HGFIIGVCCVRHNHSYQQGLERLWSRTDRLDYYVPQFANLGEQPVKKKEIMLTGEASDEE  467
            H  +IG+  VR + +YQQGL R++SR    DYY P  + +GEQ VK KEI   G A+DE 
Sbjct  385  HTIVIGMVSVRTDLTYQQGLNRMFSRETIYDYYWPTLSTIGEQAVKNKEIYAQGSAADET  444

Query  468  TFGYQEAWADYRMKPNRVSSKMRSNATGTLDFWHYADNYKSVPTLSQGWMAERKTEIART  527
            TFGYQE +A+YR KP+ V+ K RSNATGTL+ WHYA  Y S+P L   W+    T + RT
Sbjct  445  TFGYQERYAEYRYKPSSVTGKFRSNATGTLESWHYAQEYASLPLLGDSWIQVTDTNVQRT  504

Query  528  LIVQDEPQFFGAIRVANKTTRRMPLYSVPG  557
            L V  EPQF        + TR MP+ S+PG
Sbjct  505  LAVASEPQFIFDSLFKLRCTRPMPVNSIPG  534


>gi|313766927|gb|ADR80653.1| putative major coat protein [Uncultured Microviridae]
Length=533

 Score =   382 bits (980),  Expect = 4e-122, Method: Compositional matrix adjust.
 Identities = 219/552 (40%), Positives = 316/552 (57%), Gaps = 52/552 (9%)

Query  7    RHFNQIPEMKASRTRFNRDQTILTTFDSGKLIPFYVDEVLPGDTFSVDTSAIIRMTTPKY  66
              F+++P+    R+ F+R   + TTF+SG LIP YVDEVLPGDTF ++ +   R+ TP Y
Sbjct  15   HEFSRVPQADIQRSTFSRVHGLKTTFNSGDLIPIYVDEVLPGDTFQMNATGFGRLATPLY  74

Query  67   PVMDDAFIDFYYFYCPNRILWDNFKQFMGEVEETPWMPKQEYSVPKIVINGKEKSPEPYE  126
            PVMD+ +++ ++FY PNRI+WDN+++F G  ++       ++ VP+I      +S    E
Sbjct  75   PVMDNMYVETFFFYVPNRIIWDNWEKFNGAQDDP--NDSTDFLVPQI------QSATVAE  126

Query  127  DSILDYMGIPTKVKKVFKVNALPIRAYVKIWNEFFRDENVDNAATIKTDDASIDYQDGNE  186
             S+ DYMG+PT++  +   N L  RAY  IWNE+FRDEN+ ++  +  DD    Y     
Sbjct  127  GSLFDYMGLPTQIAGI-DFNNLHGRAYNLIWNEWFRDENLQDSLGVPKDDGPDTY-----  180

Query  187  SEDNTDDILKKAIGGGRCLPVNKFHDYFTSCLPYPQRGPEVTLPMQGNAPVGMYKNDSLT  246
                T   ++K           K HDYFTS LP+PQ+G  V+LP+  +A +        T
Sbjct  181  ----TGYTIQKR---------GKRHDYFTSALPWPQKGDAVSLPLGTSADIHTAAAAG-T  226

Query  247  EFGTINGNSEIFLNQALNGSALAPKISNSFKEGARRALVTGSTNPTTQVSDAAYLAANLG  306
            + G  +  S  F  + L    +   +S             G T P T       + A+L 
Sbjct  227  DIGIYSVGSSDF--RLLTSDPVEVALS-------------GGTPPETN-----KMFADLS  266

Query  307  ETTATTINDLRKAVAVQQYYEALARGGSRYREQVQALWDVVISDKTVQIPEYLGGGRYHV  366
              TA TIN LR+A  +Q+ YE  ARGG+RY E +Q+ + V   D  +Q PEYLGG +  V
Sbjct  267  NATAATINQLREAFQIQRLYEKDARGGTRYTEILQSHFGVTSPDARLQRPEYLGGQKTEV  326

Query  367  NMNQIVQTSGQQSSTDTPIGETGAMSVTPVNESSFTKSFEEHGFIIGVCCVRHNHSYQQG  426
             M  + QTS   S+  +P G   A+  T  +   F+KSF EHG +IG+ CV  + +YQQG
Sbjct  327  MMQTVPQTSSTDST--SPQGNLAALG-TATSRGGFSKSFVEHGVLIGLACVFADLTYQQG  383

Query  427  LERLWSRTDRLDYYVPQFANLGEQPVKKKEIMLTGEASDEETFGYQEAWADYRMKPNRVS  486
            + R+WSR DR D+Y P  A+LGEQ V  +EI   G ++D +TFGYQE +A+YR KP++++
Sbjct  384  MNRMWSRRDRWDFYWPSLAHLGEQAVLNQEIYTQGTSADTQTFGYQERFAEYRYKPSQIT  443

Query  487  SKMRSNATGTLDFWHYADNYKSVPTLSQGWMAERKTEIARTLIVQDEPQFFGAIRVANKT  546
             KMRSNATGTLD WH A ++ ++P L+  ++ E    + R + V  EP+F        KT
Sbjct  444  GKMRSNATGTLDAWHLAQDFTALPALNASFI-EENPPVDRVIAVPSEPEFIWDWYFDLKT  502

Query  547  TRRMPLYSVPGL  558
            TR MP+YSVPGL
Sbjct  503  TRPMPVYSVPGL  514


>gi|530695351|gb|AGT39907.1| major capsid protein [Marine gokushovirus]
Length=539

 Score =   382 bits (980),  Expect = 5e-122, Method: Compositional matrix adjust.
 Identities = 219/569 (38%), Positives = 323/569 (57%), Gaps = 50/569 (9%)

Query  2    NRNNERH-FNQIPEMKASRTRFNRDQTILTTFDSGKLIPFYVDEVLPGDTFSVDTSAIIR  60
            N++   H F+ IP  +  R++F+  +T+ T FDSG L+P  VDEVLPGD+ ++  +A  R
Sbjct  5    NKSASAHQFSMIPRAEIPRSKFDAQKTLKTAFDSGYLVPILVDEVLPGDSMNLRMTAFTR  64

Query  61   MTTPKYPVMDDAFIDFYYFYCPNRILWDNFKQFMGEVEETPWMPKQEYSVPKIVI-NGKE  119
            + TP +PVMD+ ++D ++F+ PNR+LW N+++FMGE +  P     +Y++P +   NG  
Sbjct  65   LATPLFPVMDNMYLDTFFFFVPNRLLWSNWQRFMGERDPDP-DSSIDYTIPTMTSPNGGY  123

Query  120  KSPEPYEDSILDYMGIPTK----VKKVFKVNALPIRAYVKIWNEFFRDENVDNAATIKTD  175
                   +S+ DYMG+PT            N+L  RAY  IWNE+FRDEN+ ++  +   
Sbjct  124  AV-----NSLQDYMGLPTAGQVDAGSSISHNSLFTRAYNLIWNEWFRDENLQDSVVVDKG  178

Query  176  DASIDYQDGNESEDNTDDILKKAIGGGRCLPVNKFHDYFTSCLPYPQRGPEVTLPMQGNA  235
            D    Y D          +L++           K HDYFTS LP+PQ+G  VTLP+ G+A
Sbjct  179  DGPDTYTDYT--------LLRRG----------KRHDYFTSALPWPQKGDAVTLPLGGSA  220

Query  236  PVGMYKNDSLTEFGTINGNSEIFLNQALNGSA-LAPKISNSFKEG-ARRALVTGSTNPTT  293
             V    ND+             ++ +   G+    P   +  KE     ++ TGS N   
Sbjct  221  NV--VYNDT---------GDPAYIREVSTGNVWTTPSRESVSKEANGNMSVPTGSVN--A  267

Query  294  QVSDAAYLAANLGETTATTINDLRKAVAVQQYYEALARGGSRYREQVQALWDVVISDKTV  353
            Q      L A+L   TA TIN +R++  +Q+  E  ARGG+RY E V++ + V+  D  +
Sbjct  268  QYDPNGSLVADLSTATAATINAIRQSFQIQRLLERDARGGTRYTEIVRSHFGVISPDARM  327

Query  354  QIPEYLGGGRYHVNMNQIVQTSGQQSS-TDTPIGETGAMSVTPVNESSFTKSFEEHGFII  412
            Q PEYLGGG   + +N + Q S   +S TDTP+G  GA+     +   F  SF EHG ++
Sbjct  328  QRPEYLGGGSAPIIVNPVAQQSASGASGTDTPLGTLGAVGTGLASGHGFASSFTEHGVVV  387

Query  413  GVCCVRHNHSYQQGLERLWSRTDRLDYYVPQFANLGEQPVKKKEIMLTGEASDEETFGYQ  472
            G+C VR + +YQQGL R++SR+ R D++ P F++LGEQP+  KE+  TG ++D++ FGYQ
Sbjct  388  GLCSVRADLTYQQGLHRMFSRSTRYDFFFPVFSHLGEQPILNKELYATGTSTDDDVFGYQ  447

Query  473  EAWADYRMKPNRVSSKMRSNATGTLDFWHYADNYKSVPTLSQGWMAERKTEIARTLIVQD  532
            EAWA+YR KP++V+  MRS A GTLD WH A N+ S+PTL+  ++ E    + R + V  
Sbjct  448  EAWAEYRYKPSQVTGLMRSTAAGTLDAWHLAQNFGSLPTLNSTFI-EDTPPVDRVVAVGS  506

Query  533  EP---QFFGAIRVANKTTRRMPLYSVPGL  558
            E    QF           R MP+YSVPGL
Sbjct  507  EANGQQFIFDAFFDINMARPMPMYSVPGL  535



Lambda      K        H        a         alpha
   0.316    0.133    0.398    0.792     4.96 

Gapped
Lambda      K        H        a         alpha    sigma
   0.267   0.0410    0.140     1.90     42.6     43.6 

Effective search space used: 4106350928880