bitscore colors: <40, 40-50 , 50-80, 80-200, >200




           BLASTP 2.2.30+


Reference: Stephen F. Altschul, Thomas L. Madden, Alejandro A.
Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J.
Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of
protein database search programs", Nucleic Acids Res. 25:3389-3402.


Reference for composition-based statistics: Alejandro A. Schaffer,
L. Aravind, Thomas L. Madden, Sergei Shavirin, John L. Spouge, Yuri
I. Wolf, Eugene V. Koonin, and Stephen F. Altschul (2001),
"Improving the accuracy of PSI-BLAST protein database searches with
composition-based statistics and other refinements", Nucleic Acids
Res. 29:2994-3005.



Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF
excluding environmental samples from WGS projects
           49,011,213 sequences; 17,563,301,199 total letters





Query= Contig-17_CDS_annotation_glimmer3.pl_2_6

Length=592
                                                                      Score     E
Sequences producing significant alignments:                          (Bits)  Value

gi|547312923|ref|WP_022044635.1|  putative uncharacterized protein    82.4    7e-14
gi|492501782|ref|WP_005867318.1|  hypothetical protein                76.3    2e-11
gi|649557305|gb|KDS63784.1|  capsid family protein                    69.7    4e-10
gi|547920049|ref|WP_022322420.1|  capsid protein VP1                  72.0    5e-10
gi|649569140|gb|KDS75238.1|  capsid family protein                    70.5    1e-09
gi|649555287|gb|KDS61824.1|  capsid family protein                    70.5    2e-09
gi|599088027|gb|AHN52939.1|  major capsid protein                     65.5    8e-09
gi|599087961|gb|AHN52906.1|  major capsid protein                     65.1    1e-08
gi|599087475|gb|AHN52663.1|  major capsid protein                     64.7    2e-08
gi|599088021|gb|AHN52936.1|  major capsid protein                     63.9    3e-08


>gi|547312923|ref|WP_022044635.1| putative uncharacterized protein [Alistipes finegoldii CAG:68]
 gi|524208404|emb|CCZ76639.1| putative uncharacterized protein [Alistipes finegoldii CAG:68]
Length=338

 Score = 82.4 bits (202),  Expect = 7e-14, Method: Compositional matrix adjust.
 Identities = 96/330 (29%), Positives = 138/330 (42%), Gaps = 35/330 (11%)

Query  294  GLCLKTYNSDLLQNWINTEWIDGVNGISEVTAIDV---TDGKLTMDALNLQQKVYNMLNR  350
            GL    Y+ DL  N I       V  I  + A+D+   T   + +  L L+ K+ N ++R
Sbjct  11   GLLSVPYSPDLFGNIIKQGSSPAVE-IEVMNALDLNISTGFSVAVPELRLRTKIQNWMDR  69

Query  351  IAVSGGTYRDWLETVFTGGNYMERCETPIFEGGMSTEIV---FQEVISNSASGEQP-LGT  406
            + VSGG   D   T++   +       P F G     I     + + + SASGE   LG 
Sbjct  70   LFVSGGRVGDVFRTLWGTKSSAIYVNKPDFLGVWQASINPSNVRAMANGSASGEDANLGQ  129

Query  407  LAGRGYDTGKQKGGHVKIK--VTEPSFIMGIGSITPRIDYSQGNEFYNEFLTLDDIHKPA  464
            LA    D      GH  I     EP   M I  + P   YSQG       ++  D   P 
Sbjct  130  LAAC-VDRYCDFSGHSGIDYYAKEPGTFMLITMLVPEPAYSQGLHPDLASISFGDDFNPE  188

Query  465  LDGIGYQ----------------DSLNWQRAWWDDNRMEKNGRL----QPSAGKTVAWLN  504
            L+GIG+Q                  L+ + + W  +     G L      S G+ VAW  
Sbjct  189  LNGIGFQLVPRHRFSMMPRGFNFTGLDQEASPWFGH--TGTGVLVDPNMVSVGEEVAWSW  246

Query  505  YMTNINRTFGNFAINDNEAFMVLNRNYEMS-PSAGTNETKIADLT-TYIDPVKYNYIFAE  562
              T+ +R  G+FA N N  + VL R +    P  GT   +  + T TYI+P+ + Y+F +
Sbjct  247  LRTDYSRLHGDFAQNGNYQYWVLTRRFTTYFPDDGTGFYQDGEYTGTYINPLDWQYVFVD  306

Query  563  TNLDAMNFWVQTKFDIKVRRLISAKQIPNL  592
              L A NF     FD+ V   +SA  +P L
Sbjct  307  QTLMAGNFAYYGTFDLNVTSSLSANYMPYL  336


>gi|492501782|ref|WP_005867318.1| hypothetical protein [Parabacteroides distasonis]
 gi|409230408|gb|EKN23272.1| hypothetical protein HMPREF1059_03257 [Parabacteroides distasonis 
CL09T03C24]
Length=538

 Score = 76.3 bits (186),  Expect = 2e-11, Method: Compositional matrix adjust.
 Identities = 73/270 (27%), Positives = 111/270 (41%), Gaps = 23/270 (9%)

Query  326  IDVTDGKLTMDALNLQQKVYNMLNRIAVSGGTYRDWLETVFTGGNYMERCETPIFEGGMS  385
            ++V +  ++++ L     +     R A SG  Y + + + F   +   R + P F GG  
Sbjct  289  VNVDELGVSINDLRTSNALQRWFERNARSGSRYIEQILSHFGVRSSDARLQRPQFLGGGR  348

Query  386  TEIVFQEVISNSASGE-QPLGTLAGRGYDTGKQKGGHVKIKVTEPSFIMGIGSITPRIDY  444
            T I   EV+  SA+    P   +AG G   G   G   K    E  +I+GI SI PR  Y
Sbjct  349  TPISVSEVLQTSATDSTSPQANMAGHGISAGVNHG--FKRYFEEHGYIIGIMSIRPRTGY  406

Query  445  SQGNEFYNEFLTLD--DIHKPALDGIGYQDSLNWQRAWWDDNRMEKNGRLQPSAGKTVAW  502
             QG     +F   D  D + P    +G Q+  N +  +        NG      G T  +
Sbjct  407  QQGVP--KDFRKFDNMDFYFPEFAHLGEQEIKN-EEVYLQQTPASNNGTF----GYTPRY  459

Query  503  LNYMTNINRTFGNFAINDNEAFMVLNRNYEMSPSAGTNETKIADLTTYIDPVKYNYIFAE  562
              Y  ++N   G+F    N AF  LNR +  SP+  T         T+++    N +FA 
Sbjct  460  AEYKYSMNEVHGDF--RGNMAFWHLNRIFSESPNLNT---------TFVECNPSNRVFAT  508

Query  563  TNLDAMNFWVQTKFDIKVRRLISAKQIPNL  592
                   +W+Q   D+K  RL+     P L
Sbjct  509  AETSDDKYWIQLYQDVKALRLMPKYGTPML  538


>gi|649557305|gb|KDS63784.1| capsid family protein [Parabacteroides distasonis str. 3999B 
T(B) 4]
 gi|649559156|gb|KDS65543.1| capsid family protein [Parabacteroides distasonis str. 3999B 
T(B) 6]
Length=245

 Score = 69.7 bits (169),  Expect = 4e-10, Method: Compositional matrix adjust.
 Identities = 70/248 (28%), Positives = 101/248 (41%), Gaps = 23/248 (9%)

Query  348  LNRIAVSGGTYRDWLETVFTGGNYMERCETPIFEGGMSTEIVFQEVISNSASGE-QPLGT  406
              R A SG  Y + + + F   +   R + P F GG  T I   EV+  S++    P   
Sbjct  18   FERNARSGSRYIEQILSHFGVRSSDARLQRPQFLGGGRTPISVSEVLQTSSTDSTSPQAN  77

Query  407  LAGRGYDTGKQKGGHVKIKVTEPSFIMGIGSITPRIDYSQGNEFYNEFLTLD--DIHKPA  464
            +AG G   G   G        E  +IMGI SI PR  Y QG     +F   D  D + P 
Sbjct  78   MAGHGISAGVNHG--FTRYFEEHGYIMGIMSIRPRTGYQQGVP--KDFRKFDNMDFYFPE  133

Query  465  LDGIGYQDSLNWQRAWWDDNRMEKNGRLQPSAGKTVAWLNYMTNINRTFGNFAINDNEAF  524
               +G Q+  N +  + +++     G      G T  +  Y  + N   G+F    N AF
Sbjct  134  FAHLGEQEIKN-EELYLNESDAANEGTF----GYTPRYAEYKYSQNEVHGDF--RGNMAF  186

Query  525  MVLNRNYEMSPSAGTNETKIADLTTYIDPVKYNYIFAETNLDAMNFWVQTKFDIKVRRLI  584
              LNR ++  P+  T         T+++    N +FA        +WVQ   DIK  RL+
Sbjct  187  WHLNRIFKEKPNLNT---------TFVECNPSNRVFATAETSDDKYWVQIYQDIKALRLM  237

Query  585  SAKQIPNL  592
                 P L
Sbjct  238  PKYGTPML  245


>gi|547920049|ref|WP_022322420.1| capsid protein VP1 [Parabacteroides merdae CAG:48]
 gi|524592961|emb|CDD13573.1| capsid protein VP1 [Parabacteroides merdae CAG:48]
Length=553

 Score = 72.0 bits (175),  Expect = 5e-10, Method: Compositional matrix adjust.
 Identities = 72/270 (27%), Positives = 112/270 (41%), Gaps = 23/270 (9%)

Query  326  IDVTDGKLTMDALNLQQKVYNMLNRIAVSGGTYRDWLETVFTGGNYMERCETPIFEGGMS  385
            ++V +  + ++ L     +     R A  G  Y + + + F   +   R + P F GG  
Sbjct  304  VNVDEMGININDLRTSNALQRWFERNARGGSRYIEQILSHFGVRSSDARLQRPQFLGGGR  363

Query  386  TEIVFQEVISNSASGE-QPLGTLAGRGYDTGKQKGGHVKIKVTEPSFIMGIGSITPRIDY  444
              I   EV+  S++ E  P   +AG G   G   G   K    E  +I+GI SITPR  Y
Sbjct  364  MPISVSEVLQTSSTDETSPQANMAGHGISAGINNG--FKHYFEEHGYIIGIMSITPRSGY  421

Query  445  SQGNEFYNEFLTLD--DIHKPALDGIGYQDSLNWQRAWWDDNRMEKNGRLQPSAGKTVAW  502
             QG     +F   D  D + P    +  Q+  N Q  +  ++    NG      G T  +
Sbjct  422  QQGVP--RDFTKFDNMDFYFPEFAHLSEQEIKN-QELFVSEDAAYNNGTF----GYTPRY  474

Query  503  LNYMTNINRTFGNFAINDNEAFMVLNRNYEMSPSAGTNETKIADLTTYIDPVKYNYIFAE  562
              Y  + +   G+F    N +F  LNR +E  P+  T         T+++    N +FA 
Sbjct  475  AEYKYHPSEAHGDF--RGNLSFWHLNRIFEDKPNLNT---------TFVECKPSNRVFAT  523

Query  563  TNLDAMNFWVQTKFDIKVRRLISAKQIPNL  592
            +  +   FWVQ   D+K  RL+     P L
Sbjct  524  SETEDDKFWVQMYQDVKALRLMPKYGTPML  553


>gi|649569140|gb|KDS75238.1| capsid family protein, partial [Parabacteroides distasonis str. 
3999B T(B) 6]
Length=390

 Score = 70.5 bits (171),  Expect = 1e-09, Method: Compositional matrix adjust.
 Identities = 70/248 (28%), Positives = 101/248 (41%), Gaps = 23/248 (9%)

Query  348  LNRIAVSGGTYRDWLETVFTGGNYMERCETPIFEGGMSTEIVFQEVISNSASGE-QPLGT  406
              R A SG  Y + + + F   +   R + P F GG  T I   EV+  S++    P   
Sbjct  163  FERNARSGSRYIEQILSHFGVRSSDARLQRPQFLGGGRTPISVSEVLQTSSTDSTSPQAN  222

Query  407  LAGRGYDTGKQKGGHVKIKVTEPSFIMGIGSITPRIDYSQGNEFYNEFLTLD--DIHKPA  464
            +AG G   G   G        E  +IMGI SI PR  Y QG     +F   D  D + P 
Sbjct  223  MAGHGISAGVNHG--FTRYFEEHGYIMGIMSIRPRTGYQQGVP--KDFRKFDNMDFYFPE  278

Query  465  LDGIGYQDSLNWQRAWWDDNRMEKNGRLQPSAGKTVAWLNYMTNINRTFGNFAINDNEAF  524
               +G Q+  N +  + +++     G      G T  +  Y  + N   G+F    N AF
Sbjct  279  FAHLGEQEIKN-EELYLNESDAANEGTF----GYTPRYAEYKYSQNEVHGDF--RGNMAF  331

Query  525  MVLNRNYEMSPSAGTNETKIADLTTYIDPVKYNYIFAETNLDAMNFWVQTKFDIKVRRLI  584
              LNR ++  P+  T         T+++    N +FA        +WVQ   DIK  RL+
Sbjct  332  WHLNRIFKEKPNLNT---------TFVECNPSNRVFATAETSDDKYWVQIYQDIKALRLM  382

Query  585  SAKQIPNL  592
                 P L
Sbjct  383  PKYGTPML  390


>gi|649555287|gb|KDS61824.1| capsid family protein [Parabacteroides distasonis str. 3999B 
T(B) 4]
 gi|649560568|gb|KDS66876.1| capsid family protein [Parabacteroides distasonis str. 3999B 
T(B) 4]
 gi|649561020|gb|KDS67307.1| capsid family protein [Parabacteroides distasonis str. 3999B 
T(B) 4]
 gi|649562724|gb|KDS68908.1| capsid family protein [Parabacteroides distasonis str. 3999B 
T(B) 6]
Length=541

 Score = 70.5 bits (171),  Expect = 2e-09, Method: Compositional matrix adjust.
 Identities = 70/248 (28%), Positives = 101/248 (41%), Gaps = 23/248 (9%)

Query  348  LNRIAVSGGTYRDWLETVFTGGNYMERCETPIFEGGMSTEIVFQEVISNSASGE-QPLGT  406
              R A SG  Y + + + F   +   R + P F GG  T I   EV+  S++    P   
Sbjct  314  FERNARSGSRYIEQILSHFGVRSSDARLQRPQFLGGGRTPISVSEVLQTSSTDSTSPQAN  373

Query  407  LAGRGYDTGKQKGGHVKIKVTEPSFIMGIGSITPRIDYSQGNEFYNEFLTLD--DIHKPA  464
            +AG G   G   G        E  +IMGI SI PR  Y QG     +F   D  D + P 
Sbjct  374  MAGHGISAGVNHG--FTRYFEEHGYIMGIMSIRPRTGYQQGVP--KDFRKFDNMDFYFPE  429

Query  465  LDGIGYQDSLNWQRAWWDDNRMEKNGRLQPSAGKTVAWLNYMTNINRTFGNFAINDNEAF  524
               +G Q+  N +  + +++     G      G T  +  Y  + N   G+F    N AF
Sbjct  430  FAHLGEQEIKN-EELYLNESDAANEGTF----GYTPRYAEYKYSQNEVHGDF--RGNMAF  482

Query  525  MVLNRNYEMSPSAGTNETKIADLTTYIDPVKYNYIFAETNLDAMNFWVQTKFDIKVRRLI  584
              LNR ++  P+  T         T+++    N +FA        +WVQ   DIK  RL+
Sbjct  483  WHLNRIFKEKPNLNT---------TFVECNPSNRVFATAETSDDKYWVQIYQDIKALRLM  533

Query  585  SAKQIPNL  592
                 P L
Sbjct  534  PKYGTPML  541


>gi|599088027|gb|AHN52939.1| major capsid protein, partial [uncultured Gokushovirinae]
Length=219

 Score = 65.5 bits (158),  Expect = 8e-09, Method: Compositional matrix adjust.
 Identities = 52/143 (36%), Positives = 68/143 (48%), Gaps = 5/143 (3%)

Query  334  TMDALNLQQKVYNMLNRIAVSGGTYRDWLETVFTGGNYMERCETPIFEGGMSTEIVFQEV  393
            T++ L    ++  +L R A SG  Y + ++  F G N+M+    P F GG ST I    V
Sbjct  77   TINQLRQAFQIQKLLERDARSGTRYAEIVKAHF-GVNFMDVTYRPEFLGGTSTPINVTSV  135

Query  394  ISNSASGEQPLGTLAGRGYDTGKQKGGHVKIKVTEPSFIMGIGSITPRIDYSQG-NEFYN  452
               S SG  P GTLA  G  T    GG      TE   +MGI S+   + Y QG N  ++
Sbjct  136  PQTSESGTTPQGTLAAFG--TATVNGGGFTKSFTEHCIVMGIASVRADLTYQQGLNRMFS  193

Query  453  EFLTLDDIHKPALDGIGYQDSLN  475
               T  D + PAL  IG Q  LN
Sbjct  194  R-STRYDFYFPALAHIGEQAVLN  215


>gi|599087961|gb|AHN52906.1| major capsid protein, partial [uncultured Gokushovirinae]
Length=210

 Score = 65.1 bits (157),  Expect = 1e-08, Method: Compositional matrix adjust.
 Identities = 52/145 (36%), Positives = 69/145 (48%), Gaps = 5/145 (3%)

Query  334  TMDALNLQQKVYNMLNRIAVSGGTYRDWLETVFTGGNYMERCETPIFEGGMSTEIVFQEV  393
            T++ L    ++  +L R A SG  Y + ++  F G N+M+    P F GG ST I    V
Sbjct  68   TINQLRQAFQIQKLLERDARSGTRYSEIVKAHF-GVNFMDVTYRPEFLGGTSTPINVTSV  126

Query  394  ISNSASGEQPLGTLAGRGYDTGKQKGGHVKIKVTEPSFIMGIGSITPRIDYSQG-NEFYN  452
               S SG  P GTLA  G  T    GG      TE   +MGI S+   + Y QG N  ++
Sbjct  127  PQTSESGTTPQGTLAAFG--TATINGGGFTKSFTEHCIVMGIASVRADLTYQQGLNRMFS  184

Query  453  EFLTLDDIHKPALDGIGYQDSLNWQ  477
               T  D + PAL  IG Q  LN +
Sbjct  185  R-STRYDFYFPALAHIGEQSVLNKE  208


>gi|599087475|gb|AHN52663.1| major capsid protein, partial [uncultured Gokushovirinae]
Length=210

 Score = 64.7 bits (156),  Expect = 2e-08, Method: Compositional matrix adjust.
 Identities = 52/145 (36%), Positives = 69/145 (48%), Gaps = 5/145 (3%)

Query  334  TMDALNLQQKVYNMLNRIAVSGGTYRDWLETVFTGGNYMERCETPIFEGGMSTEIVFQEV  393
            T++ L    ++  +L R A SG  Y + ++  F G N+M+    P F GG ST I    V
Sbjct  68   TINQLRQAFQIQKLLERDARSGTRYSEIVKAHF-GVNFMDVTYRPEFLGGTSTPINVTSV  126

Query  394  ISNSASGEQPLGTLAGRGYDTGKQKGGHVKIKVTEPSFIMGIGSITPRIDYSQG-NEFYN  452
               S SG  P GTLA  G  T    GG      TE   +MGI S+   + Y QG N  ++
Sbjct  127  PQTSESGTTPQGTLAAFG--TATINGGGFTKSFTEHCILMGIASVRADLTYQQGLNRMFS  184

Query  453  EFLTLDDIHKPALDGIGYQDSLNWQ  477
               T  D + PAL  IG Q  LN +
Sbjct  185  R-STRYDFYFPALAHIGEQSVLNKE  208


>gi|599088021|gb|AHN52936.1| major capsid protein, partial [uncultured Gokushovirinae]
Length=220

 Score = 63.9 bits (154),  Expect = 3e-08, Method: Compositional matrix adjust.
 Identities = 51/145 (35%), Positives = 69/145 (48%), Gaps = 5/145 (3%)

Query  334  TMDALNLQQKVYNMLNRIAVSGGTYRDWLETVFTGGNYMERCETPIFEGGMSTEIVFQEV  393
            T++ L    ++  +L R A SG  Y + ++  F G N+M+    P F GG ST +    V
Sbjct  78   TINQLRQAFQIQKLLERDARSGTRYSEIVKAHF-GVNFMDVTYRPEFLGGSSTPVNVTSV  136

Query  394  ISNSASGEQPLGTLAGRGYDTGKQKGGHVKIKVTEPSFIMGIGSITPRIDYSQG-NEFYN  452
               S SG  P GTLA  G  T    GG      TE   +MGI S+   + Y QG N  ++
Sbjct  137  PQTSESGTTPQGTLAAFG--TATINGGGFTKSFTEHCIVMGIASVRADLTYQQGLNRMFS  194

Query  453  EFLTLDDIHKPALDGIGYQDSLNWQ  477
               T  D + PAL  IG Q  LN +
Sbjct  195  R-STRYDFYFPALAHIGEQSVLNKE  218



Lambda      K        H        a         alpha
   0.316    0.133    0.400    0.792     4.96 

Gapped
Lambda      K        H        a         alpha    sigma
   0.267   0.0410    0.140     1.90     42.6     43.6 

Effective search space used: 4386821585886