bitscore colors: <40, 40-50 , 50-80, 80-200, >200




           BLASTP 2.2.30+


Reference: Stephen F. Altschul, Thomas L. Madden, Alejandro A.
Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J.
Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of
protein database search programs", Nucleic Acids Res. 25:3389-3402.


Reference for composition-based statistics: Alejandro A. Schaffer,
L. Aravind, Thomas L. Madden, Sergei Shavirin, John L. Spouge, Yuri
I. Wolf, Eugene V. Koonin, and Stephen F. Altschul (2001),
"Improving the accuracy of PSI-BLAST protein database searches with
composition-based statistics and other refinements", Nucleic Acids
Res. 29:2994-3005.



Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF
excluding environmental samples from WGS projects
           49,011,213 sequences; 17,563,301,199 total letters





Query= Contig-11_CDS_annotation_glimmer3.pl_2_7

Length=648
                                                                      Score     E
Sequences producing significant alignments:                          (Bits)  Value

gi|547312923|ref|WP_022044635.1|  putative uncharacterized protein    82.0    1e-13
gi|492501782|ref|WP_005867318.1|  hypothetical protein                76.3    2e-11
gi|649557305|gb|KDS63784.1|  capsid family protein                    69.7    5e-10
gi|547920049|ref|WP_022322420.1|  capsid protein VP1                  72.0    6e-10
gi|649569140|gb|KDS75238.1|  capsid family protein                    70.5    1e-09
gi|649555287|gb|KDS61824.1|  capsid family protein                    70.5    2e-09
gi|599088027|gb|AHN52939.1|  major capsid protein                     65.9    9e-09
gi|599087961|gb|AHN52906.1|  major capsid protein                     65.1    1e-08
gi|599087475|gb|AHN52663.1|  major capsid protein                     64.7    2e-08
gi|599088021|gb|AHN52936.1|  major capsid protein                     64.3    3e-08


>gi|547312923|ref|WP_022044635.1| putative uncharacterized protein [Alistipes finegoldii CAG:68]
 gi|524208404|emb|CCZ76639.1| putative uncharacterized protein [Alistipes finegoldii CAG:68]
Length=338

 Score = 82.0 bits (201),  Expect = 1e-13, Method: Compositional matrix adjust.
 Identities = 96/330 (29%), Positives = 138/330 (42%), Gaps = 35/330 (11%)

Query  350  GLCLKTYNSDLLQNWINTEWIDGVNGISEVTAIDV---TDGKLTMDALNLQQKVYNMLNR  406
            GL    Y+ DL  N I       V  I  + A+D+   T   + +  L L+ K+ N ++R
Sbjct  11   GLLSVPYSPDLFGNIIKQGSSPAVE-IEVMNALDLNISTGFSVAVPELRLRTKIQNWMDR  69

Query  407  IAVSGGTYRDWLETVFTGGNYMERCETPIFEGGMSTEIV---FQEVISNSASGEQP-LGT  462
            + VSGG   D   T++   +       P F G     I     + + + SASGE   LG 
Sbjct  70   LFVSGGRVGDVFRTLWGTKSSAIYVNKPDFLGVWQASINPSNVRAMANGSASGEDANLGQ  129

Query  463  LAGRGYDTGKQKGGHVKIK--VTEPSFIMGIGSITPRIDYSQGNEFYNEFLTLDDIHKPA  520
            LA    D      GH  I     EP   M I  + P   YSQG       ++  D   P 
Sbjct  130  LAAC-VDRYCDFSGHSGIDYYAKEPGTFMLITMLVPEPAYSQGLHPDLASISFGDDFNPE  188

Query  521  LDGIGYQ----------------DSLNWQRAWWDDNRMEKNGRL----QPSAGKTVAWLN  560
            L+GIG+Q                  L+ + + W  +     G L      S G+ VAW  
Sbjct  189  LNGIGFQLVPRHRFSMMPRGFNFTGLDQEASPWFGH--TGTGVLVDPNMVSVGEEVAWSW  246

Query  561  YMTNINRTFGNFAINDNEAFMVLNRNYE-MSPSAGTNETKIADLT-TYIDPVKYNYIFAE  618
              T+ +R  G+FA N N  + VL R +    P  GT   +  + T TYI+P+ + Y+F +
Sbjct  247  LRTDYSRLHGDFAQNGNYQYWVLTRRFTTYFPDDGTGFYQDGEYTGTYINPLDWQYVFVD  306

Query  619  TNLDAMNFWVQTKFDIKVRRLISAKQIPNL  648
              L A NF     FD+ V   +SA  +P L
Sbjct  307  QTLMAGNFAYYGTFDLNVTSSLSANYMPYL  336


>gi|492501782|ref|WP_005867318.1| hypothetical protein [Parabacteroides distasonis]
 gi|409230408|gb|EKN23272.1| hypothetical protein HMPREF1059_03257 [Parabacteroides distasonis 
CL09T03C24]
Length=538

 Score = 76.3 bits (186),  Expect = 2e-11, Method: Compositional matrix adjust.
 Identities = 73/270 (27%), Positives = 111/270 (41%), Gaps = 23/270 (9%)

Query  382  IDVTDGKLTMDALNLQQKVYNMLNRIAVSGGTYRDWLETVFTGGNYMERCETPIFEGGMS  441
            ++V +  ++++ L     +     R A SG  Y + + + F   +   R + P F GG  
Sbjct  289  VNVDELGVSINDLRTSNALQRWFERNARSGSRYIEQILSHFGVRSSDARLQRPQFLGGGR  348

Query  442  TEIVFQEVISNSASGE-QPLGTLAGRGYDTGKQKGGHVKIKVTEPSFIMGIGSITPRIDY  500
            T I   EV+  SA+    P   +AG G   G   G   K    E  +I+GI SI PR  Y
Sbjct  349  TPISVSEVLQTSATDSTSPQANMAGHGISAGVNHG--FKRYFEEHGYIIGIMSIRPRTGY  406

Query  501  SQGNEFYNEFLTLD--DIHKPALDGIGYQDSLNWQRAWWDDNRMEKNGRLQPSAGKTVAW  558
             QG     +F   D  D + P    +G Q+  N +  +        NG      G T  +
Sbjct  407  QQGVP--KDFRKFDNMDFYFPEFAHLGEQEIKN-EEVYLQQTPASNNGTF----GYTPRY  459

Query  559  LNYMTNINRTFGNFAINDNEAFMVLNRNYEMSPSAGTNETKIADLTTYIDPVKYNYIFAE  618
              Y  ++N   G+F    N AF  LNR +  SP+  T         T+++    N +FA 
Sbjct  460  AEYKYSMNEVHGDF--RGNMAFWHLNRIFSESPNLNT---------TFVECNPSNRVFAT  508

Query  619  TNLDAMNFWVQTKFDIKVRRLISAKQIPNL  648
                   +W+Q   D+K  RL+     P L
Sbjct  509  AETSDDKYWIQLYQDVKALRLMPKYGTPML  538


>gi|649557305|gb|KDS63784.1| capsid family protein [Parabacteroides distasonis str. 3999B 
T(B) 4]
 gi|649559156|gb|KDS65543.1| capsid family protein [Parabacteroides distasonis str. 3999B 
T(B) 6]
Length=245

 Score = 69.7 bits (169),  Expect = 5e-10, Method: Compositional matrix adjust.
 Identities = 70/248 (28%), Positives = 101/248 (41%), Gaps = 23/248 (9%)

Query  404  LNRIAVSGGTYRDWLETVFTGGNYMERCETPIFEGGMSTEIVFQEVISNSASGE-QPLGT  462
              R A SG  Y + + + F   +   R + P F GG  T I   EV+  S++    P   
Sbjct  18   FERNARSGSRYIEQILSHFGVRSSDARLQRPQFLGGGRTPISVSEVLQTSSTDSTSPQAN  77

Query  463  LAGRGYDTGKQKGGHVKIKVTEPSFIMGIGSITPRIDYSQGNEFYNEFLTLD--DIHKPA  520
            +AG G   G   G        E  +IMGI SI PR  Y QG     +F   D  D + P 
Sbjct  78   MAGHGISAGVNHG--FTRYFEEHGYIMGIMSIRPRTGYQQGVP--KDFRKFDNMDFYFPE  133

Query  521  LDGIGYQDSLNWQRAWWDDNRMEKNGRLQPSAGKTVAWLNYMTNINRTFGNFAINDNEAF  580
               +G Q+  N +  + +++     G      G T  +  Y  + N   G+F    N AF
Sbjct  134  FAHLGEQEIKN-EELYLNESDAANEGTF----GYTPRYAEYKYSQNEVHGDF--RGNMAF  186

Query  581  MVLNRNYEMSPSAGTNETKIADLTTYIDPVKYNYIFAETNLDAMNFWVQTKFDIKVRRLI  640
              LNR ++  P+  T         T+++    N +FA        +WVQ   DIK  RL+
Sbjct  187  WHLNRIFKEKPNLNT---------TFVECNPSNRVFATAETSDDKYWVQIYQDIKALRLM  237

Query  641  SAKQIPNL  648
                 P L
Sbjct  238  PKYGTPML  245


>gi|547920049|ref|WP_022322420.1| capsid protein VP1 [Parabacteroides merdae CAG:48]
 gi|524592961|emb|CDD13573.1| capsid protein VP1 [Parabacteroides merdae CAG:48]
Length=553

 Score = 72.0 bits (175),  Expect = 6e-10, Method: Compositional matrix adjust.
 Identities = 72/270 (27%), Positives = 112/270 (41%), Gaps = 23/270 (9%)

Query  382  IDVTDGKLTMDALNLQQKVYNMLNRIAVSGGTYRDWLETVFTGGNYMERCETPIFEGGMS  441
            ++V +  + ++ L     +     R A  G  Y + + + F   +   R + P F GG  
Sbjct  304  VNVDEMGININDLRTSNALQRWFERNARGGSRYIEQILSHFGVRSSDARLQRPQFLGGGR  363

Query  442  TEIVFQEVISNSASGE-QPLGTLAGRGYDTGKQKGGHVKIKVTEPSFIMGIGSITPRIDY  500
              I   EV+  S++ E  P   +AG G   G   G   K    E  +I+GI SITPR  Y
Sbjct  364  MPISVSEVLQTSSTDETSPQANMAGHGISAGINNG--FKHYFEEHGYIIGIMSITPRSGY  421

Query  501  SQGNEFYNEFLTLD--DIHKPALDGIGYQDSLNWQRAWWDDNRMEKNGRLQPSAGKTVAW  558
             QG     +F   D  D + P    +  Q+  N Q  +  ++    NG      G T  +
Sbjct  422  QQGVP--RDFTKFDNMDFYFPEFAHLSEQEIKN-QELFVSEDAAYNNGTF----GYTPRY  474

Query  559  LNYMTNINRTFGNFAINDNEAFMVLNRNYEMSPSAGTNETKIADLTTYIDPVKYNYIFAE  618
              Y  + +   G+F    N +F  LNR +E  P+  T         T+++    N +FA 
Sbjct  475  AEYKYHPSEAHGDF--RGNLSFWHLNRIFEDKPNLNT---------TFVECKPSNRVFAT  523

Query  619  TNLDAMNFWVQTKFDIKVRRLISAKQIPNL  648
            +  +   FWVQ   D+K  RL+     P L
Sbjct  524  SETEDDKFWVQMYQDVKALRLMPKYGTPML  553


>gi|649569140|gb|KDS75238.1| capsid family protein, partial [Parabacteroides distasonis str. 
3999B T(B) 6]
Length=390

 Score = 70.5 bits (171),  Expect = 1e-09, Method: Compositional matrix adjust.
 Identities = 70/248 (28%), Positives = 101/248 (41%), Gaps = 23/248 (9%)

Query  404  LNRIAVSGGTYRDWLETVFTGGNYMERCETPIFEGGMSTEIVFQEVISNSASGE-QPLGT  462
              R A SG  Y + + + F   +   R + P F GG  T I   EV+  S++    P   
Sbjct  163  FERNARSGSRYIEQILSHFGVRSSDARLQRPQFLGGGRTPISVSEVLQTSSTDSTSPQAN  222

Query  463  LAGRGYDTGKQKGGHVKIKVTEPSFIMGIGSITPRIDYSQGNEFYNEFLTLD--DIHKPA  520
            +AG G   G   G        E  +IMGI SI PR  Y QG     +F   D  D + P 
Sbjct  223  MAGHGISAGVNHG--FTRYFEEHGYIMGIMSIRPRTGYQQGVP--KDFRKFDNMDFYFPE  278

Query  521  LDGIGYQDSLNWQRAWWDDNRMEKNGRLQPSAGKTVAWLNYMTNINRTFGNFAINDNEAF  580
               +G Q+  N +  + +++     G      G T  +  Y  + N   G+F    N AF
Sbjct  279  FAHLGEQEIKN-EELYLNESDAANEGTF----GYTPRYAEYKYSQNEVHGDF--RGNMAF  331

Query  581  MVLNRNYEMSPSAGTNETKIADLTTYIDPVKYNYIFAETNLDAMNFWVQTKFDIKVRRLI  640
              LNR ++  P+  T         T+++    N +FA        +WVQ   DIK  RL+
Sbjct  332  WHLNRIFKEKPNLNT---------TFVECNPSNRVFATAETSDDKYWVQIYQDIKALRLM  382

Query  641  SAKQIPNL  648
                 P L
Sbjct  383  PKYGTPML  390


>gi|649555287|gb|KDS61824.1| capsid family protein [Parabacteroides distasonis str. 3999B 
T(B) 4]
 gi|649560568|gb|KDS66876.1| capsid family protein [Parabacteroides distasonis str. 3999B 
T(B) 4]
 gi|649561020|gb|KDS67307.1| capsid family protein [Parabacteroides distasonis str. 3999B 
T(B) 4]
 gi|649562724|gb|KDS68908.1| capsid family protein [Parabacteroides distasonis str. 3999B 
T(B) 6]
Length=541

 Score = 70.5 bits (171),  Expect = 2e-09, Method: Compositional matrix adjust.
 Identities = 70/248 (28%), Positives = 101/248 (41%), Gaps = 23/248 (9%)

Query  404  LNRIAVSGGTYRDWLETVFTGGNYMERCETPIFEGGMSTEIVFQEVISNSASGE-QPLGT  462
              R A SG  Y + + + F   +   R + P F GG  T I   EV+  S++    P   
Sbjct  314  FERNARSGSRYIEQILSHFGVRSSDARLQRPQFLGGGRTPISVSEVLQTSSTDSTSPQAN  373

Query  463  LAGRGYDTGKQKGGHVKIKVTEPSFIMGIGSITPRIDYSQGNEFYNEFLTLD--DIHKPA  520
            +AG G   G   G        E  +IMGI SI PR  Y QG     +F   D  D + P 
Sbjct  374  MAGHGISAGVNHG--FTRYFEEHGYIMGIMSIRPRTGYQQGVP--KDFRKFDNMDFYFPE  429

Query  521  LDGIGYQDSLNWQRAWWDDNRMEKNGRLQPSAGKTVAWLNYMTNINRTFGNFAINDNEAF  580
               +G Q+  N +  + +++     G      G T  +  Y  + N   G+F    N AF
Sbjct  430  FAHLGEQEIKN-EELYLNESDAANEGTF----GYTPRYAEYKYSQNEVHGDF--RGNMAF  482

Query  581  MVLNRNYEMSPSAGTNETKIADLTTYIDPVKYNYIFAETNLDAMNFWVQTKFDIKVRRLI  640
              LNR ++  P+  T         T+++    N +FA        +WVQ   DIK  RL+
Sbjct  483  WHLNRIFKEKPNLNT---------TFVECNPSNRVFATAETSDDKYWVQIYQDIKALRLM  533

Query  641  SAKQIPNL  648
                 P L
Sbjct  534  PKYGTPML  541


>gi|599088027|gb|AHN52939.1| major capsid protein, partial [uncultured Gokushovirinae]
Length=219

 Score = 65.9 bits (159),  Expect = 9e-09, Method: Compositional matrix adjust.
 Identities = 52/143 (36%), Positives = 68/143 (48%), Gaps = 5/143 (3%)

Query  390  TMDALNLQQKVYNMLNRIAVSGGTYRDWLETVFTGGNYMERCETPIFEGGMSTEIVFQEV  449
            T++ L    ++  +L R A SG  Y + ++  F G N+M+    P F GG ST I    V
Sbjct  77   TINQLRQAFQIQKLLERDARSGTRYAEIVKAHF-GVNFMDVTYRPEFLGGTSTPINVTSV  135

Query  450  ISNSASGEQPLGTLAGRGYDTGKQKGGHVKIKVTEPSFIMGIGSITPRIDYSQG-NEFYN  508
               S SG  P GTLA  G  T    GG      TE   +MGI S+   + Y QG N  ++
Sbjct  136  PQTSESGTTPQGTLAAFG--TATVNGGGFTKSFTEHCIVMGIASVRADLTYQQGLNRMFS  193

Query  509  EFLTLDDIHKPALDGIGYQDSLN  531
               T  D + PAL  IG Q  LN
Sbjct  194  R-STRYDFYFPALAHIGEQAVLN  215


>gi|599087961|gb|AHN52906.1| major capsid protein, partial [uncultured Gokushovirinae]
Length=210

 Score = 65.1 bits (157),  Expect = 1e-08, Method: Compositional matrix adjust.
 Identities = 52/145 (36%), Positives = 69/145 (48%), Gaps = 5/145 (3%)

Query  390  TMDALNLQQKVYNMLNRIAVSGGTYRDWLETVFTGGNYMERCETPIFEGGMSTEIVFQEV  449
            T++ L    ++  +L R A SG  Y + ++  F G N+M+    P F GG ST I    V
Sbjct  68   TINQLRQAFQIQKLLERDARSGTRYSEIVKAHF-GVNFMDVTYRPEFLGGTSTPINVTSV  126

Query  450  ISNSASGEQPLGTLAGRGYDTGKQKGGHVKIKVTEPSFIMGIGSITPRIDYSQG-NEFYN  508
               S SG  P GTLA  G  T    GG      TE   +MGI S+   + Y QG N  ++
Sbjct  127  PQTSESGTTPQGTLAAFG--TATINGGGFTKSFTEHCIVMGIASVRADLTYQQGLNRMFS  184

Query  509  EFLTLDDIHKPALDGIGYQDSLNWQ  533
               T  D + PAL  IG Q  LN +
Sbjct  185  R-STRYDFYFPALAHIGEQSVLNKE  208


>gi|599087475|gb|AHN52663.1| major capsid protein, partial [uncultured Gokushovirinae]
Length=210

 Score = 64.7 bits (156),  Expect = 2e-08, Method: Compositional matrix adjust.
 Identities = 52/145 (36%), Positives = 69/145 (48%), Gaps = 5/145 (3%)

Query  390  TMDALNLQQKVYNMLNRIAVSGGTYRDWLETVFTGGNYMERCETPIFEGGMSTEIVFQEV  449
            T++ L    ++  +L R A SG  Y + ++  F G N+M+    P F GG ST I    V
Sbjct  68   TINQLRQAFQIQKLLERDARSGTRYSEIVKAHF-GVNFMDVTYRPEFLGGTSTPINVTSV  126

Query  450  ISNSASGEQPLGTLAGRGYDTGKQKGGHVKIKVTEPSFIMGIGSITPRIDYSQG-NEFYN  508
               S SG  P GTLA  G  T    GG      TE   +MGI S+   + Y QG N  ++
Sbjct  127  PQTSESGTTPQGTLAAFG--TATINGGGFTKSFTEHCILMGIASVRADLTYQQGLNRMFS  184

Query  509  EFLTLDDIHKPALDGIGYQDSLNWQ  533
               T  D + PAL  IG Q  LN +
Sbjct  185  R-STRYDFYFPALAHIGEQSVLNKE  208


>gi|599088021|gb|AHN52936.1| major capsid protein, partial [uncultured Gokushovirinae]
Length=220

 Score = 64.3 bits (155),  Expect = 3e-08, Method: Compositional matrix adjust.
 Identities = 51/145 (35%), Positives = 69/145 (48%), Gaps = 5/145 (3%)

Query  390  TMDALNLQQKVYNMLNRIAVSGGTYRDWLETVFTGGNYMERCETPIFEGGMSTEIVFQEV  449
            T++ L    ++  +L R A SG  Y + ++  F G N+M+    P F GG ST +    V
Sbjct  78   TINQLRQAFQIQKLLERDARSGTRYSEIVKAHF-GVNFMDVTYRPEFLGGSSTPVNVTSV  136

Query  450  ISNSASGEQPLGTLAGRGYDTGKQKGGHVKIKVTEPSFIMGIGSITPRIDYSQG-NEFYN  508
               S SG  P GTLA  G  T    GG      TE   +MGI S+   + Y QG N  ++
Sbjct  137  PQTSESGTTPQGTLAAFG--TATINGGGFTKSFTEHCIVMGIASVRADLTYQQGLNRMFS  194

Query  509  EFLTLDDIHKPALDGIGYQDSLNWQ  533
               T  D + PAL  IG Q  LN +
Sbjct  195  R-STRYDFYFPALAHIGEQSVLNKE  218



Lambda      K        H        a         alpha
   0.315    0.132    0.393    0.792     4.96 

Gapped
Lambda      K        H        a         alpha    sigma
   0.267   0.0410    0.140     1.90     42.6     43.6 

Effective search space used: 4913515649712