bitscore colors: <40, 40-50 , 50-80, 80-200, >200




           BLASTP 2.2.30+


Reference: Stephen F. Altschul, Thomas L. Madden, Alejandro A.
Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J.
Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of
protein database search programs", Nucleic Acids Res. 25:3389-3402.


Reference for composition-based statistics: Alejandro A. Schaffer,
L. Aravind, Thomas L. Madden, Sergei Shavirin, John L. Spouge, Yuri
I. Wolf, Eugene V. Koonin, and Stephen F. Altschul (2001),
"Improving the accuracy of PSI-BLAST protein database searches with
composition-based statistics and other refinements", Nucleic Acids
Res. 29:2994-3005.



Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF
excluding environmental samples from WGS projects
           49,011,213 sequences; 17,563,301,199 total letters





Query= Contig-38_CDS_annotation_glimmer3.pl_2_2

Length=346
                                                                      Score     E
Sequences producing significant alignments:                          (Bits)  Value

gi|547312922|ref|WP_022044634.1|  putative replication initiation...  79.3    2e-13
gi|547920048|ref|WP_022322419.1|  putative replication protein        76.3    1e-12
gi|575094374|emb|CDL65755.1|  unnamed protein product                 74.7    2e-11
gi|649555288|gb|KDS61825.1|  hypothetical protein M095_3808           72.4    3e-11
gi|492501778|ref|WP_005867316.1|  hypothetical protein                71.2    7e-11
gi|609718275|emb|CDN73649.1|  conserved hypothetical protein          65.5    7e-09
gi|575096096|emb|CDL66976.1|  unnamed protein product                 56.2    1e-05
gi|530695361|gb|AGT39916.1|  replication initiator                    55.8    1e-05
gi|575094557|emb|CDL65915.1|  unnamed protein product                 53.5    1e-04
gi|568293148|gb|ETN80369.1|  hypothetical protein NECAME_18023        49.7    0.002


>gi|547312922|ref|WP_022044634.1| putative replication initiation protein [Alistipes finegoldii 
CAG:68]
 gi|524208442|emb|CCZ76638.1| putative replication initiation protein [Alistipes finegoldii 
CAG:68]
Length=320

 Score = 79.3 bits (194),  Expect = 2e-13, Method: Compositional matrix adjust.
 Identities = 66/229 (29%), Positives = 105/229 (46%), Gaps = 33/229 (14%)

Query  34   VEVECGHCFECRKKKRREWRIRNYEQLKETP--IAVFFTGTVSPQRYEHICKQYGYKNDG  91
            +EV CG+C  C+K    ++RIR   +L++ P    +F T T +    E   K        
Sbjct  40   LEVPCGYCHSCQKSYNNQYRIRLLYELRKYPPGTCLFVTLTFNDDSLEKFSKD-------  92

Query  92   SQDNEIITKIQRLFLERIRKEKGYSIKHWCVTEKGHTNTRRIHIHGLYYAT-------HG  144
                    K  RLFL+R RK  G  I+HW V E G T   R H HG+ +         + 
Sbjct  93   ------TNKAVRLFLDRFRKVYGKQIRHWFVCEFG-TLHGRPHYHGILFNVPQALIDGYD  145

Query  145  ETKWQLTKTLFENWIDGYRFYGSYVNEKTINYVSKYMTKK---DEDNPDYIGIVLCSKGL  201
                     L   W  G+ F G YV+++T +Y++KY+TK    D+  P     V+ S G+
Sbjct  146  SDMPGHHPLLASCWKYGFVFVG-YVSDETCSYITKYVTKSINGDKVRPR----VISSFGI  200

Query  202  GANYAK-RMAYKHEWNKEKTNITYKAKNGADLPLPRYYKTQLYTEDQRQ  249
            G+NY     +  H+   ++    +   NG    +PRYY  +++++  +Q
Sbjct  201  GSNYLNTEESSLHKLGNQRYQ-PFMVLNGFQQAMPRYYYNKIFSDVDKQ  248


>gi|547920048|ref|WP_022322419.1| putative replication protein [Parabacteroides merdae CAG:48]
 gi|524592960|emb|CDD13572.1| putative replication protein [Parabacteroides merdae CAG:48]
Length=278

 Score = 76.3 bits (186),  Expect = 1e-12, Method: Compositional matrix adjust.
 Identities = 65/228 (29%), Positives = 106/228 (46%), Gaps = 18/228 (8%)

Query  34   VEVECGHCFECRKKKRREWRIRNYEQLKETPIAVFFTGTVSPQRYEHI-CKQYGYKNDGS  92
             +V CG C  CR+ KR+ W  R   + KE P+++F T T      EH+  ++ G  +D  
Sbjct  8    AKVPCGWCVNCRQNKRQSWVYRLQAEAKEYPLSLFVTLTYDD---EHLPIERIG--SDLF  62

Query  93   QDNEIITKIQ--RLFLERIRKE-KGYSIKHWCVTEKGHTNTRRIHIHGLYYATHGETKWQ  149
            Q N  +   +  +LF++R+RK+ + Y ++++  +E G  N R  H H + +      K  
Sbjct  63   QTNVAVVSKRDVQLFMKRLRKKYEDYKMRYFVTSEYGAKNGRP-HYHMILFGFPFTGK-M  120

Query  150  LTKTLFENWIDGYRFYGSYVNEKTINYVSKYMTKKD------EDNPDYIGIVLCSKGLGA  203
                L E W +G+      +  K I YV KYM +K        D   Y   +LCS+  G 
Sbjct  121  AGDLLAECWQNGF-VQAHPLTIKEIAYVCKYMYEKSMCPEILRDEKKYKPFMLCSRNPGI  179

Query  204  NYAKRMAYKHEWNKEKTNITYKAKNGADLPLPRYYKTQLYTEDQRQLL  251
             +    A   E+ +       +A  G  + +PRYY  +LY +D +  L
Sbjct  180  GFGFMKADIIEFYRRHPRDYVRAWAGHKMAMPRYYADKLYDDDMKAFL  227


>gi|575094374|emb|CDL65755.1| unnamed protein product [uncultured bacterium]
Length=487

 Score = 74.7 bits (182),  Expect = 2e-11, Method: Compositional matrix adjust.
 Identities = 48/158 (30%), Positives = 76/158 (48%), Gaps = 12/158 (8%)

Query  36   VECGHCFECRKKKRREWRIRNYEQLKETPIAVFFTGTVSPQRYEHICKQYGYKNDGSQDN  95
            V CGHC++C+  K  +W++R  E+L     + F+T T+ P+        YG   DGS   
Sbjct  26   VPCGHCYDCKSAKTTDWQVRCSEELNNNSQSYFYTLTLDPR----FIDTYGTLPDGSPRY  81

Query  96   EIITKIQRLFLERIRKEKG---YSIKHWCVTEKGHTNTRRIHIHGLYYATHGETKWQLTK  152
                +  +LFL+R+RK       S+K+  V E G T T R H H ++Y +     ++   
Sbjct  82   VFNKRHIQLFLKRLRKALSKYNISLKYVIVGELGET-THRPHYHAIFYLSSSVNPFKFRI  140

Query  153  TLFENWIDGYRFYGS----YVNEKTINYVSKYMTKKDE  186
             +  +W  G+   G      +N   ++YV KYM K D 
Sbjct  141  MVRNSWSLGFIKSGDNNGIILNNDAVSYVIKYMHKTDS  178


>gi|649555288|gb|KDS61825.1| hypothetical protein M095_3808 [Parabacteroides distasonis str. 
3999B T(B) 4]
 gi|649560564|gb|KDS66872.1| hypothetical protein M095_2449 [Parabacteroides distasonis str. 
3999B T(B) 4]
 gi|649561011|gb|KDS67298.1| hypothetical protein M095_2409 [Parabacteroides distasonis str. 
3999B T(B) 4]
Length=284

 Score = 72.4 bits (176),  Expect = 3e-11, Method: Compositional matrix adjust.
 Identities = 66/226 (29%), Positives = 104/226 (46%), Gaps = 18/226 (8%)

Query  36   VECGHCFECRKKKRREWRIRNYEQLKETPIAVFFTGTVSPQRYEHICKQYGYKNDGSQDN  95
            V CG C  CRK KR+ W  R   +  E P ++F T T      EHI      ++      
Sbjct  15   VPCGRCVNCRKNKRQSWVYRLQAEADEYPFSLFVTLTYDD---EHIPTAMIGEDLFKTTV  71

Query  96   EIITK--IQRLFLERIRKEKG-YSIKHWCVTEKGHTNTRRIHIHGLYYATHGETKWQLTK  152
             +++K  IQ LF++R+RK+   Y ++++  +E G +   R H H + +      K     
Sbjct  72   GVVSKRDIQ-LFMKRLRKKYAQYRLRYFLTSEYG-SQGGRPHYHMILFGFPFTGK-HGGD  128

Query  153  TLFENWIDGYRFYGSYVNEKTINYVSKYMTKKDEDNPD-------YIGIVLCSKGLGANY  205
             L E W +G+      +  K I+YV+KYM +K    PD       Y   +LCSK  G  Y
Sbjct  129  LLAECWKNGF-VQAHPLTTKEISYVTKYMYEKSM-IPDILKGVKEYQPFMLCSKMPGIGY  186

Query  206  AKRMAYKHEWNKEKTNITYKAKNGADLPLPRYYKTQLYTEDQRQLL  251
                    ++ +       +A NG  + +PRYY  +LY +D ++ L
Sbjct  187  HFLREQILDFYRLHPRDYVRAFNGMRMAMPRYYADKLYDDDMKEYL  232


>gi|492501778|ref|WP_005867316.1| hypothetical protein [Parabacteroides distasonis]
 gi|409230407|gb|EKN23271.1| hypothetical protein HMPREF1059_03256 [Parabacteroides distasonis 
CL09T03C24]
Length=284

 Score = 71.2 bits (173),  Expect = 7e-11, Method: Compositional matrix adjust.
 Identities = 63/225 (28%), Positives = 104/225 (46%), Gaps = 16/225 (7%)

Query  36   VECGHCFECRKKKRREWRIRNYEQLKETPIAVFFTGTVSPQRYEHICKQYGYKNDGSQDN  95
            V CG C  CRK KR+ W  R   +  E P ++F T T      EH+      ++      
Sbjct  15   VPCGRCVNCRKNKRQSWVYRLQAEADEYPFSLFVTLTYDD---EHMPTAMIGEDLFKSTV  71

Query  96   EIITK--IQRLFLERIRKE-KGYSIKHWCVTEKGHTNTRRIHIHGLYYATHGETKWQLTK  152
             +++K  IQ LF++R+RK+   Y ++++  +E G +   R H H + +      K     
Sbjct  72   GVVSKRDIQ-LFMKRLRKKYDQYRLRYFLTSEYG-SQGGRPHYHMILFGFPFTGK-HGGD  128

Query  153  TLFENWIDGYRFYGSYVNEKTINYVSKYMTKKD------EDNPDYIGIVLCSKGLGANYA  206
             L E W +G+      +  K I YV+KYM +K       +D  +Y   +LCS+  G  Y 
Sbjct  129  LLAECWKNGF-VQAHPLTTKEIAYVTKYMYEKSMVPDILKDVKEYQPFMLCSRIPGIGYH  187

Query  207  KRMAYKHEWNKEKTNITYKAKNGADLPLPRYYKTQLYTEDQRQLL  251
                   ++ +       +A NG  + +PRYY  +LY +D ++ L
Sbjct  188  FLREQILDFYRLHPRDYVRAFNGMRMAMPRYYADKLYDDDMKEYL  232


>gi|609718275|emb|CDN73649.1| conserved hypothetical protein [Elizabethkingia anophelis]
Length=265

 Score = 65.5 bits (158),  Expect = 7e-09, Method: Compositional matrix adjust.
 Identities = 63/221 (29%), Positives = 95/221 (43%), Gaps = 32/221 (14%)

Query  38   CGHCFECRKKKRREWRIRNYEQLKETPIAVFFTGTVSPQRYEHICKQYGYKNDGSQDNEI  97
            CG C ECRK +   W  R  E+LK +  A F T T     Y  +   Y        DN +
Sbjct  25   CGKCLECRKARTNSWFARLTEELKVSKSAHFVTLT-----YSDVYLPY-------SDNGL  72

Query  98   ITKIQR---LFLERIRKEKGYSIKHWCVTEKGHTNTRRIHIHGLYYATHGETKWQLTKTL  154
            I+   R   LF++R RK +   IK++ V E G   T R H H + +              
Sbjct  73   ISLDYRDFQLFMKRARKLQKSKIKYFLVGEYG-AQTYRPHYHAIVFGVEN------IDAF  125

Query  155  FENWIDGYRFYGSYVNEKTINYVSKYMTKKDEDNPDYIGI-------VLCSKGLGANYAK  207
               W  G    G+ V  K+I Y  KY TK   + PD            L SKGLG ++  
Sbjct  126  LGEWRMGNVHAGT-VTAKSIYYTLKYCTKSITEGPDKDPDDDRKPEKALMSKGLGLSHLT  184

Query  208  RMAYKHEWNKEKTNITYKAKNGADLPLPRYYKTQLYTEDQR  248
                K  + K+  + ++    G  + LPRYY+ +++++ ++
Sbjct  185  ESMIK--YYKDDVSRSFSLLGGTTIALPRYYRDKVFSDIEK  223


>gi|575096096|emb|CDL66976.1| unnamed protein product [uncultured bacterium]
Length=296

 Score = 56.2 bits (134),  Expect = 1e-05, Method: Compositional matrix adjust.
 Identities = 61/236 (26%), Positives = 101/236 (43%), Gaps = 52/236 (22%)

Query  33   YVEVECGHCFECRKKKRREWRIRNYEQLKETPIAVFFTGTVSPQRYEHICKQYGYKNDGS  92
            YV V CG C ECR  +  EW +R   +LK     +F T T              Y +D  
Sbjct  33   YVLVPCGQCLECRLHRASEWALRCCHELKSHDKGIFLTLT--------------YNDDNL  78

Query  93   QDN-EIITKIQRLFLERIRKEKGY-----SIKHWCVTEKGHTNTRRIHIHGLYYATH---  143
              N  ++ K  + F++R+R+   Y      I++ C  E G  + R  H H L +  +   
Sbjct  79   PPNGTLVKKHVQDFIKRLRRHIDYYGDCTKIRYLCAGEYGDLSLRP-HYHLLVFGYYPSD  137

Query  144  ----------GETKWQLTKTLFENWIDGYRFYGSYVNEKTINYVSKYMTKKD--EDNPDY  191
                      G+     + TL + W  G+  +G+ +  ++  Y  +Y  KK   E +  Y
Sbjct  138  PRLLHGLQKIGKNSLFTSPTLTKLWGKGHISFGA-ITFESARYTCQYALKKQTGEHSHYY  196

Query  192  I--GIV----LCS--KGLGANYAKRMAYKHEWNKEKTNITYKAKNGADLPLPRYYK  239
            +  G++    +CS   GLG ++    A  H+   E+  +T    NG  + +PRYY+
Sbjct  197  VDRGVIPEFMICSNRNGLGYDF----AVSHDNMFERGYLT---MNGKKIGIPRYYQ  245


>gi|530695361|gb|AGT39916.1| replication initiator [Marine gokushovirus]
Length=289

 Score = 55.8 bits (133),  Expect = 1e-05, Method: Compositional matrix adjust.
 Identities = 50/163 (31%), Positives = 76/163 (47%), Gaps = 22/163 (13%)

Query  32   RYVEVECGHCFECRKKKRREWRIRNYEQLKETPIAVFFTGTVSPQRYEHICKQYGYKNDG  91
            R   + CG C  CR    R+W IR   + +      F T T      EHI K+   KN  
Sbjct  26   RGFNLPCGQCIGCRLDYSRQWAIRCVHEAQTHEDNCFITLTFDN---EHIAKR---KNPE  79

Query  92   SQDNEIITKIQRLFLERIRKEKGYSIKHWCVTEKGHTNTRRIHIHGLYY----------A  141
            S DN   T+ QR F++R+RK+  + I+ +   E G  N +R H H L +          +
Sbjct  80   SLDN---TEFQR-FMKRLRKKYPHKIRFFHCGEYGDQN-KRPHYHALLFGHDFKDKKLWS  134

Query  142  THGETKWQLTKTLFENWIDGYRFYGSYVNEKTINYVSKYMTKK  184
              G+ K  +++ L E W  G+   G+ V+  T  Y ++Y+ KK
Sbjct  135  NKGDFKLFVSQELAELWPYGFHTIGA-VSFDTAAYCARYVMKK  176


>gi|575094557|emb|CDL65915.1| unnamed protein product [uncultured bacterium]
Length=354

 Score = 53.5 bits (127),  Expect = 1e-04, Method: Compositional matrix adjust.
 Identities = 49/218 (22%), Positives = 93/218 (43%), Gaps = 19/218 (9%)

Query  31   FRYVEVECGHCFECRKKKRREWRIRNYEQLKETPIAVFFTGTVSPQRYEHICKQYGYKND  90
             R V   CG C  CR K RREW  R   ++       F T T S + Y  + +  G++  
Sbjct  25   LRGVPFGCGKCLACRVKTRREWTSRLILEMLGHDSGAFVTLTYS-EDYVPVTES-GHRTL  82

Query  91   GSQDNEIITKIQRLFLERIRKEKGYSIKHWCVTEKGHTNTRRIHIHGLYYATHGETKWQL  150
              +D ++  K  R  LE  RK   + I+++   E G   T+R H H +++    +     
Sbjct  83   SLRDLQLFLKRLRRNLEE-RKRSKHPIRYYACGEYGTRGTQRPHYHIIFFGV-SDLDLDF  140

Query  151  TKTLFENWIDGYRF--------YGSY----VNEKTINYVSKYMTKKDEDNPDYIGIVLCS  198
             K+++  W +  ++        +G+     +N KT+ Y + Y  KK         +V+ S
Sbjct  141  IKSVYAAWSEPAKYGQKGQTPQFGNITIEPLNAKTVAYTAGYNMKKLISPKKVHKVVVSS  200

Query  199  KGLGA---NYAKRMAYKHEWNKEKTNITYKAKNGADLP  233
              +G+    + K +  +   N++   +  + +  + +P
Sbjct  201  AEIGSRRVTFEKLVLDRKNSNRDDNGVLAEFRVMSRMP  238


>gi|568293148|gb|ETN80369.1| hypothetical protein NECAME_18023 [Necator americanus]
Length=345

 Score = 49.7 bits (117),  Expect = 0.002, Method: Compositional matrix adjust.
 Identities = 52/222 (23%), Positives = 95/222 (43%), Gaps = 25/222 (11%)

Query  34   VEVECGHCFECRKKKRREWRIRNYEQLKETPIAVFFTGTVSPQRYEHICKQYGYKNDGSQ  93
            V V CG C  C++++   W  R  ++  +   A F T T    R+  I K      D  +
Sbjct  18   VPVPCGRCPPCKRRRVDSWVFRLLQEELQHENASFVTLTYD-TRFVPISKNGFMTLDRGE  76

Query  94   DNEIITKIQRLFLERIRKEKGYSIKHWCVTEKGHTNTRRIHIHGLYYATHGETKWQLTKT  153
                + ++++L         G  +K++   E G +   R H H + +    ++ +    T
Sbjct  77   FPRYMKRLRKLV-------PGRKLKYYMCGEYG-SQRFRPHYHAIIFGVPQDSLFADAWT  128

Query  154  LFENWIDGYRFYGSYVNEKTINYVSKYMTK--------KDEDNPDYIGIVLCSKGLGANY  205
            L  N           V  K+I Y  KY+ K        +D+  P++    L SKG+G +Y
Sbjct  129  L--NGDSLGGVVVGTVTGKSIAYTMKYIDKSTWKQKHGRDDRVPEF---SLMSKGMGVSY  183

Query  206  AKRMAYKHEWNKEKTNITYKAKNGAD-LPLPRYYKTQLYTED  246
                    E++KE  +  +  + G   + +PRYY+ ++Y++D
Sbjct  184  LTPQMV--EYHKEDISRLFCTREGGSRIAMPRYYRQKIYSDD  223



Lambda      K        H        a         alpha
   0.319    0.136    0.428    0.792     4.96 

Gapped
Lambda      K        H        a         alpha    sigma
   0.267   0.0410    0.140     1.90     42.6     43.6 

Effective search space used: 2041309051650