bitscore colors: <40, 40-50 , 50-80, 80-200, >200
BLASTP 2.2.30+ Reference: Stephen F. Altschul, Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Reference for composition-based statistics: Alejandro A. Schaffer, L. Aravind, Thomas L. Madden, Sergei Shavirin, John L. Spouge, Yuri I. Wolf, Eugene V. Koonin, and Stephen F. Altschul (2001), "Improving the accuracy of PSI-BLAST protein database searches with composition-based statistics and other refinements", Nucleic Acids Res. 29:2994-3005. Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF excluding environmental samples from WGS projects 49,011,213 sequences; 17,563,301,199 total letters Query= Contig-27_CDS_annotation_glimmer3.pl_2_4 Length=250 Score E Sequences producing significant alignments: (Bits) Value gi|575096063|emb|CDL66946.1| unnamed protein product 131 4e-34 gi|575094491|emb|CDL65858.1| unnamed protein product 130 8e-34 gi|575094571|emb|CDL65927.1| unnamed protein product 109 3e-26 gi|575094543|emb|CDL65903.1| unnamed protein product 108 1e-25 gi|575094497|emb|CDL65863.1| unnamed protein product 102 4e-23 gi|575094414|emb|CDL65789.1| unnamed protein product 96.7 6e-21 gi|530695387|gb|AGT39940.1| portal protein 47.8 0.001 gi|17402853|ref|NP_510875.1| capsid protein VP3 45.1 0.007 gi|9791182|ref|NP_063897.1| capsid protein VP3 44.7 0.010 gi|77020118|ref|YP_338242.1| putative capsid protein 44.7 0.011 >gi|575096063|emb|CDL66946.1| unnamed protein product [uncultured bacterium] Length=163 Score = 131 bits (330), Expect = 4e-34, Method: Compositional matrix adjust. Identities = 68/157 (43%), Positives = 92/157 (59%), Gaps = 0/157 (0%) Query 79 MPMFETWHREQQHFCSEPGCGEKILYSPEFDRFGVMTLKESGKEDLYAFIQSHKDSVDLH 138 M F+T + +E G E I Y +D G + L+E+G+ +LY IQSH +SVDLH Sbjct 1 MMEFKTQYDPHDRIFTEAGSREHITYGGHYDDEGRVVLEETGRINLYDEIQSHAESVDLH 60 Query 139 KIMDRFNAGDASALQKVQGMFGDFSEMPQTYAGLLNHMIEAEQTFMSLPLETREKFGQSF 198 +M R++ GD L + QG FGD + PQTYA LNHM E E+ FMSLPL+ R KFG SF Sbjct 61 VLMQRYSCGDVDCLSQRQGFFGDVLDFPQTYAEALNHMQEMERQFMSLPLDVRAKFGHSF 120 Query 199 HAWLAQAGSVSWLEAMGMVTPPTSQNSAGEPPAAPQS 235 +LA +G +LE +G P S + PA P++ Sbjct 121 SEFLAASGDDDFLERLGFKPEPESAPAVDPVPAVPEA 157 >gi|575094491|emb|CDL65858.1| unnamed protein product [uncultured bacterium] Length=176 Score = 130 bits (328), Expect = 8e-34, Method: Compositional matrix adjust. Identities = 61/141 (43%), Positives = 87/141 (62%), Gaps = 0/141 (0%) Query 79 MPMFETWHREQQHFCSEPGCGEKILYSPEFDRFGVMTLKESGKEDLYAFIQSHKDSVDLH 138 M F+T PG K++YSP +D GV+ L+ +G+E+LY FIQSH S D+H Sbjct 1 MVKFKTQFHSHARVFQRPGDPIKVVYSPRYDENGVLDLQPTGEENLYDFIQSHAQSTDIH 60 Query 139 KIMDRFNAGDASALQKVQGMFGDFSEMPQTYAGLLNHMIEAEQTFMSLPLETREKFGQSF 198 I+DRF +G+ L ++QG + D S+MP+TYA +LN +I EQTF LP+E ++KFG SF Sbjct 61 VILDRFASGETDVLSQIQGFYADASDMPKTYAEVLNAVIAGEQTFDRLPVEIKQKFGNSF 120 Query 199 HAWLAQAGSVSWLEAMGMVTP 219 WL+ + + E MG P Sbjct 121 STWLSSMDNPDFAERMGFPKP 141 >gi|575094571|emb|CDL65927.1| unnamed protein product [uncultured bacterium] Length=133 Score = 109 bits (273), Expect = 3e-26, Method: Compositional matrix adjust. Identities = 49/103 (48%), Positives = 68/103 (66%), Gaps = 0/103 (0%) Query 112 GVMTLKESGKEDLYAFIQSHKDSVDLHKIMDRFNAGDASALQKVQGMFGDFSEMPQTYAG 171 G + L E+GK++LY IQSHKDSVDL+ ++ RFN G+ L ++QG + D S MP+TYA Sbjct 8 GAIDLVENGKKNLYDEIQSHKDSVDLNLLLQRFNNGEVDVLSRMQGTYADLSNMPKTYAD 67 Query 172 LLNHMIEAEQTFMSLPLETREKFGQSFHAWLAQAGSVSWLEAM 214 +LN + + E F+SLP++ R KF SF WL GS W+ M Sbjct 68 MLNLIKKGEADFLSLPVDVRAKFDHSFEKWLVTFGSQDWIVNM 110 >gi|575094543|emb|CDL65903.1| unnamed protein product [uncultured bacterium] Length=145 Score = 108 bits (269), Expect = 1e-25, Method: Compositional matrix adjust. Identities = 48/123 (39%), Positives = 74/123 (60%), Gaps = 0/123 (0%) Query 81 MFETWHREQQHFCSEPGCGEKILYSPEFDRFGVMTLKESGKEDLYAFIQSHKDSVDLHKI 140 MF + + G I+Y +D GV L SG++++Y IQSH+DSVD+H + Sbjct 1 MFRSQFDDHDRVYCSSGSPIHIVYQGRYDDDGVFDLFPSGQDNIYDQIQSHRDSVDIHVL 60 Query 141 MDRFNAGDASALQKVQGMFGDFSEMPQTYAGLLNHMIEAEQTFMSLPLETREKFGQSFHA 200 +DR+ GD L QG++GDF+ MP +Y+ +LN ++ E+ FM LP++ R K+G SF Sbjct 61 LDRYQRGDVDVLSARQGVYGDFTGMPASYSEILNAVLAGERAFMDLPVDERAKYGHSFAQ 120 Query 201 WLA 203 WL+ Sbjct 121 WLS 123 >gi|575094497|emb|CDL65863.1| unnamed protein product [uncultured bacterium] Length=170 Score = 102 bits (254), Expect = 4e-23, Method: Compositional matrix adjust. Identities = 59/171 (35%), Positives = 94/171 (55%), Gaps = 5/171 (3%) Query 81 MFET-WHREQQHFCSEPGCGEKILYSPEFDRFGVMTLKESGKEDLYAFIQSHKDSVDLHK 139 +F T ++R ++ F S G G + Y G L++ G+ DLYA IQS+KDS D++ Sbjct 2 LFRTKYNRNKEVFKSTSGDGYQPTYKMRVHEDGKRELEKVGRIDLYAQIQSYKDSCDINY 61 Query 140 IMDRFNAGDASALQKVQGMFGDFSEMPQTYAGLLNHMIEAEQTFMSLPLETREKFGQSFH 199 I++RF GD SAL K+QG++GDF+ MP A L +++AE F +LP++ R +F S Sbjct 62 ILERFARGDESALSKIQGVYGDFTAMPTNLAELQQRVVDAEALFYNLPVDIRAEFNHSPS 121 Query 200 AWLAQAGSVSWLEAMGMVTPPTSQNSAGEPPAAPQSPEGGETDPAPASPAG 250 + + G+ + +A+G +TP +P +P + DP P G Sbjct 122 EFYSAIGTDKFNKAVG-ITPQVDPQPIIDPQPVISTP---QVDPQPVVQKG 168 >gi|575094414|emb|CDL65789.1| unnamed protein product [uncultured bacterium] Length=167 Score = 96.7 bits (239), Expect = 6e-21, Method: Compositional matrix adjust. Identities = 54/149 (36%), Positives = 79/149 (53%), Gaps = 5/149 (3%) Query 94 SEPGCGEKILYSPEFDRFGVMTLKESGKEDLYAFIQSHKDSVDLHKIMDRFNAGDASALQ 153 +EPG + Y + G L++ G+ + YA IQS+KD D+H I+ R+ AGD S L Sbjct 18 NEPGNELEPHYIEKIGENGRTYLEKDGETNTYAEIQSYKDECDVHSILCRYFAGDTSVLS 77 Query 154 KVQGMFGDFSEMPQTYAGLLNHMIEAEQTFMSLPLETREKFGQSFHAWLAQAGSVSWLEA 213 + QG++ D +++P TY + N M E F LPLE + KF SF+ W + AGS W + Sbjct 78 R-QGVYIDATQLPTTYHEMYNLMAEQRDKFDQLPLEIKRKFDNSFNVWASTAGSEEWYKL 136 Query 214 MGMVTPPTSQ----NSAGEPPAAPQSPEG 238 MG+V + N +GE EG Sbjct 137 MGIVQNAENNKPEGNGSGENEGENAQKEG 165 >gi|530695387|gb|AGT39940.1| portal protein [Marine gokushovirus] Length=164 Score = 47.8 bits (112), Expect = 0.001, Method: Compositional matrix adjust. Identities = 34/129 (26%), Positives = 63/129 (49%), Gaps = 10/129 (8%) Query 126 AFIQSH-KDSVDLHKIMDRFN-AGDASALQKVQGMFGDFSEMPQTYAGLLNHMIEAEQTF 183 + Q H KD ++ I+ + + G +Q+ Q +GDFS++ Y L+ + +A+Q F Sbjct 32 SLTQQHFKDECEVINIIKKHDRNGIIEHVQRGQARYGDFSQVAD-YREALDLVRDAQQEF 90 Query 184 MSLPLETREKFGQS---FHAWLAQAGSVSWLEAMGMVTPPTSQNSAGEPPAAPQSPEGGE 240 MS+P + R+KF F+ +++ + L+ MG + P G+P + P Sbjct 91 MSVPSDIRKKFDNDPGKFYEFVSNPDNKEELKQMGFIETP----EVGKPSSVPTKALSEA 146 Query 241 TDPAPASPA 249 +P+ A A Sbjct 147 GEPSTAQEA 155 >gi|17402853|ref|NP_510875.1| capsid protein VP3 [Guinea pig Chlamydia phage] Length=148 Score = 45.1 bits (105), Expect = 0.007, Method: Compositional matrix adjust. Identities = 35/126 (28%), Positives = 58/126 (46%), Gaps = 20/126 (16%) Query 129 QSHKDSVDLHKIMDRFNA-GDASALQKVQGMFGDFSEMPQTYAGLLNHMIEAEQTFMSLP 187 Q +KD D++ I+ + NA G +++ + D + P Y+ LN +IEA++ F SLP Sbjct 24 QHNKDECDINNIVAKLNATGVLEHVERRSPRYMDCMD-PMEYSEALNVVIEAQEQFDSLP 82 Query 188 LETREKFGQSFHA---WLAQAGSVSWLEAMGMV---------------TPPTSQNSAGEP 229 + RE+FG A +L++ + +A+G V P QN A + Sbjct 83 AKVRERFGNDPEAMLDFLSREENYEEAKALGFVYEDGTSGAPHTFCEADPKDDQNVANQE 142 Query 230 PAAPQS 235 P Q Sbjct 143 PGLAQK 148 >gi|9791182|ref|NP_063897.1| capsid protein VP3 [Chlamydia pneumoniae phage CPAR39] gi|7190961|gb|AAF39721.1| capsid protein VP3 [Chlamydia pneumoniae phage CPAR39] Length=148 Score = 44.7 bits (104), Expect = 0.010, Method: Compositional matrix adjust. Identities = 35/126 (28%), Positives = 58/126 (46%), Gaps = 20/126 (16%) Query 129 QSHKDSVDLHKIMDRFNA-GDASALQKVQGMFGDFSEMPQTYAGLLNHMIEAEQTFMSLP 187 Q +KD D++ I+ + NA G +++ + D + P Y+ LN +IEA++ F SLP Sbjct 24 QHNKDECDINNIVAKLNATGVLEHVERRSPRYMDCMD-PMEYSEALNVVIEAQEQFDSLP 82 Query 188 LETREKFGQSFHA---WLAQAGSVSWLEAMGMV---------------TPPTSQNSAGEP 229 + RE+FG A +L++ + +A+G V P QN A + Sbjct 83 AKVRERFGNDPEAMLDFLSREENYEEAKALGFVYEDGTSGAPQTFFEADPKDDQNVANQE 142 Query 230 PAAPQS 235 P Q Sbjct 143 PGLAQK 148 >gi|77020118|ref|YP_338242.1| putative capsid protein [Chlamydia phage 4] gi|59940017|gb|AAX12546.1| putative capsid protein [Chlamydia phage 4] Length=148 Score = 44.7 bits (104), Expect = 0.011, Method: Compositional matrix adjust. Identities = 35/126 (28%), Positives = 58/126 (46%), Gaps = 20/126 (16%) Query 129 QSHKDSVDLHKIMDRFNA-GDASALQKVQGMFGDFSEMPQTYAGLLNHMIEAEQTFMSLP 187 Q +KD D++ I+ + NA G +++ + D + P Y+ LN +IEA++ F SLP Sbjct 24 QHNKDECDINNIVAKLNATGVLEHVERRSPRYMDCMD-PMEYSEALNVVIEAQEQFDSLP 82 Query 188 LETREKFGQSFHA---WLAQAGSVSWLEAMGMV---------------TPPTSQNSAGEP 229 + RE+FG A +L++ + +A+G V P QN A + Sbjct 83 AKIRERFGNDPEAMLDFLSREENYEEAKALGFVYEDGTSGAPQTFFEADPKDDQNVANQE 142 Query 230 PAAPQS 235 P Q Sbjct 143 PGLAQK 148 Lambda K H a alpha 0.319 0.132 0.411 0.792 4.96 Gapped Lambda K H a alpha sigma 0.267 0.0410 0.140 1.90 42.6 43.6 Effective search space used: 1113602771862