BLASTP 2.2.24 [Aug-08-2010] Reference for compositional score matrix adjustment: Altschul, Stephen F., John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis, Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109. Query= Eace_0015_orf3 (314 letters) Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF excluding environmental samples from WGS projects 14,777,732 sequences; 5,058,227,080 total letters
Score E Sequences producing significant alignments: (bits) Value gi|145505976|ref|XP_001438954.1| hypothetical protein [Parameciu... 42 0.17 gi|324114172|gb|EGC08145.1| type VI secretion protein [Escherich... 39 1.4 gi|325497722|gb|EGC95581.1| hypothetical protein ECD227_1819 [Es... 38 1.6 gi|218549310|ref|YP_002383101.1| hypothetical protein EFER_1972 ... 38 1.6 gi|148236153|ref|NP_001084637.1| protein kinase, cAMP-dependent,... 38 2.0 gi|239813377|ref|YP_002942287.1| FAD dependent oxidoreductase [V... 36 9.9 >gi|145505976|ref|XP_001438954.1| hypothetical protein [Paramecium tetraurelia strain d4-2] gi|124406127|emb|CAK71557.1| unnamed protein product [Paramecium tetraurelia] Length = 750 Score = 41.6 bits (96), Expect = 0.17, Method: Compositional matrix adjust. Identities = 35/130 (26%), Positives = 58/130 (44%), Gaps = 33/130 (25%) Query: 37 VFDTVVVKASSALSPQ----EQMRAARFVREKAKGELYRGAVVDSGYPTSFSVVAVGNET 92 VF V+VK + + PQ Q+ ++++A G F + NET Sbjct: 183 VFSAVLVKDNQIIHPQPLPNTQVEETHQIQQEAHG---------------FDTLQAMNET 227 Query: 93 GTVYGVLK---GHVEDLTCTIEFLGASGLHENFVEAMLLADTLPFM-----------ERL 138 TV G K ++E++ ++ LG S ENF++ ++ DTL + E L Sbjct: 228 VTVKGKKKMQDWNIEEVCIWLDCLGLSQYKENFIKNHMIGDTLHDLTDVELKEELGIEIL 287 Query: 139 GVRKVVLKSI 148 G RK++L+ I Sbjct: 288 GHRKLILQQI 297 >gi|324114172|gb|EGC08145.1| type VI secretion protein [Escherichia fergusonii B253] Length = 345 Score = 38.5 bits (88), Expect = 1.4, Method: Compositional matrix adjust. Identities = 27/78 (34%), Positives = 38/78 (48%), Gaps = 11/78 (14%) Query: 176 MVRDGSLPPPDTDCITLPRSFRAQYGVLKNFEGRH--RAPIDGKYLDVLELARMMVYEAP 233 V D SL P +D ++L + QY + +F G H ++P+ G YLD ELA + E Sbjct: 56 FVADASLAFPKSDVVSLTKKENGQYAITVSFSGLHGSQSPLPGYYLD--ELALEVAQEEE 113 Query: 234 G-------FIKRRTQELF 244 G F R TQ L+ Sbjct: 114 GLAGFLDLFAHRWTQFLY 131 >gi|325497722|gb|EGC95581.1| hypothetical protein ECD227_1819 [Escherichia fergusonii ECD227] Length = 316 Score = 38.1 bits (87), Expect = 1.6, Method: Compositional matrix adjust. Identities = 27/78 (34%), Positives = 38/78 (48%), Gaps = 11/78 (14%) Query: 176 MVRDGSLPPPDTDCITLPRSFRAQYGVLKNFEGRH--RAPIDGKYLDVLELARMMVYEAP 233 V D SL P +D ++L + QY + +F G H ++P+ G YLD ELA + E Sbjct: 27 FVADASLAFPKSDVVSLTKKENGQYAITVSFSGLHGSQSPLPGYYLD--ELALEVAQEEE 84 Query: 234 G-------FIKRRTQELF 244 G F R TQ L+ Sbjct: 85 GLAGFLDLFAHRWTQFLY 102 >gi|218549310|ref|YP_002383101.1| hypothetical protein EFER_1972 [Escherichia fergusonii ATCC 35469] gi|218356851|emb|CAQ89479.1| conserved hypothetical protein, Aec22 [Escherichia fergusonii ATCC 35469] Length = 345 Score = 38.1 bits (87), Expect = 1.6, Method: Compositional matrix adjust. Identities = 27/78 (34%), Positives = 38/78 (48%), Gaps = 11/78 (14%) Query: 176 MVRDGSLPPPDTDCITLPRSFRAQYGVLKNFEGRH--RAPIDGKYLDVLELARMMVYEAP 233 V D SL P +D ++L + QY + +F G H ++P+ G YLD ELA + E Sbjct: 56 FVADASLVFPKSDVVSLTKKENGQYAITVSFSGLHGSQSPLPGYYLD--ELALEVAQEEE 113 Query: 234 G-------FIKRRTQELF 244 G F R TQ L+ Sbjct: 114 GLAGFLDLFAHRWTQFLY 131 >gi|148236153|ref|NP_001084637.1| protein kinase, cAMP-dependent, regulatory, type II, beta [Xenopus laevis] gi|46249649|gb|AAH68931.1| MGC83177 protein [Xenopus laevis] Length = 402 Score = 37.7 bits (86), Expect = 2.0, Method: Compositional matrix adjust. Identities = 26/87 (29%), Positives = 46/87 (52%), Gaps = 7/87 (8%) Query: 118 LHENFVEAMLLADTLPFMERLGVRKVVLKSIPICDAGMV---GLAAELGFVIKASSGDSH 174 L+ENF+E++ +L F ERL V V+ + CD + G A+ F+++ SG+ Sbjct: 250 LYENFIESLPFLKSLEFSERLKVVDVI-GTKTYCDEEQIIAQGDGADSFFIVE--SGEVR 306 Query: 175 LMVRDGSLPPPDTD-CITLPRSFRAQY 200 + ++ S P +T+ + + R R QY Sbjct: 307 ITMKSKSKPDAETNEAVEIARYIRGQY 333 >gi|239813377|ref|YP_002942287.1| FAD dependent oxidoreductase [Variovorax paradoxus S110] gi|239799954|gb|ACS17021.1| FAD dependent oxidoreductase [Variovorax paradoxus S110] Length = 453 Score = 35.8 bits (81), Expect = 9.9, Method: Compositional matrix adjust. Identities = 25/93 (26%), Positives = 41/93 (44%), Gaps = 13/93 (13%) Query: 166 IKASSGDSHLMVRDGSLPPPDTDCITLPRSFRAQYGVLKNFEGRHRAPIDGKYL------ 219 ++A ++ R+G++ P+ D PR +RA ++N GR +D + L Sbjct: 231 VQAQKPGAYRFPREGAILRPNID----PREWRANVTQIRNAAGRAMNGVDARELTEGELE 286 Query: 220 ---DVLELARMMVYEAPGFIKRRTQELFLQVGV 249 + E R M E PGF E+ QVG+ Sbjct: 287 GRRQIAEYFRFMKAEVPGFAHSAIVEIAPQVGI 319 Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF excluding environmental samples from WGS projects Posted date: Jul 22, 2011 4:42 PM Number of letters in database: 5,058,227,080 Number of sequences in database: 14,777,732 Lambda K H 0.321 0.136 0.389 Gapped Lambda K H 0.267 0.0410 0.140 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Sequences: 14777732 Number of Hits to DB: 2,946,963,253 Number of extensions: 118618262 Number of successful extensions: 283644 Number of sequences better than 10.0: 10 Number of HSP's gapped: 284781 Number of HSP's successfully gapped: 10 Length of query: 314 Length of database: 5,058,227,080 Length adjustment: 139 Effective length of query: 175 Effective length of database: 3,004,122,332 Effective search space: 525721408100 Effective search space used: 525721408100 Neighboring words threshold: 11 Window for multiple hits: 40 X1: 16 ( 7.4 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 41 (21.9 bits) S2: 81 (35.8 bits)