BLASTP 2.2.24 [Aug-08-2010] Reference for compositional score matrix adjustment: Altschul, Stephen F., John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis, Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109. Query= Eten_4023_orf2 (170 letters) Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF excluding environmental samples from WGS projects 14,777,732 sequences; 5,058,227,080 total letters
Score E Sequences producing significant alignments: (bits) Value gi|237832557|ref|XP_002365576.1| hypothetical protein, conserved... 57 9e-07 gi|221488028|gb|EEE26242.1| conserved hypothetical protein [Toxo... 57 9e-07 gi|325118419|emb|CBZ53970.1| conserved hypothetical protein [Neo... 50 1e-04 gi|156088155|ref|XP_001611484.1| hypothetical protein [Babesia b... 41 0.059 gi|124504897|ref|XP_001351191.1| conserved protein, unknown func... 37 0.86 gi|83273695|ref|XP_729511.1| hypothetical protein [Plasmodium yo... 35 5.4 >gi|237832557|ref|XP_002365576.1| hypothetical protein, conserved [Toxoplasma gondii ME49] gi|211963240|gb|EEA98435.1| hypothetical protein, conserved [Toxoplasma gondii ME49] Length = 536 Score = 57.0 bits (136), Expect = 9e-07, Method: Composition-based stats. Identities = 32/97 (32%), Positives = 40/97 (41%), Gaps = 28/97 (28%) Query: 89 IDWAFWERSIAHKDIVNCLRSHYEQQESAYGRLLGXXXXX-------------------- 128 IDWA WE IAHKDI+NCL++ Y Q R LG Sbjct: 62 IDWAKWESQIAHKDILNCLKTFYTNQVQILDRALGALETAKTPAPCEGAEKGWALFDAAL 121 Query: 129 --------XXXXXXXKGAAALWLSCRNPPLSALSTND 157 GA ALW+SC NPP+ ++TN+ Sbjct: 122 SACAKSVEKSEELLSNGARALWVSCSNPPVWKVNTNE 158 >gi|221488028|gb|EEE26242.1| conserved hypothetical protein [Toxoplasma gondii GT1] gi|221508549|gb|EEE34118.1| conserved hypothetical protein [Toxoplasma gondii VEG] Length = 536 Score = 57.0 bits (136), Expect = 9e-07, Method: Composition-based stats. Identities = 32/97 (32%), Positives = 40/97 (41%), Gaps = 28/97 (28%) Query: 89 IDWAFWERSIAHKDIVNCLRSHYEQQESAYGRLLGXXXXX-------------------- 128 IDWA WE IAHKDI+NCL++ Y Q R LG Sbjct: 62 IDWAKWESQIAHKDILNCLKTFYTNQVQILDRALGALETAKTPAPCEGAEKGWALFDAAL 121 Query: 129 --------XXXXXXXKGAAALWLSCRNPPLSALSTND 157 GA ALW+SC NPP+ ++TN+ Sbjct: 122 SACAKSVEKSEELLSNGARALWVSCSNPPVWKVNTNE 158 >gi|325118419|emb|CBZ53970.1| conserved hypothetical protein [Neospora caninum Liverpool] Length = 536 Score = 50.4 bits (119), Expect = 1e-04, Method: Composition-based stats. Identities = 29/97 (29%), Positives = 39/97 (40%), Gaps = 28/97 (28%) Query: 89 IDWAFWERSIAHKDIVNCLRSHYEQQESAYGRLLGX------------------------ 124 IDWA WE IAHKDI++C+++ Y Q LG Sbjct: 62 IDWAKWEGQIAHKDILHCMKTFYTSQVQILDLALGALEKARTPAPCEGAEKGWALYDAAL 121 Query: 125 ----XXXXXXXXXXXKGAAALWLSCRNPPLSALSTND 157 GA ALW+SC NPP+ ++TN+ Sbjct: 122 RACTKSVEKSEELLANGARALWVSCNNPPVWKVNTNE 158 >gi|156088155|ref|XP_001611484.1| hypothetical protein [Babesia bovis T2Bo] gi|154798738|gb|EDO07916.1| conserved hypothetical protein [Babesia bovis] Length = 548 Score = 40.8 bits (94), Expect = 0.059, Method: Composition-based stats. Identities = 24/94 (25%), Positives = 36/94 (38%), Gaps = 22/94 (23%) Query: 86 NKGIDWAFWERSIAHKDIVNCLRSHYEQQESAYGRLLGX--------------------- 124 N+G+DW WE+ IAHKDI+ ++ Y+ + R Sbjct: 67 NEGVDWDKWEKLIAHKDILQHMKRTYDANMAMIERTANMEGDMSHLDTDWELYENARQNC 126 Query: 125 -XXXXXXXXXXXKGAAALWLSCRNPPLSALSTND 157 G+ ALW+S NPP + TN+ Sbjct: 127 NQATRTVKKIIADGSKALWVSQNNPPAWKVDTNE 160 >gi|124504897|ref|XP_001351191.1| conserved protein, unknown function [Plasmodium falciparum 3D7] gi|7672220|emb|CAA15604.2| conserved protein, unknown function [Plasmodium falciparum 3D7] Length = 609 Score = 37.0 bits (84), Expect = 0.86, Method: Composition-based stats. Identities = 21/94 (22%), Positives = 37/94 (39%), Gaps = 22/94 (23%) Query: 86 NKGIDWAFWERSIAHKDIVNCLRSHYEQQESAYGRLLGXXXXXXXXXXXXK--------- 136 +K IDW W I++K++++C+++ Y+ Q L G K Sbjct: 62 SKSIDWVDWNEKISNKELLSCMKNFYDNQMKMLQELGGEKEGNLKEGEEEKIFEDALKNC 121 Query: 137 -------------GAAALWLSCRNPPLSALSTND 157 GA LW++ NP +S + N+ Sbjct: 122 KESEEISKKLLVDGAKTLWINFHNPNVSNVDNNE 155 >gi|83273695|ref|XP_729511.1| hypothetical protein [Plasmodium yoelii yoelii str. 17XNL] gi|23487522|gb|EAA21076.1| hypothetical protein [Plasmodium yoelii yoelii] Length = 572 Score = 34.7 bits (78), Expect = 5.4, Method: Composition-based stats. Identities = 21/95 (22%), Positives = 36/95 (37%), Gaps = 24/95 (25%) Query: 87 KGIDWAFWERSIAHKDIVNCLRSHYEQQESAYGRLLGXXXXXXXXXXXXK---------- 136 K IDW W I++K+++ C+++ Y+ Q SA + + Sbjct: 63 KKIDWNKWNEKISNKELLLCMKNFYDNQMSALEAMEEGEKKESGSKKSEEDKLFEEALNN 122 Query: 137 --------------GAAALWLSCRNPPLSALSTND 157 GA LW+S NP ++ L N+ Sbjct: 123 CKKAEETSAKLLIDGAKTLWISFHNPSVNNLDNNE 157 Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF excluding environmental samples from WGS projects Posted date: Jul 22, 2011 4:42 PM Number of letters in database: 5,058,227,080 Number of sequences in database: 14,777,732 Lambda K H 0.321 0.135 0.435 Gapped Lambda K H 0.267 0.0410 0.140 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Sequences: 14777732 Number of Hits to DB: 1,048,385,600 Number of extensions: 23041888 Number of successful extensions: 29523 Number of sequences better than 10.0: 7 Number of HSP's gapped: 29563 Number of HSP's successfully gapped: 7 Length of query: 170 Length of database: 5,058,227,080 Length adjustment: 129 Effective length of query: 41 Effective length of database: 3,151,899,652 Effective search space: 129227885732 Effective search space used: 129227885732 Neighboring words threshold: 11 Window for multiple hits: 40 X1: 16 ( 7.4 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 41 (21.9 bits) S2: 76 (33.9 bits)