BLASTP 2.2.24 [Aug-08-2010] Reference for compositional score matrix adjustment: Altschul, Stephen F., John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis, Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109. Query= Eten_6626_orf3 (177 letters) Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF excluding environmental samples from WGS projects 14,777,732 sequences; 5,058,227,080 total letters
Score E Sequences producing significant alignments: (bits) Value gi|325115322|emb|CBZ50877.1| conserved hypothetical protein [Neo... 71 7e-11 gi|221502097|gb|EEE27843.1| hypothetical protein TGVEG_086490 [T... 68 6e-10 gi|237838863|ref|XP_002368729.1| hypothetical protein TGME49_066... 57 1e-06 gi|294948148|ref|XP_002785640.1| conserved hypothetical protein ... 44 0.011 gi|294874922|ref|XP_002767154.1| hypothetical protein Pmar_PMAR0... 41 0.071 gi|209875447|ref|XP_002139166.1| hypothetical protein [Cryptospo... 37 1.2 gi|154422195|ref|XP_001584110.1| Beige/BEACH domain containing p... 36 1.7 gi|326431520|gb|EGD77090.1| hypothetical protein PTSG_07428 [Sal... 36 2.5 gi|71014452|ref|XP_758714.1| hypothetical protein UM02567.1 [Ust... 35 4.4 >gi|325115322|emb|CBZ50877.1| conserved hypothetical protein [Neospora caninum Liverpool] Length = 971 Score = 70.9 bits (172), Expect = 7e-11, Method: Composition-based stats. Identities = 35/64 (54%), Positives = 48/64 (75%), Gaps = 1/64 (1%) Query: 2 LLPVELELREKNYTAYMKCEIYKFTKDKEEWQKRQEKRSKTRELFKDIIS-DAAIEVNNQ 60 LL VE ELREKNY AY+KCE++KF + +E W RQEK+SKTRELF+D+++ D +E + Sbjct: 732 LLDVEEELREKNYAAYVKCEVHKFKQSEEGWTARQEKKSKTRELFRDLLAEDEGVEATDA 791 Query: 61 EAAD 64 A+ Sbjct: 792 TVAE 795 >gi|221502097|gb|EEE27843.1| hypothetical protein TGVEG_086490 [Toxoplasma gondii VEG] Length = 950 Score = 67.8 bits (164), Expect = 6e-10, Method: Composition-based stats. Identities = 32/51 (62%), Positives = 42/51 (82%) Query: 2 LLPVELELREKNYTAYMKCEIYKFTKDKEEWQKRQEKRSKTRELFKDIISD 52 LL VE ELREKNY AY+ CEI+ F +++E W RQEK+SKTRELFKD++++ Sbjct: 729 LLEVEDELREKNYAAYVNCEIHTFKRNEEGWATRQEKKSKTRELFKDLLAE 779 >gi|237838863|ref|XP_002368729.1| hypothetical protein TGME49_066400 [Toxoplasma gondii ME49] gi|211966393|gb|EEB01589.1| hypothetical protein TGME49_066400 [Toxoplasma gondii ME49] Length = 960 Score = 56.6 bits (135), Expect = 1e-06, Method: Composition-based stats. Identities = 24/41 (58%), Positives = 34/41 (82%) Query: 12 KNYTAYMKCEIYKFTKDKEEWQKRQEKRSKTRELFKDIISD 52 KNY AY+ CEI+ F +++E W RQEK+SKTRELFKD++++ Sbjct: 739 KNYAAYVNCEIHTFKRNEEGWATRQEKKSKTRELFKDLLAE 779 >gi|294948148|ref|XP_002785640.1| conserved hypothetical protein [Perkinsus marinus ATCC 50983] gi|239899619|gb|EER17436.1| conserved hypothetical protein [Perkinsus marinus ATCC 50983] Length = 1098 Score = 43.5 bits (101), Expect = 0.011, Method: Composition-based stats. Identities = 20/64 (31%), Positives = 42/64 (65%), Gaps = 4/64 (6%) Query: 2 LLPVELELREKNYTAYMKCEIYKFTKDKEEWQKRQEKRSKTRELFKDII----SDAAIEV 57 L+ +E +++EKN + + +++F EEW+++ +K+S+ RELF DI+ S A +E Sbjct: 974 LVKIEEDIKEKNMVLWKRLGLHEFKVANEEWEEKMQKQSQARELFDDILNLDTSTAGVET 1033 Query: 58 NNQE 61 ++++ Sbjct: 1034 SHKK 1037 >gi|294874922|ref|XP_002767154.1| hypothetical protein Pmar_PMAR006543 [Perkinsus marinus ATCC 50983] gi|239868603|gb|EEQ99871.1| hypothetical protein Pmar_PMAR006543 [Perkinsus marinus ATCC 50983] Length = 516 Score = 40.8 bits (94), Expect = 0.071, Method: Compositional matrix adjust. Identities = 20/59 (33%), Positives = 38/59 (64%), Gaps = 4/59 (6%) Query: 2 LLPVELELREKNYTAYMKCEIYKFTKDKEEWQKRQEKRSKTRELFKDII----SDAAIE 56 L+ +E +++EKN + + +++F EEW+++ +K+S+ RELF DI+ S A +E Sbjct: 391 LVKIEEDIKEKNMVLWKRLGLHEFKVANEEWEEKMQKQSQARELFDDILNLDTSTAGVE 449 >gi|209875447|ref|XP_002139166.1| hypothetical protein [Cryptosporidium muris RN66] gi|209554772|gb|EEA04817.1| hypothetical protein, conserved [Cryptosporidium muris RN66] Length = 816 Score = 37.0 bits (84), Expect = 1.2, Method: Compositional matrix adjust. Identities = 15/43 (34%), Positives = 29/43 (67%), Gaps = 1/43 (2%) Query: 9 LREKNYTAYMKCEIYKFTK-DKEEWQKRQEKRSKTRELFKDII 50 ++ +NY+ Y +CEI + + D++ W Q +++KTR +F DI+ Sbjct: 697 IKSRNYSLYKRCEIPLYLELDEKLWDISQSQKTKTRIMFSDIL 739 >gi|154422195|ref|XP_001584110.1| Beige/BEACH domain containing protein [Trichomonas vaginalis G3] gi|121918355|gb|EAY23124.1| Beige/BEACH domain containing protein [Trichomonas vaginalis G3] Length = 2734 Score = 36.2 bits (82), Expect = 1.7, Method: Composition-based stats. Identities = 28/114 (24%), Positives = 55/114 (48%), Gaps = 3/114 (2%) Query: 18 MKCEIYKFTKDKEEWQKRQEKRSKTR---ELFKDIISDAAIEVNNQEAADLLPDNPKGKK 74 ++ E KF K ++E +K EK+ + KD+ + I+ + + + + K +K Sbjct: 1149 IQAEKQKFEKIQDENEKVDEKKEENEVKSNDNKDLDQENKIKNQKSQQENQVKNEEKVEK 1208 Query: 75 AEAESTPLQNEGKGFRKPKNENKNKKGTINGDEEEEINTKGKAEKEAKRAIDEI 128 ES +NEGK K + +++ K N +E+EIN + + ++ D+I Sbjct: 1209 QPNESQTKENEGKQLNKDETKDQTNKNEENPVQEKEINKENVSPVKSDENHDKI 1262 >gi|326431520|gb|EGD77090.1| hypothetical protein PTSG_07428 [Salpingoeca sp. ATCC 50818] Length = 849 Score = 35.8 bits (81), Expect = 2.5, Method: Composition-based stats. Identities = 18/51 (35%), Positives = 29/51 (56%), Gaps = 2/51 (3%) Query: 2 LLPVELELRE--KNYTAYMKCEIYKFTKDKEEWQKRQEKRSKTRELFKDII 50 LL E +L+E + T C + F K+EW ++QE ++ ++FKDII Sbjct: 601 LLAAEAKLKESPQGRTVMRNCRLAHFKWKKQEWMQKQEANARKAKMFKDII 651 >gi|71014452|ref|XP_758714.1| hypothetical protein UM02567.1 [Ustilago maydis 521] gi|74702568|sp|Q4PBE6.1|HAT1_USTMA RecName: Full=Histone acetyltransferase type B catalytic subunit gi|46098504|gb|EAK83737.1| hypothetical protein UM02567.1 [Ustilago maydis 521] Length = 448 Score = 35.0 bits (79), Expect = 4.4, Method: Compositional matrix adjust. Identities = 27/87 (31%), Positives = 44/87 (50%), Gaps = 3/87 (3%) Query: 12 KNYTAYMKCEIYKFTKDKEEWQKRQEKRSKTRELFKDIISDAAIEVNNQEAADLLPDNPK 71 K Y +K IY+ KD ++Q++RSK +E F+ ++ + ++ + DLL D P Sbjct: 353 KQYRLQVKARIYRQNKDILMQLEKQQQRSKLQETFEGVVEEYG-DMVGVDVEDLLDDGPS 411 Query: 72 GKKA--EAESTPLQNEGKGFRKPKNEN 96 G A A+ Q +G+G R N N Sbjct: 412 GTGALYGADEEEDQEQGQGDRHYSNGN 438 Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF excluding environmental samples from WGS projects Posted date: Jul 22, 2011 4:42 PM Number of letters in database: 5,058,227,080 Number of sequences in database: 14,777,732 Lambda K H 0.306 0.128 0.347 Gapped Lambda K H 0.267 0.0410 0.140 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Sequences: 14777732 Number of Hits to DB: 1,362,009,380 Number of extensions: 55719218 Number of successful extensions: 209916 Number of sequences better than 10.0: 1757 Number of HSP's gapped: 220019 Number of HSP's successfully gapped: 2298 Length of query: 177 Length of database: 5,058,227,080 Length adjustment: 130 Effective length of query: 47 Effective length of database: 3,137,121,920 Effective search space: 147444730240 Effective search space used: 147444730240 Neighboring words threshold: 11 Window for multiple hits: 40 X1: 16 ( 7.1 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 43 (21.9 bits) S2: 76 (33.9 bits)