BLASTP 2.2.24 [Aug-08-2010] Reference for compositional score matrix adjustment: Altschul, Stephen F., John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis, Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109. Query= Eten_1973_orf5 (113 letters) Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF excluding environmental samples from WGS projects 14,777,732 sequences; 5,058,227,080 total letters
Score E Sequences producing significant alignments: (bits) Value gi|157133750|ref|XP_001662996.1| nonsense-mediated mrna decay pr... 36 2.3 gi|94468868|gb|ABF18283.1| NMD protein [Aedes aegypti] 36 2.3 gi|242093730|ref|XP_002437355.1| hypothetical protein SORBIDRAFT... 35 2.8 >gi|157133750|ref|XP_001662996.1| nonsense-mediated mrna decay protein [Aedes aegypti] gi|108881501|gb|EAT45726.1| nonsense-mediated mrna decay protein [Aedes aegypti] Length = 510 Score = 35.8 bits (81), Expect = 2.3, Method: Composition-based stats. Identities = 32/114 (28%), Positives = 47/114 (41%), Gaps = 20/114 (17%) Query: 1 QQLLLHFRAHFASFRLGFIPSPRGG-------EVSARRARG--ERALPS---------VH 42 +Q++L +AH + LG PSP GG + AR+ + LP H Sbjct: 179 EQMILKHKAHENT--LGIKPSPSGGLDFYYATDAHARKMVDFVQAVLPVKVTNSKKLISH 236 Query: 43 SFHKTETNDKYIYLVLIYPVISASLVRIGKWRRAQLGCTCTSVVRGHAAAAIQL 96 H N KY Y + + P+ SLV + K + QLG + A A+ L Sbjct: 237 DIHSNSYNYKYSYALDVVPISKGSLVCLNKKLQQQLGHISPICLVTKVANAVHL 290 >gi|94468868|gb|ABF18283.1| NMD protein [Aedes aegypti] Length = 510 Score = 35.8 bits (81), Expect = 2.3, Method: Composition-based stats. Identities = 32/114 (28%), Positives = 47/114 (41%), Gaps = 20/114 (17%) Query: 1 QQLLLHFRAHFASFRLGFIPSPRGG-------EVSARRARG--ERALPS---------VH 42 +Q++L +AH + LG PSP GG + AR+ + LP H Sbjct: 179 EQMILKHKAHENT--LGIKPSPSGGLDFYYATDAHARKMVDFVQAVLPVKVTNSKKLISH 236 Query: 43 SFHKTETNDKYIYLVLIYPVISASLVRIGKWRRAQLGCTCTSVVRGHAAAAIQL 96 H N KY Y + + P+ SLV + K + QLG + A A+ L Sbjct: 237 DIHSNSYNYKYSYALDVVPISKGSLVCLNKKLQQQLGHISPICLVTKVANAVHL 290 >gi|242093730|ref|XP_002437355.1| hypothetical protein SORBIDRAFT_10g025460 [Sorghum bicolor] gi|241915578|gb|EER88722.1| hypothetical protein SORBIDRAFT_10g025460 [Sorghum bicolor] Length = 470 Score = 35.4 bits (80), Expect = 2.8, Method: Composition-based stats. Identities = 24/83 (28%), Positives = 42/83 (50%), Gaps = 10/83 (12%) Query: 19 IPSPRGGEVSARRARGERALPSVHSFHKTETNDKYIYLVLIYPVISASLVRIGKWRRAQL 78 +PS +G SA ARG +P + FH +T+ K + ++ YP++ A+ GK+ + + Sbjct: 123 LPSGKGAR-SADTARGIMLVPILSKFHSLKTSRKGLVAMIEYPLLEAA---TGKFSESNV 178 Query: 79 ------GCTCTSVVRGHAAAAIQ 95 GC +V G AA++ Sbjct: 179 LGVGGFGCVYKAVFDGGVTAAVK 201 Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF excluding environmental samples from WGS projects Posted date: Jul 22, 2011 4:42 PM Number of letters in database: 5,058,227,080 Number of sequences in database: 14,777,732 Lambda K H 0.329 0.140 0.427 Gapped Lambda K H 0.267 0.0410 0.140 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Sequences: 14777732 Number of Hits to DB: 1,020,903,146 Number of extensions: 33046471 Number of successful extensions: 87636 Number of sequences better than 10.0: 3 Number of HSP's gapped: 87904 Number of HSP's successfully gapped: 3 Length of query: 113 Length of database: 5,058,227,080 Length adjustment: 81 Effective length of query: 32 Effective length of database: 3,861,230,788 Effective search space: 123559385216 Effective search space used: 123559385216 Neighboring words threshold: 11 Window for multiple hits: 40 X1: 15 ( 7.1 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 40 (21.8 bits) S2: 76 (33.9 bits)