bitscore colors: <40, 40-50 , 50-80, 80-200, >200

BLASTP 2.2.24+
Reference: Stephen F. Altschul, Thomas L. Madden, Alejandro A.
Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J.
Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of
protein database search programs", Nucleic Acids Res. 25:3389-3402.
Reference for composition-based statistics: Alejandro A. Schaffer,
L. Aravind, Thomas L. Madden, Sergei Shavirin, John L. Spouge, Yuri
I. Wolf, Eugene V. Koonin, and Stephen F. Altschul (2001),
"Improving the accuracy of PSI-BLAST protein database searches with
composition-based statistics and other refinements", Nucleic Acids
Res. 29:2994-3005.
Database: egene_temp_file_orthology_annotation_similarity_blast_database_966
164,496 sequences; 82,071,388 total letters
Query= Eace_0346_orf2
Length=253
Score E
Sequences producing significant alignments: (Bits) Value
tpv:TP04_0751 hypothetical protein; K12842 U2-associated prote... 108 1e-23
bbo:BBOV_III008490 17.m07743; surp module family protein; K128... 103 7e-22
dre:100037380 zgc:163098; K12842 U2-associated protein SR140 91.7 2e-18
pfa:PF14_0028 pre-mRNA splicing factor, putative; K12842 U2-as... 87.8 4e-17
mmu:67958 2610101N10Rik, AU023006, Sr140, U2surp; RIKEN cDNA 2... 83.6 6e-16
hsa:23350 U2SURP, KIAA0332, MGC133197, SR140, fSAPa; U2 snRNP-... 83.6 6e-16
dre:567461 KIAA0332, mp:zf637-1-000918, wu:fa03e02, wu:fb38b01... 83.2 8e-16
ath:AT5G25060 RNA recognition motif (RRM)-containing protein; ... 79.3 1e-14
ath:AT5G10800 RNA recognition motif (RRM)-containing protein 77.0 5e-14
cel:C07H6.4 hypothetical protein; K12842 U2-associated protein... 65.9 1e-10
tgo:TGME49_040710 RRM domain-containing protein 55.1 2e-07
cpv:cgd8_1480 RPR domain containing protein, present in protei... 51.2 3e-06
mmu:237504 Rassf9, AW322107, MGC41694, Pamci; Ras association ... 32.7 1.4
dre:100150148 ras association domain-containing protein 9-like 31.2 3.4
xla:100158456 rassf9, P-CIP1; Ras association (RalGDS/AF-6) do... 30.4 6.9
> tpv:TP04_0751 hypothetical protein; K12842 U2-associated protein
SR140
Length=730
Score = 108 bits (271), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 66/232 (28%), Positives = 113/232 (48%), Gaps = 28/232 (12%)
Query 1 LLRNITRDRASICAAMSFCMSHGSSSAEVCSCLTESLTLPDTALTMRLARMYLLSDLLAN 60
+LR +T R IC AM F +++ S+ + L P+T + +++ +Y++SD+L N
Sbjct 408 ILRGVTSVRNDICNAMLFVINNSESAYHLTDLLFNYFNDPNTNVQQKISILYVISDVLYN 467
Query 61 SSAQTAHAWTYRHHLEKHLPFIADRWSKVLEDLRLGVGVGGASPRTQQEQWQGVERALYK 120
SS+ ++W YR+ +EKHLP + + E T + Q + A+ K
Sbjct 468 SSSSKQYSWVYRNSIEKHLPQLFHSIKQYKE------------KSTSKISSQQLIDAVMK 515
Query 121 MTKLWEAWGVYPPAFLRGIEATLCGRDITA--TILDPEAAAAAAAKCEAAALELADPQLD 178
+ +W++W VY FL G+EATL G D+ + T+ + E + + ++ D
Sbjct 516 LLSIWDSWTVYSQQFLNGLEATLLGDDLDSFKTLPEFEQHKDLLSTNDGVVVDYFD---- 571
Query 179 GEPLAENLRPLIAQFPLWLRPAAMEWLLLDRYQLDRLCQQRGLGVLPTRSSS 230
L+A PL R A ++LL+ +L +C QRGL V P+ +S
Sbjct 572 ----------LLATLPLKYRETAYKYLLMRLKELKSMCLQRGLLVQPSDRNS 613
> bbo:BBOV_III008490 17.m07743; surp module family protein; K12842
U2-associated protein SR140
Length=717
Score = 103 bits (256), Expect = 7e-22, Method: Compositional matrix adjust.
Identities = 71/230 (30%), Positives = 107/230 (46%), Gaps = 24/230 (10%)
Query 1 LLRNITRDRASICAAMSFCMSHGSSSAEVCSCLTESLTLPDTALTMRLARMYLLSDLLAN 60
+L N T R I AM F ++HG S+ +V CL E + + +++R+Y+LSD+L N
Sbjct 393 ILSNATTIRGYIADAMMFMINHGESAVQVTDCLVEYIMKDSPTVDTKISRLYILSDVLYN 452
Query 61 SSAQTAHAWTYRHHLEKHLPFIADRWSKVLEDLRLGVGVGGASPRTQQEQWQGVERALYK 120
+SA W YR EK +P + ++ + E ++ S QE VER L
Sbjct 453 TSASHQFGWIYRLTFEKKIPEV---FAHIREYIK-----SSTSKIAVQELISCVERIL-- 502
Query 121 MTKLWEAWGVYPPAFLRGIEATLCGRDITATILDPEAAAAAAAKCEAAALELADPQLDGE 180
W W YP +L G+E+ L G P + ++ DP DG+
Sbjct 503 --NAWHQWDAYPQEYLYGLESMLWG---------PYPSEFVTLPIFDQHRDIIDPGYDGD 551
Query 181 PLAENLRPLIAQFPLWLRPAAMEWLLLDRYQLDRLCQQRGLGVLP-TRSS 229
L L+++F L RP A E+ L+ +L +LCQ RG+ P TR S
Sbjct 552 EFI--LFDLLSRFTLKWRPVAYEYSLMRLKKLKQLCQNRGVVSTPYTRES 599
> dre:100037380 zgc:163098; K12842 U2-associated protein SR140
Length=874
Score = 91.7 bits (226), Expect = 2e-18, Method: Composition-based stats.
Identities = 64/210 (30%), Positives = 103/210 (49%), Gaps = 19/210 (9%)
Query 1 LLRNITRDRASICAAMSFCMSHGSSSAEVCSCLTESLTLPDTALTMRLARMYLLSDLLAN 60
LLR +T R I AM FC+ ++ E+ SC+TESL++ T L ++AR+YL+SD+L N
Sbjct 436 LLRGLTPRRDEIGDAMLFCLERAEAAEEIVSCITESLSIAHTPLQKKIARLYLISDILYN 495
Query 61 SSAQTAHAWTYRHHLEKHLPFIADRWSKVLEDLRLGVGVGGASPRTQQEQWQGVERALYK 120
S A+ A+A YR E LP I S+ +++ R Q EQ++ + +
Sbjct 496 SCAKVANASYYRKFFETKLPEIFGDISEAYRNIQ---------ARLQAEQFK---QRIMG 543
Query 121 MTKLWEAWGVYPPAFLRGIEATLCG--RDITATILDPEAAAAAAAKCEAAALELADPQLD 178
+ WE W VYP +FL ++ G + I PE + + L ++D
Sbjct 544 CFRAWEDWAVYPDSFLIQLQNIFLGLIKPGEEVIERPEPESPDLDGAPLDGVPLDRGEID 603
Query 179 GEPLAENLRPLIAQFPLWLRPAAMEWLLLD 208
G PL + + PL P+A++ + +D
Sbjct 604 GTPLDD-----LDGSPLTWDPSAIDGVPVD 628
> pfa:PF14_0028 pre-mRNA splicing factor, putative; K12842 U2-associated
protein SR140
Length=655
Score = 87.8 bits (216), Expect = 4e-17, Method: Compositional matrix adjust.
Identities = 54/227 (23%), Positives = 94/227 (41%), Gaps = 36/227 (15%)
Query 1 LLRNITRDRASICAAMSFCMSHGSSSAEVCSCLTESLT--LPDTALTMRLARMYLLSDLL 58
+L + + R SIC AM FC H S ++ ++ LT D +++ +YLLSD+L
Sbjct 403 ILSTLNKKRVSICRAMIFCTRHSDYSLDIIKIISNFLTDVKYDLLKKVKINLIYLLSDIL 462
Query 59 ANSSAQTAHAWTYRHHLEKHLP----FIADRWSKVLEDLRLGVGVGGASPRTQQEQWQGV 114
N S + +W+YR H+E LP F+ KV ++ + +
Sbjct 463 YNCSNEFFSSWSYRKHIEDELPRIFYFLRKHIKKVDSKIKGKLFID-------------- 508
Query 115 ERALYKMTKLWEAWGVYPPAFLRGIEATLCGRDITATILDPEAAAAAAAKCEAAALELAD 174
+L + +W W +Y F+ G+ LC I ++ + ++ D
Sbjct 509 --SLINIFNMWNCWAIYNSVFVNGLLCLLCNMKINYIDVNKSEEENDEENIDGHKIQYYD 566
Query 175 PQLDGEPLAENLRPLIAQFPLWLRPAAMEWLLLDRYQLDRLCQQRGL 221
I ++PL LR A + + ++++LC QRGL
Sbjct 567 D--------------IKRYPLSLRRNAYMYFKKNDDEINKLCLQRGL 599
> mmu:67958 2610101N10Rik, AU023006, Sr140, U2surp; RIKEN cDNA
2610101N10 gene; K12842 U2-associated protein SR140
Length=1029
Score = 83.6 bits (205), Expect = 6e-16, Method: Composition-based stats.
Identities = 58/186 (31%), Positives = 91/186 (48%), Gaps = 22/186 (11%)
Query 1 LLRNITRDRASICAAMSFCMSHGSSSAEVCSCLTESLTLPDTALTMRLARMYLLSDLLAN 60
+LR +T + I AM FC+++ ++ E+ C+TESL++ T L ++AR+YL+SD+L N
Sbjct 545 ILRGLTPRKNDIGDAMVFCLNNAEAAEEIVDCITESLSILKTPLPKKIARLYLVSDVLYN 604
Query 61 SSAQTAHAWTYRHHLEKHLPFIADRWSKVLEDLRLGVGVGGASPRTQQEQWQGVERALYK 120
SSA+ A+A YR E L ++ DL A+ RT Q Q
Sbjct 605 SSAKVANASYYRKFFETKL-------CQIFSDL-------NATYRTIQGHLQSENFKQRV 650
Query 121 MT--KLWEAWGVYPPAFLRGIEATLCGRDITATILDPEAAAAAAAKCEAAALELADPQLD 178
MT + WE W +YP FL ++ G I++ + + A +E +LD
Sbjct 651 MTCFRAWEDWAIYPEPFLIKLQNIFLG---LVNIIEEKETEDVPDDLDGAPIE---EELD 704
Query 179 GEPLAE 184
G PL +
Sbjct 705 GAPLED 710
> hsa:23350 U2SURP, KIAA0332, MGC133197, SR140, fSAPa; U2 snRNP-associated
SURP domain containing; K12842 U2-associated protein
SR140
Length=1029
Score = 83.6 bits (205), Expect = 6e-16, Method: Composition-based stats.
Identities = 58/186 (31%), Positives = 91/186 (48%), Gaps = 22/186 (11%)
Query 1 LLRNITRDRASICAAMSFCMSHGSSSAEVCSCLTESLTLPDTALTMRLARMYLLSDLLAN 60
+LR +T + I AM FC+++ ++ E+ C+TESL++ T L ++AR+YL+SD+L N
Sbjct 545 ILRGLTPRKNDIGDAMVFCLNNAEAAEEIVDCITESLSILKTPLPKKIARLYLVSDVLYN 604
Query 61 SSAQTAHAWTYRHHLEKHLPFIADRWSKVLEDLRLGVGVGGASPRTQQEQWQGVERALYK 120
SSA+ A+A YR E L ++ DL A+ RT Q Q
Sbjct 605 SSAKVANASYYRKFFETKL-------CQIFSDL-------NATYRTIQGHLQSENFKQRV 650
Query 121 MT--KLWEAWGVYPPAFLRGIEATLCGRDITATILDPEAAAAAAAKCEAAALELADPQLD 178
MT + WE W +YP FL ++ G I++ + + A +E +LD
Sbjct 651 MTCFRAWEDWAIYPEPFLIKLQNIFLG---LVNIIEEKETEDVPDDLDGAPIE---EELD 704
Query 179 GEPLAE 184
G PL +
Sbjct 705 GAPLED 710
> dre:567461 KIAA0332, mp:zf637-1-000918, wu:fa03e02, wu:fb38b01,
wu:fc75d03; si:dkey-181m9.9; K12842 U2-associated protein
SR140
Length=968
Score = 83.2 bits (204), Expect = 8e-16, Method: Composition-based stats.
Identities = 57/190 (30%), Positives = 90/190 (47%), Gaps = 26/190 (13%)
Query 1 LLRNITRDRASICAAMSFCMSHGSSSAEVCSCLTESLTLPDTALTMRLARMYLLSDLLAN 60
LLR +T ++ I M FC++H ++ E+ C+ ESL++ T L ++AR+YL+SD+L N
Sbjct 480 LLRGLTPRKSDIAGGMFFCLTHADAAEEIVECIAESLSILKTPLPKKIARLYLVSDVLYN 539
Query 61 SSAQTAHAWTYRHHLEKHLPFIADRWSKVLEDLRLGVGVGGASPRTQQE--QWQGVERAL 118
SSA+ +A YR E L ++ DL A+ +T Q Q + ++ +
Sbjct 540 SSAKVTNASYYRKFFETKL-------CQIFSDL-------NATYKTIQGHLQSENFKQRV 585
Query 119 YKMTKLWEAWGVYPPAFLRGIEATLCGRDITATILDPEAAAAAAAKCEAAALE------L 172
+ WE W VYP FL ++ G LDPE +E +
Sbjct 586 MSCFRAWEEWAVYPDPFLIKLQNIFLG----LVSLDPEKEPEELVPELPEKVEDIDGAPI 641
Query 173 ADPQLDGEPL 182
+ +LDG PL
Sbjct 642 VEEELDGAPL 651
> ath:AT5G25060 RNA recognition motif (RRM)-containing protein;
K12842 U2-associated protein SR140
Length=946
Score = 79.3 bits (194), Expect = 1e-14, Method: Composition-based stats.
Identities = 61/232 (26%), Positives = 113/232 (48%), Gaps = 30/232 (12%)
Query 1 LLRNITRDRASICAAMSFCMSHGSSSAEVCSCLTESLTLPDTALTMRLARMYLLSDLLAN 60
+LR +T +R+ I AM F + + ++ EV LTESLTL +T++ ++AR+ L+SD+L N
Sbjct 448 MLRALTLERSQIKEAMGFALDNADAAGEVVEVLTESLTLKETSIPTKVARLMLVSDILHN 507
Query 61 SSAQTAHAWTYRHHLEKHLPFIADRWSKVLEDLRLGVGVGGASPRTQQEQWQGVERALYK 120
SSA+ +A YR E LP I + ++ + + + R E + ER L K
Sbjct 508 SSARVKNASAYRTKFEATLPDIMESFNDLYRSI---------TGRITAEALK--ERVL-K 555
Query 121 MTKLWEAWGVYPPAFLRGIEATLCGRDITATILDPEAAAAAAAKCEAAALELADPQLDGE 180
+ ++W W ++ A++ G+ +T ++ + + C A P+++ +
Sbjct 556 VLQVWADWFLFSDAYIYGLRSTFLRSGVSGV-------TSFHSICGDA------PEIENK 602
Query 181 PLAENLRPL-----IAQFPLWLRPAAMEWLLLDRYQLDRLCQQRGLGVLPTR 227
A+N+ + A + A E + L +L+R C+ GL ++ R
Sbjct 603 SYADNMSDIGKINPDAALAIGKGAARQELMNLPIAELERRCRHNGLSLVGGR 654
> ath:AT5G10800 RNA recognition motif (RRM)-containing protein
Length=947
Score = 77.0 bits (188), Expect = 5e-14, Method: Composition-based stats.
Identities = 63/228 (27%), Positives = 108/228 (47%), Gaps = 22/228 (9%)
Query 1 LLRNITRDRASICAAMSFCMSHGSSSAEVCSCLTESLTLPDTALTMRLARMYLLSDLLAN 60
+LR +T +R+ I AM F + + ++ EV LTESLTL +T++ ++AR+ L+SD++ N
Sbjct 455 MLRALTLERSQIREAMGFALDNAEAAGEVVEVLTESLTLKETSIPTKVARLMLVSDIIHN 514
Query 61 SSAQTAHAWTYRHHLEKHLPFIADRWSKVLEDLRLGVGVGGASPRTQQEQWQGVERALYK 120
SSA+ +A YR E LP I + ++ + + R E + ER L K
Sbjct 515 SSARVKNASAYRTKFEATLPDIMESFNDLYHSVH---------GRITAEALR--ERVL-K 562
Query 121 MTKLWEAWGVYPPAFLRGIEAT-LCGRDITATILDPEAAAAAAAKCEAAALELADPQLDG 179
+ ++W W ++ A++ G+ AT L R+ T A + + + D
Sbjct 563 VLQVWADWFLFSDAYINGLRATFLRSRNFGVTSFHSICGDAPDIEKKGLIGNMNDADKIN 622
Query 180 EPLAENLRPLIAQFPLWLRPAAMEWLLLDRYQLDRLCQQRGLGVLPTR 227
+ A + A+ L RP + +L+R C+ GL +L R
Sbjct 623 QDAALAMGEGAARQELMNRPIS---------ELERRCRHNGLSLLGGR 661
> cel:C07H6.4 hypothetical protein; K12842 U2-associated protein
SR140
Length=933
Score = 65.9 bits (159), Expect = 1e-10, Method: Composition-based stats.
Identities = 39/135 (28%), Positives = 71/135 (52%), Gaps = 17/135 (12%)
Query 1 LLRNITRDRASICAAMSFCMSHGSSSAEVCSCLTESLTLPDTALTMRLARMYLLSDLLAN 60
L+R +T ++ SI A M +C+ + +AE+C C+ +SLT+ D L +++R+YL++D+L+N
Sbjct 533 LIRELTPEKTSIAATMVWCIQNAKYAAEICECVYDSLTIEDIPLYKKISRLYLINDILSN 592
Query 61 SSAQTAH-AWTYRHHLEKHLPFIADRWSKVLEDLRLGVGVG--GASPRTQQEQWQGVERA 117
+ + YR H E +LE + + VG R +QEQ++
Sbjct 593 CVQRNVRDVFLYRSHFE-----------ALLEKIFVAVGKAYRAIPSRIKQEQFKQRVMC 641
Query 118 LYKMTKLWEAWGVYP 132
+++ +E VYP
Sbjct 642 VFRN---FEEMAVYP 653
> tgo:TGME49_040710 RRM domain-containing protein
Length=859
Score = 55.1 bits (131), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 34/83 (40%), Positives = 48/83 (57%), Gaps = 4/83 (4%)
Query 137 RGIEATLCGRDITATILDPEAAAAAAAKCEAAALELADPQLDGEPLAENLRPLIAQFPLW 196
+G+EA+L D A + EA A EAA EL D +DG+P+A L+A++PL
Sbjct 524 KGLEASLFSADFPAQLA--EAPEEAGPPAEAA--ELVDDDVDGQPIAAAEACLLAKYPLQ 579
Query 197 LRPAAMEWLLLDRYQLDRLCQQR 219
LR +W LDR +L++LC QR
Sbjct 580 LREQIHQWCRLDRSELEKLCLQR 602
> cpv:cgd8_1480 RPR domain containing protein, present in proteins
involved in mRNA splicing
Length=324
Score = 51.2 bits (121), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 40/212 (18%), Positives = 87/212 (41%), Gaps = 23/212 (10%)
Query 1 LLRNITRDRASICAAMSFCMSHGSS-SAEVCSCLTESLTLPDTALTMRLARMYLLSDLLA 59
LL ++T R I M+F + + S ++ L + P++ ++++ +Y +SD+L
Sbjct 91 LLDSLTTKRMKINEIMTFVVDYSDQHSVQMIELLIQRF--PNSTFEVKVSILYCISDILY 148
Query 60 NSSAQTAHAWTYRHHLEKHLPFIADRWSKVLEDLRLGVGVGGASPRTQQEQWQGVERALY 119
NS + AW R+ + P++ + S R + +
Sbjct 149 NSHSSKIGAWKLRNCIMNIFPYLVSH-------------ISFQSNRGNSCYIDLINKT-K 194
Query 120 KMTKLWEAWGVYPPAFLRGIEATLCGRDITATILDPEAAAAAAAKCEAAALELADPQLDG 179
+ ++W W ++P ++ G+ ++LC +LD E++D L
Sbjct 195 SIIEIWIDWKIFPSEYIEGLSSSLC----FDKMLDEMKNDKKTKLISNINDEISDGILLD 250
Query 180 EPLAENLR--PLIAQFPLWLRPAAMEWLLLDR 209
+ + L P+ ++ P+W M LL+++
Sbjct 251 KDIVSILSIWPIKSRLPVWKVWREMNNLLINK 282
> mmu:237504 Rassf9, AW322107, MGC41694, Pamci; Ras association
(RalGDS/AF-6) domain family (N-terminal) member 9
Length=435
Score = 32.7 bits (73), Expect = 1.4, Method: Compositional matrix adjust.
Identities = 18/40 (45%), Positives = 23/40 (57%), Gaps = 3/40 (7%)
Query 99 VGGASPRTQQEQWQGVERALYKMT---KLWEAWGVYPPAF 135
+G AS E+W+G ERAL +T KLW+AWG P
Sbjct 71 LGKASDYCIVEKWRGSERALPPLTRILKLWKAWGDEQPNM 110
> dre:100150148 ras association domain-containing protein 9-like
Length=435
Score = 31.2 bits (69), Expect = 3.4, Method: Compositional matrix adjust.
Identities = 15/34 (44%), Positives = 22/34 (64%), Gaps = 4/34 (11%)
Query 109 EQWQGVERALYKMT---KLWEAWGVYPPAFLRGI 139
E+W+G ERAL +T +LW AWG P F++ +
Sbjct 81 ERWKGFERALPPLTRILRLWNAWGDEKP-FIKFV 113
> xla:100158456 rassf9, P-CIP1; Ras association (RalGDS/AF-6)
domain family (N-terminal) member 9
Length=420
Score = 30.4 bits (67), Expect = 6.9, Method: Compositional matrix adjust.
Identities = 13/24 (54%), Positives = 17/24 (70%), Gaps = 3/24 (12%)
Query 109 EQWQGVERAL---YKMTKLWEAWG 129
E+W+G ER L K+ KLW+AWG
Sbjct 72 EKWRGFERVLPPPTKILKLWKAWG 95
Lambda K H
0.321 0.132 0.408
Gapped
Lambda K H
0.267 0.0410 0.140
Effective search space used: 9148686404
Database: egene_temp_file_orthology_annotation_similarity_blast_database_966
Posted date: Sep 16, 2011 8:45 PM
Number of letters in database: 82,071,388
Number of sequences in database: 164,496
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Neighboring words threshold: 11
Window for multiple hits: 40