bitscore colors: <40, 40-50 , 50-80, 80-200, >200
BLASTP 2.2.24+ Reference: Stephen F. Altschul, Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Reference for composition-based statistics: Alejandro A. Schaffer, L. Aravind, Thomas L. Madden, Sergei Shavirin, John L. Spouge, Yuri I. Wolf, Eugene V. Koonin, and Stephen F. Altschul (2001), "Improving the accuracy of PSI-BLAST protein database searches with composition-based statistics and other refinements", Nucleic Acids Res. 29:2994-3005. Database: egene_temp_file_orthology_annotation_similarity_blast_database_865 164,496 sequences; 82,071,388 total letters Query= Emax_1655_orf1 Length=124 Score E Sequences producing significant alignments: (Bits) Value tgo:TGME49_114280 hypothetical protein ; K13205 A1 cistron-spl... 62.4 3e-10 hsa:25980 C20orf4, DKFZp564N1363, bA234K24.2; chromosome 20 op... 38.5 0.005 dre:431768 im:7153138; zgc:92018; K13205 A1 cistron-splicing f... 38.1 0.006 xla:446520 c20orf4, MGC80226; chromosome 20 open reading frame... 38.1 0.007 mmu:68295 0610011L14Rik; RIKEN cDNA 0610011L14 gene; K13205 A1... 37.7 0.009 xla:379159 MGC130823; hypothetical protein MGC52890 34.3 0.10 cel:F10B5.2 hypothetical protein; K13205 A1 cistron-splicing f... 32.3 0.39 ath:AT3G02760 ATP binding / aminoacyl-tRNA ligase/ histidine-t... 30.8 1.2 hsa:100510383 hypothetical protein LOC100510383 30.0 1.7 pfa:MAL13P1.28 conserved Plasmodium protein, unknown function 30.0 2.1 pfa:PF14_0333 conserved Plasmodium protein, unknown function 29.6 2.5 cel:Y37E3.17 hypothetical protein 29.6 2.5 dre:556023 dynein, axonemal, heavy chain 2-like; K10408 dynein... 29.3 3.3 hsa:100507035 Golgin subfamily A member 8-like protein 1-like 29.3 3.6 xla:398229 bcan, Xbcan; brevican; K06795 brevican 28.9 4.7 hsa:23030 KDM4B, FLJ44906, JMJD2B, KIAA0876; lysine (K)-specif... 28.5 5.2 mmu:236798 Gpr112, Gm367, PGR17; G protein-coupled receptor 11... 28.1 7.2 cel:C49F5.2 set-6; SET (trithorax/polycomb) domain containing ... 28.1 7.3 sce:YBR136W MEC1, ESR1, SAD3; Genome integrity checkpoint prot... 28.1 7.7 dre:321166 apobl, cb235, hm:zeh1207, sb:cb235, si:ch211-219i10... 27.7 8.6 ath:AT1G54920 hypothetical protein 27.7 9.1 > tgo:TGME49_114280 hypothetical protein ; K13205 A1 cistron-splicing factor AAR2 Length=653 Score = 62.4 bits (150), Expect = 3e-10, Method: Composition-based stats. Identities = 28/72 (38%), Positives = 42/72 (58%), Gaps = 0/72 (0%) Query 1 LAHHFPSLLHWRELIQLLSNAGRAVYLLPGLYGKFLDTFYAQLSQSPDDIILGPLEEGSF 60 L H F W++L+ LL + RA+ LP +Y FL + QL Q PDD++ L + SF Sbjct 475 LGHRFDGFEQWKQLLLLLCHCERAIRFLPEMYVSFLRLLHCQLEQCPDDLLSDSLLQKSF 534 Query 61 LSSSCASLLENC 72 L SS ++L++ C Sbjct 535 LFSSASALVDTC 546 > hsa:25980 C20orf4, DKFZp564N1363, bA234K24.2; chromosome 20 open reading frame 4; K13205 A1 cistron-splicing factor AAR2 Length=384 Score = 38.5 bits (88), Expect = 0.005, Method: Composition-based stats. Identities = 24/85 (28%), Positives = 41/85 (48%), Gaps = 2/85 (2%) Query 1 LAHHFPSLLHWRELIQLLSNAGRAVYLLPGLYGKFLDTFYAQLSQSPDDIILGPLEEGSF 60 L + + + HW+ L+ LL + A+ LY + Y QL + P D + + + +F Sbjct 264 LGNVYEAFEHWKRLLNLLCRSEAAMMKHHTLYINLISILYHQLGEIPADFFVDIVSQDNF 323 Query 61 LSSSCASLLEN-CSY-IDRQLAKKA 83 L+S+ + CS +D L KKA Sbjct 324 LTSTLQVFFSSACSIAVDATLRKKA 348 > dre:431768 im:7153138; zgc:92018; K13205 A1 cistron-splicing factor AAR2 Length=384 Score = 38.1 bits (87), Expect = 0.006, Method: Composition-based stats. Identities = 22/81 (27%), Positives = 38/81 (46%), Gaps = 2/81 (2%) Query 5 FPSLLHWRELIQLLSNAGRAVYLLPGLYGKFLDTFYAQLSQSPDDIILGPLEEGSFLSSS 64 + + HW+ L+ LL + A+ LY + Y QL + P D L + + +FL+S+ Sbjct 268 YEAFEHWKSLLALLCRSEEAMKDRKELYLGLIAVLYHQLGEIPPDFFLDIVSQSNFLTST 327 Query 65 CASLLENCSY--IDRQLAKKA 83 + S +D L K+A Sbjct 328 LQDFFQFASSPGVDSTLRKRA 348 > xla:446520 c20orf4, MGC80226; chromosome 20 open reading frame 4; K13205 A1 cistron-splicing factor AAR2 Length=388 Score = 38.1 bits (87), Expect = 0.007, Method: Composition-based stats. Identities = 20/69 (28%), Positives = 33/69 (47%), Gaps = 0/69 (0%) Query 1 LAHHFPSLLHWRELIQLLSNAGRAVYLLPGLYGKFLDTFYAQLSQSPDDIILGPLEEGSF 60 L + + + W+ L+ LL A P LY K + Y QL+Q P D + + + +F Sbjct 258 LGNVYEAFEQWKSLLNLLCRAETFSLQHPELYIKVISVLYHQLAQVPTDFFVDIVSQNNF 317 Query 61 LSSSCASLL 69 L+S+ L Sbjct 318 LTSTLQVLF 326 > mmu:68295 0610011L14Rik; RIKEN cDNA 0610011L14 gene; K13205 A1 cistron-splicing factor AAR2 Length=384 Score = 37.7 bits (86), Expect = 0.009, Method: Composition-based stats. Identities = 23/85 (27%), Positives = 41/85 (48%), Gaps = 2/85 (2%) Query 1 LAHHFPSLLHWRELIQLLSNAGRAVYLLPGLYGKFLDTFYAQLSQSPDDIILGPLEEGSF 60 L + + + HW+ L+ LL + A+ LY + Y QL + P D + + + +F Sbjct 264 LGNVYEAFEHWKRLLNLLCRSESAMGKYHALYISLISILYHQLGEIPADFFVDIVSQDNF 323 Query 61 LSSSCASLLEN-CSY-IDRQLAKKA 83 L+S+ + CS ++ L KKA Sbjct 324 LTSTLQVFFSSACSIAVEATLRKKA 348 > xla:379159 MGC130823; hypothetical protein MGC52890 Length=330 Score = 34.3 bits (77), Expect = 0.10, Method: Compositional matrix adjust. Identities = 24/68 (35%), Positives = 34/68 (50%), Gaps = 6/68 (8%) Query 50 IILGPLEEGS--FLSSSCASLLE----NCSYIDRQLAKKALIQQITNNIQEKKQKKIQNI 103 II G +E G+ +L+ CA+LL+ NC +D AL Q NN++ + I Sbjct 92 IIHGFIERGTDKWLTHMCANLLQVEDVNCLCVDWAGGSYALFSQAANNVRVVGAEVAHFI 151 Query 104 HQLSNKKG 111 LSNK G Sbjct 152 QLLSNKYG 159 > cel:F10B5.2 hypothetical protein; K13205 A1 cistron-splicing factor AAR2 Length=357 Score = 32.3 bits (72), Expect = 0.39, Method: Compositional matrix adjust. Identities = 15/67 (22%), Positives = 31/67 (46%), Gaps = 0/67 (0%) Query 5 FPSLLHWRELIQLLSNAGRAVYLLPGLYGKFLDTFYAQLSQSPDDIILGPLEEGSFLSSS 64 F W+ +I L+S ++ L+ F+ + QL + P D + + +FL+++ Sbjct 258 FEGFEQWKRIIHLMSCCPNSLGSEKELFMSFIRVLFFQLKECPTDFFVDIVSRDNFLTTT 317 Query 65 CASLLEN 71 + L N Sbjct 318 LSMLFAN 324 > ath:AT3G02760 ATP binding / aminoacyl-tRNA ligase/ histidine-tRNA ligase/ nucleotide binding (EC:6.1.1.21); K01892 histidyl-tRNA synthetase [EC:6.1.1.21] Length=883 Score = 30.8 bits (68), Expect = 1.2, Method: Composition-based stats. Identities = 22/72 (30%), Positives = 34/72 (47%), Gaps = 7/72 (9%) Query 57 EGSFLSSSCASLLENCSYIDRQLAKKALIQQITNNIQ----EKKQKKIQNIHQLSNKKGK 112 E L SCASL+ CS ID K + QI +++ E + I + L + G Sbjct 120 ENIVLEKSCASLIGICSIIDH---KSTTLSQIVDSVAALSCEVTKADIASFSLLDSGDGN 176 Query 113 GNKNNIIIQEDM 124 G+K+ I + D+ Sbjct 177 GDKDVIGVAGDL 188 > hsa:100510383 hypothetical protein LOC100510383 Length=389 Score = 30.0 bits (66), Expect = 1.7, Method: Composition-based stats. Identities = 22/63 (34%), Positives = 30/63 (47%), Gaps = 1/63 (1%) Query 45 QSPDDIILGPLEEGSFLSSSCASLLENCSYIDRQLAKKAL-IQQITNNIQEKKQKKIQNI 103 QSP D G EG S++ L C L +++ I Q+ N I+ KQ+K Q Sbjct 323 QSPGDSATGIHGEGPTSSATLKDLESPCQEPAVVLNPRSVKISQLKNTIKSLKQQKKQVE 382 Query 104 HQL 106 HQL Sbjct 383 HQL 385 > pfa:MAL13P1.28 conserved Plasmodium protein, unknown function Length=609 Score = 30.0 bits (66), Expect = 2.1, Method: Composition-based stats. Identities = 21/90 (23%), Positives = 41/90 (45%), Gaps = 2/90 (2%) Query 1 LAHHFPSLLHWRELIQLLSNAGRAVYLLPGLYGKFLDTFYAQLSQSPDDIILGPLEEGSF 60 L +++ S + WR+L +L++N+ V + + L QLS +D + SF Sbjct 490 LGYNYQSFIQWRKLFELMTNSSYLVTNSTNFFEEVLKAVCIQLSYLSEDFFDN--SDNSF 547 Query 61 LSSSCASLLENCSYIDRQLAKKALIQQITN 90 + ++ E + ID ++ K + I N Sbjct 548 ILFGIYNIYEIINNIDEKIRPKNITTLIDN 577 > pfa:PF14_0333 conserved Plasmodium protein, unknown function Length=359 Score = 29.6 bits (65), Expect = 2.5, Method: Compositional matrix adjust. Identities = 20/61 (32%), Positives = 32/61 (52%), Gaps = 7/61 (11%) Query 54 PLEEGSFLSSSCASLLENCSYIDRQLAKKALIQQITNNIQEKKQKKIQNIHQLSNKKGKG 113 P ++GSFL +EN I KK +IQ++ N + + Q ++ N+ + NKKG Sbjct 271 PQKDGSFL-------IENVDPIKYNNKKKLIIQELLNEHENRFQSQVSNLKKKINKKGYT 323 Query 114 N 114 N Sbjct 324 N 324 > cel:Y37E3.17 hypothetical protein Length=837 Score = 29.6 bits (65), Expect = 2.5, Method: Composition-based stats. Identities = 15/42 (35%), Positives = 25/42 (59%), Gaps = 1/42 (2%) Query 23 RAVYLLPGLYGKFLDTFYAQLSQSPDDI-ILGPLEEGSFLSS 63 RA L+PGL G +D A S +PD ++GP ++ ++S+ Sbjct 339 RACDLIPGLQGSKVDARAAVFSMTPDGYPLVGPYDKNYWMST 380 > dre:556023 dynein, axonemal, heavy chain 2-like; K10408 dynein heavy chain, axonemal Length=4424 Score = 29.3 bits (64), Expect = 3.3, Method: Composition-based stats. Identities = 23/49 (46%), Positives = 26/49 (53%), Gaps = 8/49 (16%) Query 44 SQSPDDI--ILGPLEEGSFLSSSCASLLENCSYIDRQLAKKAL--IQQI 88 +QS DI LGPLEE SF S CA L I QL K + IQ+I Sbjct 259 AQSVSDIGDTLGPLEEISFWKSRCADL----DSISTQLQKPGVRHIQEI 303 > hsa:100507035 Golgin subfamily A member 8-like protein 1-like Length=123 Score = 29.3 bits (64), Expect = 3.6, Method: Compositional matrix adjust. Identities = 24/75 (32%), Positives = 34/75 (45%), Gaps = 1/75 (1%) Query 33 GKFLDTFYAQLSQSPDDIILGPLEEGSFLSSSCASLLENCSYIDRQLAKKAL-IQQITNN 91 G +T + QSP D G EG S++ L C L +++ I Q+ N Sbjct 45 GSIPETATSGGCQSPGDSATGIHGEGPTSSATLKDLESPCQEPAVVLNPRSVKISQLKNT 104 Query 92 IQEKKQKKIQNIHQL 106 I+ KQ+K Q HQL Sbjct 105 IKSLKQQKKQVEHQL 119 > xla:398229 bcan, Xbcan; brevican; K06795 brevican Length=1152 Score = 28.9 bits (63), Expect = 4.7, Method: Composition-based stats. Identities = 18/82 (21%), Positives = 36/82 (43%), Gaps = 10/82 (12%) Query 37 DTFYAQLSQSPDDIILGPLEEGSFLSSSCASLLENCSYIDRQLAKKALIQQITNNIQEKK 96 D FY ++ PD+ F S +L + + + + Q+ NN EK+ Sbjct 535 DYFYDNITNLPDE----------FGSEESETLNQTFNTTTQSPQRSLETDQLMNNHGEKE 584 Query 97 QKKIQNIHQLSNKKGKGNKNNI 118 +++ NI +NK+ + + N + Sbjct 585 NEEVYNISVTTNKQSEASSNPV 606 > hsa:23030 KDM4B, FLJ44906, JMJD2B, KIAA0876; lysine (K)-specific demethylase 4B; K06709 jumonji domain-containing protein 2 [EC:1.14.11.-] Length=1096 Score = 28.5 bits (62), Expect = 5.2, Method: Composition-based stats. Identities = 22/79 (27%), Positives = 34/79 (43%), Gaps = 12/79 (15%) Query 33 GKFLDTFYAQLSQSPDDIILGPLEEGSFLSSSCASLLENCSYIDRQLAKKALIQQITNNI 92 G + D Y + S D + LGP EG L + D L K I +T++I Sbjct 955 GSYSDNLYPESITSRDCVQLGPPSEGE---------LVELRWTDGNLYKAKFISSVTSHI 1005 Query 93 QEKKQKKIQNIHQLSNKKG 111 Q + ++ QL+ K+G Sbjct 1006 Y---QVEFEDGSQLTVKRG 1021 > mmu:236798 Gpr112, Gm367, PGR17; G protein-coupled receptor 112; K08455 G protein-coupled receptor 112 Length=3056 Score = 28.1 bits (61), Expect = 7.2, Method: Composition-based stats. Identities = 30/95 (31%), Positives = 44/95 (46%), Gaps = 15/95 (15%) Query 27 LLPGLYGKFLDTFYAQLS-QSPDDI---ILG------PL--EEGSFLSSSCASLLENCSY 74 LL GL K +D +S ++ DD+ IL PL EE + S A + NC Sbjct 2381 LLQGLPDKIVDLANITISDENADDVAEHILNLVNESPPLDEEETKIIVSKVADI-SNCDE 2439 Query 75 IDRQLAKKALIQQITNNIQEKKQKKIQNIHQLSNK 109 I L + +I QI N + EK+ N+ +SN+ Sbjct 2440 ISINLTQ--IILQILNTVTEKQSDSASNLPPVSNE 2472 > cel:C49F5.2 set-6; SET (trithorax/polycomb) domain containing family member (set-6) Length=708 Score = 28.1 bits (61), Expect = 7.3, Method: Composition-based stats. Identities = 9/27 (33%), Positives = 21/27 (77%), Gaps = 0/27 (0%) Query 86 QQITNNIQEKKQKKIQNIHQLSNKKGK 112 + I+NN+ EK+++ +++ H+L +K+ K Sbjct 89 RAISNNLTEKRRRSVESFHRLHSKERK 115 > sce:YBR136W MEC1, ESR1, SAD3; Genome integrity checkpoint protein and PI kinase superfamily member; signal transducer required for cell cycle arrest and transcriptional responses prompted by damaged or unreplicated DNA; monitors and participates in meiotic recombination (EC:2.7.11.1); K02543 cell cycle checkpoint protein MEC1 Length=2368 Score = 28.1 bits (61), Expect = 7.7, Method: Compositional matrix adjust. Identities = 20/65 (30%), Positives = 31/65 (47%), Gaps = 5/65 (7%) Query 1 LAHHFPSLLHWRELIQLLSNAGRAVYLLPGLYGKFLDTFYAQLSQSPDDIILGPLEEGSF 60 LA +PS + W + SN+ + V L GK + Y Q SQ+P D++ L+ Sbjct 1926 LAVEYPSHILWYITALVNSNSSKRV-----LRGKHILEKYRQHSQNPHDLVSSALDLTKA 1980 Query 61 LSSSC 65 L+ C Sbjct 1981 LTRVC 1985 > dre:321166 apobl, cb235, hm:zeh1207, sb:cb235, si:ch211-219i10.1, wu:fb39c11, wu:fc51e12, zeh1207; apolipoprotein B, like Length=3730 Score = 27.7 bits (60), Expect = 8.6, Method: Composition-based stats. Identities = 22/76 (28%), Positives = 39/76 (51%), Gaps = 8/76 (10%) Query 28 LPGLYGKFLDTFYAQLSQSPDDIILGPLEEGSFLSSSCASLLENCSYIDRQLAKKALIQQ 87 L G+ G F DT ++ + D + G + S +L N I Q A ++++++ Sbjct 662 LIGIDGFFRDTMQKTINYAADKVPRG----NDIMQSMFPTLWNN---IKMQKAPQSIVKE 714 Query 88 ITNNIQEKKQK-KIQN 102 ITNN+ + QK K+Q+ Sbjct 715 ITNNVNKLIQKLKVQD 730 > ath:AT1G54920 hypothetical protein Length=766 Score = 27.7 bits (60), Expect = 9.1, Method: Composition-based stats. Identities = 24/80 (30%), Positives = 40/80 (50%), Gaps = 15/80 (18%) Query 50 IILGPLEEGSF-------LSSSCASLLENCSYIDRQLAKKALIQQITNNIQEKKQKKIQN 102 I LG LE+ S +SS+CA L + + I +Q+ I N++QE+ K Q Sbjct 693 INLGSLEQQSKYASTWFEISSTCAQELRHAASIWKQV--------IKNDVQEEILSKPQE 744 Query 103 IHQLSNKKGKGNKNNIIIQE 122 + LS + + +KN+ I + Sbjct 745 LCSLSGRNLQSSKNSESINQ 764 Lambda K H 0.318 0.135 0.386 Gapped Lambda K H 0.267 0.0410 0.140 Effective search space used: 2069971060 Database: egene_temp_file_orthology_annotation_similarity_blast_database_865 Posted date: Sep 17, 2011 11:19 AM Number of letters in database: 82,071,388 Number of sequences in database: 164,496 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Neighboring words threshold: 11 Window for multiple hits: 40