bitscore colors: <40, 40-50 , 50-80, 80-200, >200

BLASTP 2.2.24+
Reference: Stephen F. Altschul, Thomas L. Madden, Alejandro A.
Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J.
Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of
protein database search programs", Nucleic Acids Res. 25:3389-3402.
Reference for composition-based statistics: Alejandro A. Schaffer,
L. Aravind, Thomas L. Madden, Sergei Shavirin, John L. Spouge, Yuri
I. Wolf, Eugene V. Koonin, and Stephen F. Altschul (2001),
"Improving the accuracy of PSI-BLAST protein database searches with
composition-based statistics and other refinements", Nucleic Acids
Res. 29:2994-3005.
Database: egene_temp_file_orthology_annotation_similarity_blast_database_865
164,496 sequences; 82,071,388 total letters
Query= Emax_0234_orf1
Length=174
Score E
Sequences producing significant alignments: (Bits) Value
tgo:TGME49_004130 membrane-attack complex / perforin domain-co... 127 3e-29
tgo:TGME49_072430 membrane-attack complex / perforin domain-co... 109 6e-24
pfa:PFL0805w MAC/Perforin, putative 84.0 3e-16
tpv:TP01_0164 hypothetical protein 83.6 3e-16
pfa:PFD0430c MAC/Perforin, putative; K13834 sporozoite microne... 73.2 5e-13
bbo:BBOV_IV001370 21.m02755; MAC/perforin domain containing pr... 68.9 8e-12
pfa:PFI1145w MAC/Perforin, putative 50.8 2e-06
bbo:BBOV_II001970 18.m09950; mac/perforin domain containing me... 42.7 6e-04
pfa:PF08_0052 perforin like protein 5 32.3
mmu:20537 Slc5a1, Sglt1; solute carrier family 5 (sodium/gluco... 30.8 2.7
mmu:23801 Aloxe3, MGC143829, MGC143830, e-LOX-3; arachidonate ... 30.4 3.0
hsa:59344 ALOXE3, E-LOX, MGC119694, MGC119695, MGC119696, eLOX... 30.0 3.9
tgo:TGME49_032360 RNA polymerase Rpb1 C-terminal repeat domain... 30.0 3.9
hsa:79009 DDX50, GU2, GUB, MGC3199, RH-II/GuB; DEAD (Asp-Glu-A... 28.9 9.9
> tgo:TGME49_004130 membrane-attack complex / perforin domain-containing
protein
Length=1054
Score = 127 bits (318), Expect = 3e-29, Method: Composition-based stats.
Identities = 70/179 (39%), Positives = 99/179 (55%), Gaps = 9/179 (5%)
Query 1 PPGFAMCDPHERILFGFAFQVNFLDDGAIAERAEIKSCPAGRDKCDGLERPAKEGDDARI 60
PPGFA C + ++ GFA +NF + G + I SCP GR+KCDG+ + E D+ RI
Sbjct 732 PPGFAKCPEGQVVILGFAMHLNFKEPGT--DNFRIISCPPGREKCDGVGTASSETDEGRI 789
Query 61 FALCGAETITGLEQVVVQSPLKA-----VAVCPQGSLILTGFSLSLTGGREGPLRTGFFP 115
+ LCG E I ++QVV +SP A A CP ++++ GF +S+ GG +G
Sbjct 790 YILCGEEPINEIQQVVAESPAHAGASVLEASCPDETVVVGGFGISVRGGSDGLDSFSIES 849
Query 116 CRAGLPTCTALGVRGTQQNMVWVACVEDTTPGLQRLTNVGA-AVVGVATKRSY-SDGLV 172
C G CT RG+++N +W+ CV+ PGL+ L NV G A KR+ SDG V
Sbjct 850 CTTGQTICTKAPTRGSEKNFLWMMCVDKQYPGLRELVNVAELGSHGNANKRAVNSDGNV 908
Score = 37.0 bits (84), Expect = 0.037, Method: Composition-based stats.
Identities = 22/98 (22%), Positives = 40/98 (40%), Gaps = 2/98 (2%)
Query 67 ETITGLEQVVVQSPLKAVAVCPQGSLILTGFSLSLTGGREGPLRTGFFPCRAGLPTCTAL 126
+ +T QV P A CP+G +++ GF++ L G C G C +
Sbjct 718 KQLTQATQVAWSGPPPGFAKCPEGQVVILGFAMHLNFKEPGTDNFRIISCPPGREKCDGV 777
Query 127 GVRG--TQQNMVWVACVEDTTPGLQRLTNVGAAVVGVA 162
G T + +++ C E+ +Q++ A G +
Sbjct 778 GTASSETDEGRIYILCGEEPINEIQQVVAESPAHAGAS 815
> tgo:TGME49_072430 membrane-attack complex / perforin domain-containing
protein ; K13834 sporozoite microneme protein 2
Length=854
Score = 109 bits (272), Expect = 6e-24, Method: Compositional matrix adjust.
Identities = 64/179 (35%), Positives = 89/179 (49%), Gaps = 6/179 (3%)
Query 1 PPGFAMCDPHERILFGFAFQVNF-LDDGAIAERAEIKSCPAGRDKCDGLERPAKEGDDAR 59
PPG+A C + +LFGFA + NF + I C AGR+KCDG+ GDD R
Sbjct 595 PPGYARCPREQVVLFGFAMRFNFKVTISNNLANYHIAPCTAGREKCDGIGAEEAAGDDER 654
Query 60 IFALCGAETITGLEQVVVQSPLK---AVAVCPQGSLILTGFSLSLTGGREGPLRTGFFPC 116
I+ CG E + QVV ++ AVA CP+ ++I GF +S+ G T PC
Sbjct 655 IYMACGPEVVNEFYQVVAETEAGENVAVATCPEDTVIAFGFGISIGTGFYSSENTQVEPC 714
Query 117 RAGLPTCTALGVRGTQQNMVWVACVEDTTPGLQRLTNVG-AAVVGVATKR-SYSDGLVR 173
AG CT T ++ VW+ C E + PG+ +L N+ G A R +DG+V
Sbjct 715 TAGQTRCTKARTSNTVKSYVWMVCAEKSFPGIAQLNNIAEVGTRGKANSRMKNTDGIVN 773
> pfa:PFL0805w MAC/Perforin, putative
Length=1073
Score = 84.0 bits (206), Expect = 3e-16, Method: Composition-based stats.
Identities = 54/155 (34%), Positives = 76/155 (49%), Gaps = 7/155 (4%)
Query 1 PPGFAMCDPHERILFGFAFQVNFLDDGAIAERAEIKSCPAGRDKC--DGLERPAKEGDDA 58
PPG C IL GF+ +NF + ++ I C ++ C +G E K D
Sbjct 823 PPGLLTCPIGTTILMGFSINLNFYKNKYLSSTNGITLCEPMKESCSGNGFE---KNYSDI 879
Query 59 RIFALCGAETITGLEQVVVQSPL-KAVAVCPQGSLILTGFSLSLTGGREGPLRTGFFPCR 117
RIFALC + + QVV Q K A CP +IL GF+L G + +PCR
Sbjct 880 RIFALCTNKPFDFITQVVQQGEAPKISASCPGELVILFGFALMKGIGSSSANKIDIYPCR 939
Query 118 AGLPTCTA-LGVRGTQQNMVWVACVEDTTPGLQRL 151
G +C A L +Q+M+++ACV+ TT GL+ L
Sbjct 940 TGQNSCEAVLQNHKFKQSMIYLACVDKTTNGLEYL 974
Score = 37.4 bits (85), Expect = 0.028, Method: Composition-based stats.
Identities = 35/143 (24%), Positives = 53/143 (37%), Gaps = 19/143 (13%)
Query 1 PPGFAMCDPHERILFGFAFQVNFLDDGAIAERAEIKSCPAGRDKCDGLERPAKEGDDARI 60
P A C ILFGFA + A + +I C G++ C+ + + K + I
Sbjct 903 PKISASCPGELVILFGFALMKGI--GSSSANKIDIYPCRTGQNSCEAVLQNHK-FKQSMI 959
Query 61 FALCGAETITGLEQVVVQSPLKAVA---------------VCPQGSLILTGFSLSLTGGR 105
+ C +T GLE + S K + CPQ + ++ GFSL
Sbjct 960 YLACVDKTTNGLEYLQTYSKTKNLGDVISDKYKSDGYLNFSCPQNNTLVFGFSLEFHTNF 1019
Query 106 EGPLRTGFFPCRAGLPTCTALGV 128
+ R F C C G+
Sbjct 1020 QAT-RNNFLNCSKYTNICEISGI 1041
> tpv:TP01_0164 hypothetical protein
Length=1182
Score = 83.6 bits (205), Expect = 3e-16, Method: Composition-based stats.
Identities = 56/159 (35%), Positives = 77/159 (48%), Gaps = 16/159 (10%)
Query 3 GFAMCDPHERILFGFAFQVNFLDDGAIAERAE----IKSCPAGRDKCD-GLERPAKEGDD 57
G A+C I+ GF+ + L I + E I CP G +KC + P E
Sbjct 944 GSAICPNKTVIIMGFSLSI--LKKKNIVGKNEFTLHITQCPVGEEKCIVSSDNPMSE--- 998
Query 58 ARIFALCGAETITGLEQVV---VQSPLKAVAVCPQGSLILTGFSLSLTGGREGPLRTGFF 114
+RI+A+CG +TI L Q + P A A CP G I GF+LS+ G L T +
Sbjct 999 SRIWAVCGEDTIPLLNQQTSSEIDEP--ATATCPVGYSIAYGFALSVPKGNVA-LNTDSY 1055
Query 115 PCRAGLPTCTALGVRGTQQNMVWVACVEDTTPGLQRLTN 153
CR+G +CT T N VW+ACVE+ P L ++N
Sbjct 1056 ACRSGTQSCTHESTDKTATNAVWIACVENGAPQLSEISN 1094
Score = 40.8 bits (94), Expect = 0.002, Method: Composition-based stats.
Identities = 27/94 (28%), Positives = 38/94 (40%), Gaps = 8/94 (8%)
Query 66 AETITGLEQVVVQSPLKAVAVCPQGSLILTGFSLS------LTGGREGPLRTGFFPCRAG 119
A I Q+V A+CP ++I+ GFSLS + G E L C G
Sbjct 927 ASVIEKGHQIVYSGNKSGSAICPNKTVIIMGFSLSILKKKNIVGKNEFTLH--ITQCPVG 984
Query 120 LPTCTALGVRGTQQNMVWVACVEDTTPGLQRLTN 153
C ++ +W C EDT P L + T+
Sbjct 985 EEKCIVSSDNPMSESRIWAVCGEDTIPLLNQQTS 1018
> pfa:PFD0430c MAC/Perforin, putative; K13834 sporozoite microneme
protein 2
Length=842
Score = 73.2 bits (178), Expect = 5e-13, Method: Composition-based stats.
Identities = 52/176 (29%), Positives = 85/176 (48%), Gaps = 7/176 (3%)
Query 1 PPGFAMCDPHERI-LFGFAFQVNFLDDGAIAERAEIKSCPAGRDKCDGLERPAKEGDDAR 59
PP A C PH ++ +FGF+ + NF D+ + I+ C AG + C + + + D +
Sbjct 600 PPINAQC-PHGKVVMFGFSLKQNFWDNTNALKGYNIEVCEAGSNSCTSKQGSSNKYDTSY 658
Query 60 IFALCGAETITGLEQVVVQSPLKAVAV-CPQGSLILTGFSLSLTGGREGPLRTGF-FPCR 117
++ CG + + EQV+ +S V CP IL GF +S + GR + PC
Sbjct 659 LYMECGDQPLPFSEQVISESTSTYNTVKCPNDYSILLGFGISSSSGRINSAEYVYSTPCI 718
Query 118 AGLPTCTALGVRGTQQNMVWVACVEDTT-PGLQRLTNVG--AAVVGVATKRSYSDG 170
G+ +C+ Q++ ++V CV+ T G+ L+ V A V + YSDG
Sbjct 719 PGMKSCSLNMNNDNQKSYIYVLCVDTTIWSGVNNLSLVALDGAHGKVNRSKKYSDG 774
> bbo:BBOV_IV001370 21.m02755; MAC/perforin domain containing
protein; K13834 sporozoite microneme protein 2
Length=978
Score = 68.9 bits (167), Expect = 8e-12, Method: Composition-based stats.
Identities = 52/163 (31%), Positives = 78/163 (47%), Gaps = 25/163 (15%)
Query 3 GFAMCDPHERILFGFAFQVNFLDDGAIAERAEIKS----------CPAGRDKCDGLERPA 52
G A+C + I+ GF+ ++ + RA + S CP G++KC P
Sbjct 685 GSAVCPNGKVIMMGFSVVIS-------SSRATVFSKPQYTISMTPCPIGQEKCMVSVPPG 737
Query 53 KEGDDARIFALCGAETITGL-EQVVVQSPLKAVAVCPQGSLILTGFSLSL-TGGREGPLR 110
EG R++ LCG+E+I L ++ V + A A CP I GF LS+ G + P+
Sbjct 738 AEG---RVWILCGSESIPLLIQETNVANNEAATAQCPDEYAIAFGFGLSIPDGAKLTPVD 794
Query 111 TGFFPCRAGLPTCTALGVRGTQQNMVWVACVEDTTPGLQRLTN 153
+ CRAG +CT + + N VW+ACVE P L + N
Sbjct 795 C--YACRAGQQSCTQASPK-SPYNAVWIACVEKNAPELGSIVN 834
Score = 40.0 bits (92), Expect = 0.004, Method: Composition-based stats.
Identities = 25/76 (32%), Positives = 36/76 (47%), Gaps = 6/76 (7%)
Query 85 AVCPQGSLILTGFSLSLTGGR----EGPLRT-GFFPCRAGLPTCTALGVRGTQQNMVWVA 139
AVCP G +I+ GFS+ ++ R P T PC G C + V + VW+
Sbjct 687 AVCPNGKVIMMGFSVVISSSRATVFSKPQYTISMTPCPIGQEKCM-VSVPPGAEGRVWIL 745
Query 140 CVEDTTPGLQRLTNVG 155
C ++ P L + TNV
Sbjct 746 CGSESIPLLIQETNVA 761
> pfa:PFI1145w MAC/Perforin, putative
Length=821
Score = 50.8 bits (120), Expect = 2e-06, Method: Composition-based stats.
Identities = 36/142 (25%), Positives = 62/142 (43%), Gaps = 11/142 (7%)
Query 7 CDPHERILFGFAFQVNFLDDGAIAERAEIKSCPAGRDKCDGLERPAKEGDDAR--IFALC 64
C+ + L GF+ + +D + + + SC DKC +K D+A IFA+C
Sbjct 605 CEEKQNFLLGFSLSIP--NDLSNLKDFYLNSCDEDSDKC-----YSKMSDNAYSYIFAMC 657
Query 65 GAETITGLEQVVVQS-PLKAVAVCPQGSLILTGFSLSLTGGREGPLRTGFFPCRAGLPTC 123
E I EQ V L + + +IL GF +S+ + P+ +PC+ G +C
Sbjct 658 KEEMIPFFEQKVKSGVGLLTLECSEKNQVILFGFGISVLNTND-PISISLYPCKYGKASC 716
Query 124 TALGVRGTQQNMVWVACVEDTT 145
+ G +W+ C + +
Sbjct 717 SMQGSTDQSAVGLWIVCAHEES 738
> bbo:BBOV_II001970 18.m09950; mac/perforin domain containing
membrane protein
Length=559
Score = 42.7 bits (99), Expect = 6e-04, Method: Composition-based stats.
Identities = 36/152 (23%), Positives = 62/152 (40%), Gaps = 17/152 (11%)
Query 7 CDPHERILFGFAFQVNFLDDGAIAERA-EIKSCPAGRDKCDGLERPAKEGDDARIFALCG 65
C +++++FGF ++ I +R + CP C ER K D I+ LCG
Sbjct 399 CPHNDKVIFGFILEME------IEQRTFSVYQCPTDAYSCSK-ERQKK--CDIVIWMLCG 449
Query 66 AETITGLEQVVVQ---SPLKAVAVCPQGSLILTGFSLSLTGGREGPLRTGFFPCRAGLPT 122
+ + Q S + C G +LTGF + ++ L PC G
Sbjct 450 SSMGLNVMQYAHNFDASNSEKEVKCLSGYKLLTGFIAESSPEKDKSL-ANLIPCHTGADL 508
Query 123 CTALGVRGTQQNMVWVACVEDTTPGLQRLTNV 154
C + + + +W C+++ PGL R + +
Sbjct 509 CRS---NSSLETHIWAVCIDERLPGLGRTSTL 537
> pfa:PF08_0052 perforin like protein 5
Length=676
Score = 32.3 bits (72), Expect = 0.85, Method: Composition-based stats.
Identities = 33/141 (23%), Positives = 53/141 (37%), Gaps = 16/141 (11%)
Query 7 CDPHERILFGFAF--QVNFLDDGAIAERAEIKSCPAGRDKCDGLERPAKEGDDARIFA-- 62
C ++IL GF + +D I I CP+ G+ E D F+
Sbjct 455 CKNGDKILSGFILTNKKKSYEDNHI-----IHMCPSNTVCSSGIN---IESDKNFEFSWI 506
Query 63 LCGAETITGLEQVVVQSPLKA--VAVCPQGSLILTGFSLSLTGGREGPLRTGFFPCRAGL 120
LC E + + Q++ ++ + A CP I GF SLT + PC +
Sbjct 507 LCSKENRSEIHQILTKNTFQGNGKASCPYNMKI--GFGFSLTFQKSINTNIKIEPCESNK 564
Query 121 PTCTALGVRGTQQNMVWVACV 141
C + + Q W+ C+
Sbjct 565 KECKRTNLASSSQTYFWINCL 585
> mmu:20537 Slc5a1, Sglt1; solute carrier family 5 (sodium/glucose
cotransporter), member 1; K14158 solute carrier family
5 (sodium/glucose cotransporter), member 1
Length=665
Score = 30.8 bits (68), Expect = 2.7, Method: Composition-based stats.
Identities = 17/48 (35%), Positives = 24/48 (50%), Gaps = 0/48 (0%)
Query 60 IFALCGAETITGLEQVVVQSPLKAVAVCPQGSLILTGFSLSLTGGREG 107
+ A+ TITG V+ + A+ GS ILTGF+ + GG E
Sbjct 184 LLAITALYTITGGLAAVIYTDTLQTAIMLVGSFILTGFAFNEVGGYEA 231
> mmu:23801 Aloxe3, MGC143829, MGC143830, e-LOX-3; arachidonate
lipoxygenase 3
Length=711
Score = 30.4 bits (67), Expect = 3.0, Method: Composition-based stats.
Identities = 22/59 (37%), Positives = 28/59 (47%), Gaps = 4/59 (6%)
Query 57 DARIFALCGAETITGLEQVVVQSPLKAVAVCPQGSLILTGFSLSLTGGREGPLRTGFFP 115
D I A I GL+Q V +PL + + PQG L+ LS T G E P+ F P
Sbjct 327 DYWILAEAPVHCINGLQQYVT-APLCLLWLNPQGVLLPLAIQLSQTPGPESPI---FLP 381
> hsa:59344 ALOXE3, E-LOX, MGC119694, MGC119695, MGC119696, eLOX3;
arachidonate lipoxygenase 3 (EC:1.13.11.-)
Length=843
Score = 30.0 bits (66), Expect = 3.9, Method: Compositional matrix adjust.
Identities = 23/71 (32%), Positives = 33/71 (46%), Gaps = 3/71 (4%)
Query 41 GRDKC--DGLERPAKEGDDARIFALCGAETITGLEQVVVQSPLKAVAVCPQGSLILTGFS 98
G+D C LER D I A + G +Q V +PL + + PQG+L+
Sbjct 441 GQDTCLQTELERGNIFLADYWILAEAPTHCLNGRQQYVA-APLCLLWLSPQGALVPLAIQ 499
Query 99 LSLTGGREGPL 109
LS T G + P+
Sbjct 500 LSQTPGPDSPI 510
> tgo:TGME49_032360 RNA polymerase Rpb1 C-terminal repeat domain-containing
protein / exonuclease domain-containing protein
(EC:3.1.13.4); K12571 PAB-dependent poly(A)-specific ribonuclease
subunit 2 [EC:3.1.13.4]
Length=2155
Score = 30.0 bits (66), Expect = 3.9, Method: Compositional matrix adjust.
Identities = 21/59 (35%), Positives = 28/59 (47%), Gaps = 1/59 (1%)
Query 26 DGAIAERAEIKSCPAGRDKCDGLERPAKEGDDARIFALCGAETITGLEQVVVQSPLKAV 84
DG + E A S + RD G ER +E +R + G ET G E+ +SPL A
Sbjct 1095 DGGLREEAAAVSA-SSRDTVTGEERSVREEKRSREQDIGGRETRNGEEKEQDESPLSAY 1152
> hsa:79009 DDX50, GU2, GUB, MGC3199, RH-II/GuB; DEAD (Asp-Glu-Ala-Asp)
box polypeptide 50 (EC:3.6.4.13); K13183 ATP-dependent
RNA helicase DDX50 [EC:3.6.4.13]
Length=737
Score = 28.9 bits (63), Expect = 9.9, Method: Composition-based stats.
Identities = 18/73 (24%), Positives = 34/73 (46%), Gaps = 6/73 (8%)
Query 44 KCDGLERPAKEGDDARIF------ALCGAETITGLEQVVVQSPLKAVAVCPQGSLILTGF 97
+C +RPA GD +++ A+ ET + ++ + +K A C G + +
Sbjct 368 QCHWSQRPAVIGDVLQVYSGSEGRAIIFCETKKNVTEMAMNPHIKQNAQCLHGDIAQSQR 427
Query 98 SLSLTGGREGPLR 110
++L G REG +
Sbjct 428 EITLKGFREGSFK 440
Lambda K H
0.322 0.139 0.433
Gapped
Lambda K H
0.267 0.0410 0.140
Effective search space used: 4471152252
Database: egene_temp_file_orthology_annotation_similarity_blast_database_865
Posted date: Sep 17, 2011 11:19 AM
Number of letters in database: 82,071,388
Number of sequences in database: 164,496
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Neighboring words threshold: 11
Window for multiple hits: 40