bitscore colors: <40, 40-50 , 50-80, 80-200, >200

BLASTP 2.2.24+
Reference: Stephen F. Altschul, Thomas L. Madden, Alejandro A.
Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J.
Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of
protein database search programs", Nucleic Acids Res. 25:3389-3402.
Reference for composition-based statistics: Alejandro A. Schaffer,
L. Aravind, Thomas L. Madden, Sergei Shavirin, John L. Spouge, Yuri
I. Wolf, Eugene V. Koonin, and Stephen F. Altschul (2001),
"Improving the accuracy of PSI-BLAST protein database searches with
composition-based statistics and other refinements", Nucleic Acids
Res. 29:2994-3005.
Database: egene_temp_file_orthology_annotation_similarity_blast_database_866
164,496 sequences; 82,071,388 total letters
Query= Eten_0193_orf4
Length=252
Score E
Sequences producing significant alignments: (Bits) Value
tgo:TGME49_049670 cysteine proteinase, putative (EC:3.4.22.1);... 320 3e-87
dre:406645 ctsba, MGC55862, ctsb, id:ibd1201, wu:fa13g05, wu:f... 199 8e-51
dre:569298 ctsbb; capthepsin B, b; K01363 cathepsin B [EC:3.4.... 192 7e-49
xla:380102 cg10992; hypothetical protein MGC52983; K01363 cath... 189 7e-48
xla:379257 ctsb, MGC53360, apps, cpsb; cathepsin B (EC:3.4.22.... 189 1e-47
mmu:13030 Ctsb, CB; cathepsin B (EC:3.4.22.1); K01363 cathepsi... 185 1e-46
hsa:1508 CTSB, APPS, CPSB; cathepsin B (EC:3.4.22.1); K01363 c... 184 3e-46
cel:C25B8.3 cpr-6; Cysteine PRotease related family member (cp... 178 2e-44
cel:W07B8.5 cpr-5; Cysteine PRotease related family member (cp... 177 3e-44
cel:W07B8.4 hypothetical protein; K01363 cathepsin B [EC:3.4.2... 173 5e-43
cel:C52E4.1 cpr-1; Cysteine PRotease related family member (cp... 167 3e-41
cel:F36D3.9 cpr-2; Cysteine PRotease related family member (cp... 163 5e-40
cel:F57F5.1 hypothetical protein; K01363 cathepsin B [EC:3.4.2... 163 5e-40
cel:F44C4.3 cpr-4; Cysteine PRotease related family member (cp... 157 3e-38
cel:T10H4.12 cpr-3; Cysteine PRotease related family member (c... 155 1e-37
ath:AT1G02305 cathepsin B-like cysteine protease, putative 154 3e-37
cel:W07B8.1 hypothetical protein; K01363 cathepsin B [EC:3.4.2... 153 4e-37
ath:AT1G02300 cathepsin B-like cysteine protease, putative 150 4e-36
ath:AT4G01610 cathepsin B-like cysteine protease, putative; K0... 149 1e-35
cel:F32H5.1 hypothetical protein; K01363 cathepsin B [EC:3.4.2... 116 7e-26
mmu:94242 Tinagl1, 1110021J17Rik, AZ-1, AZ1, Arg1, Lcn7, TARP,... 102 2e-21
hsa:64129 TINAGL1, ARG1, LCN7, LIECG3, TINAGRP; tubulointersti... 99.0 1e-20
cel:Y65B4A.2 hypothetical protein 97.4 4e-20
cel:F26E4.3 hypothetical protein 95.1 2e-19
dre:562116 tinagl1, si:dkey-158b13.1; tubulointerstitial nephr... 95.1 2e-19
xla:380203 ctsc, MGC69126; cathepsin C (EC:3.4.14.1); K01275 c... 92.8 1e-18
hsa:27283 TINAG, TIN-AG; tubulointerstitial nephritis antigen 91.3 3e-18
mmu:26944 Tinag, AI452335, TIN-ag; tubulointerstitial nephriti... 90.1 7e-18
dre:368704 ctsc, cb912, ik:tdsubc_1h2, sb:cb146, wu:fb34g12, w... 86.7 7e-17
mmu:13032 Ctsc, AI047818, DPP1, DPPI; cathepsin C (EC:3.4.14.1... 78.6 2e-14
xla:100036949 ctsh; cathepsin H (EC:3.4.22.16) 73.2 8e-13
ath:AT5G60360 AALP; AALP (Arabidopsis aleurain-like protease);... 72.4 1e-12
cpv:cgd4_2110 preprocathepsin c precursor ; K01275 cathepsin C... 68.9 1e-11
mmu:13036 Ctsh, AL022844; cathepsin H (EC:3.4.22.16); K01366 c... 68.6 2e-11
hsa:1522 CTSZ, CTSX, FLJ17088; cathepsin Z (EC:3.4.18.1); K085... 67.4 5e-11
dre:100333521 Cathepsin Z-like 67.4 5e-11
cel:M04G12.2 cpz-2; CathePsin Z family member (cpz-2); K08568 ... 67.0 5e-11
ath:AT3G45310 cysteine proteinase, putative; K01366 cathepsin ... 65.9 1e-10
dre:450022 ctsz, wu:fj81f10, zgc:103420; cathepsin Z (EC:3.4.1... 65.5 2e-10
xla:494800 ctsz; cathepsin Z (EC:3.4.18.1); K08568 cathepsin X... 64.3 4e-10
tpv:TP03_0283 cysteine proteinase (EC:3.4.22.-); K01376 [EC:3... 63.5 7e-10
xla:432187 hypothetical protein MGC82409; K08568 cathepsin X [... 63.5 7e-10
xla:380516 ctss-a, MGC69026; cathepsin S (EC:3.4.22.27); K0136... 62.4 1e-09
dre:324818 ctsh, fc44c02, wu:fc44c02, zgc:85774; cathepsin H (... 62.0 2e-09
hsa:1512 CTSH, ACC-4, ACC-5, CPSB, DKFZp686B24257, MGC1519, mi... 61.6 3e-09
cel:F32B5.8 cpz-1; CathePsin Z family member (cpz-1); K08568 c... 61.2 4e-09
pfa:PFB0360c SERA-1; serine repeat antigen 1 (SERA-1) 60.8 4e-09
mmu:64138 Ctsz, AI787083, AU019819, CTSX, D2Wsu143e; cathepsin... 59.7 1e-08
pfa:PFB0330c SERA-7; serine repeat antigen 7 (SERA-7) 58.9 2e-08
xla:398927 hypothetical protein MGC68723; K01368 cathepsin S [... 58.9 2e-08
> tgo:TGME49_049670 cysteine proteinase, putative (EC:3.4.22.1);
K01363 cathepsin B [EC:3.4.22.1]
Length=572
Score = 320 bits (820), Expect = 3e-87, Method: Compositional matrix adjust.
Identities = 144/232 (62%), Positives = 173/232 (74%), Gaps = 2/232 (0%)
Query 5 SGWAFASTEALNDRFCIFSGGRHREALSPQHTTSCCDLLHCLSFGCSGGQPRMAWRWFSN 64
S WAFASTEA NDR CI S G+ LS QHTTSCC+ +HC SFGC+GGQP MAWRWF
Sbjct 305 SCWAFASTEAFNDRLCIRSQGKGLMPLSAQHTTSCCNAIHCASFGCNGGQPGMAWRWFER 364
Query 65 DGVVTGGDYNELHTGKSCWPYEIPFCRHHSEGPYPKCEGPLP--KAPKCRKDCEEAEYTS 122
GVVTGGD++ L G +CWPYE+PFC HH++ P+P C+ L K PKCRKDCEE Y
Sbjct 365 KGVVTGGDFDALGKGTTCWPYEVPFCAHHAKAPFPDCDATLVPRKTPKCRKDCEEQAYAD 424
Query 123 KVKPFKDDLHFATSAYSVEGRDQIKRELMENGTLTGAFLVYEDFLLYKEGVYHHVTGMPM 182
V PF D H ATSAYS+ RD +KR++M +G ++GAF+VYEDFL YK GVY HV+G+P+
Sbjct 425 NVHPFDQDTHKATSAYSLRSRDDVKRDMMTHGPVSGAFMVYEDFLSYKSGVYKHVSGLPV 484
Query 183 GGHAVKVIGFGNEDGRDYWLAVNSWNEYWGDKGTFKIEMGEAGIDKEFCGGE 234
GGHA+K+IG+G E+G +YW AVNSWN YWGD G FKI MG+ GID E GE
Sbjct 485 GGHAIKIIGWGTENGEEYWHAVNSWNTYWGDGGQFKIAMGQCGIDGEMVAGE 536
> dre:406645 ctsba, MGC55862, ctsb, id:ibd1201, wu:fa13g05, wu:fb34e12,
zgc:55862, zgc:65809, zgc:77181; cathepsin B, a; K01363
cathepsin B [EC:3.4.22.1]
Length=330
Score = 199 bits (506), Expect = 8e-51, Method: Compositional matrix adjust.
Identities = 109/234 (46%), Positives = 137/234 (58%), Gaps = 13/234 (5%)
Query 5 SGWAFASTEALNDRFCIFSGGRHREALSPQHTTSCCDLLHCLSFGCSGGQPRMAWRWFSN 64
S WAF + EA++DR CI S + +S Q +CCD GC+GG P AW +++
Sbjct 106 SCWAFGAAEAISDRVCIHSDAKVSVEISSQDLLTCCD---SCGMGCNGGYPSAAWDFWAT 162
Query 65 DGVVTGGDYNELHTGKSCWPYEIPFCRHHSEGPYPKCEGPLPKAPKCRKDCEEAEYTSKV 124
+G+VTGG YN H G C PY I C HH G P C G P C CE S
Sbjct 163 EGLVTGGLYNS-HIG--CRPYTIEPCEHHVNGSRPPCSGEGGDTPNCDMKCEPGYSPS-- 217
Query 125 KPFKDDLHFATSAYSV-EGRDQIKRELMENGTLTGAFLVYEDFLLYKEGVYHHVTGMPMG 183
+K D HF ++YSV ++ I EL +NG + GAF VYEDFLLYK GVY H++G P+G
Sbjct 218 --YKQDKHFGKTSYSVPSNQNSIMAELFKNGPVEGAFTVYEDFLLYKSGVYQHMSGSPVG 275
Query 184 GHAVKVIGFGNEDGRDYWLAVNSWNEYWGDKGTFKIEMGE--AGIDKEFCGGEP 235
GHA+K++G+G E+G YWLA NSWN WGD G FKI GE GI+ E G P
Sbjct 276 GHAIKILGWGEENGVPYWLAANSWNTDWGDNGYFKILRGEDHCGIESEIVAGIP 329
> dre:569298 ctsbb; capthepsin B, b; K01363 cathepsin B [EC:3.4.22.1]
Length=326
Score = 192 bits (489), Expect = 7e-49, Method: Compositional matrix adjust.
Identities = 108/236 (45%), Positives = 135/236 (57%), Gaps = 14/236 (5%)
Query 5 SGWAFASTEALNDRFCIFSGGRHREALSPQHTTSCCDLLHCLSFGCSGGQPRMAWRWFSN 64
S WAF + E+++DR CI S G+ +S + SCCD FGCSGG P AW ++
Sbjct 102 SCWAFGAVESISDRICIHSKGKQSPEISAEDLLSCCDQC---GFGCSGGFPAEAWDYWRR 158
Query 65 DGVVTGGDYNELHTGKSCWPYEIPFCRHHSEGPYPKCEGPLPKAPKCRKDCEEAEYTSKV 124
G+VTGG YN + C PY I C HH G P C G PKC C +Y+
Sbjct 159 SGLVTGGLYN---SDVGCRPYSIAPCEHHVNGTRPPCSGE-QDTPKCTGVCI-PKYSV-- 211
Query 125 KPFKDDLHFATSAYSVEG-RDQIKRELMENGTLTGAFLVYEDFLLYKEGVYHHVTGMPMG 183
P+K D HF + Y+V + QI EL NG + AF VYEDF LYK GVY H+TG +G
Sbjct 212 -PYKQDKHFGSKVYNVPSDQQQIMTELYTNGPVEAAFTVYEDFPLYKSGVYQHLTGSALG 270
Query 184 GHAVKVIGFGNEDGRDYWLAVNSWNEYWGDKGTFKIEMG--EAGIDKEFCGGEPKV 237
GHAVK++G+G E+G +WL NSWN WGD G FKI G E GI+ E G PK+
Sbjct 271 GHAVKILGWGEENGTPFWLVANSWNSDWGDNGYFKILRGHDECGIESEMVAGLPKL 326
> xla:380102 cg10992; hypothetical protein MGC52983; K01363 cathepsin
B [EC:3.4.22.1]
Length=333
Score = 189 bits (480), Expect = 7e-48, Method: Compositional matrix adjust.
Identities = 104/235 (44%), Positives = 138/235 (58%), Gaps = 12/235 (5%)
Query 5 SGWAFASTEALNDRFCIFSGGRHREALSPQHTTSCCDLLHCLSFGCSGGQPRMAWRWFSN 64
S WAF + EA++DR C+ + G+ +S + SCC GC+GG P AWR+++
Sbjct 107 SCWAFGAVEAISDRVCVHTNGKVNVEVSAEDLLSCCGFK--CGMGCNGGYPSGAWRFWTE 164
Query 65 DGVVTGGDYNELHTGKSCWPYEIPFCRHHSEGPYPKCEGPLPKAPKCRKDCEEAEYTSKV 124
G+V+GG Y+ H G C PY IP C HH G P C+G PKC K CEE YT
Sbjct 165 TGLVSGGLYDS-HVG--CRPYSIPPCEHHVNGSRPSCKGEEGDTPKCMKTCEEG-YTPA- 219
Query 125 KPFKDDLHFATSAYSVEGRD-QIKRELMENGTLTGAFLVYEDFLLYKEGVYHHVTGMPMG 183
+ D HF ++Y V + +I ++ +NG + GAF+VY DF LYK GVY H TG +G
Sbjct 220 --YGSDKHFGATSYGVPSSEKEIMADIYKNGPVEGAFVVYADFPLYKSGVYQHETGEELG 277
Query 184 GHAVKVIGFGNEDGRDYWLAVNSWNEYWGDKGTFKIEMGE--AGIDKEFCGGEPK 236
GHA+K++G+G E+G YWL NSWN WGD G FKI G+ GI+ E G PK
Sbjct 278 GHAIKILGWGVENGTPYWLCANSWNTDWGDNGFFKILRGKDHCGIESEVVAGIPK 332
> xla:379257 ctsb, MGC53360, apps, cpsb; cathepsin B (EC:3.4.22.1);
K01363 cathepsin B [EC:3.4.22.1]
Length=333
Score = 189 bits (479), Expect = 1e-47, Method: Compositional matrix adjust.
Identities = 104/235 (44%), Positives = 137/235 (58%), Gaps = 12/235 (5%)
Query 5 SGWAFASTEALNDRFCIFSGGRHREALSPQHTTSCCDLLHCLSFGCSGGQPRMAWRWFSN 64
S WAF + EA++DR C+ + G+ +S + SCC GC+GG P AW++++
Sbjct 107 SCWAFGAVEAISDRVCVHTNGKVNVEVSAEDLLSCCG--DECGMGCNGGYPSGAWQFWTE 164
Query 65 DGVVTGGDYNELHTGKSCWPYEIPFCRHHSEGPYPKCEGPLPKAPKCRKDCEEAEYTSKV 124
G+V+GG Y+ H G C PY IP C HH G P C+G PKC K CEE +
Sbjct 165 TGLVSGGLYDS-HVG--CRPYSIPPCEHHVNGSRPACKGEEGDTPKCVKQCEEGYSPA-- 219
Query 125 KPFKDDLHFATSAYSV-EGRDQIKRELMENGTLTGAFLVYEDFLLYKEGVYHHVTGMPMG 183
+ D HF T++Y V +I E+ +NG + GAFLVY DF LYK GVY H TG +G
Sbjct 220 --YGTDKHFGTTSYGVPTSEKEIMAEIYKNGPVEGAFLVYADFPLYKSGVYQHETGEELG 277
Query 184 GHAVKVIGFGNEDGRDYWLAVNSWNEYWGDKGTFKIEMGE--AGIDKEFCGGEPK 236
GHA+K++G+G E+G YWL NSWN WGD G FKI G+ GI+ E G PK
Sbjct 278 GHAIKILGWGVENGTPYWLCANSWNTDWGDNGFFKILRGKDHCGIESEIVAGVPK 332
> mmu:13030 Ctsb, CB; cathepsin B (EC:3.4.22.1); K01363 cathepsin
B [EC:3.4.22.1]
Length=339
Score = 185 bits (469), Expect = 1e-46, Method: Compositional matrix adjust.
Identities = 104/236 (44%), Positives = 136/236 (57%), Gaps = 13/236 (5%)
Query 5 SGWAFASTEALNDRFCIFSGGRHREALSPQHTTSCCDLLHCLSFGCSGGQPRMAWRWFSN 64
S WAF + EA++DR CI + GR +S + +CC + GC+GG P AW +++
Sbjct 107 SCWAFGAVEAISDRTCIHTNGRVNVEVSAEDLLTCCGIQ--CGDGCNGGYPSGAWSFWTK 164
Query 65 DGVVTGGDYNELHTGKSCWPYEIPFCRHHSEGPYPKCEGPLPKAPKCRKDCEEAEYTSKV 124
G+V+GG YN H G C PY IP C HH G P C G P+C K CE S
Sbjct 165 KGLVSGGVYNS-HVG--CLPYTIPPCEHHVNGSRPPCTGE-GDTPRCNKSCEAGYSPS-- 218
Query 125 KPFKDDLHFATSAYSVEGR-DQIKRELMENGTLTGAFLVYEDFLLYKEGVYHHVTGMPMG 183
+K+D HF ++YSV +I E+ +NG + GAF V+ DFL YK GVY H G MG
Sbjct 219 --YKEDKHFGYTSYSVSNSVKEIMAEIYKNGPVEGAFTVFSDFLTYKSGVYKHEAGDMMG 276
Query 184 GHAVKVIGFGNEDGRDYWLAVNSWNEYWGDKGTFKIEMGE--AGIDKEFCGGEPKV 237
GHA++++G+G E+G YWLA NSWN WGD G FKI GE GI+ E G P+
Sbjct 277 GHAIRILGWGVENGVPYWLAANSWNLDWGDNGFFKILRGENHCGIESEIVAGIPRT 332
> hsa:1508 CTSB, APPS, CPSB; cathepsin B (EC:3.4.22.1); K01363
cathepsin B [EC:3.4.22.1]
Length=339
Score = 184 bits (467), Expect = 3e-46, Method: Compositional matrix adjust.
Identities = 103/236 (43%), Positives = 136/236 (57%), Gaps = 13/236 (5%)
Query 5 SGWAFASTEALNDRFCIFSGGRHREALSPQHTTSCCDLLHCLSFGCSGGQPRMAWRWFSN 64
S WAF + EA++DR CI + +S + +CC + GC+GG P AW +++
Sbjct 107 SCWAFGAVEAISDRICIHTNAHVSVEVSAEDLLTCCGSM--CGDGCNGGYPAEAWNFWTR 164
Query 65 DGVVTGGDYNELHTGKSCWPYEIPFCRHHSEGPYPKCEGPLPKAPKCRKDCEEAEYTSKV 124
G+V+GG Y E H G C PY IP C HH G P C G PKC K CE +
Sbjct 165 KGLVSGGLY-ESHVG--CRPYSIPPCEHHVNGSRPPCTGE-GDTPKCSKICEPGYSPT-- 218
Query 125 KPFKDDLHFATSAYSVEGRDQ-IKRELMENGTLTGAFLVYEDFLLYKEGVYHHVTGMPMG 183
+K D H+ ++YSV ++ I E+ +NG + GAF VY DFLLYK GVY HVTG MG
Sbjct 219 --YKQDKHYGYNSYSVSNSEKDIMAEIYKNGPVEGAFSVYSDFLLYKSGVYQHVTGEMMG 276
Query 184 GHAVKVIGFGNEDGRDYWLAVNSWNEYWGDKGTFKIEMGE--AGIDKEFCGGEPKV 237
GHA++++G+G E+G YWL NSWN WGD G FKI G+ GI+ E G P+
Sbjct 277 GHAIRILGWGVENGTPYWLVANSWNTDWGDNGFFKILRGQDHCGIESEVVAGIPRT 332
> cel:C25B8.3 cpr-6; Cysteine PRotease related family member (cpr-6)
Length=379
Score = 178 bits (451), Expect = 2e-44, Method: Compositional matrix adjust.
Identities = 106/241 (43%), Positives = 135/241 (56%), Gaps = 13/241 (5%)
Query 1 ATAXSGWAFASTEALNDRFCIFSGGRHREALSPQHTTSCCDLLHCLSFGCSGGQPRMAWR 60
++ S WAF + EA++DR CI S G + LS SCC FGC+GG P AWR
Sbjct 128 SSCGSCWAFGAVEAMSDRICIASHGELQVTLSADDLLSCC---KSCGFGCNGGDPLAAWR 184
Query 61 WFSNDGVVTGGDYNELHTGKSCWPYEIPFCRHHSEGP-YPKCEGPLPKAPKCRKDCEEAE 119
++ DG+VTG +Y C PY P C HHS+ + C L PKC K C ++
Sbjct 185 YWVKDGIVTGSNYT---ANNGCKPYPFPPCEHHSKKTHFDPCPHDLYPTPKCEKKCV-SD 240
Query 120 YTSKVKPFKDDLHFATSAYSV-EGRDQIKRELMENGTLTGAFLVYEDFLLYKEGVYHHVT 178
YT K + +D F SAY V + + I++ELM +G L AF VYEDFL Y GVY H
Sbjct 241 YTDKT--YSEDKFFGASAYGVKDDVEAIQKELMTHGPLEIAFEVYEDFLNYDGGVYVHTG 298
Query 179 GMPMGGHAVKVIGFGNEDGRDYWLAVNSWNEYWGDKGTFKIEMG--EAGIDKEFCGGEPK 236
G GGHAVK+IG+G +DG YW NSWN WG+ G F+I G E GI+ GG PK
Sbjct 299 GKLGGGHAVKLIGWGIDDGIPYWTVANSWNTDWGEDGFFRILRGVDECGIESGVVGGIPK 358
Query 237 V 237
+
Sbjct 359 L 359
> cel:W07B8.5 cpr-5; Cysteine PRotease related family member (cpr-5);
K01363 cathepsin B [EC:3.4.22.1]
Length=344
Score = 177 bits (449), Expect = 3e-44, Method: Compositional matrix adjust.
Identities = 98/237 (41%), Positives = 128/237 (54%), Gaps = 9/237 (3%)
Query 5 SGWAFASTEALNDRFCIFSGGRHREALSPQHTTSCCDLLHCLSFGCSGGQPRMAWRWFSN 64
S WAFA+ EA++DR CI S G LS + SCC + GC GG P AW+W+
Sbjct 109 SCWAFAAAEAISDRTCIASNGAVNTLLSSEDLLSCCTGMFSCGNGCEGGYPIQAWKWWVK 168
Query 65 DGVVTGGDYNELHTGKSCWPYEIPFCRHHSEG-PYPKCEGPLPKAPKCRKDCEEAEYTSK 123
G+VTGG Y T C PY I C G +P C PKC C +
Sbjct 169 HGLVTGGSY---ETQFGCKPYSIAPCGETVNGVKWPACPEDTEPTPKCVDSCTSKN--NY 223
Query 124 VKPFKDDLHFATSAYSVEGR-DQIKRELMENGTLTGAFLVYEDFLLYKEGVYHHVTGMPM 182
P+ D HF ++AY+V + +QI+ E++ NG + AF VYEDF Y GVY H G +
Sbjct 224 ATPYLQDKHFGSTAYAVGKKVEQIQTEILTNGPIEVAFTVYEDFYQYTTGVYVHTAGASL 283
Query 183 GGHAVKVIGFGNEDGRDYWLAVNSWNEYWGDKGTFKIEMG--EAGIDKEFCGGEPKV 237
GGHAVK++G+G ++G YWL NSWN WG+KG F+I G E GI+ G P +
Sbjct 284 GGHAVKILGWGVDNGTPYWLVANSWNVAWGEKGYFRIIRGLNECGIEHSAVAGIPDL 340
> cel:W07B8.4 hypothetical protein; K01363 cathepsin B [EC:3.4.22.1]
Length=335
Score = 173 bits (439), Expect = 5e-43, Method: Compositional matrix adjust.
Identities = 98/242 (40%), Positives = 134/242 (55%), Gaps = 10/242 (4%)
Query 5 SGWAFASTEALNDRFCIFSGGRHREALSPQHTTSCCDLLHCLSFGCSGGQPRMAWRWFSN 64
S WA A+ EA++DR CI S G LS + +CC GC GG P AWR++
Sbjct 100 SCWAVAAAEAISDRTCIASNGDVNTLLSAEDILTCCTGKFNCGDGCEGGYPIQAWRYWVK 159
Query 65 DGVVTGGDYNELHTGKSCWPYEIPFCRHHSEG-PYPKCEGPLPKAPKCRKDCEEAEYTSK 123
+G+VTGG + + C PY I C +G +P+C + PKC C S
Sbjct 160 NGLVTGGSFESQY---GCKPYSIAPCGETIDGVTWPECPMKISDTPKCEHHCTGNN--SY 214
Query 124 VKPFKDDLHFATSAYSV-EGRDQIKRELMENGTLTGAFLVYEDFLLYKEGVYHHVTGMPM 182
P+ D HF SAY++ QI+ E++ +G + F+VYEDF LYK G+Y HV G +
Sbjct 215 PIPYDQDKHFGASAYAIGRSAKQIQTEILAHGPVEVGFIVYEDFYLYKTGIYTHVAGGEL 274
Query 183 GGHAVKVIGFGNEDGRDYWLAVNSWNEYWGDKGTFKIEMG--EAGIDKEFCGGEPKVPND 240
GGHAVK++G+G ++G YWLA NSWN WG+KG F+I G E GI+ G P + N
Sbjct 275 GGHAVKMLGWGVDNGTPYWLAANSWNTVWGEKGYFRILRGVDECGIESAAVAGMPDL-NR 333
Query 241 KN 242
+N
Sbjct 334 RN 335
> cel:C52E4.1 cpr-1; Cysteine PRotease related family member (cpr-1);
K01363 cathepsin B [EC:3.4.22.1]
Length=329
Score = 167 bits (423), Expect = 3e-41, Method: Compositional matrix adjust.
Identities = 99/240 (41%), Positives = 124/240 (51%), Gaps = 21/240 (8%)
Query 1 ATAXSGWAFASTEALNDRFCIFSGGRHREALSPQHTTSCCDLLHCLSFGCSGGQPRMAWR 60
AT S WAF + E ++DR CI + G + +SP SCC GC GG P A R
Sbjct 108 ATCGSCWAFGAAEMISDRTCIETKGAQQPIISPDDLLSCCG--SSCGNGCEGGYPIQALR 165
Query 61 WFSNDGVVTGGDYNELHTGKSCWPYEIPFCRHHSEGPYPKCEGPLPKAPKCRKDCEEAEY 120
W+ + GVVTGGDY+ G C PY I C + P K P C C+
Sbjct 166 WWDSKGVVTGGDYH----GAGCKPYPIAPCTSGN--------CPESKTPSCSMSCQSGYS 213
Query 121 TSKVKPFKDDLHFATSAYSV-EGRDQIKRELMENGTLTGAFLVYEDFLLYKEGVYHHVTG 179
T+ K D HF SAY+V + I+ E+ NG + AF VYEDF YK GVY H G
Sbjct 214 TAYAK----DKHFGVSAYAVPKNAASIQAEIYANGPVEAAFSVYEDFYKYKSGVYKHTAG 269
Query 180 MPMGGHAVKVIGFGNEDGRDYWLAVNSWNEYWGDKGTFKIEMG--EAGIDKEFCGGEPKV 237
+GGHA+K+IG+G E G YWL NSW WG+ G FKI G + GI+ G+ KV
Sbjct 270 KYLGGHAIKIIGWGTESGSPYWLVANSWGVNWGESGFFKIYRGDDQCGIESAVVAGKAKV 329
> cel:F36D3.9 cpr-2; Cysteine PRotease related family member (cpr-2);
K01363 cathepsin B [EC:3.4.22.1]
Length=344
Score = 163 bits (413), Expect = 5e-40, Method: Compositional matrix adjust.
Identities = 94/235 (40%), Positives = 127/235 (54%), Gaps = 21/235 (8%)
Query 5 SGWAFASTEALNDRFCIFSGGRHREALSPQHTTSCCDLLHCLSFGCSGGQPRMAWRWFSN 64
S WAF++ E ++DR CI S G + +SP +CC + GC GG P A++W++
Sbjct 128 SCWAFSTAEVISDRTCIASNGTQQPIISPTDLLTCCGM--SCGEGCDGGFPYRAFQWWAR 185
Query 65 DGVVTGGDYNELHTGKSCWPYEIPFCRHHSEGPYPKCEGPLPKAPKCRKDCEEAEYTSKV 124
GVVTGGDY G C PY I C + C + P CR C+ T+
Sbjct 186 RGVVTGGDY----LGTGCKPYPIRPCNSDN------CVNL--QTPPCRLSCQPGYRTT-- 231
Query 125 KPFKDDLHFATSAYSV-EGRDQIKRELMENGTLTGAFLVYEDFLLYKEGVYHHVTGMPMG 183
+ +D ++ SAY V I+ ++ NG + AF+VYEDF YK G+Y H+ G G
Sbjct 232 --YTNDKNYGNSAYPVPRTVAAIQADIYYNGPVVAAFIVYEDFEKYKSGIYRHIAGRSKG 289
Query 184 GHAVKVIGFGNEDGRDYWLAVNSWNEYWGDKGTFKIEMG--EAGIDKEFCGGEPK 236
GHAVK+IG+G E G YWLAVNSW WG+ GTF+I G E GI+ G P+
Sbjct 290 GHAVKLIGWGTERGTPYWLAVNSWGSQWGESGTFRILRGVDECGIESRIVAGLPR 344
> cel:F57F5.1 hypothetical protein; K01363 cathepsin B [EC:3.4.22.1]
Length=400
Score = 163 bits (413), Expect = 5e-40, Method: Compositional matrix adjust.
Identities = 89/220 (40%), Positives = 124/220 (56%), Gaps = 11/220 (5%)
Query 5 SGWAFASTEALNDRFCIFSGGRHREALSPQHTTSCCDLLHCLSFGCSGGQPRMAWRWFSN 64
S WA ++ E ++DR CI S + ++S +CC ++ GC+GG P AWR +
Sbjct 173 SCWAVSAAETISDRICIASNAKTILSISADDINACCGMV--CGNGCNGGYPIEAWRHYVK 230
Query 65 DGVVTGGDYNELHTGKSCWPYEIPFCRHHSEGP-YPKCEGPLPKAPKCRKDCEEAEYTSK 123
G VTGG Y + TG C PY P C HH G Y C + KC + C+ +
Sbjct 231 KGYVTGGSYQD-KTG--CKPYPYPPCEHHVNGTHYKPCPSNMYPTDKCERSCQAGYALT- 286
Query 124 VKPFKDDLHFATSAYSVEGRD-QIKRELMENGTLTGAFLVYEDFLLYKEGVYHHVTGMPM 182
++ DLHF SAY+V + +I++E+M +G + AF VYEDF Y GVY H G +
Sbjct 287 ---YQQDLHFGQSAYAVSKKAAEIQKEIMTHGPVEVAFTVYEDFEHYSGGVYVHTAGASL 343
Query 183 GGHAVKVIGFGNEDGRDYWLAVNSWNEYWGDKGTFKIEMG 222
GGHAVK++G+G ++G YWL NSWNE WG+ G F+I G
Sbjct 344 GGHAVKMLGWGVDNGTPYWLCANSWNEDWGENGYFRIIRG 383
> cel:F44C4.3 cpr-4; Cysteine PRotease related family member (cpr-4);
K01363 cathepsin B [EC:3.4.22.1]
Length=335
Score = 157 bits (397), Expect = 3e-38, Method: Compositional matrix adjust.
Identities = 97/237 (40%), Positives = 127/237 (53%), Gaps = 13/237 (5%)
Query 5 SGWAFASTEALNDRFCIFSGGRHREALSPQHTTSCCDLLHCLSFGCSGGQPRMAWRWFSN 64
S WAFA+ EA +DRFCI S G LS + SCC +C +GC GG P AW++
Sbjct 108 SCWAFAAAEAASDRFCIASNGAVNTLLSAEDVLSCCS--NC-GYGCEGGYPINAWKYLVK 164
Query 65 DGVVTGGDYNELHTGKSCWPYEIPFC-RHHSEGPYPKCEGPLPKAPKCRKDCEEAEYTSK 123
G TGG Y E G C PY + C +P C P C C Y
Sbjct 165 SGFCTGGSY-EAQFG--CKPYSLAPCGETVGNVTWPSCPDDGYDTPACVNKCTNKNYNVA 221
Query 124 VKPFKDDLHFATSAYSVEGR-DQIKRELMENGTLTGAFLVYEDFLLYKEGVYHHVTGMPM 182
+ D HF ++AY+V + QI+ E++ +G + AF VYEDF YK GVY H TG +
Sbjct 222 ---YTADKHFGSTAYAVGKKVSQIQAEIIAHGPVEAAFTVYEDFYQYKTGVYVHTTGQEL 278
Query 183 GGHAVKVIGFGNEDGRDYWLAVNSWNEYWGDKGTFKIEMG--EAGIDKEFCGGEPKV 237
GGHA++++G+G ++G YWL NSWN WG+ G F+I G E GI+ GG PKV
Sbjct 279 GGHAIRILGWGTDNGTPYWLVANSWNVNWGENGYFRIIRGTNECGIEHAVVGGVPKV 335
> cel:T10H4.12 cpr-3; Cysteine PRotease related family member
(cpr-3); K01363 cathepsin B [EC:3.4.22.1]
Length=370
Score = 155 bits (392), Expect = 1e-37, Method: Compositional matrix adjust.
Identities = 91/242 (37%), Positives = 126/242 (52%), Gaps = 23/242 (9%)
Query 1 ATAXSGWAFASTEALNDRFCIFSGGRHREALSPQHTTSCCDLLHCLSFGCSGGQPRMAWR 60
AT S WAF + E ++DR CI S G + +S + SCC +GC GG A R
Sbjct 115 ATCGSCWAFGAAEVISDRVCIQSNGTQQPVISVEDILSCCGTT--CGYGCKGGYSIEALR 172
Query 61 WFSNDGVVTGGDYNELHTGKSCWPYEIPFCRHHSEGPYPKCEGPLPKAPKCRKDCEEAEY 120
++++ G VTGGDY G C PY C + P P C+ C+ +
Sbjct 173 FWASSGAVTGGDYG----GHGCMPYSFAPCTKNC---------PESTTPSCKTTCQSS-- 217
Query 121 TSKVKPFKDDLHFATSAYSV---EGRDQIKRELMENGTLTGAFLVYEDFLLYKEGVYHHV 177
K + +K D H+ SAY V + +I+ E+ G + ++ VYEDF YK GVYH+
Sbjct 218 -YKTEEYKKDKHYGASAYKVTTTKSVTEIQTEIYHYGPVEASYKVYEDFYHYKSGVYHYT 276
Query 178 TGMPMGGHAVKVIGFGNEDGRDYWLAVNSWNEYWGDKGTFKIEMG--EAGIDKEFCGGEP 235
+G +GGHAVK+IG+G E+G DYWL NSW +G+KG FKI G E I+ G
Sbjct 277 SGKLVGGHAVKIIGWGVENGVDYWLIANSWGTSFGEKGFFKIRRGTNECQIEGNVVAGIA 336
Query 236 KV 237
K+
Sbjct 337 KL 338
> ath:AT1G02305 cathepsin B-like cysteine protease, putative
Length=362
Score = 154 bits (389), Expect = 3e-37, Method: Compositional matrix adjust.
Identities = 101/244 (41%), Positives = 130/244 (53%), Gaps = 34/244 (13%)
Query 5 SGWAFASTEALNDRFCIFSGGRHREALSPQHTTSCCDLLHCLSFGCSGGQPRMAWRWFSN 64
S WAF + E+L+DRFCI +LS +CC L GC+GG P AWR+F +
Sbjct 133 SCWAFGAVESLSDRFCI--KYNMNVSLSVNDLLACCGFL--CGQGCNGGYPIAAWRYFKH 188
Query 65 DGVVTGGDYNELHTGKSCWPY-EIPFCRHHSEGPYPKCEGPLPKAPKCRKDCEEAEYTSK 123
GVVT + C PY + C H P CE P PKC + C S
Sbjct 189 HGVVT----------EECDPYFDNTGCSH------PGCEPAYP-TPKCARKC-----VSG 226
Query 124 VKPFKDDLHFATSAYSVEGR-DQIKRELMENGTLTGAFLVYEDFLLYKEGVYHHVTGMPM 182
+ +++ H+ SAY V D I E+ +NG + AF VYEDF YK GVY H+TG +
Sbjct 227 NQLWRESKHYGVSAYKVRSHPDDIMAEVYKNGPVEVAFTVYEDFAHYKSGVYKHITGTNI 286
Query 183 GGHAVKVIGFG-NEDGRDYWLAVNSWNEYWGDKGTFKIEMG--EAGIDKEFCGGEPKVPN 239
GGHAVK+IG+G ++DG DYWL N WN WGD G FKI G E GI+ G +P+
Sbjct 287 GGHAVKLIGWGTSDDGEDYWLLANQWNRSWGDDGYFKIRRGTNECGIEHGVVAG---LPS 343
Query 240 DKNA 243
D+N
Sbjct 344 DRNV 347
> cel:W07B8.1 hypothetical protein; K01363 cathepsin B [EC:3.4.22.1]
Length=335
Score = 153 bits (387), Expect = 4e-37, Method: Compositional matrix adjust.
Identities = 86/240 (35%), Positives = 120/240 (50%), Gaps = 15/240 (6%)
Query 5 SGWAFASTEALNDRFCIFSGGRHREALSPQHTTSCCDLLHCLSFGCSGGQPRMAWRWFSN 64
+ WAFA+ E+++DR CI SGG LS + SCC + GC GG P AW++
Sbjct 103 TSWAFAAAESMSDRLCINSGGFKNTILSAEELLSCCTGMFSCGEGCEGGNPFKAWQYIQK 162
Query 65 DGVVTGGDYNELHTGKSCWPYEIPFC-RHHSEGPYPKCEGPLPKAPKCRKDCEEAEYTSK 123
G+ TGG Y C PY IP C + YP C P C K C TS+
Sbjct 163 HGIPTGGSYESQF---GCKPYSIPPCGKTVGNVTYPACTNTTSPTPSCEKKC-----TSR 214
Query 124 V---KPFKDDLHFATSAYSV-EGRDQIKRELMENGTLTGAFLVYEDFLLYKEGVYHHVTG 179
+ D H+ S + + +I+ ++M NG + F VY+DFL Y G+Y H+TG
Sbjct 215 IGYPIDIDKDRHYGVSVDQLPNSQIEIQSDVMLNGPIQATFEVYDDFLQYTTGIYVHLTG 274
Query 180 MPMGGHAVKVIGFGNEDGRDYWLAVNSWNEYWGDKGTFKIEMG--EAGIDKEFCGGEPKV 237
G +V++IG+G G YWL NSW WG+ GTF++ G E G++ G PK+
Sbjct 275 NKQGHLSVRIIGWGVWQGVPYWLCANSWGRQWGENGTFRVLRGTNECGLESNCVSGMPKL 334
> ath:AT1G02300 cathepsin B-like cysteine protease, putative
Length=379
Score = 150 bits (379), Expect = 4e-36, Method: Compositional matrix adjust.
Identities = 98/240 (40%), Positives = 126/240 (52%), Gaps = 31/240 (12%)
Query 5 SGWAFASTEALNDRFCIFSGGRHREALSPQHTTSCCDLLHCLSFGCSGGQPRMAWRWFSN 64
S WAF + E+L+DRFCI +LS +CC LL FGC+GG P AW +F
Sbjct 150 SCWAFGAVESLSDRFCI--KYNLNVSLSANDVIACCGLL--CGFGCNGGFPMGAWLYFKY 205
Query 65 DGVVTGGDYNELHTGKSCWPY-EIPFCRHHSEGPYPKCEGPLPKAPKCRKDCEEAEYTSK 123
GVVT + C PY + C H P CE P PKC + C S+
Sbjct 206 HGVVT----------QECDPYFDNTGCSH------PGCEPTYP-TPKCERKC-----VSR 243
Query 124 VKPFKDDLHFATSAYSVEGRDQ-IKRELMENGTLTGAFLVYEDFLLYKEGVYHHVTGMPM 182
+ + + H+ AY + Q I E+ +NG + AF VYEDF YK GVY ++TG +
Sbjct 244 NQLWGESKHYGVGAYRINPDPQDIMAEVYKNGPVEVAFTVYEDFAHYKSGVYKYITGTKI 303
Query 183 GGHAVKVIGFG-NEDGRDYWLAVNSWNEYWGDKGTFKIEMG--EAGIDKEFCGGEPKVPN 239
GGHAVK+IG+G ++DG DYWL N WN WGD G FKI G E GI++ G P N
Sbjct 304 GGHAVKLIGWGTSDDGEDYWLLANQWNRSWGDDGYFKIRRGTNECGIEQSVVAGLPSEKN 363
> ath:AT4G01610 cathepsin B-like cysteine protease, putative;
K01363 cathepsin B [EC:3.4.22.1]
Length=359
Score = 149 bits (375), Expect = 1e-35, Method: Compositional matrix adjust.
Identities = 104/259 (40%), Positives = 132/259 (50%), Gaps = 43/259 (16%)
Query 5 SGWAFASTEALNDRFCIFSGGRHREALSPQHTTSCCDLLHCLSF----GCSGGQPRMAWR 60
S WAF + E+L+DRFCI G + S DLL C F GC GG P AW+
Sbjct 130 SCWAFGAVESLSDRFCIQFG--------MNISLSVNDLLACCGFRCGDGCDGGYPIAAWQ 181
Query 61 WFSNDGVVTGGDYNELHTGKSCWPY-EIPFCRHHSEGPYPKCEGPLPKAPKCRKDCEEAE 119
+FS GVVT + C PY + C H P CE P PKC + C
Sbjct 182 YFSYSGVVT----------EECDPYFDNTGCSH------PGCEPAYP-TPKCSRKC---- 220
Query 120 YTSKVKPFKDDLHFATSAYSVEGRDQ-IKRELMENGTLTGAFLVYEDFLLYKEGVYHHVT 178
S K + + H++ S Y+V+ Q I E+ +NG + +F VYEDF YK GVY H+T
Sbjct 221 -VSDNKLWSESKHYSVSTYTVKSNPQDIMAEVYKNGPVEVSFTVYEDFAHYKSGVYKHIT 279
Query 179 GMPMGGHAVKVIGFG-NEDGRDYWLAVNSWNEYWGDKGTFKIEMG--EAGIDKEFCGGEP 235
G +GGHAVK+IG+G + +G DYWL N WN WGD G F I G E GI+ E G P
Sbjct 280 GSNIGGHAVKLIGWGTSSEGEDYWLMANQWNRGWGDDGYFMIRRGTNECGIEDEPVAGLP 339
Query 236 KVPN----DKNASLLPSAQ 250
N D ++ LP A
Sbjct 340 SSKNVFRVDTGSNDLPVAS 358
> cel:F32H5.1 hypothetical protein; K01363 cathepsin B [EC:3.4.22.1]
Length=356
Score = 116 bits (291), Expect = 7e-26, Method: Compositional matrix adjust.
Identities = 75/236 (31%), Positives = 109/236 (46%), Gaps = 17/236 (7%)
Query 9 FASTEALNDRFCIFSGGRHREALSPQHTTSCCDLLHCL---SFGCSGGQPRMAWRWFSND 65
+ E +DR CI S G LS Q SCC L + +GC G P+ +W+
Sbjct 123 LVAVEIASDRTCIASNGTFNWPLSAQDPLSCCVGLMSICGDGWGCDGSWPKDILKWWQTH 182
Query 66 GVVTGGDYNELHTGKSCWPYEIPFCRHHSEGPYPKCEGPLPKAPKCRKDCEEAEYTSKVK 125
G+ TGG+YN+ C PY I C P P C + C TS +
Sbjct 183 GLCTGGNYNDQF---GCKPYSIYPCDKKYANGTTSVPCPGYHTPTCEEHC-----TSNIT 234
Query 126 ---PFKDDLHFATSAYSV-EGRDQIKRELMENGTLTGAFLVYEDFLLYKEGVYHHVTGMP 181
+K D HF + Y+V + I+ E+M NG + +F++Y+DF YK G+Y H G
Sbjct 235 WPIAYKQDKHFGKAHYNVGKKMTDIQIEIMTNGPVIASFIIYDDFWDYKTGIYVHTAGDQ 294
Query 182 MGGHAVKVIGFGNEDGRDYWLAVNSWNEYWGDKGTFKIEMG--EAGIDKEFCGGEP 235
GG K+IG+G ++G YWL V+ W +G+ G + G E I+ + P
Sbjct 295 EGGMDTKIIGWGVDNGVPYWLCVHQWGTDFGENGFVRFLRGVNEVNIEHQVLAALP 350
> mmu:94242 Tinagl1, 1110021J17Rik, AZ-1, AZ1, Arg1, Lcn7, TARP,
Tinagl; tubulointerstitial nephritis antigen-like 1
Length=466
Score = 102 bits (253), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 79/238 (33%), Positives = 112/238 (47%), Gaps = 33/238 (13%)
Query 7 WAFASTEALNDRFCIFSGGRHREALSPQHTTSCCDLLHCLSFGCSGGQPRMAWRWFSNDG 66
WAF++ +DR I S G LSPQ+ SC D H GC GG+ AW + G
Sbjct 229 WAFSTAAVASDRVSIHSLGHMTPILSPQNLLSC-DTHH--QQGCRGGRLDGAWWFLRRRG 285
Query 67 VVTGGDYNELHTGKSCWPYEIPFCRHHSEG-PYPKCEGPLPKAPKCRKDCEEAEYTSKVK 125
VV+ +C+P+ R +E P P+C + ++ +V
Sbjct 286 VVS----------DNCYPFSG---REQNEASPTPRCMMHSRAMGRGKRQATSRCPNGQVD 332
Query 126 PFKDDLHFATSAYSV-EGRDQIKRELMENGTLTGAFLVYEDFLLYKEGVYHHV---TGMP 181
+D++ T AY + +I +ELMENG + V+EDF LY+ G+Y H G P
Sbjct 333 --SNDIYQVTPAYRLGSDEKEIMKELMENGPVQALMEVHEDFFLYQRGIYSHTPVSQGRP 390
Query 182 -----MGGHAVKVIGFGNE---DGR--DYWLAVNSWNEYWGDKGTFKIEMGEAGIDKE 229
G H+VK+ G+G E DGR YW A NSW +WG++G F+I G D E
Sbjct 391 EQYRRHGTHSVKITGWGEETLPDGRTIKYWTAANSWGPWWGERGHFRIVRGTNECDIE 448
> hsa:64129 TINAGL1, ARG1, LCN7, LIECG3, TINAGRP; tubulointerstitial
nephritis antigen-like 1
Length=436
Score = 99.0 bits (245), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 81/238 (34%), Positives = 108/238 (45%), Gaps = 33/238 (13%)
Query 7 WAFASTEALNDRFCIFSGGRHREALSPQHTTSCCDLLHCLSFGCSGGQPRMAWRWFSNDG 66
WAF++ +DR I S G LSPQ+ SC D GC GG+ AW + G
Sbjct 199 WAFSTAAVASDRVSIHSLGHMTPVLSPQNLLSC-DTHQ--QQGCRGGRLDGAWWFLRRRG 255
Query 67 VVTGGDYNELHTGKSCWPYEIPFCRHHSE-GPYPKCEGPLPKAPKCRKDCEEAEYTSKVK 125
VV+ C+P+ R E GP P C + ++ S V
Sbjct 256 VVS----------DHCYPFS---GRERDEAGPAPPCMMHSRAMGRGKRQATAHCPNSYVN 302
Query 126 PFKDDLHFATSAYSVEGRD-QIKRELMENGTLTGAFLVYEDFLLYKEGVYHHV---TGMP 181
+D++ T Y + D +I +ELMENG + V+EDF LYK G+Y H G P
Sbjct 303 --NNDIYQVTPVYRLGSNDKEIMKELMENGPVQALMEVHEDFFLYKGGIYSHTPVSLGRP 360
Query 182 -----MGGHAVKVIGFGNE---DGR--DYWLAVNSWNEYWGDKGTFKIEMGEAGIDKE 229
G H+VK+ G+G E DGR YW A NSW WG++G F+I G D E
Sbjct 361 ERYRRHGTHSVKITGWGEETLPDGRTLKYWTAANSWGPAWGERGHFRIVRGVNECDIE 418
> cel:Y65B4A.2 hypothetical protein
Length=421
Score = 97.4 bits (241), Expect = 4e-20, Method: Compositional matrix adjust.
Identities = 85/272 (31%), Positives = 116/272 (42%), Gaps = 66/272 (24%)
Query 5 SGWAFASTEALNDRFCIFSGGRHREALSPQHTTSCCDLLHCLSFGCSGGQPRMAWRWFSN 64
S +A A+ +DR CI S G + LS + CC + C GG P A ++ N
Sbjct 165 SCFAVAAAGVASDRACIHSNGTFKSLLSEEDIIGCCSVCG----NCYGGDPLKALTYWVN 220
Query 65 DGVVTGGDYNELHTGKSCWPYEIPFCRHHSEGPYPKCEGPLPKAPKCRKDCEEAEYTSKV 124
G+VTGG C PY P E + C K C+ Y K
Sbjct 221 QGLVTGGR-------DGCRPYSFDLSCGVPCSPATFFEAEEKRT--CMKRCQNIYYQQK- 270
Query 125 KPFKDDLHFATSAYSV-----------------------------------EGRDQIKRE 149
+++D HFAT AYS+ E RD IK+E
Sbjct 271 --YEEDKHFATFAYSMYPRSMTVSPDGKERVKVPTIIGHFNDKKTEKLNVTEYRDIIKKE 328
Query 150 LMENGTLTGAFLVYEDFLLYKEGVYHHVTGMPMGG--------HAVKVIGFG-NEDGRDY 200
++ G T AF V E+FL Y GV+ P G H V++IG+G ++DG Y
Sbjct 329 ILLYGPTTMAFPVPEEFLHYSSGVFR---PYPTDGFDDRIVYWHVVRLIGWGESDDGTHY 385
Query 201 WLAVNSWNEYWGDKGTFKI---EMGEAGIDKE 229
WLAVNS+ +WGD G FKI +M + G++ E
Sbjct 386 WLAVNSFGNHWGDNGLFKINTDDMEKYGLEYE 417
> cel:F26E4.3 hypothetical protein
Length=452
Score = 95.1 bits (235), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 70/232 (30%), Positives = 99/232 (42%), Gaps = 37/232 (15%)
Query 5 SGWAFASTEALNDRFCIFSGGRHREALSPQHTTSCCDLLHCLSFGCSGGQPRMAWRWFSN 64
S W+ ++T +DR I S GR LS Q SC GC GG AW +
Sbjct 209 SSWSVSTTAISSDRLAIISEGRINSTLSSQQLLSCNQHRQ---KGCEGGYLDRAWWYIRK 265
Query 65 DGVVTGGDYNELHTGKSCWPYEIPFCRHHSEGPYPKCEGPLPKAPKCRKDCEEAEYTSKV 124
GVV G C+PY R PK + + +C +++
Sbjct 266 LGVV----------GDHCYPYVSGQSREPGHCLIPKRDYTNRQGLRCPSGSQDSTAFKMT 315
Query 125 KPFKDDLHFATSAYSVEGRDQ-IKRELMENGTLTGAFLVYEDFLLYKEGVYHH------- 176
P+K V R++ I+ ELM NG + F+V+EDF +Y GVY H
Sbjct 316 PPYK-----------VSSREEDIQTELMTNGPVQATFVVHEDFFMYAGGVYQHSDLAAQK 364
Query 177 -VTGMPMGGHAVKVIGFGNEDGR----DYWLAVNSWNEYWGDKGTFKIEMGE 223
+ + G H+V+V+G+G + YWL NSW WG+ G FK+ GE
Sbjct 365 GASSVAEGYHSVRVLGWGVDHSTGKPIKYWLCANSWGTQWGEDGYFKVLRGE 416
> dre:562116 tinagl1, si:dkey-158b13.1; tubulointerstitial nephritis
antigen-like 1
Length=471
Score = 95.1 bits (235), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 67/239 (28%), Positives = 106/239 (44%), Gaps = 32/239 (13%)
Query 5 SGWAFASTEALNDRFCIFSGGRHREALSPQHTTSCCDLLHCLSFGCSGGQPRMAWRWFSN 64
+ WAF++ +DR I S G LSPQ+ SC D H GC+GG+ AW +
Sbjct 225 ASWAFSTAAVASDRISIQSMGHMTPQLSPQNLISC-DTRH--QDGCAGGRIDGAWWFMRR 281
Query 65 DGVVTGGDYNELHTGKSCWPYEIPFCRHHSEGPYPKCEGPLPKAPKCRKDCEEAEYTSKV 124
GVVT + C+P+ P S +C + R + +
Sbjct 282 RGVVT----------QDCYPFSPP---EQSAVEVARCM--MQSRAVGRGKRQATAHCPNS 326
Query 125 KPFKDDLHFATSAYSVE-GRDQIKRELMENGTLTGAFLVYEDFLLYKEGVYHHV------ 177
+ +D++ +T Y + ++I +E+M+NG + V+EDF +YK G++ H
Sbjct 327 HSYHNDIYQSTPPYRLSTNENEIMKEIMDNGPVQAIMEVHEDFFVYKSGIFRHTDVNYHK 386
Query 178 --TGMPMGGHAVKVIGFGNEDG-----RDYWLAVNSWNEYWGDKGTFKIEMGEAGIDKE 229
H+V++ G+G E R YW+ NSW + WG+ G F+I G D E
Sbjct 387 PSQYRKHATHSVRITGWGEERDYSGRTRKYWIGANSWGKNWGEDGYFRIARGVNECDIE 445
> xla:380203 ctsc, MGC69126; cathepsin C (EC:3.4.14.1); K01275
cathepsin C [EC:3.4.14.1]
Length=458
Score = 92.8 bits (229), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 75/245 (30%), Positives = 107/245 (43%), Gaps = 52/245 (21%)
Query 5 SGWAFASTEALNDRFCIFSGGRHREALSPQHTTSCCDLLHCLSFGCSGGQPRM-AWRWFS 63
S +AFAS L R I S + LSPQ SC + S GC GG P + A ++ +
Sbjct 252 SCYAFASMGMLESRIQIQSQLSQKPILSPQQVVSCSNY----SQGCDGGFPYLIAGKYLN 307
Query 64 NDGVVTGGDYNELHTGKSCWPYEIPFCRHHSEGPYPKCEGPLPKAPKCRKDCEEAEYTSK 123
+ G+V D+ PY + P KD + YT+
Sbjct 308 DFGIVEESDF-----------------------PYIGSDSPC-----TLKDSYQRYYTA- 338
Query 124 VKPFKDDLHFATSAYSVEGRDQIKRELMENGTLTGAFLVYEDFLLYKEGVYHHVTGM--- 180
+ H+ Y +K EL+ G L+ AF VY+DF+ Y+ GVYHH TG+
Sbjct 339 ------EYHYVGGFYGGCNEAYMKLELVLGGPLSVAFEVYDDFIHYRSGVYHH-TGLQDK 391
Query 181 ----PMGGHAVKVIGFG--NEDGRDYWLAVNSWNEYWGDKGTFKIEMG--EAGIDKEFCG 232
+ HAV ++G+G + G YW+ NSW E WG+KG F+I G E I+
Sbjct 392 FNPFQLTNHAVLLVGYGTDQQTGEKYWIVKNSWGESWGEKGFFRIRRGSDECAIESIAVS 451
Query 233 GEPKV 237
P +
Sbjct 452 ANPII 456
> hsa:27283 TINAG, TIN-AG; tubulointerstitial nephritis antigen
Length=476
Score = 91.3 bits (225), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 70/246 (28%), Positives = 104/246 (42%), Gaps = 48/246 (19%)
Query 5 SGWAFASTEALNDRFCIFSGGRHREALSPQHTTSCCDLLHCLSFGCSGGQPRMAWRWFSN 64
+ WAF++ DR I S GR+ LSPQ+ SCC GC+ G AW +
Sbjct 242 ASWAFSTASVAADRIAIQSKGRYTANLSPQNLISCCAKNR---HGCNSGSIDRAWWYLRK 298
Query 65 DGVVTGG------DYNELHTGKSCWPYEIPFCRHHSEGPYPKCEGPLPKAPKCRKDCEEA 118
G+V+ D N + G + + H+ P C + E++
Sbjct 299 RGLVSHACYPLFKDQNATNNGCAMASRSDGRGKRHATKP-------------CPNNVEKS 345
Query 119 EYTSKVKPFKDDLHFATSAYSVEGRD-QIKRELMENGTLTGAFLVYEDFLLYKEGVYHHV 177
+ P Y V + +I +E+M+NG + V EDF YK G+Y HV
Sbjct 346 NRIYQCSP----------PYRVSSNETEIMKEIMQNGPVQAIMQVREDFFHYKTGIYRHV 395
Query 178 TGM--------PMGGHAVKVIGFGNEDG-----RDYWLAVNSWNEYWGDKGTFKIEMG-- 222
T + HAVK+ G+G G +W+A NSW + WG+ G F+I G
Sbjct 396 TSTNKESEKYRKLQTHAVKLTGWGTLRGAQGQKEKFWIAANSWGKSWGENGYFRILRGVN 455
Query 223 EAGIDK 228
E+ I+K
Sbjct 456 ESDIEK 461
> mmu:26944 Tinag, AI452335, TIN-ag; tubulointerstitial nephritis
antigen
Length=475
Score = 90.1 bits (222), Expect = 7e-18, Method: Compositional matrix adjust.
Identities = 69/240 (28%), Positives = 101/240 (42%), Gaps = 36/240 (15%)
Query 5 SGWAFASTEALNDRFCIFSGGRHREALSPQHTTSCCDLLHCLSFGCSGGQPRMAWRWFSN 64
+ WAF++ DR I S GR+ LSPQ+ SCC GC+ G AW +
Sbjct 241 ASWAFSTASVAADRIAIQSKGRYTANLSPQNLISCCAKNR---HGCNSGSIDRAWWFLRK 297
Query 65 DGVVTGGDYNELHTGKSCWPYEIPFCRHHSEGPYPKCEGPLPKAPKCRKDCEEAEYTSKV 124
G+V+ Y + I S+G G C E++ +
Sbjct 298 RGLVSHACYPLFKDQNT--TNNICAMASRSDG-----RGKRHATKPCPNSFEKSNRIYQC 350
Query 125 KPFKDDLHFATSAYSVEGRD-QIKRELMENGTLTGAFLVYEDFLLYKEGVYHHVTGM--- 180
P Y V + +I RE+++NG + V+EDF YK G+Y HV
Sbjct 351 SP----------PYRVSSNETEIMREIIQNGPVQAIMQVHEDFFYYKTGIYRHVVSTNEE 400
Query 181 -----PMGGHAVKVIGFGNEDG-----RDYWLAVNSWNEYWGDKGTFKIEMG--EAGIDK 228
+ HAVK+ G+G G +W+A NSW + WG+ G F+I G E+ I+K
Sbjct 401 PEKYKKLRTHAVKLTGWGTLRGARGKKEKFWIAANSWGKSWGENGYFRILRGVNESDIEK 460
> dre:368704 ctsc, cb912, ik:tdsubc_1h2, sb:cb146, wu:fb34g12,
wu:fj58d01; cathepsin C (EC:3.4.14.1); K01275 cathepsin C [EC:3.4.14.1]
Length=455
Score = 86.7 bits (213), Expect = 7e-17, Method: Compositional matrix adjust.
Identities = 71/246 (28%), Positives = 103/246 (41%), Gaps = 51/246 (20%)
Query 1 ATAXSGWAFASTEALNDRFCIFSGGRHREALSPQHTTSCCDLLHCLSFGCSGGQPRMAWR 60
A S ++FA+ L R I + + SPQ SC S GC GG P + +
Sbjct 246 AQCGSCYSFATMGMLEARVRIQTNNTQQPVFSPQQVVSCSQY----SQGCDGGFPYLIGK 301
Query 61 WFSNDGVVTGGDYNELHTGKSCWPYEIPFCRHHSEGPYPKCEGPLPKAPKCRKDCEEAEY 120
+ + G+V + C+PY S+ P C P KC
Sbjct 302 YIQDFGIVE----------EDCFPYT------GSDSP---CNLP----AKC--------- 329
Query 121 TSKVKPFKDDLHFATSAYSVEGRDQIKRELMENGTLTGAFLVYEDFLLYKEGVYHHVTGM 180
K + D H+ Y + EL++NG + A VY DF+ YKEG+YHH TG+
Sbjct 330 ---TKYYASDYHYVGGFYGGCSESAMMLELVKNGPMGVALEVYPDFMNYKEGIYHH-TGL 385
Query 181 -------PMGGHAVKVIGFG--NEDGRDYWLAVNSWNEYWGDKGTFKIEMG--EAGIDKE 229
+ HAV ++G+G ++ G YW+ NSW WG+ G F+I G E I+
Sbjct 386 RDANNPFELTNHAVLLVGYGQCHKTGEKYWIVKNSWGSGWGENGFFRIRRGTDECAIESI 445
Query 230 FCGGEP 235
P
Sbjct 446 AVAATP 451
> mmu:13032 Ctsc, AI047818, DPP1, DPPI; cathepsin C (EC:3.4.14.1);
K01275 cathepsin C [EC:3.4.14.1]
Length=462
Score = 78.6 bits (192), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 66/228 (28%), Positives = 100/228 (43%), Gaps = 50/228 (21%)
Query 5 SGWAFASTEALNDRFCIFSGGRHREALSPQHTTSCCDLLHCLSFGCSGGQPRMAWRWFSN 64
S ++FAS L R I + LSPQ SC GC GG P + ++
Sbjct 256 SCYSFASMGMLEARIRILTNNSQTPILSPQEVVSCSPYAQ----GCDGGFPYLIAGKYAQ 311
Query 65 D-GVVTGGDYNELHTGKSCWPYEIPFCRHHSEGPYPKCEGPLPKAPKCRKDCEEAEYTSK 123
D GVV +SC+PY + P K R++C
Sbjct 312 DFGVVE----------ESCFPYTAK-------------DSPC----KPRENC-------- 336
Query 124 VKPFKDDLHFATSAYSVEGRDQIKRELMENGTLTGAFLVYEDFLLYKEGVYHHVTGMP-- 181
++ + D ++ Y +K EL+++G + AF V++DFL Y G+YHH TG+
Sbjct 337 LRYYSSDYYYVGGFYGGCNEALMKLELVKHGPMAVAFEVHDDFLHYHSGIYHH-TGLSDP 395
Query 182 -----MGGHAVKVIGFGNE--DGRDYWLAVNSWNEYWGDKGTFKIEMG 222
+ HAV ++G+G + G +YW+ NSW WG+ G F+I G
Sbjct 396 FNPFELTNHAVLLVGYGRDPVTGIEYWIIKNSWGSNWGESGYFRIRRG 443
> xla:100036949 ctsh; cathepsin H (EC:3.4.22.16)
Length=319
Score = 73.2 bits (178), Expect = 8e-13, Method: Compositional matrix adjust.
Identities = 60/235 (25%), Positives = 95/235 (40%), Gaps = 59/235 (25%)
Query 5 SGWAFASTEALNDRFCIFSGGRHREALSPQHTTSCCDLLHCLSFGCSGGQPRMAWRWFSN 64
S WAFA+ + C+ G LS Q C L GC GG P +A + ++
Sbjct 125 SCWAFATVGVIEALHCL--AGNELSNLSEQQLVDC----DHLDSGCCGGFPVLAMDYIAH 178
Query 65 DGVVTGGDYNELHTGKSCWPYEIPFCRHHSEGPYPKCEGPLPKAPKCRKDCEEAEYTSKV 124
G++ DY YE K C+ D + A +
Sbjct 179 RGIMKTEDY----------EYE-------------------AKQSTCQYDSDNAIRLN-- 207
Query 125 KPFKDDLHFATSAYSVEGRDQIKRELMENGTLTGAFLVYEDFLLYKEGVYHHVTGMPMGG 184
+ Y + + + + ++G +T F V EDF+ Y +G++ P
Sbjct 208 ---------VSKYYILPDEENMASSVAKDGPITVGFAVAEDFMFYSKGIFDGECA-PSPN 257
Query 185 HAVKVIGFGN-------EDGRDYWLAVNSWNEYWGDKGTFKIEMGEAGIDKEFCG 232
HA+ V+G+G +DG DYW+ NSW E+WG++G KI+ +K+ CG
Sbjct 258 HAIIVVGYGTLHCEDGEDDGEDYWIIKNSWGEHWGEEGFGKIQR-----NKDMCG 307
> ath:AT5G60360 AALP; AALP (Arabidopsis aleurain-like protease);
cysteine-type peptidase; K01366 cathepsin H [EC:3.4.22.16]
Length=357
Score = 72.4 bits (176), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 66/223 (29%), Positives = 93/223 (41%), Gaps = 47/223 (21%)
Query 5 SGWAFASTEALNDRF-CIFSGGRHREALSPQHTTSCCDLLHCLSFGCSGGQPRMAWRWFS 63
S W F++T AL + F G +LS Q C + ++GC+GG P A+ +
Sbjct 164 SCWTFSTTGALEAAYHQAFGKGI---SLSEQQLVDCAGAFN--NYGCNGGLPSQAFEYIK 218
Query 64 NDGVVTGGDYNELHTGKSCWPYEIPFCRHHSEGPYPKCEGPLPKAPKCRKDCEEAEYTSK 123
++G L T K+ +PY K C+ E
Sbjct 219 SNG--------GLDTEKA-YPYT-------------------GKDETCKFSAENVGVQV- 249
Query 124 VKPFKDDLHFATSAYSVEGRDQIKRELMENGTLTGAFLVYEDFLLYKEGVY--HHVTGMP 181
+ ++ D++K + ++ AF V F LYK GVY H P
Sbjct 250 ---------LNSVNITLGAEDELKHAVGLVRPVSIAFEVIHSFRLYKSGVYTDSHCGSTP 300
Query 182 MG-GHAVKVIGFGNEDGRDYWLAVNSWNEYWGDKGTFKIEMGE 223
M HAV +G+G EDG YWL NSW WGDKG FK+EMG+
Sbjct 301 MDVNHAVLAVGYGVEDGVPYWLIKNSWGADWGDKGYFKMEMGK 343
> cpv:cgd4_2110 preprocathepsin c precursor ; K01275 cathepsin
C [EC:3.4.14.1]
Length=635
Score = 68.9 bits (167), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 64/252 (25%), Positives = 97/252 (38%), Gaps = 49/252 (19%)
Query 22 FSGGRHRE---ALSPQHTTSCCDLLHCLSFGCSGGQPRMAWRWFSNDGVVTGGDYNELHT 78
+ G RE LSPQ SC GC GG P + R G+ +
Sbjct 379 LTNGASREEKILLSPQSVLSCSPFNQ----GCEGGYPFLVGRQAEEIGI----------S 424
Query 79 GKSCWPYEIPFCRHHSEGPY--PKCEGPLPKAPKCRKDCEEAEYTSKVKPFKDDLHFATS 136
+ C Y + + P+ P+ E R CEE E + + ++ +
Sbjct 425 SEKCMGYYADSNQECNFSPFITPEIED--------RIYCEEGE-----RMYAEEYGYVGG 471
Query 137 AYSVEGRDQIKRELMENGTLTGAFLVYEDFLLYKEGVYHHV---------------TGMP 181
Y D++K E+ +NG + A + L+Y GVY + G
Sbjct 472 CYGCCDEDRMKEEIFKNGPIAVAMHIDTSLLVYDNGVYDSIPNDHTKYCDLPNKQLNGWE 531
Query 182 MGGHAVKVIGFGNEDGRDYWLAVNSWNEYWGDKGTFKIEMGE--AGIDKEFCGGEPKVPN 239
HA+ ++G+G E+G YW+ NSW WG KG KI G+ GI+ + +P
Sbjct 532 YTNHAIAIVGWGEENGIPYWIIRNSWGANWGKKGYAKIRRGKNIGGIENQAVFIDPDFTR 591
Query 240 DKNASLLPSAQD 251
SLL Q+
Sbjct 592 GMGLSLLNKYQN 603
> mmu:13036 Ctsh, AL022844; cathepsin H (EC:3.4.22.16); K01366
cathepsin H [EC:3.4.22.16]
Length=333
Score = 68.6 bits (166), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 63/238 (26%), Positives = 95/238 (39%), Gaps = 53/238 (22%)
Query 5 SGWAFASTEALNDRFCIFSGGRHREALSPQHTTSCCDLLHCLSFGCSGGQPRMAWRW-FS 63
S W F++T AL I SG +L+ Q C + + GC GG P A+ +
Sbjct 138 SCWTFSTTGALESAVAIASG--KMLSLAEQQLVDCAQAFN--NHGCKGGLPSQAFEYILY 193
Query 64 NDGVVTGGDYNELHTGKSCWPYEIPFCRHHSEGPYPKCEGPLPKAPKCRKDCEEAEYTSK 123
N G++ Y PY + K CR + ++A
Sbjct 194 NKGIMEEDSY-----------------------PY------IGKDSSCRFNPQKA----- 219
Query 124 VKPFKDDLHFATSAYSVEGRDQ--IKRELMENGTLTGAFLVYEDFLLYKEGVYH----HV 177
+ F + ++ D+ + + ++ AF V EDFL+YK GVY H
Sbjct 220 -------VAFVKNVVNITLNDEAAMVEAVALYNPVSFAFEVTEDFLMYKSGVYSSKSCHK 272
Query 178 TGMPMGGHAVKVIGFGNEDGRDYWLAVNSWNEYWGDKGTFKIEMGEAGIDKEFCGGEP 235
T + HAV +G+G ++G YW+ NSW WG+ G F IE G+ C P
Sbjct 273 TPDKVN-HAVLAVGYGEQNGLLYWIVKNSWGSQWGENGYFLIERGKNMCGLAACASYP 329
> hsa:1522 CTSZ, CTSX, FLJ17088; cathepsin Z (EC:3.4.18.1); K08568
cathepsin X [EC:3.4.18.1]
Length=303
Score = 67.4 bits (163), Expect = 5e-11, Method: Compositional matrix adjust.
Identities = 55/221 (24%), Positives = 83/221 (37%), Gaps = 42/221 (19%)
Query 5 SGWAFASTEALNDRFCIFSGGRHREALSPQHTTSCCDLLHCLSFG-CSGGQPRMAWRWFS 63
S WA AST A+ DR I R+ P S +++ C + G C GG W +
Sbjct 91 SCWAHASTSAMADRINI-----KRKGAWPSTLLSVQNVIDCGNAGSCEGGNDLSVWDYAH 145
Query 64 NDGVVTGGDYNELHTGKSCWPYEIPFCRHHSEGPYPKCEGPLPKAPKCRK-----DCEEA 118
G+ P C K +C K C E
Sbjct 146 QHGI-----------------------------PDETCNNYQAKDQECDKFNQCGTCNEF 176
Query 119 EYTSKVKPFKDDLHFATSAYSVEGRDQIKRELMENGTLTGAFLVYEDFLLYKEGVYHHVT 178
+ ++ + L S+ GR+++ E+ NG ++ + E Y G+Y
Sbjct 177 KECHAIRNYT--LWRVGDYGSLSGREKMMAEIYANGPISCGIMATERLANYTGGIYAEYQ 234
Query 179 GMPMGGHAVKVIGFGNEDGRDYWLAVNSWNEYWGDKGTFKI 219
H V V G+G DG +YW+ NSW E WG++G +I
Sbjct 235 DTTYINHVVSVAGWGISDGTEYWIVRNSWGEPWGERGWLRI 275
> dre:100333521 Cathepsin Z-like
Length=267
Score = 67.4 bits (163), Expect = 5e-11, Method: Compositional matrix adjust.
Identities = 55/220 (25%), Positives = 85/220 (38%), Gaps = 39/220 (17%)
Query 5 SGWAFASTEALNDRFCIFSGGRHREALSPQHTTSCCDLLHCLSFG-CSGGQPRMAWRWFS 63
S WA ST AL DR I R+ P S +++ C G C GG + + +
Sbjct 53 SCWAMGSTSALADRINI-----KRKGAWPSAYLSVQNVIDCGKAGSCFGGDHLGVYAYAN 107
Query 64 NDGVVTGGDYNELHTGKSCWPYEIPFCRHHSEGPYPKCEGPLPKAPKCR--KDCEEAEYT 121
G+ P C + KC C +
Sbjct 108 EHGI-----------------------------PDETCNNYQARNQKCDPFNQCGTCSFF 138
Query 122 SKVKPFKDDLHFATSAY-SVEGRDQIKRELMENGTLTGAFLVYEDFLLYKEGVYHHVTGM 180
K+ + Y + GRD++K E+ +NG ++ A + + Y GV+ +
Sbjct 139 GSCSIIKNYTVWKVGDYGDISGRDRMKAEIFKNGPISCAIMATKGLEAYDGGVFAEFHIL 198
Query 181 PMGGHAVKVIGFG-NEDGRDYWLAVNSWNEYWGDKGTFKI 219
M H + V G+G EDG +YW+ NSW E+WG+ G +I
Sbjct 199 SMPNHIISVAGWGVTEDGTEYWIVRNSWGEFWGESGWARI 238
> cel:M04G12.2 cpz-2; CathePsin Z family member (cpz-2); K08568
cathepsin X [EC:3.4.18.1]
Length=467
Score = 67.0 bits (162), Expect = 5e-11, Method: Compositional matrix adjust.
Identities = 60/227 (26%), Positives = 89/227 (39%), Gaps = 54/227 (23%)
Query 5 SGWAFASTEALNDRFCIFSGGR-HREALSPQHTTSCCDLLHCLSFGCSGGQPRMAWRWFS 63
S W F +T ALNDRF + GR LSPQ C C GG+
Sbjct 250 SCWVFGTTGALNDRFNVARKGRWPMTQLSPQEIIDCNG-----KGNCQGGEIGNVLEHAK 304
Query 64 NDGVV---------TGGDYNELHTGKSCWPYEIPFCRHHSEGPYPKCEGPLPKAPKCRKD 114
G+V T G+ N H SCWP E +++
Sbjct 305 IQGLVEEGCNVYRATNGECNPYHRCGSCWPNECFSLTNYTR------------------- 345
Query 115 CEEAEYTSKVKPFKDDLHFATSAYSVEGRDQIKRELMENGTLTGAFLVYEDFLL-YKEGV 173
++ V+GRD+I E+ + G + A + F Y +GV
Sbjct 346 -----------------YYVKDYGQVQGRDKIMSEIKKGGPIACAIGATKKFEYEYVKGV 388
Query 174 YHHVTGMPMGGHAVKVIGFG-NEDGRDYWLAVNSWNEYWGDKGTFKI 219
Y + + H + + G+G +E+G +YW+A NSW E WG+ G F++
Sbjct 389 YSEKSDLE-SNHIISLTGWGVDENGVEYWIARNSWGEAWGELGWFRV 434
> ath:AT3G45310 cysteine proteinase, putative; K01366 cathepsin
H [EC:3.4.22.16]
Length=357
Score = 65.9 bits (159), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 65/223 (29%), Positives = 94/223 (42%), Gaps = 47/223 (21%)
Query 5 SGWAFASTEALNDRF-CIFSGGRHREALSPQHTTSCCDLLHCLSFGCSGGQPRMAWRWFS 63
S W F++T AL + F G +LS Q C + +FGC GG P A+ +
Sbjct 164 SCWTFSTTGALEAAYHQAFGKGI---SLSEQQLVDCAGTFN--NFGCHGGLPSQAFEYIK 218
Query 64 NDGVVTGGDYNELHTGKSCWPYEIPFCRHHSEGPYPKCEGPLPKAPKCRKDCEEAEYTSK 123
+G G D E + PY +G C+ +
Sbjct 219 YNG---GLDTEEAY-------------------PYTGKDG----------GCKFSAKNIG 246
Query 124 VKPFKDDLHFATSAYSVEGRDQIKRELMENGTLTGAFLVYEDFLLYKEGVYHHVT--GMP 181
V+ +D ++ A D++K + ++ AF V +F YK+GV+ T P
Sbjct 247 VQ-VRDSVNITLGA-----EDELKHAVGLVRPVSVAFEVVHEFRFYKKGVFTSNTCGNTP 300
Query 182 MG-GHAVKVIGFGNEDGRDYWLAVNSWNEYWGDKGTFKIEMGE 223
M HAV +G+G ED YWL NSW WGD G FK+EMG+
Sbjct 301 MDVNHAVLAVGYGVEDDVPYWLIKNSWGGEWGDNGYFKMEMGK 343
> dre:450022 ctsz, wu:fj81f10, zgc:103420; cathepsin Z (EC:3.4.18.1);
K08568 cathepsin X [EC:3.4.18.1]
Length=301
Score = 65.5 bits (158), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 59/217 (27%), Positives = 88/217 (40%), Gaps = 33/217 (15%)
Query 5 SGWAFASTEALNDRFCIFSGGRHREALSPQHTTSCCDLLHCLSFG-CSGGQPRMAWRWFS 63
S WA ST AL DR I R+A P S +++ C G CSGG W +
Sbjct 83 SCWAHGSTSALADRINI-----KRKAAWPSAYLSVQNVIDCGDAGSCSGGDHSGVWEYAH 137
Query 64 NDGVVTGGDYNELHTGKSCWPYEIPFCRHHSEGPYPKCEGPLPKAPKCRKDCEEAEYTSK 123
N G+ ++C Y+ + P+ +C C +
Sbjct 138 NKGI----------PDETCNNYQ---AKDQDCKPFNQC-----------GTCTTFGVCNI 173
Query 124 VKPFKDDLHFATSAYSVEGRDQIKRELMENGTLTGAFLVYEDFLLYKEGVYHHVTGMPMG 183
VK F L S G D++K E+ G ++ + + Y G+Y P
Sbjct 174 VKNFT--LWKVGDYGSASGLDKMKAEIYSGGPISCGIMATDKLDAYTGGLYSEYVQEPYI 231
Query 184 GHAVKVIGFG-NEDGRDYWLAVNSWNEYWGDKGTFKI 219
H V V G+G +E+G ++W+ NSW E WG+KG +I
Sbjct 232 NHIVSVAGWGVDENGVEFWVVRNSWGEPWGEKGWLRI 268
> xla:494800 ctsz; cathepsin Z (EC:3.4.18.1); K08568 cathepsin
X [EC:3.4.18.1]
Length=296
Score = 64.3 bits (155), Expect = 4e-10, Method: Compositional matrix adjust.
Identities = 57/220 (25%), Positives = 84/220 (38%), Gaps = 39/220 (17%)
Query 5 SGWAFASTEALNDRFCIFSGGRHREA-LSPQHTTSCCDLLHCLSFGCSGGQPRMAWRWFS 63
S WA ST A+ DR I G A LS QH C + + C GG W + +
Sbjct 82 SCWAHGSTSAMADRINIKRNGVWPSAYLSVQHVIDCAN-----AGSCEGGDHGGVWEYAN 136
Query 64 NDGVVTGGDYNELHTGKSCWPYEIPFCRHHSEGPYPKCEGPLPKAPKCR--KDCEEAEYT 121
+ G+ P C K KC C
Sbjct 137 SHGI-----------------------------PDETCNNYQAKDQKCDTFNQCGTCVTF 167
Query 122 SKVKPFKDDLHFATSAY-SVEGRDQIKRELMENGTLTGAFLVYEDFLLYKEGVYHHVTGM 180
K + + + SV GR+++ E+ +NG ++ + E Y G+Y
Sbjct 168 GKCFNISNYTLWKVGDFGSVSGREKMMAEIYKNGPISCGIMATEKLDAYTGGLYAEFQPS 227
Query 181 PMGGHAVKVIGFG-NEDGRDYWLAVNSWNEYWGDKGTFKI 219
M H V V G+G +E+G +YW+ NSW E WG++G +I
Sbjct 228 AMINHIVSVAGWGLDENGVEYWIVRNSWGEPWGERGWLRI 267
> tpv:TP03_0283 cysteine proteinase (EC:3.4.22.-); K01376 [EC:3.4.22.-]
Length=441
Score = 63.5 bits (153), Expect = 7e-10, Method: Compositional matrix adjust.
Identities = 61/230 (26%), Positives = 93/230 (40%), Gaps = 54/230 (23%)
Query 5 SGWAFASTEALNDRFCIFSGGRHREALSPQHTTSCCDLLHCLSFGCSGGQPRMAWRWFSN 64
S WAF+S ++ + ++ + LS Q +C S GCSGG P A + +
Sbjct 251 SCWAFSSIGSVESLYRLYKNKSY--FLSEQELVNC----DKSSMGCSGGLPITAMEYIHS 304
Query 65 DGVVTGGDYNELHTGKSCWPYEIPFCRHHSEGPYPKCEGPLPKAPKCRKDCEEAEYTSKV 124
G+ SE PY + P CR K
Sbjct 305 KGI-----------------------SFESEIPYIGIDAP------CRPSI-------KN 328
Query 125 KPFKDDLHFATSAYSVEGRDQIKRELMENGTLTGAFLVYEDFLLYKEGVYHHVTGMPMGG 184
K F D + ++G D + + L+ + T+ A + LY+ GVY G +
Sbjct 329 KVFVDSISI------LKGNDVVNKSLVISPTVV-AIAATRELKLYQGGVYTGKCGDAL-N 380
Query 185 HAVKVI--GFGNEDGRDYWLAVNSWNEYWGDKGTFKIEMGEAGIDKEFCG 232
HAV ++ G+ E G YW+ NSW E WG+ G ++E + G+DK CG
Sbjct 381 HAVLLVGEGYDEETGLRYWIIKNSWGEDWGENGFLRLERTKKGLDK--CG 428
> xla:432187 hypothetical protein MGC82409; K08568 cathepsin X
[EC:3.4.18.1]
Length=296
Score = 63.5 bits (153), Expect = 7e-10, Method: Compositional matrix adjust.
Identities = 59/221 (26%), Positives = 86/221 (38%), Gaps = 41/221 (18%)
Query 5 SGWAFASTEALNDRFCIFSGGRHREA-LSPQHTTSCCDLLHCLSFGCSGGQPRMAWRWFS 63
S WA ST A+ DR I G + LS QH C D C
Sbjct 82 SCWAHGSTSAMADRINIKRNGVWPSSYLSVQHVIDCADAGSC------------------ 123
Query 64 NDGVVTGGDYNELHTGKSCWPYEIPFCRHHSEG-PYPKCEGPLPKAPKCRK--DCEEAEY 120
GGD+ + W Y HS G P C + KC K C
Sbjct 124 -----EGGDHGGV------WEYA------HSHGIPDETCNNYQARDQKCDKFNQCGTCVT 166
Query 121 TSKVKPFKDDLHFATSAY-SVEGRDQIKRELMENGTLTGAFLVYEDFLLYKEGVYHHVTG 179
K + + + SV GR+++ E+ +NG ++ + + Y G+Y
Sbjct 167 FGKCFNLSNYTLWKVGDFGSVSGREKMMAEIYKNGPISCGIMATDKLDAYTGGLYAEYQP 226
Query 180 MPMGGHAVKVIGFG-NEDGRDYWLAVNSWNEYWGDKGTFKI 219
M H + V G+G +E+G +YW+ NSW E WG++G +I
Sbjct 227 RAMINHIISVAGWGLDENGVEYWIVRNSWGEPWGERGWLRI 267
> xla:380516 ctss-a, MGC69026; cathepsin S (EC:3.4.22.27); K01368
cathepsin S [EC:3.4.22.27]
Length=333
Score = 62.4 bits (150), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 60/223 (26%), Positives = 89/223 (39%), Gaps = 43/223 (19%)
Query 5 SGWAFASTEALNDRFCIFSGGRHREALSPQHTTSCCDLLHCLSFGCSGGQPRMAWRWFSN 64
S WAF++ AL + + +G +LSPQ+ C + GCSGG A+++ +
Sbjct 141 SCWAFSAVGALEGQLMLKTG--KLVSLSPQNLVDCASKYG--NKGCSGGFMTSAFQYVID 196
Query 65 DGVVTGGDYNELHTGKSCWPYEIPFCRHHSEGPYPKCEGPLPKAPKCRKDCEEAEYTSKV 124
+ + Y H YE+ KA C K YT V
Sbjct 197 NNGIDSDSYYPYHAMDEKCHYELA-----------------GKASSCVK------YTEIV 233
Query 125 KPFKDDLHFATSAYSVEGRDQIKRELMENGTLTGAFL-VYEDFLLYKEGVYHHVTGMPMG 183
+D+L K+ L G ++ A F LYK GVY +
Sbjct 234 PGTEDNL---------------KQALGTIGPISVAIDGTRPTFFLYKSGVYSDPSCSQEV 278
Query 184 GHAVKVIGFGNEDGRDYWLAVNSWNEYWGDKGTFKIEMGEAGI 226
H V IG+G +G+D+WL NSW Y+GDKG +I + +
Sbjct 279 NHGVLAIGYGTLNGQDFWLLKNSWGTYYGDKGFVRIARNKGNL 321
> dre:324818 ctsh, fc44c02, wu:fc44c02, zgc:85774; cathepsin H
(EC:3.4.22.16); K01366 cathepsin H [EC:3.4.22.16]
Length=330
Score = 62.0 bits (149), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 64/234 (27%), Positives = 93/234 (39%), Gaps = 56/234 (23%)
Query 5 SGWAFASTEALNDRFCIFSGGRHREALSPQHTTSCCDLLHCLSFGCSGGQPRMAWRWFS- 63
S W F++T L I +G + L+ Q C + GC+GG P A+ +
Sbjct 135 SCWTFSTTGCLESVTAIATGKLLQ--LAEQQLIDCAGDFD--NHGCNGGLPSHAFEYIMY 190
Query 64 NDGVVTGGDYNELHTGKSCWPYEIPFCRHHSEGPYPKCEGPLPKAPKCRKDCEEAEYTSK 123
N G++T DY PY+ K +CR + A
Sbjct 191 NKGLMTEDDY----------PYQ-------------------AKGGQCRFKPQLA----- 216
Query 124 VKPFKDDLHFATSAYSVEGRDQIKRELMENGTLTGAFLVYEDFLLYKEGVY-----HHVT 178
F ++ T + D + R ++ A+ V DF+ YK+G+Y H+ T
Sbjct 217 -AAFVKEVVNITKYDEMGMVDAVARL----NPVSFAYEVTSDFMHYKDGIYTSTECHNTT 271
Query 179 GMPMGGHAVKVIGFGNEDGRDYWLAVNSWNEYWGDKGTFKIEMGEAGIDKEFCG 232
M HAV +G+ E+G YW+ NSW WG KG F IE G K CG
Sbjct 272 DMV--NHAVLAVGYAEENGTPYWIVKNSWGTNWGIKGYFYIERG-----KNMCG 318
> hsa:1512 CTSH, ACC-4, ACC-5, CPSB, DKFZp686B24257, MGC1519,
minichain; cathepsin H (EC:3.4.22.16); K01366 cathepsin H [EC:3.4.22.16]
Length=335
Score = 61.6 bits (148), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 56/234 (23%), Positives = 90/234 (38%), Gaps = 45/234 (19%)
Query 5 SGWAFASTEALNDRFCIFSGGRHREALSPQHTTSCCDLLHCLSFGCSGGQPRMAWRWFSN 64
S W F++T AL I +G +L+ Q C + + GC GG P A+ +
Sbjct 140 SCWTFSTTGALESAIAIATG--KMLSLAEQQLVDCAQDFN--NHGCQGGLPSQAFEYIL- 194
Query 65 DGVVTGGDYNELHTGKSCWPYEIPFCRHHSEGPYPKCEGPLPKAPKCRKDCEEAEYTSKV 124
YN+ G+ +PY+ G C+ KA KD
Sbjct 195 --------YNKGIMGEDTYPYQ---------GKDGYCKFQPGKAIGFVKD---------- 227
Query 125 KPFKDDLHFATSAYSVEGRDQIKRELMENGTLTGAFLVYEDFLLYKEGVYHHVTGMPM-- 182
+ ++ + + + ++ AF V +DF++Y+ G+Y +
Sbjct 228 ----------VANITIYDEEAMVEAVALYNPVSFAFEVTQDFMMYRTGIYSSTSCHKTPD 277
Query 183 -GGHAVKVIGFGNEDGRDYWLAVNSWNEYWGDKGTFKIEMGEAGIDKEFCGGEP 235
HAV +G+G ++G YW+ NSW WG G F IE G+ C P
Sbjct 278 KVNHAVLAVGYGEKNGIPYWIVKNSWGPQWGMNGYFLIERGKNMCGLAACASYP 331
> cel:F32B5.8 cpz-1; CathePsin Z family member (cpz-1); K08568
cathepsin X [EC:3.4.18.1]
Length=306
Score = 61.2 bits (147), Expect = 4e-09, Method: Compositional matrix adjust.
Identities = 59/220 (26%), Positives = 91/220 (41%), Gaps = 39/220 (17%)
Query 5 SGWAFASTEALNDRFCIFSGGRHREALSPQHTTSCCDLLHCLSFG--CSGGQPRMAWRWF 62
S WAF +T AL DR I R+ PQ S +++ C G GG+P +++
Sbjct 94 SCWAFGATSALADRINI-----KRKNAWPQAYLSVQEVIDCSGAGTCVMGGEPGGVYKYA 148
Query 63 SNDGVVTGGDYNELHTGKSCWPYEIPFCRHHSEGPYPKCEGPLPKAPKCRKDCEEAEYTS 122
G+ ++C Y+ R PY +C P
Sbjct 149 HEHGI----------PHETCNNYQ---ARDGKCDPYNRCGSCWP---------------G 180
Query 123 KVKPFKDDLHFATSAY-SVEGRDQIKRELMENGTLTGAFLVYEDFLLYKEGVYHHVTGMP 181
+ K+ + S Y +V G +++K E+ G + + F Y G+Y VT
Sbjct 181 ECFSIKNYTLYKVSEYGTVHGYEKMKAEIYHKGPIACGIAATKAFETYAGGIYKEVTDED 240
Query 182 MGGHAVKVIGFG--NEDGRDYWLAVNSWNEYWGDKGTFKI 219
+ H + V G+G +E G +YW+ NSW E WG+ G FKI
Sbjct 241 ID-HIISVHGWGVDHESGVEYWIGRNSWGEPWGEHGWFKI 279
> pfa:PFB0360c SERA-1; serine repeat antigen 1 (SERA-1)
Length=994
Score = 60.8 bits (146), Expect = 4e-09, Method: Composition-based stats.
Identities = 30/83 (36%), Positives = 50/83 (60%), Gaps = 8/83 (9%)
Query 146 IKRELMENGTLTGAFLVYEDFLLYKEG--VYHHVTGMPMGGHAVKVIGFGN-----EDGR 198
IK E+M NG++ A++ E+ L Y+ ++ G HAV ++G+GN ++ +
Sbjct 673 IKDEIMNNGSVI-AYVKAENVLGYELNGKNVQNLCGDKTPDHAVNIVGYGNYINDEDEKK 731
Query 199 DYWLAVNSWNEYWGDKGTFKIEM 221
YW+ NSW +YWGD+G FK++M
Sbjct 732 SYWIVRNSWGKYWGDEGYFKVDM 754
> mmu:64138 Ctsz, AI787083, AU019819, CTSX, D2Wsu143e; cathepsin
Z (EC:3.4.18.1); K08568 cathepsin X [EC:3.4.18.1]
Length=306
Score = 59.7 bits (143), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 54/222 (24%), Positives = 83/222 (37%), Gaps = 43/222 (19%)
Query 5 SGWAFASTEALNDRFCIFSGGRHREALSPQHTTSCCDLLHCLSFG-CSGGQPRMAWRWFS 63
S WA ST A+ DR I R+ P S +++ C + G C GG W +
Sbjct 93 SCWAHGSTSAMADRINI-----KRKGAWPSILLSVQNVIDCGNAGSCEGGNDLPVWEYAH 147
Query 64 NDGVVTGGDYNELHTGKSCWPYEIPFCRHHSEGPYPKCEGPLPKAPKCRK-----DCEEA 118
G+ P C K C K C E
Sbjct 148 KHGI-----------------------------PDETCNNYQAKDQDCDKFNQCGTCTEF 178
Query 119 EYTSKVKPFKDDLHFATSAYSVEGRDQIKRELMENGTLTGAFLVYEDFLLYKEGVYHHVT 178
+ ++ + L S+ GR+++ E+ NG ++ + E Y G+Y
Sbjct 179 KECHTIQNYT--LWRVGDYGSLSGREKMMAEIYANGPISCGIMATEMMSNYTGGIYAEHQ 236
Query 179 GMPMGGHAVKVIGFG-NEDGRDYWLAVNSWNEYWGDKGTFKI 219
+ H + V G+G + DG +YW+ NSW E WG+KG +I
Sbjct 237 DQAVINHIISVAGWGVSNDGIEYWIVRNSWGEPWGEKGWMRI 278
> pfa:PFB0330c SERA-7; serine repeat antigen 7 (SERA-7)
Length=946
Score = 58.9 bits (141), Expect = 2e-08, Method: Composition-based stats.
Identities = 34/84 (40%), Positives = 50/84 (59%), Gaps = 10/84 (11%)
Query 146 IKRELMENGTLTGAFLVYE---DFLLYKEGVYHHVTGMPMGGHAVKVIGFGN---EDG-- 197
IKRE+ G++ A++ E DF +GV H++ G HA +IG+GN E+G
Sbjct 678 IKREIQNKGSVI-AYIKTENVIDFDFNGKGV-HNMCGDKEPDHAANIIGYGNYIDEEGEK 735
Query 198 RDYWLAVNSWNEYWGDKGTFKIEM 221
+ YWL NSW YWGD+G F+++M
Sbjct 736 KSYWLIRNSWGYYWGDEGNFRVDM 759
> xla:398927 hypothetical protein MGC68723; K01368 cathepsin S
[EC:3.4.22.27]
Length=333
Score = 58.9 bits (141), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 59/220 (26%), Positives = 96/220 (43%), Gaps = 44/220 (20%)
Query 2 TAXSGWAFASTEALNDRFCIFSGGRHREALSPQHTTSCCDLLHCLSFGCSGGQPRMAWRW 61
+ S WAF+S A+ + G+ E+LS Q+ C + + GC GG ++R+
Sbjct 137 SCMSSWAFSSIGAMECQNMRKRTGK-LESLSVQNLLDCSQ--NYGNNGCKGGWAVSSFRY 193
Query 62 FSNDGVVTGGDYNELHTGKSCWPYEIPFCRHHSEGPYPKCE-GPLPKAPKCRKDCEEAEY 120
++G+ EL +S +PY+ G KC P+ KAP+C + Y
Sbjct 194 IIDNGI-------EL---ESIYPYQ---------GKDGKCSYTPVKKAPRC-TSYRQLPY 233
Query 121 TSKVKPFKDDLHFATSAYSVEGRDQIKRELMENGTLTGAFLVYEDFLLYKEGVYHHVT-G 179
++ + + ++EG + F +YK GVY+ G
Sbjct 234 GNEATLKQVVGLMGPVSVAIEGSRKT-------------------FRMYKSGVYYDPNCG 274
Query 180 MPMGGHAVKVIGFGNEDGRDYWLAVNSWNEYWGDKGTFKI 219
H+V V+G+G EDG +YWL NSW +GD+G K+
Sbjct 275 GSTVDHSVLVVGYGAEDGVEYWLVKNSWGTSFGDEGYIKM 314
Lambda K H
0.319 0.138 0.460
Gapped
Lambda K H
0.267 0.0410 0.140
Effective search space used: 9084709576
Database: egene_temp_file_orthology_annotation_similarity_blast_database_866
Posted date: Sep 17, 2011 2:57 PM
Number of letters in database: 82,071,388
Number of sequences in database: 164,496
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Neighboring words threshold: 11
Window for multiple hits: 40