BLASTP 2.2.24 [Aug-08-2010] Reference for compositional score matrix adjustment: Altschul, Stephen F., John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis, Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109. Query= Eten_2433_orf3 (341 letters) Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF excluding environmental samples from WGS projects 14,777,732 sequences; 5,058,227,080 total letters
Score E Sequences producing significant alignments: (bits) Value gi|325119637|emb|CBZ55190.1| conserved hypothetical protein [Neo... 462 e-128 gi|221487562|gb|EEE25794.1| conserved hypothetical protein [Toxo... 444 e-123 gi|237830375|ref|XP_002364485.1| hypothetical protein TGME49_112... 443 e-122 gi|83273805|ref|XP_729560.1| hypothetical protein [Plasmodium yo... 317 1e-84 gi|296005560|ref|XP_002809096.1| conserved Plasmodium protein, u... 316 4e-84 gi|70954151|ref|XP_746136.1| hypothetical protein [Plasmodium ch... 305 5e-81 gi|221057526|ref|XP_002261271.1| hypothetical protein, conserved... 304 1e-80 gi|156101413|ref|XP_001616400.1| hypothetical protein [Plasmodiu... 285 5e-75 gi|68076717|ref|XP_680278.1| hypothetical protein [Plasmodium be... 245 7e-63 gi|83616157|gb|ABC25603.1| anonymous antigen-1 [Babesia bovis] 229 6e-58 gi|156083505|ref|XP_001609236.1| hypothetical protein [Babesia b... 228 1e-57 gi|209882397|ref|XP_002142635.1| hypothetical protein [Cryptospo... 209 5e-52 gi|67605795|ref|XP_666706.1| hypothetical protein [Cryptosporidi... 194 1e-47 gi|32399022|emb|CAD98262.1| hypothetical predicted Armadillo/bet... 192 6e-47 gi|66475922|ref|XP_627777.1| hypothetical protein [Cryptosporidi... 192 6e-47 gi|84999792|ref|XP_954617.1| hypothetical protein [Theileria ann... 166 3e-39 gi|71031955|ref|XP_765619.1| hypothetical protein [Theileria par... 160 3e-37 gi|326432426|gb|EGD77996.1| hypothetical protein PTSG_12905 [Sal... 40 0.56 gi|254467462|ref|ZP_05080872.1| methyl-accepting chemotaxis sens... 39 1.5 gi|104781550|ref|YP_608048.1| hypothetical protein PSEEN2440 [Ps... 38 2.1 gi|121256|sp|P02231.1|GLBT_CHITH RecName: Full=Globin CTT-IIIA 38 2.7 gi|242007443|ref|XP_002424549.1| conserved hypothetical protein ... 37 3.6 >gi|325119637|emb|CBZ55190.1| conserved hypothetical protein [Neospora caninum Liverpool] Length = 2723 Score = 462 bits (1188), Expect = e-128, Method: Compositional matrix adjust. Identities = 210/310 (67%), Positives = 252/310 (81%), Gaps = 1/310 (0%) Query: 33 EELANAAYGGWYHMGMDEVMIGAIIQAVCACAGSESHAKQLRLQRISLGVAAHFVSEQMG 92 +ELANAAYGGWY MGMDEVMI AI+QAVCACA E+HAKQLRLQR+ LG+AA+F SEQMG Sbjct: 2414 DELANAAYGGWYQMGMDEVMIDAILQAVCACATVETHAKQLRLQRVCLGLAAYFASEQMG 2473 Query: 93 QESFVGSGGELVIMQALNTFPGEHSIAQLSCIIVNSIAMTSGDMYDSIKTRELVSAFKGS 152 S VGSG E V+ Q + F GE + QLSC+I+NSIAMTSGDM+D IKT EL++A K S Sbjct: 2474 TSSLVGSGIEQVLTQIMTNFAGEATTMQLSCVIINSIAMTSGDMFDEIKTPELLAALKSS 2533 Query: 153 ISKIGNKKPEEKAVRDECQKTLDALGSNLDPMDAFASTVTELDLNITQWNVDPYPNGVHD 212 + K+ KK EEKA+++ C TL+A S DP DAF+ TVTELD T+WNVDPYPNGVHD Sbjct: 2534 VGKMATKKAEEKALKESCAMTLEAATSGADPFDAFSKTVTELDFKFTEWNVDPYPNGVHD 2593 Query: 213 LPVHVKEALRKGGELKVYIGDKEAEKIRWRSSQDLNAFEWNIGNETDWNNRVPIVRIRNV 272 LP +VKEALRKGG++KV++ KE+E+I+WRSSQDLN FEW +GN+TD+NNR+PIVRIRNV Sbjct: 2594 LPSNVKEALRKGGKMKVFLPGKESEEIKWRSSQDLNVFEWCMGNDTDFNNRIPIVRIRNV 2653 Query: 273 AKGLSHPALVKANKKS-RSVTSKICLCIFGPPNEDNPAGIELPLRAKSQKERDHLVEMLV 331 AKGL HPAL A KK R +T K LC+FGPPN+D P G+ELP++AK+QKERD VEM+V Sbjct: 2654 AKGLVHPALQAAAKKEPRKITPKFTLCLFGPPNDDFPNGVELPMKAKTQKERDSFVEMMV 2713 Query: 332 QWRDAATYNF 341 QWRDAATYNF Sbjct: 2714 QWRDAATYNF 2723 >gi|221487562|gb|EEE25794.1| conserved hypothetical protein [Toxoplasma gondii GT1] Length = 2705 Score = 444 bits (1142), Expect = e-123, Method: Compositional matrix adjust. Identities = 212/310 (68%), Positives = 250/310 (80%), Gaps = 1/310 (0%) Query: 33 EELANAAYGGWYHMGMDEVMIGAIIQAVCACAGSESHAKQLRLQRISLGVAAHFVSEQMG 92 +ELANAAYGGWY MGMDEVMI AI+QAVCACA E+HAKQLRLQR+ LG+AA+F SEQMG Sbjct: 2396 DELANAAYGGWYQMGMDEVMIDAILQAVCACAAVEAHAKQLRLQRVCLGLAAYFASEQMG 2455 Query: 93 QESFVGSGGELVIMQALNTFPGEHSIAQLSCIIVNSIAMTSGDMYDSIKTRELVSAFKGS 152 S VGSG E V+ Q + F GE + QLSC+I+NSIAMTSGDMY+ IKT L+SA K S Sbjct: 2456 TSSLVGSGIEQVLTQIMTNFAGEGTTMQLSCVIINSIAMTSGDMYEEIKTSALLSALKTS 2515 Query: 153 ISKIGNKKPEEKAVRDECQKTLDALGSNLDPMDAFASTVTELDLNITQWNVDPYPNGVHD 212 + K+ KKPEEKA+++ C TL+A S DP DAF+ TVTELD T+WNVDPYPNGVHD Sbjct: 2516 VGKMATKKPEEKALKETCAATLEAASSGEDPFDAFSKTVTELDFKFTEWNVDPYPNGVHD 2575 Query: 213 LPVHVKEALRKGGELKVYIGDKEAEKIRWRSSQDLNAFEWNIGNETDWNNRVPIVRIRNV 272 LP +VKEALRKGG+LKV++ +KE E+IRWRSSQDLN FEW +GN+ D+NNR+PIVRIRNV Sbjct: 2576 LPSNVKEALRKGGKLKVFLPEKEKEEIRWRSSQDLNVFEWCMGNDQDYNNRIPIVRIRNV 2635 Query: 273 AKGLSHPALVKANKKS-RSVTSKICLCIFGPPNEDNPAGIELPLRAKSQKERDHLVEMLV 331 AKGL HPAL A KK R V +K +C+FGPPN+D P G+ELP+ AKSQKERD VEM+V Sbjct: 2636 AKGLVHPALKAAAKKEPRKVAAKFTMCLFGPPNDDFPEGVELPMVAKSQKERDAFVEMMV 2695 Query: 332 QWRDAATYNF 341 QWRDAATYNF Sbjct: 2696 QWRDAATYNF 2705 >gi|237830375|ref|XP_002364485.1| hypothetical protein TGME49_112630 [Toxoplasma gondii ME49] gi|211962149|gb|EEA97344.1| hypothetical protein TGME49_112630 [Toxoplasma gondii ME49] gi|221507356|gb|EEE32960.1| conserved hypothetical protein [Toxoplasma gondii VEG] Length = 2638 Score = 443 bits (1140), Expect = e-122, Method: Compositional matrix adjust. Identities = 212/310 (68%), Positives = 250/310 (80%), Gaps = 1/310 (0%) Query: 33 EELANAAYGGWYHMGMDEVMIGAIIQAVCACAGSESHAKQLRLQRISLGVAAHFVSEQMG 92 +ELANAAYGGWY MGMDEVMI AI+QAVCACA E+HAKQLRLQR+ LG+AA+F SEQMG Sbjct: 2329 DELANAAYGGWYQMGMDEVMIDAILQAVCACAAVEAHAKQLRLQRVCLGLAAYFASEQMG 2388 Query: 93 QESFVGSGGELVIMQALNTFPGEHSIAQLSCIIVNSIAMTSGDMYDSIKTRELVSAFKGS 152 S VGSG E V+ Q + F GE + QLSC+I+NSIAMTSGDMY+ IKT L+SA K S Sbjct: 2389 TSSLVGSGIEQVLTQIMTNFAGEGTTMQLSCVIINSIAMTSGDMYEEIKTSALLSALKTS 2448 Query: 153 ISKIGNKKPEEKAVRDECQKTLDALGSNLDPMDAFASTVTELDLNITQWNVDPYPNGVHD 212 + K+ KKPEEKA+++ C TL+A S DP DAF+ TVTELD T+WNVDPYPNGVHD Sbjct: 2449 VGKMATKKPEEKALKETCAATLEAASSGEDPFDAFSKTVTELDFKFTEWNVDPYPNGVHD 2508 Query: 213 LPVHVKEALRKGGELKVYIGDKEAEKIRWRSSQDLNAFEWNIGNETDWNNRVPIVRIRNV 272 LP +VKEALRKGG+LKV++ +KE E+IRWRSSQDLN FEW +GN+ D+NNR+PIVRIRNV Sbjct: 2509 LPSNVKEALRKGGKLKVFLPEKEKEEIRWRSSQDLNVFEWCMGNDQDYNNRIPIVRIRNV 2568 Query: 273 AKGLSHPALVKANKKS-RSVTSKICLCIFGPPNEDNPAGIELPLRAKSQKERDHLVEMLV 331 AKGL HPAL A KK R V +K +C+FGPPN+D P G+ELP+ AKSQKERD VEM+V Sbjct: 2569 AKGLVHPALKAAAKKEPRKVAAKFTMCLFGPPNDDFPEGVELPMVAKSQKERDAFVEMMV 2628 Query: 332 QWRDAATYNF 341 QWRDAATYNF Sbjct: 2629 QWRDAATYNF 2638 >gi|83273805|ref|XP_729560.1| hypothetical protein [Plasmodium yoelii yoelii str. 17XNL] gi|23487693|gb|EAA21125.1| hypothetical protein [Plasmodium yoelii yoelii] Length = 2598 Score = 317 bits (813), Expect = 1e-84, Method: Compositional matrix adjust. Identities = 146/320 (45%), Positives = 215/320 (67%), Gaps = 2/320 (0%) Query: 23 GNIPDAEPDPEELANAAYGGWYHMGMDEVMIGAIIQAVCACAGSESHAKQLRLQRISLGV 82 G + + + EEL NA GGWY++ MD+ MI I+Q+V C+ E+H KQLRLQ++SLG+ Sbjct: 2280 GVVNEYSEEDEELFNARLGGWYNISMDKEMIDVILQSVLTCSNDENHQKQLRLQKVSLGL 2339 Query: 83 AAHFVSEQMGQESFVGSGGELVIMQALNTFPGEHSIAQLSCIIVNSIAMTSGDMYDSIKT 142 A+F ++G S SG +++ LN F G+ I QL I +++IA+ S ++YD T Sbjct: 2340 LAYFAYHRLGIISMTASGFDILAKNNLNHFGGDMVIMQLLAICIDNIAINSAEIYDMTIT 2399 Query: 143 RELVSAFKGSISKIGNKKPEEKAVRDECQKTLDALGSNLDPMDAFASTVTELDLNITQWN 202 R++V FK SISKI NKK + K + +KTL+A+GS+ DP+D F T+ D ++++++ Sbjct: 2400 RDIVKLFKSSISKIQNKK-DHKQIVQSIEKTLEAMGSDGDPLDTFKDTILTFDFSLSEFD 2458 Query: 203 VDPYPNGVHDLPVHVKEALRKGGELKVYIGDKEAEKIRWRSSQDLNAFEWNIGNETDWNN 262 DPY NGVHDLP +VKEALR GG+ K+Y + +W++SQDL EW +G+ D Sbjct: 2459 KDPYVNGVHDLPQNVKEALRTGGQYKIYHKSDKRTLFKWKASQDLGTLEWTVGDNLDRIF 2518 Query: 263 RVPIVRIRNVAKGLSHPALVKANK-KSRSVTSKICLCIFGPPNEDNPAGIELPLRAKSQK 321 ++ +VRI+N++KGL HP L ANK + R V SK+ LC++GPP ED P G+ELP++ KS K Sbjct: 2519 KISVVRIKNISKGLVHPLLKAANKYEPRKVNSKVVLCVYGPPTEDFPEGLELPIKTKSNK 2578 Query: 322 ERDHLVEMLVQWRDAATYNF 341 ERD ++L+ WRDAA+YN+ Sbjct: 2579 ERDAFADLLILWRDAASYNY 2598 >gi|296005560|ref|XP_002809096.1| conserved Plasmodium protein, unknown function [Plasmodium falciparum 3D7] gi|225632044|emb|CAX64377.1| conserved Plasmodium protein, unknown function [Plasmodium falciparum 3D7] Length = 2584 Score = 316 bits (809), Expect = 4e-84, Method: Compositional matrix adjust. Identities = 144/308 (46%), Positives = 217/308 (70%), Gaps = 2/308 (0%) Query: 35 LANAAYGGWYHMGMDEVMIGAIIQAVCACAGSESHAKQLRLQRISLGVAAHFVSEQMGQE 94 L NA GGWY++ MD+ MI +IIQAV CA +H KQLRLQ++SLG+ A+F ++G Sbjct: 2278 LYNARLGGWYNISMDKEMIDSIIQAVLTCAYDVNHQKQLRLQKVSLGLLAYFAYHRLGII 2337 Query: 95 SFVGSGGELVIMQALNTFPGEHSIAQLSCIIVNSIAMTSGDMYDSIKTRELVSAFKGSIS 154 S SG + + + LN F G+ I QL I +++IAM S ++YD+ TR+++ FK ++S Sbjct: 2338 SMTASGFDSLTRELLNNFGGDAVIMQLLAICIDNIAMYSVEVYDTTITRDIIKCFKSALS 2397 Query: 155 KIGNKKPEEKAVRDECQKTLDALGSNLDPMDAFASTVTELDLNITQWNVDPYPNGVHDLP 214 K+ NKK E+K + + Q TL+A+ S DP++AF +T+ D N+++++ DPY NGVHDL Sbjct: 2398 KMNNKK-EDKQLWQKVQLTLEAMNSADDPLEAFKNTLLIFDFNLSEFDKDPYINGVHDLA 2456 Query: 215 VHVKEALRKGGELKVYIGDKEAEKIRWRSSQDLNAFEWNIGNETDWNNRVPIVRIRNVAK 274 ++K+ LRKGG K+Y + +W++SQDLN EW IG++T+ ++ +VRI+N++K Sbjct: 2457 SNIKDCLRKGGHSKIYYQSDQRLLFKWKASQDLNTLEWTIGDDTERVFKISVVRIKNISK 2516 Query: 275 GLSHPALVKANKKS-RSVTSKICLCIFGPPNEDNPAGIELPLRAKSQKERDHLVEMLVQW 333 GLSHP L+ ANK+ R V++K+ LCI+GPP ED P G+ELP++ K+QKERD V+++V W Sbjct: 2517 GLSHPILISANKREPRKVSAKVTLCIYGPPTEDFPEGLELPIKTKTQKERDAFVDLIVLW 2576 Query: 334 RDAATYNF 341 RDAA+YN+ Sbjct: 2577 RDAASYNY 2584 >gi|70954151|ref|XP_746136.1| hypothetical protein [Plasmodium chabaudi chabaudi] gi|56526658|emb|CAH78411.1| hypothetical protein PC104863.00.0 [Plasmodium chabaudi chabaudi] Length = 501 Score = 305 bits (782), Expect = 5e-81, Method: Compositional matrix adjust. Identities = 140/308 (45%), Positives = 209/308 (67%), Gaps = 2/308 (0%) Query: 35 LANAAYGGWYHMGMDEVMIGAIIQAVCACAGSESHAKQLRLQRISLGVAAHFVSEQMGQE 94 L NA GGWY++ MD+ MI AI+Q+V C+ E+H KQLRLQ++SLG+ A+F ++G Sbjct: 195 LFNARLGGWYNISMDKEMIDAILQSVLTCSNDENHQKQLRLQKVSLGLIAYFAYHRLGII 254 Query: 95 SFVGSGGELVIMQALNTFPGEHSIAQLSCIIVNSIAMTSGDMYDSIKTRELVSAFKGSIS 154 S SG + + LN F G+ I QL I +++IA+ S ++YD TR+++ +FK S++ Sbjct: 255 SMTASGFDTLAKNNLNHFGGDIVIMQLLSICIDNIAINSAEIYDMTITRDIIKSFKTSLT 314 Query: 155 KIGNKKPEEKAVRDECQKTLDALGSNLDPMDAFASTVTELDLNITQWNVDPYPNGVHDLP 214 K+ NKK E K + +KTL+A+ S DP+D F T+ D ++++++ DPY NGVHDLP Sbjct: 315 KMQNKK-ENKQIIQSVEKTLEAMSSEGDPLDTFKDTLLTFDFSLSEFDKDPYVNGVHDLP 373 Query: 215 VHVKEALRKGGELKVYIGDKEAEKIRWRSSQDLNAFEWNIGNETDWNNRVPIVRIRNVAK 274 ++KEALR GG+ K+Y ++ +W++SQDL EW IG+ D ++ +VRI+N++K Sbjct: 374 QNIKEALRTGGQYKIYHKSEKRTMFKWKASQDLGTLEWTIGDNADRIFKISVVRIKNISK 433 Query: 275 GLSHPALVKANK-KSRSVTSKICLCIFGPPNEDNPAGIELPLRAKSQKERDHLVEMLVQW 333 GL HP L ANK + R V SK+ LCI+GPP ED P G+ELP++ K+ KERD ++L+ W Sbjct: 434 GLVHPLLKAANKYEPRKVHSKVVLCIYGPPTEDFPEGLELPIKTKTNKERDAFADLLILW 493 Query: 334 RDAATYNF 341 RDAA+YN+ Sbjct: 494 RDAASYNY 501 >gi|221057526|ref|XP_002261271.1| hypothetical protein, conserved in Apicomplexan species [Plasmodium knowlesi strain H] gi|194247276|emb|CAQ40676.1| hypothetical protein, conserved in Apicomplexan species [Plasmodium knowlesi strain H] Length = 2609 Score = 304 bits (778), Expect = 1e-80, Method: Compositional matrix adjust. Identities = 139/308 (45%), Positives = 212/308 (68%), Gaps = 2/308 (0%) Query: 35 LANAAYGGWYHMGMDEVMIGAIIQAVCACAGSESHAKQLRLQRISLGVAAHFVSEQMGQE 94 L NA GGWY++ MD+ MI IIQ+V CA +H KQLRLQ++SLG+ A+F +G Sbjct: 2303 LYNAKLGGWYNISMDKEMIDTIIQSVLTCANDNNHLKQLRLQKVSLGLLAYFAYHNLGII 2362 Query: 95 SFVGSGGELVIMQALNTFPGEHSIAQLSCIIVNSIAMTSGDMYDSIKTRELVSAFKGSIS 154 S SG + + + LN F G+ I QL I +++IAM S ++YD +R+++ +FK S+S Sbjct: 2363 SMTASGFDTLTRENLNNFGGDGVIMQLLSICIDNIAMYSAEVYDMTISRDIIKSFKSSVS 2422 Query: 155 KIGNKKPEEKAVRDECQKTLDALGSNLDPMDAFASTVTELDLNITQWNVDPYPNGVHDLP 214 K+ NKK E+K + + + TL+A+ S DP+DAF +T+ D ++++++ DPY NGVHDL Sbjct: 2423 KMSNKK-EDKPIIQKVELTLEAMNSAEDPLDAFKNTILIFDFSLSEFDKDPYVNGVHDLS 2481 Query: 215 VHVKEALRKGGELKVYIGDKEAEKIRWRSSQDLNAFEWNIGNETDWNNRVPIVRIRNVAK 274 ++K+ALRKGG K+Y + +W++SQDL EW +G++ + ++ +VRI+N++K Sbjct: 2482 SNIKDALRKGGVHKIYHNSDVRKPFKWKASQDLATLEWIVGDDAEHIFKISVVRIKNISK 2541 Query: 275 GLSHPALVKANKKS-RSVTSKICLCIFGPPNEDNPAGIELPLRAKSQKERDHLVEMLVQW 333 GL+HP L ANK+ R V +K+ +CI+GPP ED P G+ELP++ K+QKERD V++LV W Sbjct: 2542 GLTHPLLKGANKREPRKVNAKVTVCIYGPPTEDFPEGLELPIKTKTQKERDAFVDLLVLW 2601 Query: 334 RDAATYNF 341 RDAA+YN+ Sbjct: 2602 RDAASYNY 2609 >gi|156101413|ref|XP_001616400.1| hypothetical protein [Plasmodium vivax SaI-1] gi|148805274|gb|EDL46673.1| hypothetical protein, conserved [Plasmodium vivax] Length = 2577 Score = 285 bits (730), Expect = 5e-75, Method: Compositional matrix adjust. Identities = 136/308 (44%), Positives = 210/308 (68%), Gaps = 2/308 (0%) Query: 35 LANAAYGGWYHMGMDEVMIGAIIQAVCACAGSESHAKQLRLQRISLGVAAHFVSEQMGQE 94 L NA GGWY++ MD+ MI IIQ+V CA +H KQLRLQ++SLG+ A+F ++G Sbjct: 2271 LFNAKLGGWYNISMDKEMIDTIIQSVLTCANDFNHQKQLRLQKVSLGLLAYFAYHRLGII 2330 Query: 95 SFVGSGGELVIMQALNTFPGEHSIAQLSCIIVNSIAMTSGDMYDSIKTRELVSAFKGSIS 154 S SG + + + L+ F G+ I QL I +++IAM S ++YD +R+++ FK S+S Sbjct: 2331 SMTASGFDSLTREILSNFGGDVVIMQLLAICIDNIAMYSAEVYDMTVSRDIIKGFKSSVS 2390 Query: 155 KIGNKKPEEKAVRDECQKTLDALGSNLDPMDAFASTVTELDLNITQWNVDPYPNGVHDLP 214 K + K E+K + + + T++A+ S DP+DAF T+ D +++++ DPY NGVHDL Sbjct: 2391 K-MSSKKEDKQIVQKVELTVEAMNSAEDPLDAFKDTLLFFDFGLSEFDKDPYVNGVHDLS 2449 Query: 215 VHVKEALRKGGELKVYIGDKEAEKIRWRSSQDLNAFEWNIGNETDWNNRVPIVRIRNVAK 274 ++K+ALRKGG K+Y + + +W++SQDL EW +G +++ ++ +VRI+N++K Sbjct: 2450 SNIKDALRKGGVTKIYHNSDKRKPFKWKASQDLGTLEWTVGEDSEHIFKISVVRIKNISK 2509 Query: 275 GLSHPALVKANKKS-RSVTSKICLCIFGPPNEDNPAGIELPLRAKSQKERDHLVEMLVQW 333 GL+HP L +NK+ R V +K+ LCI+GPP ED P G+ELP++AK+QKERD V++LV W Sbjct: 2510 GLAHPLLRASNKREPRKVNAKVTLCIYGPPTEDFPEGLELPIKAKTQKERDAFVDLLVLW 2569 Query: 334 RDAATYNF 341 RDAA+YN+ Sbjct: 2570 RDAASYNY 2577 >gi|68076717|ref|XP_680278.1| hypothetical protein [Plasmodium berghei strain ANKA] gi|56501189|emb|CAI00354.1| conserved hypothetical protein [Plasmodium berghei] Length = 2491 Score = 245 bits (625), Expect = 7e-63, Method: Compositional matrix adjust. Identities = 116/278 (41%), Positives = 178/278 (64%), Gaps = 7/278 (2%) Query: 23 GNIPDAEPDPEELANAAYGGWYHMGMDEVMIGAIIQAVCACAGSESHAKQLRLQRISLGV 82 G + + + EEL NA GGWY++ MD+ MI I+Q+V C+ E+H KQLRLQ++SLG+ Sbjct: 2214 GVVNEYSEEDEELFNARLGGWYNISMDKEMIDVILQSVLTCSNDENHQKQLRLQKVSLGL 2273 Query: 83 AAHFVSEQMGQESFVGSGGELVIMQALNTFPGEHSIAQLSCIIVNSIAMTSGDMYDSIKT 142 A+F ++G S SG +++ LN F G+ I QL I +++IAM S ++YD T Sbjct: 2274 LAYFAYHRLGIISMTASGFDILAKNNLNHFGGDMVIMQLLAICIDNIAMNSAEIYDMTIT 2333 Query: 143 RELVSAFKGSISKIGNKKPEEKAVRDECQKTLDALGSNLDPMDAFASTVTELDLNITQWN 202 R+++ FK SISKI NKK ++ V+ +KT++A+GS+ DP+DAF T+ D ++++++ Sbjct: 2334 RDIIKLFKSSISKIPNKKDNKQIVQ-SIEKTVEAMGSDGDPLDAFKDTILTFDFSLSEFD 2392 Query: 203 VDPYPNGVHDLPVHVKEALRKGGELKVYIGDKEAEKIRWRSSQDLNAFEWNIGNETDWNN 262 DPY NGVHDLP ++KEALR GG+ K+Y + +W++SQDL EW +G D Sbjct: 2393 KDPYVNGVHDLPQNIKEALRTGGQYKIYHKSDKRTLFKWKASQDLGTLEWTVGENVDRIF 2452 Query: 263 RVPIVRIRNVAKGLSHPALVKANKKSRSVTSKICLCIF 300 ++ +VRI+N++KGL HP + S SK +CI+ Sbjct: 2453 KISVVRIKNISKGLVHPL------EKISFISKTSICIY 2484 >gi|83616157|gb|ABC25603.1| anonymous antigen-1 [Babesia bovis] Length = 1356 Score = 229 bits (583), Expect = 6e-58, Method: Compositional matrix adjust. Identities = 117/302 (38%), Positives = 175/302 (57%), Gaps = 6/302 (1%) Query: 42 GWYHMGMDEVMIGAIIQAVCACAGSESHAKQLRLQRISLGVAAHFVSEQMGQESFVGSGG 101 W ++GM++ I ++ C G+E K +RLQ+I V +F+S+ +G E + + Sbjct: 1059 AWENIGMEKEDIENLLNITFVCGGNEQAQKMIRLQKIVFSVIGYFMSQGLGSEVLIMNNF 1118 Query: 102 ELVIMQALNTFPGEHSIAQLSCIIVNSIAMTSGDMYDSIKTRELVSAFKGSISKIGNKKP 161 + L FPG + L +++ + ++ ++I T+E++ ++ S + NKKP Sbjct: 1119 SSLGHIYLTNFPGTVEMVVLMTVVLENTFTVPAEVRNNILTKEIMKKYRDVASSLPNKKP 1178 Query: 162 EEKAVRDECQKTLDALGSNLDPMDAFASTVTELDLNITQWNVDPYPNGVHDLPVHVKEAL 221 E+KA+ + C + AL S+ + + + ++ WNVDPYP+G HDLP VK+ L Sbjct: 1179 EDKALYNRCHALVTALASSENKT---LESTGHFNFELSGWNVDPYPHGTHDLPEAVKQGL 1235 Query: 222 RKGGELKVYIGDKEAE-KIRWRSSQDLNAFEWNIGNETDWNNRVPIVRIRNVAKGLSHPA 280 R GG +K YI D IRWRSSQDLN EW E D+ R+ + RIRN+A+GL HP Sbjct: 1236 RTGGRVKGYIRDNPKRVGIRWRSSQDLNYLEWG-PEEEDYPYRIAVRRIRNIARGLRHPI 1294 Query: 281 LVKAN-KKSRSVTSKICLCIFGPPNEDNPAGIELPLRAKSQKERDHLVEMLVQWRDAATY 339 L AN K+ R VT+ C CI G ED P G LP++ K+ KERD +VE+LVQWR+AATY Sbjct: 1295 LEAANAKEPRKVTNNTCFCIMGSATEDFPDGFALPIKCKNIKERDAVVELLVQWREAATY 1354 Query: 340 NF 341 N+ Sbjct: 1355 NY 1356 >gi|156083505|ref|XP_001609236.1| hypothetical protein [Babesia bovis T2Bo] gi|154796487|gb|EDO05668.1| conserved hypothetical protein [Babesia bovis] Length = 2591 Score = 228 bits (581), Expect = 1e-57, Method: Compositional matrix adjust. Identities = 117/302 (38%), Positives = 175/302 (57%), Gaps = 6/302 (1%) Query: 42 GWYHMGMDEVMIGAIIQAVCACAGSESHAKQLRLQRISLGVAAHFVSEQMGQESFVGSGG 101 W ++GM++ I ++ C G+E K +RLQ+I V +F+S+ +G E + + Sbjct: 2294 AWENIGMEKEDIENLLNITFVCGGNEQAQKMIRLQKIVFSVIGYFMSQGLGSEVLIMNNF 2353 Query: 102 ELVIMQALNTFPGEHSIAQLSCIIVNSIAMTSGDMYDSIKTRELVSAFKGSISKIGNKKP 161 + L FPG + L +++ + ++ ++I T+E++ ++ S + NKKP Sbjct: 2354 SSLGHIYLTNFPGTVEMVVLMTVVLENTFTVPAEVRNNILTKEIMKKYRDVASSLPNKKP 2413 Query: 162 EEKAVRDECQKTLDALGSNLDPMDAFASTVTELDLNITQWNVDPYPNGVHDLPVHVKEAL 221 E+KA+ + C + AL S+ + + + ++ WNVDPYP+G HDLP VK+ L Sbjct: 2414 EDKALYNRCHALVTALASSENKT---LESTGHFNFELSGWNVDPYPHGTHDLPEAVKQGL 2470 Query: 222 RKGGELKVYIGDKEAE-KIRWRSSQDLNAFEWNIGNETDWNNRVPIVRIRNVAKGLSHPA 280 R GG +K YI D IRWRSSQDLN EW E D+ R+ + RIRN+A+GL HP Sbjct: 2471 RTGGRVKGYIRDNPKRVGIRWRSSQDLNYLEWG-PEEEDYPYRIAVRRIRNIARGLRHPI 2529 Query: 281 LVKAN-KKSRSVTSKICLCIFGPPNEDNPAGIELPLRAKSQKERDHLVEMLVQWRDAATY 339 L AN K+ R VT+ C CI G ED P G LP++ K+ KERD +VE+LVQWR+AATY Sbjct: 2530 LEAANAKEPRKVTNNTCFCIMGSATEDFPDGFALPIKCKNIKERDAVVELLVQWREAATY 2589 Query: 340 NF 341 N+ Sbjct: 2590 NY 2591 >gi|209882397|ref|XP_002142635.1| hypothetical protein [Cryptosporidium muris RN66] gi|209558241|gb|EEA08286.1| hypothetical protein, conserved [Cryptosporidium muris RN66] Length = 2560 Score = 209 bits (532), Expect = 5e-52, Method: Compositional matrix adjust. Identities = 119/312 (38%), Positives = 168/312 (53%), Gaps = 10/312 (3%) Query: 30 PDPEELANAAYGGWYHMGMDEVMIGAIIQAVCACAGSESHAKQLRLQRISLGVAAHFVSE 89 PDP A A G+ MD +I ++ SES+ + LRL R+ G+ +F+SE Sbjct: 2256 PDPSYSAPDAPRGYQVCQMDVNDCNGVIASINKAVQSESNERHLRLMRVGFGLMTYFLSE 2315 Query: 90 QMGQESFVGSGGELVIMQALNTFPGEHSIAQLSCIIVNSIAMTSGDMYDSIKTRELVSAF 149 M ES S I + ++ F + I ++C I ++ + D+ + L + Sbjct: 2316 NMCIESVATSENIATITKVIDMFSSDSDIVVVACEIFTYLSKYAPDIVPGLFNANLQTVI 2375 Query: 150 KGSISKIGNKKPEEKAVRDECQKTLDALGSNLDPMDAFASTVTELDLNITQWNVDPYPNG 209 + S K+ K E+K L+ SN + A T D IT W+ +PYPNG Sbjct: 2376 EASAQKM---KGEQKNFVTNVSTALET--SNTSSLAILAPT---FDFAITHWDEEPYPNG 2427 Query: 210 VHDLPVHVKEALRKGGELKVYIGDKEAEKIRWRSSQDLNAFEWNIG-NETDWNNRVPIVR 268 V DLP +KE LR GG LK+ + K E+ +WR+SQDL EW +G ET++N +PIV+ Sbjct: 2428 VQDLPKEIKEMLRNGGRLKIVLDGKVREEFKWRASQDLYKLEWKVGAKETEYNQSLPIVK 2487 Query: 269 IRNVAKGLSHPALVKANK-KSRSVTSKICLCIFGPPNEDNPAGIELPLRAKSQKERDHLV 327 IRN+ KGL L AN + R VTS +C I GPP ED P GIEL L+AKS+ ERD L+ Sbjct: 2488 IRNIWKGLQSTILKAANMVEPRKVTSSVCFVIVGPPTEDQPQGIELSLKAKSKGERDTLI 2547 Query: 328 EMLVQWRDAATY 339 E LV WR+A++Y Sbjct: 2548 ENLVMWREASSY 2559 >gi|67605795|ref|XP_666706.1| hypothetical protein [Cryptosporidium hominis TU502] gi|54657754|gb|EAL36480.1| hypothetical protein Chro.60511 [Cryptosporidium hominis] Length = 1757 Score = 194 bits (494), Expect = 1e-47, Method: Compositional matrix adjust. Identities = 114/320 (35%), Positives = 167/320 (52%), Gaps = 10/320 (3%) Query: 23 GNIPDAEPDPEELANAAYGGWYHMGMDEVMIGAIIQAVCACAGSESHAKQLRLQRISLGV 82 G + PD E A A G+ +D AI+++V E + + LRL R G+ Sbjct: 1446 GLLYSGSPDSEYSAADAPKGYQVCQLDVNDCNAIVKSVNHAIHKEENGRHLRLMRAGFGI 1505 Query: 83 AAHFVSEQMGQESFVGSGGELVIMQALNTFPGEHSIAQLSCIIVNSIAMTSGDMYDSIKT 142 A+ +SE + ES S V+ + + F + L C V+ ++ + D+ I Sbjct: 1506 MAYLLSENLCIESIASSETVNVMSKVMTIFASDMDSTALICQYVSFLSKFALDLVPGIVN 1565 Query: 143 RELVSAFKGSISKIGNKKPEEKAVRDECQKTLDALGSNLDPMDAFASTVTELDLNITQWN 202 + +A + S SK + A +D A+ S A + E D +IT WN Sbjct: 1566 DDFRNALENSASK------AKGARKDFVTSVSTAIMSG--DYSALSVLCGEFDFDITHWN 1617 Query: 203 VDPYPNGVHDLPVHVKEALRKGGELKVYIGDKEAEKIRWRSSQDLNAFEWNIGN-ETDWN 261 V+PYPNGV DLP K+ LR GG+LK+ + K ++ WR+SQDL EW IG + D+N Sbjct: 1618 VEPYPNGVQDLPKETKDFLRNGGKLKIVLDGKSRDEFTWRASQDLYKLEWKIGTKDNDFN 1677 Query: 262 NRVPIVRIRNVAKGLSHPALVKANK-KSRSVTSKICLCIFGPPNEDNPAGIELPLRAKSQ 320 N +PI +IRN+ KGL L AN + R +T C + GPP+ED P G+EL L+AKS+ Sbjct: 1678 NSLPIGKIRNIWKGLQSTVLKAANMVEPRKITGPTCFVVVGPPSEDQPQGMELSLKAKSK 1737 Query: 321 KERDHLVEMLVQWRDAATYN 340 ERD ++E V WR+AATY+ Sbjct: 1738 SERDGIIENFVMWREAATYH 1757 >gi|32399022|emb|CAD98262.1| hypothetical predicted Armadillo/beta-catenin-like repeat protein, unknown function [Cryptosporidium parvum] Length = 2564 Score = 192 bits (488), Expect = 6e-47, Method: Compositional matrix adjust. Identities = 112/320 (35%), Positives = 167/320 (52%), Gaps = 10/320 (3%) Query: 23 GNIPDAEPDPEELANAAYGGWYHMGMDEVMIGAIIQAVCACAGSESHAKQLRLQRISLGV 82 G + PD E A A G+ +D AI+++V E + + LRL R G+ Sbjct: 2253 GLLYSGSPDSEYSAADAPKGYQVCQLDVNDCNAIVKSVNHAIHKEENGRHLRLMRAGFGI 2312 Query: 83 AAHFVSEQMGQESFVGSGGELVIMQALNTFPGEHSIAQLSCIIVNSIAMTSGDMYDSIKT 142 A+ +SE + ES S V+ + + F + L C ++ ++ + D+ I Sbjct: 2313 MAYLLSENLCIESIANSETVNVMSKVMTIFASDMDSTALICQYISFLSKYALDLVPGIVN 2372 Query: 143 RELVSAFKGSISKIGNKKPEEKAVRDECQKTLDALGSNLDPMDAFASTVTELDLNITQWN 202 + +A + S SK + A +D A+ S A + E D +IT WN Sbjct: 2373 DDFRNALENSASK------AKGARKDFVTGVSTAIMSG--DYSALSVLCGEFDFDITHWN 2424 Query: 203 VDPYPNGVHDLPVHVKEALRKGGELKVYIGDKEAEKIRWRSSQDLNAFEWNIGN-ETDWN 261 V+PYPNGV DLP K+ LR GG+LK+ + K ++ WR+SQDL EW +G + D+N Sbjct: 2425 VEPYPNGVQDLPKETKDFLRNGGKLKIVLDGKSRDEFTWRASQDLYKLEWKVGTKDNDFN 2484 Query: 262 NRVPIVRIRNVAKGLSHPALVKANK-KSRSVTSKICLCIFGPPNEDNPAGIELPLRAKSQ 320 N +PI +IRN+ KGL L AN + R +T C + GPP+ED P G+EL L+AKS+ Sbjct: 2485 NSLPIGKIRNIWKGLQSTVLKAANMVEPRKITGPTCFVVVGPPSEDQPQGMELSLKAKSK 2544 Query: 321 KERDHLVEMLVQWRDAATYN 340 ERD ++E V WR+AATY+ Sbjct: 2545 SERDGIIENFVMWREAATYH 2564 >gi|66475922|ref|XP_627777.1| hypothetical protein [Cryptosporidium parvum Iowa II] gi|46229316|gb|EAK90165.1| large protein with ARM repeats [Cryptosporidium parvum Iowa II] Length = 2558 Score = 192 bits (488), Expect = 6e-47, Method: Compositional matrix adjust. Identities = 112/320 (35%), Positives = 167/320 (52%), Gaps = 10/320 (3%) Query: 23 GNIPDAEPDPEELANAAYGGWYHMGMDEVMIGAIIQAVCACAGSESHAKQLRLQRISLGV 82 G + PD E A A G+ +D AI+++V E + + LRL R G+ Sbjct: 2247 GLLYSGSPDSEYSAADAPKGYQVCQLDVNDCNAIVKSVNHAIHKEENGRHLRLMRAGFGI 2306 Query: 83 AAHFVSEQMGQESFVGSGGELVIMQALNTFPGEHSIAQLSCIIVNSIAMTSGDMYDSIKT 142 A+ +SE + ES S V+ + + F + L C ++ ++ + D+ I Sbjct: 2307 MAYLLSENLCIESIANSETVNVMSKVMTIFASDMDSTALICQYISFLSKYALDLVPGIVN 2366 Query: 143 RELVSAFKGSISKIGNKKPEEKAVRDECQKTLDALGSNLDPMDAFASTVTELDLNITQWN 202 + +A + S SK + A +D A+ S A + E D +IT WN Sbjct: 2367 DDFRNALENSASK------AKGARKDFVTGVSTAIMSG--DYSALSVLCGEFDFDITHWN 2418 Query: 203 VDPYPNGVHDLPVHVKEALRKGGELKVYIGDKEAEKIRWRSSQDLNAFEWNIGN-ETDWN 261 V+PYPNGV DLP K+ LR GG+LK+ + K ++ WR+SQDL EW +G + D+N Sbjct: 2419 VEPYPNGVQDLPKETKDFLRNGGKLKIVLDGKSRDEFTWRASQDLYKLEWKVGTKDNDFN 2478 Query: 262 NRVPIVRIRNVAKGLSHPALVKANK-KSRSVTSKICLCIFGPPNEDNPAGIELPLRAKSQ 320 N +PI +IRN+ KGL L AN + R +T C + GPP+ED P G+EL L+AKS+ Sbjct: 2479 NSLPIGKIRNIWKGLQSTVLKAANMVEPRKITGPTCFVVVGPPSEDQPQGMELSLKAKSK 2538 Query: 321 KERDHLVEMLVQWRDAATYN 340 ERD ++E V WR+AATY+ Sbjct: 2539 SERDGIIENFVMWREAATYH 2558 >gi|84999792|ref|XP_954617.1| hypothetical protein [Theileria annulata] gi|65305615|emb|CAI73940.1| hypothetical protein, conserved [Theileria annulata] Length = 2637 Score = 166 bits (421), Expect = 3e-39, Method: Compositional matrix adjust. Identities = 105/335 (31%), Positives = 172/335 (51%), Gaps = 36/335 (10%) Query: 20 DENGNIPDAEPDPEELANAAYGGWYHMGMDEVMIGAIIQAVCACAGSESHAKQLRLQRIS 79 DE+ + D E D GW ++GM I +I+ C A ES K RLQ Sbjct: 2325 DESTGLSDGESDY---------GWENIGMTATSIVEVIKFSCYVASLESCLKMSRLQSSV 2375 Query: 80 LGVAAHFVSEQMGQESFVGSGGELVIMQALNTFPGEHSIAQLSCIIVNSIAMTSG---DM 136 + + +F+S + E +G LV+ +++F + L+ + ++++ + D+ Sbjct: 2376 VSLCVYFMSCGLCGEELAMNGFSLVLENFISSFC--LTAPNLALLAISALEASFNYPPDL 2433 Query: 137 YDSIKTRELVSAFKGSISKIGNKKPEEKAVRDECQKTLDALGSNLDPMDAFASTVTELDL 196 +SI T+ + + + +K+ + K K L+ + +N P S + + DL Sbjct: 2434 RNSILTKPIQKKLRDLTLVVTDKQSKAKLT-----KLLEHVSNNTSP-----SVIGKFDL 2483 Query: 197 NITQWNVDPYPNGVHDLPVHVKEALRKGGELKVYIGDKEAEKIR----------WRSSQD 246 +++WNVDPYPNGVHDLP +KE LR GG+ ++ +E K R WRSSQD Sbjct: 2484 GLSEWNVDPYPNGVHDLPESMKEMLRNGGKFQLITEGEEEFKRRLRRGKEFEYSWRSSQD 2543 Query: 247 LNAFEWNIGNETDWNNRVPIVRIRNVAKGLSHPALVKANKKS-RSVTSKICLCIFGPPNE 305 L EW + + N+V +R+RN+A+GL H LVKAN+K + V++ L + G E Sbjct: 2544 LLTLEW-MHDGLQEKNKVAFMRVRNIARGLKHDLLVKANQKDYKRVSNTNTLVLLGSSTE 2602 Query: 306 DNPAGIELPLRAKSQKERDHLVEMLVQWRDAATYN 340 + P G LP+ K+ ER+ + E +QWRDA+++N Sbjct: 2603 EFPQGFALPMVFKNNHEREAVAEAFIQWRDASSFN 2637 >gi|71031955|ref|XP_765619.1| hypothetical protein [Theileria parva strain Muguga] gi|68352576|gb|EAN33336.1| hypothetical protein, conserved [Theileria parva] Length = 2607 Score = 160 bits (404), Expect = 3e-37, Method: Compositional matrix adjust. Identities = 105/336 (31%), Positives = 164/336 (48%), Gaps = 38/336 (11%) Query: 20 DENGNIPDAEPDPEELANAAYGGWYHMGMDEVMIGAIIQAVCACAGSESHAKQLRLQRIS 79 DE+ + DAE D GW ++GM I +I+ C A ES K RLQ Sbjct: 2295 DESTGVSDAESDY---------GWENIGMTATTIVEVIKFSCYVASQESCLKMSRLQSSV 2345 Query: 80 LGVAAHFVSEQMGQESFVGSGGELVIMQALNTF----PGEHSIAQLSCIIVNSIAMTSGD 135 + + +F+S + E +G L++ ++ F P +A + + S D Sbjct: 2346 VSLCVYFMSCGLCGEELAMNGFSLILENFISNFCLTAPNLALLAVAA---LESSFNYPPD 2402 Query: 136 MYDSIKTRELVSAFKGSISKIGNKKPEEKAVRDECQKTLDALGSNLDPMDAFASTVTELD 195 + SI T+ + + I +K+ + K K L+ +N P S + + D Sbjct: 2403 LRSSILTKPIQKKLRDLTLVITDKQTKAKLT-----KLLEHFSNNTQP-----SVIGKFD 2452 Query: 196 LNITQWNVDPYPNGVHDLPVHVKEALRKGGELKVYI-GDKEAEK---------IRWRSSQ 245 L +++WNVDPYPNGVHDLP +KE LR GG+ + GD+E ++ WR+SQ Sbjct: 2453 LGLSEWNVDPYPNGVHDLPESMKEMLRNGGKFHLVTEGDEEVKRRLKRGKEFEYSWRASQ 2512 Query: 246 DLNAFEWNIGNETDWNNRVPIVRIRNVAKGLSHPALVKANKKSRSVTSKI-CLCIFGPPN 304 DL EW + N++ +R+RN+A+GL H L KAN+K S + L + G Sbjct: 2513 DLLTLEWT-HDALQEKNKIAFMRVRNIARGLKHDLLAKANQKDYKRVSNVNTLVLLGSCT 2571 Query: 305 EDNPAGIELPLRAKSQKERDHLVEMLVQWRDAATYN 340 E+ P G LP+ K+ ER+ + E +QWRDA+++N Sbjct: 2572 EEFPQGFALPMVFKNNHEREAVAEAFIQWRDASSFN 2607 >gi|326432426|gb|EGD77996.1| hypothetical protein PTSG_12905 [Salpingoeca sp. ATCC 50818] Length = 538 Score = 40.0 bits (92), Expect = 0.56, Method: Compositional matrix adjust. Identities = 22/73 (30%), Positives = 38/73 (52%) Query: 63 CAGSESHAKQLRLQRISLGVAAHFVSEQMGQESFVGSGGELVIMQALNTFPGEHSIAQLS 122 CA E H + RL I + EQ G+ +F+G+ G ++ + + T P +I + + Sbjct: 131 CATMERHIGEDRLADIGCCTLYCLIEEQEGRLAFMGADGVALLGRVMQTHPYSRAIQEHA 190 Query: 123 CIIVNSIAMTSGD 135 C IV+++A T D Sbjct: 191 CWIVDALARTDKD 203 >gi|254467462|ref|ZP_05080872.1| methyl-accepting chemotaxis sensory transducer [Rhodobacterales bacterium Y4I] gi|206684463|gb|EDZ44946.1| methyl-accepting chemotaxis sensory transducer [Rhodobacterales bacterium Y4I] Length = 747 Score = 38.5 bits (88), Expect = 1.5, Method: Compositional matrix adjust. Identities = 31/98 (31%), Positives = 43/98 (43%), Gaps = 5/98 (5%) Query: 203 VDPYPNGVHDLPVHVKEALRKGGELKVYIGDKEAEKIRWRSSQDLNAFEWNIGNETDWNN 262 V P + D P + R+GG+ + + I DL F+W++ ETD + Sbjct: 275 VFPVLTALPDTPQILAAKAREGGQFEAVVSSLGERAIALVMPLDLPGFDWSLVLETDEHT 334 Query: 263 RVPIV-RIRNVAKGLSHPALVKANKKS----RSVTSKI 295 +V RIR A GL AL+ A S RSVT I Sbjct: 335 AFAVVERIRLTAAGLIGAALLAAVGVSWLAARSVTRPI 372 >gi|104781550|ref|YP_608048.1| hypothetical protein PSEEN2440 [Pseudomonas entomophila L48] gi|95110537|emb|CAK15245.1| conserved hypothetical protein; putative signal peptide [Pseudomonas entomophila L48] Length = 866 Score = 38.1 bits (87), Expect = 2.1, Method: Compositional matrix adjust. Identities = 20/66 (30%), Positives = 35/66 (53%), Gaps = 3/66 (4%) Query: 188 ASTVTELDLNITQWNVDPYPN-GVHDLPVHVKEALRKGG--ELKVYIGDKEAEKIRWRSS 244 A+ V + L +T+WN +P+P H LP++V R G L Y ++A WR++ Sbjct: 561 AAQVVGMPLTVTEWNAEPFPTPDRHSLPLYVAATARHQGWDALMQYAYSQQALTEGWRTA 620 Query: 245 QDLNAF 250 + +A+ Sbjct: 621 DNWHAY 626 >gi|121256|sp|P02231.1|GLBT_CHITH RecName: Full=Globin CTT-IIIA Length = 151 Score = 37.7 bits (86), Expect = 2.7, Method: Compositional matrix adjust. Identities = 35/130 (26%), Positives = 55/130 (42%), Gaps = 16/130 (12%) Query: 94 ESFVGSGGELVIMQALNTFPGEHSIAQLSCIIVNSIAMTSGDMYDSIKTRELVSAFKGSI 153 E GSG E++ LN FPG + + + N +A G + ++++ +G I Sbjct: 22 EKIKGSGVEILYF-FLNKFPGNFPMFKK---LGNDLAAAKGTAEFKDQADKIIAFLQGVI 77 Query: 154 SKIGNKKPEEKAVRDECQKTLDALGSNLDPMDAFASTVTEL------DLNITQWNVDPYP 207 K+G+ KA+ ++ + A+G D D F +TEL NI WN Sbjct: 78 EKLGSDMGGAKALLNQLGTSHKAMGITKDQFDQFRQALTELLGNLGFGGNIGAWNA---- 133 Query: 208 NGVHDLPVHV 217 DL HV Sbjct: 134 --TVDLMFHV 141 >gi|242007443|ref|XP_002424549.1| conserved hypothetical protein [Pediculus humanus corporis] gi|212507992|gb|EEB11811.1| conserved hypothetical protein [Pediculus humanus corporis] Length = 2452 Score = 37.4 bits (85), Expect = 3.6, Method: Compositional matrix adjust. Identities = 26/115 (22%), Positives = 55/115 (47%), Gaps = 3/115 (2%) Query: 119 AQLSCIIVNSIAMTSGDMYDSIKTRELVSAFKGSISKIGNKKPEEKAVRDECQKTLDALG 178 AQLS + A+ S + + +EL ++ + + + EEK + ++ ++ L+ + Sbjct: 858 AQLSDTVKECTALKSALKEEKTRFKELAASLERQTAIAKERMLEEKKLAEKAREQLEIVN 917 Query: 179 SNLDPMDAFASTVTELDLNITQWNVDPYPNGVHDLPVHVKEALRKGGELKVYIGD 233 L+ + + EL L I + N P N + + V++ E+ +K EL+ + D Sbjct: 918 QELELKNK---KIEELSLKIRELNNLPTKNVIESVSVNLDESTKKNKELEKTVRD 969 Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF excluding environmental samples from WGS projects Posted date: Jul 22, 2011 4:42 PM Number of letters in database: 5,058,227,080 Number of sequences in database: 14,777,732 Lambda K H 0.315 0.132 0.394 Gapped Lambda K H 0.267 0.0410 0.140 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Sequences: 14777732 Number of Hits to DB: 3,291,444,602 Number of extensions: 131817872 Number of successful extensions: 314227 Number of sequences better than 10.0: 28 Number of HSP's gapped: 315601 Number of HSP's successfully gapped: 28 Length of query: 341 Length of database: 5,058,227,080 Length adjustment: 140 Effective length of query: 201 Effective length of database: 2,989,344,600 Effective search space: 600858264600 Effective search space used: 600858264600 Neighboring words threshold: 11 Window for multiple hits: 40 X1: 16 ( 7.3 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 41 (21.6 bits) S2: 81 (35.8 bits)