BLASTP 2.2.24 [Aug-08-2010] Reference for compositional score matrix adjustment: Altschul, Stephen F., John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis, Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109. Query= Eten_7773_orf1 (108 letters) Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF excluding environmental samples from WGS projects 14,777,732 sequences; 5,058,227,080 total letters
Score E Sequences producing significant alignments: (bits) Value gi|37699770|emb|CAE52295.1| surface antigen 12 [Eimeria tenella] 184 5e-45 gi|37699780|emb|CAE52300.1| surface antigen 9 [Eimeria tenella] 134 4e-30 gi|37699806|emb|CAE52313.1| surface antigen 10 [Eimeria tenella] 129 2e-28 gi|149980826|gb|ABR53732.1| surface antigen 10 [Eimeria tenella] 129 2e-28 gi|166034393|gb|ABY78897.1| surface antigen 10 [Eimeria tenella] 127 5e-28 gi|37699768|emb|CAE52294.1| surface antigen 7 [Eimeria tenella] 120 6e-26 gi|37699774|emb|CAE52297.1| surface antigen 6 [Eimeria tenella] 117 6e-25 gi|37699766|emb|CAE52293.1| surface antigen 5 [Eimeria tenella] 117 7e-25 gi|151303303|gb|ABR92920.1| surface antigen 7 [Eimeria tenella] 117 7e-25 gi|37699776|emb|CAE52298.1| surface antigen 8 [Eimeria tenella] 115 2e-24 gi|149389603|gb|ABR26258.1| merozoite surface antigen 5 [Eimeria... 115 2e-24 gi|37699778|emb|CAE52299.1| surface antigen 11 [Eimeria tenella] 114 4e-24 gi|37699772|emb|CAE52296.1| surface antigen 4 [Eimeria tenella] 48 5e-04 gi|37699808|emb|CAE52314.1| surface antigen 3 [Eimeria tenella] 47 0.001 gi|2507143|sp|P13399.2|TA4_EIMTE RecName: Full=Sporulated oocyst... 45 0.003 gi|84783094|gb|ABC61818.1| TA4 antigen protein [Eimeria tenella] 45 0.003 gi|158877|gb|AAA29075.1| TA4 antigen protein [Eimeria tenella] 45 0.003 gi|169793986|gb|ACA81533.1| NA4 antigen [Eimeria necatrix] 44 0.010 gi|37699782|emb|CAE52301.1| surface antigen 2 [Eimeria tenella] 42 0.038 gi|151303301|gb|ABR92919.1| surface antigen 2 [Eimeria tenella] 42 0.038 gi|336268487|ref|XP_003349008.1| hypothetical protein SMAC_09044... 38 0.47 gi|116278579|gb|ABJ94605.1| pol protein [Human immunodeficiency ... 35 3.3 gi|224111132|ref|XP_002315759.1| predicted protein [Populus tric... 35 3.6 >gi|37699770|emb|CAE52295.1| surface antigen 12 [Eimeria tenella] Length = 260 Score = 184 bits (466), Expect = 5e-45, Method: Compositional matrix adjust. Identities = 86/86 (100%), Positives = 86/86 (100%) Query: 23 MQVETLADQAPTNPFSGGTYAFKKLTDEQHSCKETVDYWKAAFKNFTGLPPSKNDAGELY 82 MQVETLADQAPTNPFSGGTYAFKKLTDEQHSCKETVDYWKAAFKNFTGLPPSKNDAGELY Sbjct: 93 MQVETLADQAPTNPFSGGTYAFKKLTDEQHSCKETVDYWKAAFKNFTGLPPSKNDAGELY 152 Query: 83 NSQDNVSFVALYNPSANASADCRVVT 108 NSQDNVSFVALYNPSANASADCRVVT Sbjct: 153 NSQDNVSFVALYNPSANASADCRVVT 178 >gi|37699780|emb|CAE52300.1| surface antigen 9 [Eimeria tenella] Length = 259 Score = 134 bits (337), Expect = 4e-30, Method: Compositional matrix adjust. Identities = 62/85 (72%), Positives = 70/85 (82%) Query: 24 QVETLADQAPTNPFSGGTYAFKKLTDEQHSCKETVDYWKAAFKNFTGLPPSKNDAGELYN 83 + ET A TNPF GTYAFK LT EQ +CKETV YWKAA+KNFTGLPPS+ +AG LYN Sbjct: 93 KTETAAQSKSTNPFEAGTYAFKSLTAEQPNCKETVAYWKAAYKNFTGLPPSRKEAGTLYN 152 Query: 84 SQDNVSFVALYNPSANASADCRVVT 108 QDNVSFVA+YNPS+NA+ADCRVVT Sbjct: 153 KQDNVSFVAVYNPSSNATADCRVVT 177 >gi|37699806|emb|CAE52313.1| surface antigen 10 [Eimeria tenella] Length = 261 Score = 129 bits (324), Expect = 2e-28, Method: Compositional matrix adjust. Identities = 58/84 (69%), Positives = 69/84 (82%) Query: 24 QVETLADQAPTNPFSGGTYAFKKLTDEQHSCKETVDYWKAAFKNFTGLPPSKNDAGELYN 83 Q ET A + NPF GTYAFK LT EQ +CKET+DYWKAA++NFTGLPPSK + G LY+ Sbjct: 93 QTETAAKTSSANPFEKGTYAFKSLTAEQPNCKETIDYWKAAYENFTGLPPSKKEGGTLYD 152 Query: 84 SQDNVSFVALYNPSANASADCRVV 107 QDNVSFVA+YNPS++A+ADCRVV Sbjct: 153 DQDNVSFVAVYNPSSSATADCRVV 176 >gi|149980826|gb|ABR53732.1| surface antigen 10 [Eimeria tenella] Length = 261 Score = 129 bits (324), Expect = 2e-28, Method: Compositional matrix adjust. Identities = 59/84 (70%), Positives = 69/84 (82%) Query: 24 QVETLADQAPTNPFSGGTYAFKKLTDEQHSCKETVDYWKAAFKNFTGLPPSKNDAGELYN 83 Q ET A + NPF GTYAFK LT EQ +CKETVDYWKAA+KNFTGLPPS+ + G LY+ Sbjct: 93 QAETAAKTSSANPFEKGTYAFKSLTAEQPNCKETVDYWKAAYKNFTGLPPSRKEDGTLYD 152 Query: 84 SQDNVSFVALYNPSANASADCRVV 107 QDNVSFVA+YNPS++A+ADCRVV Sbjct: 153 DQDNVSFVAVYNPSSSATADCRVV 176 >gi|166034393|gb|ABY78897.1| surface antigen 10 [Eimeria tenella] Length = 261 Score = 127 bits (319), Expect = 5e-28, Method: Compositional matrix adjust. Identities = 58/84 (69%), Positives = 68/84 (80%) Query: 24 QVETLADQAPTNPFSGGTYAFKKLTDEQHSCKETVDYWKAAFKNFTGLPPSKNDAGELYN 83 Q ET A + NPF GTYAFK LT EQ +CKETVDYWKAA+KNFTGLPPS+ + G LY+ Sbjct: 93 QTETAAKTSSANPFEKGTYAFKSLTAEQPNCKETVDYWKAAYKNFTGLPPSRKEDGTLYD 152 Query: 84 SQDNVSFVALYNPSANASADCRVV 107 QDN SFVA+YNPS++A+ADCRVV Sbjct: 153 DQDNASFVAVYNPSSSATADCRVV 176 >gi|37699768|emb|CAE52294.1| surface antigen 7 [Eimeria tenella] Length = 253 Score = 120 bits (301), Expect = 6e-26, Method: Compositional matrix adjust. Identities = 56/85 (65%), Positives = 68/85 (80%) Query: 24 QVETLADQAPTNPFSGGTYAFKKLTDEQHSCKETVDYWKAAFKNFTGLPPSKNDAGELYN 83 Q + +A TNPF GTYAFK LT + +CKETVD+WKAAF+NFTGLPPSK + LY Sbjct: 91 QKDPVAAAGATNPFQDGTYAFKSLTAAEPNCKETVDHWKAAFENFTGLPPSKTEGANLYK 150 Query: 84 SQDNVSFVALYNPSANASADCRVVT 108 +QDNVSFVALYNPS++A+ADC+VVT Sbjct: 151 NQDNVSFVALYNPSSDATADCKVVT 175 >gi|37699774|emb|CAE52297.1| surface antigen 6 [Eimeria tenella] Length = 256 Score = 117 bits (292), Expect = 6e-25, Method: Compositional matrix adjust. Identities = 56/85 (65%), Positives = 68/85 (80%) Query: 24 QVETLADQAPTNPFSGGTYAFKKLTDEQHSCKETVDYWKAAFKNFTGLPPSKNDAGELYN 83 Q E + + TNPF GTYAFK LT + +CKE V+YWKAAFKNFTGLPPS++ AG+LY Sbjct: 93 QKEPVEATSGTNPFEKGTYAFKSLTTAEPNCKEIVNYWKAAFKNFTGLPPSESQAGDLYK 152 Query: 84 SQDNVSFVALYNPSANASADCRVVT 108 S +NVSFVALYN S+NA+ADC+VVT Sbjct: 153 SYNNVSFVALYNTSSNATADCQVVT 177 >gi|37699766|emb|CAE52293.1| surface antigen 5 [Eimeria tenella] Length = 257 Score = 117 bits (292), Expect = 7e-25, Method: Compositional matrix adjust. Identities = 55/84 (65%), Positives = 66/84 (78%) Query: 24 QVETLADQAPTNPFSGGTYAFKKLTDEQHSCKETVDYWKAAFKNFTGLPPSKNDAGELYN 83 Q E + PF G YAFK LT + +CKE VDYWK+AFKNF+GLPPSK+ AG+LYN Sbjct: 94 QTEPAKAASAAKPFEQGIYAFKSLTTAEPNCKEAVDYWKSAFKNFSGLPPSKSQAGQLYN 153 Query: 84 SQDNVSFVALYNPSANASADCRVV 107 SQDNVSFVALYNPS++A+ADCRV+ Sbjct: 154 SQDNVSFVALYNPSSDATADCRVI 177 >gi|151303303|gb|ABR92920.1| surface antigen 7 [Eimeria tenella] Length = 253 Score = 117 bits (292), Expect = 7e-25, Method: Compositional matrix adjust. Identities = 55/85 (64%), Positives = 67/85 (78%) Query: 24 QVETLADQAPTNPFSGGTYAFKKLTDEQHSCKETVDYWKAAFKNFTGLPPSKNDAGELYN 83 Q + +A TNPF GTYAFK LT + +CKETVD+WKAAF+NFTGL PSK + LY Sbjct: 91 QKDPVAAAGATNPFQDGTYAFKSLTAAEPNCKETVDHWKAAFENFTGLRPSKTEGANLYK 150 Query: 84 SQDNVSFVALYNPSANASADCRVVT 108 +QDNVSFVALYNPS++A+ADC+VVT Sbjct: 151 NQDNVSFVALYNPSSDATADCKVVT 175 >gi|37699776|emb|CAE52298.1| surface antigen 8 [Eimeria tenella] Length = 265 Score = 115 bits (289), Expect = 2e-24, Method: Compositional matrix adjust. Identities = 54/82 (65%), Positives = 64/82 (78%) Query: 26 ETLADQAPTNPFSGGTYAFKKLTDEQHSCKETVDYWKAAFKNFTGLPPSKNDAGELYNSQ 85 +T+A + PF GTYAFK LT + CKETVDYWKAA+KNFTGLPP K D LY +Q Sbjct: 97 KTMAASSSLKPFKDGTYAFKSLTAAKPDCKETVDYWKAAYKNFTGLPPPKTDNETLYKNQ 156 Query: 86 DNVSFVALYNPSANASADCRVV 107 DNVSFV+LYNPS++A+ADCRVV Sbjct: 157 DNVSFVSLYNPSSSATADCRVV 178 >gi|149389603|gb|ABR26258.1| merozoite surface antigen 5 [Eimeria tenella] Length = 257 Score = 115 bits (288), Expect = 2e-24, Method: Compositional matrix adjust. Identities = 54/84 (64%), Positives = 66/84 (78%) Query: 24 QVETLADQAPTNPFSGGTYAFKKLTDEQHSCKETVDYWKAAFKNFTGLPPSKNDAGELYN 83 Q E + PF G YAFK LT + +CKE VDYWK+AFKNF+GLPPSK+ AG+LYN Sbjct: 94 QTEPAKAASAAKPFEQGIYAFKSLTTAEPNCKEAVDYWKSAFKNFSGLPPSKSQAGQLYN 153 Query: 84 SQDNVSFVALYNPSANASADCRVV 107 SQ+NVSFVALYNPS++A+ADCRV+ Sbjct: 154 SQENVSFVALYNPSSDATADCRVI 177 >gi|37699778|emb|CAE52299.1| surface antigen 11 [Eimeria tenella] Length = 263 Score = 114 bits (286), Expect = 4e-24, Method: Compositional matrix adjust. Identities = 52/78 (66%), Positives = 61/78 (78%) Query: 31 QAPTNPFSGGTYAFKKLTDEQHSCKETVDYWKAAFKNFTGLPPSKNDAGELYNSQDNVSF 90 QA PF GTYAFK LT EQ C++TVDYWKAA+KNFTG+PP K++ +Y+ QDNVSF Sbjct: 98 QATAEPFKDGTYAFKSLTAEQPDCEKTVDYWKAAYKNFTGMPPPKSEDTSIYSKQDNVSF 157 Query: 91 VALYNPSANASADCRVVT 108 VALYNP A+ADCRVVT Sbjct: 158 VALYNPIPKATADCRVVT 175 >gi|37699772|emb|CAE52296.1| surface antigen 4 [Eimeria tenella] Length = 253 Score = 47.8 bits (112), Expect = 5e-04, Method: Compositional matrix adjust. Identities = 26/68 (38%), Positives = 37/68 (54%), Gaps = 4/68 (5%) Query: 40 GTYAFKKLTDEQHSCKETVDYWKAAFKNFTG-LPPSKNDAGE--LYNSQDNVSFVALYNP 96 GT+A+ + D + C V+YWK F F +PP +A + +YN + VSFVALYNP Sbjct: 101 GTFAYYPVADGKKDCNAAVEYWKGGFSLFKNEIPPEFTEANKTTVYNDRA-VSFVALYNP 159 Query: 97 SANASADC 104 + C Sbjct: 160 KPDPVVSC 167 >gi|37699808|emb|CAE52314.1| surface antigen 3 [Eimeria tenella] Length = 255 Score = 46.6 bits (109), Expect = 0.001, Method: Compositional matrix adjust. Identities = 30/72 (41%), Positives = 39/72 (54%), Gaps = 7/72 (9%) Query: 40 GTYAFKKLTDEQHSCKETVDYWKAAFKNFTG-LPPSKNDAGE--LYNSQDNVSFVALYNP 96 GT+A+ + ++ CK V YWK F F LPP+ N + +Y Q SFVALYNP Sbjct: 98 GTFAYYR---GENDCKAAVQYWKDGFPLFKNELPPTYNALNDPKIYTDQAT-SFVALYNP 153 Query: 97 SANASADCRVVT 108 A+ A C VT Sbjct: 154 QASPVASCAFVT 165 >gi|2507143|sp|P13399.2|TA4_EIMTE RecName: Full=Sporulated oocyst TA4 antigen; AltName: Full=Major sporozoite surface antigen; Contains: RecName: Full=Sporulated oocyst TA4 antigen 17 kDa subunit; Contains: RecName: Full=Sporulated oocyst TA4 antigen 8 kDa subunit; Flags: Precursor gi|158873|gb|AAA29074.1| sporozoite surface antigen precursor [Eimeria tenella] gi|38602661|emb|CAE52292.2| surface antigen 1 [Eimeria tenella] Length = 253 Score = 45.1 bits (105), Expect = 0.003, Method: Compositional matrix adjust. Identities = 24/68 (35%), Positives = 36/68 (52%), Gaps = 4/68 (5%) Query: 40 GTYAFKKLTDEQHSCKETVDYWKAAFKNFTG-LPPSKNDAGE--LYNSQDNVSFVALYNP 96 G +A+ +TD + C + V+YWK F +PP+ + +YN + VSFVALYNP Sbjct: 103 GNFAYYPVTDGKKECSDAVEYWKGGLSQFNDTIPPTFQALNDPVVYNDR-AVSFVALYNP 161 Query: 97 SANASADC 104 + C Sbjct: 162 KTSPVVSC 169 >gi|84783094|gb|ABC61818.1| TA4 antigen protein [Eimeria tenella] Length = 229 Score = 45.1 bits (105), Expect = 0.003, Method: Compositional matrix adjust. Identities = 24/68 (35%), Positives = 36/68 (52%), Gaps = 4/68 (5%) Query: 40 GTYAFKKLTDEQHSCKETVDYWKAAFKNFTG-LPPSKNDAGE--LYNSQDNVSFVALYNP 96 G +A+ +TD + C + V+YWK F +PP+ + +YN + VSFVALYNP Sbjct: 79 GNFAYYPVTDGKKECSDAVEYWKGGLSQFNDTIPPTFQALNDPVVYNDR-AVSFVALYNP 137 Query: 97 SANASADC 104 + C Sbjct: 138 KTSPVVSC 145 >gi|158877|gb|AAA29075.1| TA4 antigen protein [Eimeria tenella] Length = 231 Score = 45.1 bits (105), Expect = 0.003, Method: Compositional matrix adjust. Identities = 24/68 (35%), Positives = 36/68 (52%), Gaps = 4/68 (5%) Query: 40 GTYAFKKLTDEQHSCKETVDYWKAAFKNFTG-LPPSKNDAGE--LYNSQDNVSFVALYNP 96 G +A+ +TD + C + V+YWK F +PP+ + +YN + VSFVALYNP Sbjct: 81 GNFAYYPVTDGKKECSDAVEYWKGGLSQFNDTIPPTFQALNDPVVYNDR-AVSFVALYNP 139 Query: 97 SANASADC 104 + C Sbjct: 140 KTSPVVSC 147 >gi|169793986|gb|ACA81533.1| NA4 antigen [Eimeria necatrix] Length = 248 Score = 43.5 bits (101), Expect = 0.010, Method: Compositional matrix adjust. Identities = 25/70 (35%), Positives = 37/70 (52%), Gaps = 8/70 (11%) Query: 40 GTYAFKKLTDEQHSCKETVDYWKAAFKNFT-GLPPS----KNDAGELYNSQDNVSFVALY 94 G +A+ +TD + C + ++YWK F +PP+ N A +YN + VSFVALY Sbjct: 103 GNFAYYPVTDGKRECSDALEYWKGGLSQFNDKIPPTFQALNNPA--VYNDR-AVSFVALY 159 Query: 95 NPSANASADC 104 NP + C Sbjct: 160 NPKPSPVVSC 169 >gi|37699782|emb|CAE52301.1| surface antigen 2 [Eimeria tenella] Length = 270 Score = 41.6 bits (96), Expect = 0.038, Method: Compositional matrix adjust. Identities = 28/72 (38%), Positives = 38/72 (52%), Gaps = 7/72 (9%) Query: 40 GTYAFKKLTDEQHSCKETVDYWKAAFKNFTG-LPPS--KNDAGELYNSQDNVSFVALYNP 96 GT+A+ + CK V YWK F F LPP+ ++ +Y + VSFVALYNP Sbjct: 98 GTFAYY---PDGKDCKAAVQYWKEGFSLFKNELPPTYTASNTPAVYTDR-AVSFVALYNP 153 Query: 97 SANASADCRVVT 108 + A C +VT Sbjct: 154 QPSPLASCALVT 165 >gi|151303301|gb|ABR92919.1| surface antigen 2 [Eimeria tenella] Length = 270 Score = 41.6 bits (96), Expect = 0.038, Method: Compositional matrix adjust. Identities = 28/72 (38%), Positives = 38/72 (52%), Gaps = 7/72 (9%) Query: 40 GTYAFKKLTDEQHSCKETVDYWKAAFKNFTG-LPPS--KNDAGELYNSQDNVSFVALYNP 96 GT+A+ + CK V YWK F F LPP+ ++ +Y + VSFVALYNP Sbjct: 98 GTFAYY---PDGKDCKAAVQYWKEGFSLFKNELPPTYTASNTPAVYTDR-AVSFVALYNP 153 Query: 97 SANASADCRVVT 108 + A C +VT Sbjct: 154 QPSPLASCALVT 165 >gi|336268487|ref|XP_003349008.1| hypothetical protein SMAC_09044 [Sordaria macrospora k-hell] gi|289618561|emb|CBI54892.1| unnamed protein product [Sordaria macrospora] Length = 1350 Score = 38.1 bits (87), Expect = 0.47, Method: Composition-based stats. Identities = 26/88 (29%), Positives = 38/88 (43%), Gaps = 1/88 (1%) Query: 16 WLSFSSQMQVETLADQ-APTNPFSGGTYAFKKLTDEQHSCKETVDYWKAAFKNFTGLPPS 74 W F + + ++DQ +P NP+ G A KK T+E K+ + F ++ P Sbjct: 1203 WWEFKTPPDLYYVSDQNSPFNPYKGRREAMKKETEEAARAKKVAEEHSKKFWHYFKKRPD 1262 Query: 75 KNDAGELYNSQDNVSFVALYNPSANASA 102 DAG NS DN V P +A Sbjct: 1263 AMDAGCRENSPDNEGHVWCLAPGGGETA 1290 >gi|116278579|gb|ABJ94605.1| pol protein [Human immunodeficiency virus 1] Length = 499 Score = 35.0 bits (79), Expect = 3.3, Method: Composition-based stats. Identities = 32/116 (27%), Positives = 52/116 (44%), Gaps = 22/116 (18%) Query: 3 FPLTSSRALVLPGWLSFSSQMQVE-TLADQAPTNPFSGGTYAFKK--------LTDEQHS 53 +PLT + L + ++M+ E ++ P NP++ +A KK +TD + Sbjct: 123 WPLTKEKIKAL---IEICTEMEKEGKISKIGPENPYNTPVFAIKKKNSDXWRKITDFREL 179 Query: 54 CKETVDYWKA--AFKNFTGLPPSKN----DAGELYNS----QDNVSFVALYNPSAN 99 K T D+W+ + GLP SK+ D G+ Y S +D + A PS N Sbjct: 180 NKRTQDFWEVQLGIPHPAGLPKSKSVTVLDVGDAYFSIPLDEDFRKYTAFTIPSTN 235 >gi|224111132|ref|XP_002315759.1| predicted protein [Populus trichocarpa] gi|222864799|gb|EEF01930.1| predicted protein [Populus trichocarpa] Length = 198 Score = 35.0 bits (79), Expect = 3.6, Method: Compositional matrix adjust. Identities = 26/90 (28%), Positives = 41/90 (45%), Gaps = 7/90 (7%) Query: 1 YVFPLTSSRALVLPGWLSFSSQMQVETLADQAPTNPFSGGTYAFKKLTDEQHSC-----K 55 + FPL SR + LP S SS + +++L +P P G A D+ H Sbjct: 85 FGFPLDESRVIALPSACSLSSPVSLDSLCSGSPALPPLRGRTASMPGPDDHHPLAPSLPP 144 Query: 56 ETVDYWKAAFKNFTGLPPSKNDAGELYNSQ 85 E+VD A+ + L P+ + + E +NS Sbjct: 145 ESVDGSPAS--PVSPLAPASHSSAEKHNSN 172 Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF excluding environmental samples from WGS projects Posted date: Jul 22, 2011 4:42 PM Number of letters in database: 5,058,227,080 Number of sequences in database: 14,777,732 Lambda K H 0.314 0.128 0.387 Gapped Lambda K H 0.267 0.0410 0.140 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Sequences: 14777732 Number of Hits to DB: 1,105,743,910 Number of extensions: 40495743 Number of successful extensions: 92069 Number of sequences better than 10.0: 24 Number of HSP's gapped: 100294 Number of HSP's successfully gapped: 24 Length of query: 108 Length of database: 5,058,227,080 Length adjustment: 76 Effective length of query: 32 Effective length of database: 3,935,119,448 Effective search space: 125923822336 Effective search space used: 125923822336 Neighboring words threshold: 11 Window for multiple hits: 40 X1: 16 ( 7.3 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 41 (21.6 bits) S2: 76 (33.9 bits)