BLASTP 2.2.24 [Aug-08-2010] Reference for compositional score matrix adjustment: Altschul, Stephen F., John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis, Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109. Query= Eten_1315_orf1 (228 letters) Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF excluding environmental samples from WGS projects 14,777,732 sequences; 5,058,227,080 total letters
Score E Sequences producing significant alignments: (bits) Value gi|237838611|ref|XP_002368603.1| hypothetical protein TGME49_064... 140 1e-31 gi|221505521|gb|EEE31166.1| conserved hypothetical protein [Toxo... 140 1e-31 gi|325115414|emb|CBZ50969.1| conserved hypothetical protein [Neo... 137 1e-30 gi|299473303|emb|CBN77702.1| conserved unknown protein [Ectocarp... 39 0.84 gi|302844759|ref|XP_002953919.1| hypothetical protein VOLCADRAFT... 37 2.3 gi|159463700|ref|XP_001690080.1| importin beta [Chlamydomonas re... 37 2.3 gi|32476119|ref|NP_869113.1| hypothetical protein RB9867 [Rhodop... 37 3.2 gi|327539416|gb|EGF26032.1| hypothetical protein RBWH47_01227 [R... 36 4.1 gi|119714672|ref|YP_921637.1| methylthioribose kinase [Nocardioi... 36 4.7 >gi|237838611|ref|XP_002368603.1| hypothetical protein TGME49_064740 [Toxoplasma gondii ME49] gi|211966267|gb|EEB01463.1| hypothetical protein TGME49_064740 [Toxoplasma gondii ME49] Length = 878 Score = 140 bits (354), Expect = 1e-31, Method: Compositional matrix adjust. Identities = 93/248 (37%), Positives = 127/248 (51%), Gaps = 26/248 (10%) Query: 2 KLALPATYMSTLTDVNVPKYAGKVGSQNLGVVDQLKDGIRLIDVRLWRDERVADPLSGWF 61 +LALPAT STL DVN P+Y +QN+ V QL DGIR DVR WR+ R W+ Sbjct: 89 ELALPATVHSTLNDVNAPRYMELTHAQNVDVTRQLYDGIRSFDVRPWRNSRDGK----WY 144 Query: 62 TEVPWRLFEDAELLK------SLANKKTIGTLSAAEAEALNVHSRTDLVEGLFKPVLQFL 115 +E PWRL ED L+ S + I L A +++HSRT L E LF+PV +FL Sbjct: 145 SEAPWRLEEDLVSLRKAVSAFSQLSTSPIPALPNALQSIIDLHSRTSLEESLFRPVQRFL 204 Query: 116 RKSRTEVAVVVFSAVNGNTHTKRNAIVKGSELGESL------------AHAEKQLESFLQ 163 R + +EVAVVVFS +NGN N++ + S LG L A L +F + Sbjct: 205 RSNGSEVAVVVFSTLNGNRRGAINSVEQASSLGAMLKRYAEYNQPGAKADPTNPLPAFNE 264 Query: 164 SGGPYSDNEH---DVEPQNQPAAPSVVRVGPHSAYYSPLLDSFVKKSVTFNHFAEITELI 220 P H DV+ + + +P+ V P S + S V+K + + + ELI Sbjct: 265 LASPKFVEGHPGGDVDRKGKGKSPTFEVVSPSDPTAS-FMTSAVRKPLMYEDVQALLELI 323 Query: 221 DKFWGALL 228 D++WG LL Sbjct: 324 DRYWGQLL 331 >gi|221505521|gb|EEE31166.1| conserved hypothetical protein [Toxoplasma gondii VEG] Length = 878 Score = 140 bits (354), Expect = 1e-31, Method: Compositional matrix adjust. Identities = 93/248 (37%), Positives = 127/248 (51%), Gaps = 26/248 (10%) Query: 2 KLALPATYMSTLTDVNVPKYAGKVGSQNLGVVDQLKDGIRLIDVRLWRDERVADPLSGWF 61 +LALPAT STL DVN P+Y +QN+ V QL DGIR DVR WR+ R W+ Sbjct: 89 ELALPATVHSTLNDVNAPRYMELTHAQNVDVTRQLYDGIRSFDVRPWRNSRDGK----WY 144 Query: 62 TEVPWRLFEDAELLK------SLANKKTIGTLSAAEAEALNVHSRTDLVEGLFKPVLQFL 115 +E PWRL ED L+ S + I L A +++HSRT L E LF+PV +FL Sbjct: 145 SEAPWRLEEDLVSLRKAVSAFSQLSTSPIPALPNALQSIIDLHSRTSLEESLFRPVQRFL 204 Query: 116 RKSRTEVAVVVFSAVNGNTHTKRNAIVKGSELGESL------------AHAEKQLESFLQ 163 R + +EVAVVVFS +NGN N++ + S LG L A L +F + Sbjct: 205 RSNGSEVAVVVFSTLNGNRRGAINSVEQASSLGAMLKRYAEYNQPGAKADPTNPLPAFNE 264 Query: 164 SGGPYSDNEH---DVEPQNQPAAPSVVRVGPHSAYYSPLLDSFVKKSVTFNHFAEITELI 220 P H DV+ + + +P+ V P S + S V+K + + + ELI Sbjct: 265 LASPKFVEGHPGGDVDRKGKGKSPTFEVVSPSDPTAS-FMTSAVRKPLMYEDVQALLELI 323 Query: 221 DKFWGALL 228 D++WG LL Sbjct: 324 DRYWGQLL 331 >gi|325115414|emb|CBZ50969.1| conserved hypothetical protein [Neospora caninum Liverpool] Length = 880 Score = 137 bits (344), Expect = 1e-30, Method: Compositional matrix adjust. Identities = 91/252 (36%), Positives = 127/252 (50%), Gaps = 34/252 (13%) Query: 2 KLALPATYMSTLTDVNVPKYAGKVGSQNLGVVDQLKDGIRLIDVRLWRDERVADPLSGWF 61 +LALPAT STLTDV P+Y +QN+ + QL DGIR DVR WR+ R W+ Sbjct: 90 ELALPATVHSTLTDVTAPRYMDLTHAQNVDINRQLYDGIRSFDVRPWRNSRDGR----WY 145 Query: 62 TEVPWRLFEDAELLKSLANKKT------IGTLSAAEAEALNVHSRTDLVEGLFKPVLQFL 115 +E PWRL ED L+ + + I L + +++HSRT L + LF+PV +FL Sbjct: 146 SEAPWRLEEDLVTLRKAVSSFSQLSSSPIAALPHSLQNTIDLHSRTTLEDSLFRPVQRFL 205 Query: 116 RKSRTEVAVVVFSAVNGNTHTKRNAIVKGSELGESLAHAEKQLESFLQSGG--------- 166 R + +EVAVVVFSA+NGN N++ + S LG L K+ + Q G Sbjct: 206 RSNGSEVAVVVFSALNGNRRGGVNSVEQASSLGAML----KRYADYNQPGAKADPSSPLP 261 Query: 167 PYSDNEHDVEPQNQPAA----------PSVVRVGPHSAYYSPLLDSFVKKSVTFNHFAEI 216 P+SD + P P+ V P S + S V+K +T+ + Sbjct: 262 PFSDLSSPTFVEGHPGGDTNRKGKGKPPTFEVVSPSDPSAS-FMTSAVRKPLTYEDVQAL 320 Query: 217 TELIDKFWGALL 228 ELID++WG LL Sbjct: 321 LELIDEYWGQLL 332 >gi|299473303|emb|CBN77702.1| conserved unknown protein [Ectocarpus siliculosus] Length = 495 Score = 38.5 bits (88), Expect = 0.84, Method: Compositional matrix adjust. Identities = 17/46 (36%), Positives = 30/46 (65%) Query: 2 KLALPATYMSTLTDVNVPKYAGKVGSQNLGVVDQLKDGIRLIDVRL 47 +++LP T+ S ++ P + +QNLGV+ QLK G+R++D+R Sbjct: 197 EMSLPGTHDSGTAKMHNPVVSPWATTQNLGVLAQLKAGVRVLDLRC 242 >gi|302844759|ref|XP_002953919.1| hypothetical protein VOLCADRAFT_82561 [Volvox carteri f. nagariensis] gi|300260731|gb|EFJ44948.1| hypothetical protein VOLCADRAFT_82561 [Volvox carteri f. nagariensis] Length = 857 Score = 37.0 bits (84), Expect = 2.3, Method: Compositional matrix adjust. Identities = 36/132 (27%), Positives = 57/132 (43%), Gaps = 14/132 (10%) Query: 32 VVDQLKDGIRLIDVRLWRDERVADPLSGWFTEVPWRLFEDAELLKSLANKKTIGTLSAAE 91 VV + D +R + + E PLS + ++ L+E AE +S A + G A Sbjct: 471 VVYYVCDALRFLALGFQSSEGETTPLSPYLKDLVQNLYETAERFRS-APCENSGKAQVAA 529 Query: 92 AEALNVHSRTDLVEGLFKPVLQFLRKSRTEVAVVVFSAVNGNTHTKRNAIVKGSELGESL 151 EA+N DLV + L+F+ + VV A+N N T+ + + E + Sbjct: 530 YEAIN-----DLVRSSARDTLEFVGT----LLQVVLGAINTNLETQ----ITSHQAAEKM 576 Query: 152 AHAEKQLESFLQ 163 A + QL LQ Sbjct: 577 ATLQGQLCGMLQ 588 >gi|159463700|ref|XP_001690080.1| importin beta [Chlamydomonas reinhardtii] gi|158284068|gb|EDP09818.1| importin beta [Chlamydomonas reinhardtii] Length = 832 Score = 37.0 bits (84), Expect = 2.3, Method: Composition-based stats. Identities = 35/127 (27%), Positives = 59/127 (46%), Gaps = 15/127 (11%) Query: 38 DGIRLIDVRLWRDERVADPLSGWFTEVPWRLFEDAELLKSLANKKTIGTLSAAEAEALNV 97 D IRL+ + + PLS + ++ +L+E AE+ ++ +++ G A EA+N Sbjct: 477 DAIRLLALGFNTGDADTSPLSPFLKDLIQKLYEAAEVFRNHPSQENSGKAQIAAYEAIN- 535 Query: 98 HSRTDLVEGLFKPVLQFLRKSRTEVAVVVFSAVNGNTHTKRNAIVKGS-ELGESLAHAEK 156 DLV + L+F+ T VV +A+N N A+ GS + E +A + Sbjct: 536 ----DLVRTSARDTLEFVGTLLT----VVLNAINANL-----AVPLGSQQAAEKMATLQG 582 Query: 157 QLESFLQ 163 Q LQ Sbjct: 583 QFCGMLQ 589 >gi|32476119|ref|NP_869113.1| hypothetical protein RB9867 [Rhodopirellula baltica SH 1] gi|32446663|emb|CAD76499.1| hypothetical protein RB9867 [Rhodopirellula baltica SH 1] Length = 209 Score = 36.6 bits (83), Expect = 3.2, Method: Compositional matrix adjust. Identities = 36/121 (29%), Positives = 57/121 (47%), Gaps = 11/121 (9%) Query: 45 VRLWRDERVADPLSGWFTEVPWRLFEDAELL--KSLANKKTIGTLSAAEA----EALNVH 98 VR W DE +A+P SGW V + + EL + +A ++ AE + LN Sbjct: 29 VRRWHDEPMANPYSGWLRWVCQQHAFNFELWHEEDVARNPSVTDARIAEVKRSIDRLN-Q 87 Query: 99 SRTDLVEGLFKPVLQFLRKSRTEVAVVVFSAVNGNTHTKRNAIVKGSELGESLAHAEKQL 158 R D +E L + + +++S +VVV NT T +AI + S + L H +QL Sbjct: 88 QRNDWIEKLDDAITESIQQS----SVVVAEDAPINTETPGSAIDRLSIMSLRLFHYAEQL 143 Query: 159 E 159 + Sbjct: 144 Q 144 >gi|327539416|gb|EGF26032.1| hypothetical protein RBWH47_01227 [Rhodopirellula baltica WH47] Length = 209 Score = 36.2 bits (82), Expect = 4.1, Method: Compositional matrix adjust. Identities = 36/121 (29%), Positives = 56/121 (46%), Gaps = 11/121 (9%) Query: 45 VRLWRDERVADPLSGWFTEVPWRLFEDAELL--KSLANKKTIGTLSAAEA----EALNVH 98 VR W DE +A+P SGW V + + EL + +A + AE + LN Sbjct: 29 VRRWHDEPMANPYSGWLRWVCQQHAFNFELWHEEDVARNPNVTDARIAEVKRSIDRLN-Q 87 Query: 99 SRTDLVEGLFKPVLQFLRKSRTEVAVVVFSAVNGNTHTKRNAIVKGSELGESLAHAEKQL 158 R D +E L + + +++S +VVV NT T +AI + S + L H +QL Sbjct: 88 QRNDWIEKLDDAITESIQQS----SVVVAENAPINTETPGSAIDRLSIMSLRLFHYAEQL 143 Query: 159 E 159 + Sbjct: 144 Q 144 >gi|119714672|ref|YP_921637.1| methylthioribose kinase [Nocardioides sp. JS614] gi|119535333|gb|ABL79950.1| 5'-methylthioribose kinase [Nocardioides sp. JS614] Length = 409 Score = 35.8 bits (81), Expect = 4.7, Method: Compositional matrix adjust. Identities = 28/101 (27%), Positives = 44/101 (43%), Gaps = 16/101 (15%) Query: 30 LGVVDQLKDGIR---------LIDVRLWRDERVADPLSGWFTEVPWRLFEDAELLKSLAN 80 LG+V Q DG +D R+WRD+ + D L W E W LF A++ + Sbjct: 296 LGLVQQTWDGFEGEFRRRWPSRLDARVWRDDTLQDLLREWENEA-W-LFAAAKM-----S 348 Query: 81 KKTIGTLSAAEAEALNVHSRTDLVEGLFKPVLQFLRKSRTE 121 ++ +G A+ E L R G+ + +R S +E Sbjct: 349 RRIVGAAKTADIETLPEELREGAARGVLRAARTLVRHSTSE 389 Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF excluding environmental samples from WGS projects Posted date: Jul 22, 2011 4:42 PM Number of letters in database: 5,058,227,080 Number of sequences in database: 14,777,732 Lambda K H 0.316 0.133 0.383 Gapped Lambda K H 0.267 0.0410 0.140 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Sequences: 14777732 Number of Hits to DB: 2,162,311,395 Number of extensions: 85950744 Number of successful extensions: 242702 Number of sequences better than 10.0: 18 Number of HSP's gapped: 244363 Number of HSP's successfully gapped: 18 Length of query: 228 Length of database: 5,058,227,080 Length adjustment: 134 Effective length of query: 94 Effective length of database: 3,078,010,992 Effective search space: 289333033248 Effective search space used: 289333033248 Neighboring words threshold: 11 Window for multiple hits: 40 X1: 16 ( 7.3 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 41 (21.6 bits) S2: 79 (35.0 bits)