BLASTP 2.2.24 [Aug-08-2010] Reference for compositional score matrix adjustment: Altschul, Stephen F., John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis, Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109. Query= Eten_4714_orf1 (166 letters) Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF excluding environmental samples from WGS projects 14,777,732 sequences; 5,058,227,080 total letters
Score E Sequences producing significant alignments: (bits) Value gi|281200925|gb|EFA75139.1| hypothetical protein PPL_11213 [Poly... 44 0.009 gi|254167499|ref|ZP_04874351.1| OB-fold nucleic acid binding dom... 43 0.013 gi|254166808|ref|ZP_04873662.1| OB-fold nucleic acid binding dom... 43 0.015 gi|66815421|ref|XP_641727.1| hypothetical protein DDB_G0279209 [... 41 0.047 gi|301102097|ref|XP_002900136.1| conserved hypothetical protein ... 41 0.057 gi|328871273|gb|EGG19644.1| inositol-3-phosphate synthase [Dicty... 39 0.27 gi|224117112|ref|XP_002317479.1| predicted protein [Populus tric... 39 0.35 gi|118481894|gb|ABK92882.1| unknown [Populus trichocarpa] 39 0.35 gi|330843157|ref|XP_003293528.1| hypothetical protein DICPUDRAFT... 37 0.66 gi|13542067|ref|NP_111755.1| single-stranded DNA-binding protein... 36 1.6 gi|325183338|emb|CCA17796.1| conserved hypothetical protein [Alb... 35 2.9 gi|134037090|gb|ABO47885.1| hypothetical protein Tb10.6k15.3460 ... 34 5.6 >gi|281200925|gb|EFA75139.1| hypothetical protein PPL_11213 [Polysphondylium pallidum PN500] Length = 140 Score = 43.5 bits (101), Expect = 0.009, Method: Compositional matrix adjust. Identities = 26/89 (29%), Positives = 45/89 (50%), Gaps = 3/89 (3%) Query: 78 FVLVKDLLPQSGCCCCLVRVLQYLSHSLLKLPSGSLVQTTEAVVADETGAVLLLLETDAQ 137 F V+ L P + +++VL S ++ +EAVV DETG ++L ++ + Q Sbjct: 24 FTKVEHLKPMTSGHNLVLKVLS--SKVVIDRNKDRKEFISEAVVGDETGTIILTVKNE-Q 80 Query: 138 RQLCQPGTEVLLLNVTVSCVGGFVCLLLR 166 + QPG ++L N T+ GF+ L + Sbjct: 81 NDVVQPGNTIILRNGTIRVFNGFMRLYVN 109 >gi|254167499|ref|ZP_04874351.1| OB-fold nucleic acid binding domain protein [Aciduliprofundum boonei T469] gi|197623762|gb|EDY36325.1| OB-fold nucleic acid binding domain protein [Aciduliprofundum boonei T469] Length = 137 Score = 43.1 bits (100), Expect = 0.013, Method: Compositional matrix adjust. Identities = 25/83 (30%), Positives = 38/83 (45%), Gaps = 1/83 (1%) Query: 81 VKDLLPQSGCCCCLVRVLQYLSHSLLKLPSGSLVQTTEAVVADETGAVLLLLETDAQRQL 140 +KDL P + L +VL + G TEAVV DETG +++ L + Q + Sbjct: 7 IKDLNPNAKRVNVLAKVLHKGEPKEINTKYGETKHVTEAVVGDETGTIIMSL-WNEQADM 65 Query: 141 CQPGTEVLLLNVTVSCVGGFVCL 163 G + + N +S V G + L Sbjct: 66 IDEGETIYVDNGYISLVRGHMRL 88 >gi|254166808|ref|ZP_04873662.1| OB-fold nucleic acid binding domain protein [Aciduliprofundum boonei T469] gi|289596472|ref|YP_003483168.1| nucleic acid binding OB-fold tRNA/helicase-type [Aciduliprofundum boonei T469] gi|197624418|gb|EDY36979.1| OB-fold nucleic acid binding domain protein [Aciduliprofundum boonei T469] gi|289534259|gb|ADD08606.1| nucleic acid binding OB-fold tRNA/helicase-type [Aciduliprofundum boonei T469] Length = 137 Score = 42.7 bits (99), Expect = 0.015, Method: Compositional matrix adjust. Identities = 25/83 (30%), Positives = 39/83 (46%), Gaps = 1/83 (1%) Query: 81 VKDLLPQSGCCCCLVRVLQYLSHSLLKLPSGSLVQTTEAVVADETGAVLLLLETDAQRQL 140 +KDL P + L +VL + G TEAVV DETG +++ L + Q + Sbjct: 7 IKDLNPNAKRVNVLAKVLHKGEPKEINTKYGETKHVTEAVVGDETGTIIMSL-WNEQADM 65 Query: 141 CQPGTEVLLLNVTVSCVGGFVCL 163 + G + + N +S V G + L Sbjct: 66 IEEGEIIYVDNGYISLVRGHMRL 88 >gi|66815421|ref|XP_641727.1| hypothetical protein DDB_G0279209 [Dictyostelium discoideum AX4] gi|60469766|gb|EAL67753.1| hypothetical protein DDB_G0279209 [Dictyostelium discoideum AX4] Length = 132 Score = 41.2 bits (95), Expect = 0.047, Method: Compositional matrix adjust. Identities = 18/50 (36%), Positives = 30/50 (60%), Gaps = 1/50 (2%) Query: 117 TEAVVADETGAVLLLLETDAQRQLCQPGTEVLLLNVTVSCVGGFVCLLLR 166 +EAVV DETG ++L ++ + Q + QPG ++L N + GF+ L + Sbjct: 52 SEAVVGDETGTIILTVKNE-QNDVVQPGNTIILRNGIIKVFNGFMRLYVN 100 >gi|301102097|ref|XP_002900136.1| conserved hypothetical protein [Phytophthora infestans T30-4] gi|262102288|gb|EEY60340.1| conserved hypothetical protein [Phytophthora infestans T30-4] Length = 146 Score = 40.8 bits (94), Expect = 0.057, Method: Compositional matrix adjust. Identities = 28/88 (31%), Positives = 44/88 (50%), Gaps = 1/88 (1%) Query: 76 APFVLVKDLLPQSGCCCCLVRVLQYLSHSLLKLPSGSLVQTTEAVVADETGAVLLLLETD 135 A +V V+DL P S +++V+ K G++ + EAV+ADETG V L D Sbjct: 14 ATYVKVQDLTPGSQGHNLVLQVVSIAPTVEKKRYDGTISRIAEAVLADETGCVTLTARND 73 Query: 136 AQRQLCQPGTEVLLLNVTVSCVGGFVCL 163 Q + + G +V++ N GF+ L Sbjct: 74 -QIDMLKEGLDVVVRNSNADIFNGFMRL 100 >gi|328871273|gb|EGG19644.1| inositol-3-phosphate synthase [Dictyostelium fasciculatum] Length = 686 Score = 38.9 bits (89), Expect = 0.27, Method: Composition-based stats. Identities = 18/47 (38%), Positives = 30/47 (63%), Gaps = 1/47 (2%) Query: 117 TEAVVADETGAVLLLLETDAQRQLCQPGTEVLLLNVTVSCVGGFVCL 163 +EAVV DETG ++L ++ + Q ++ QPG ++L N + GF+ L Sbjct: 597 SEAVVGDETGTIVLTVKNE-QNEVVQPGNTIILRNGIIKVFNGFMRL 642 >gi|224117112|ref|XP_002317479.1| predicted protein [Populus trichocarpa] gi|222860544|gb|EEE98091.1| predicted protein [Populus trichocarpa] Length = 155 Score = 38.5 bits (88), Expect = 0.35, Method: Compositional matrix adjust. Identities = 27/90 (30%), Positives = 41/90 (45%), Gaps = 5/90 (5%) Query: 78 FVLVKDLLPQSGCCCCLVRVLQYLSHSLLKLPSGSL----VQTTEAVVADETGAVLLLLE 133 F+ V L P +G V+VL + + S SL + E ++ DETG ++ Sbjct: 30 FIKVDQLKPGTGGHTLTVKVLNFNTVPQKDRRSVSLHARQTRIAECLIGDETGTIIFTAR 89 Query: 134 TDAQRQLCQPGTEVLLLNVTVSCVGGFVCL 163 D Q L +PGT V+L N + G + L Sbjct: 90 ND-QVDLMKPGTTVILRNAKIDMFKGSMRL 118 >gi|118481894|gb|ABK92882.1| unknown [Populus trichocarpa] Length = 155 Score = 38.5 bits (88), Expect = 0.35, Method: Compositional matrix adjust. Identities = 27/90 (30%), Positives = 41/90 (45%), Gaps = 5/90 (5%) Query: 78 FVLVKDLLPQSGCCCCLVRVLQYLSHSLLKLPSGSL----VQTTEAVVADETGAVLLLLE 133 F+ V L P +G V+VL + + S SL + E ++ DETG ++ Sbjct: 30 FIKVDQLKPGTGGHTLTVKVLNFNTVPQKDRRSVSLHVRQTRIAECLIGDETGTIIFTAR 89 Query: 134 TDAQRQLCQPGTEVLLLNVTVSCVGGFVCL 163 D Q L +PGT V+L N + G + L Sbjct: 90 ND-QVDLMKPGTTVILRNAKIDMFKGSMRL 118 >gi|330843157|ref|XP_003293528.1| hypothetical protein DICPUDRAFT_42156 [Dictyostelium purpureum] gi|325076134|gb|EGC29947.1| hypothetical protein DICPUDRAFT_42156 [Dictyostelium purpureum] Length = 116 Score = 37.4 bits (85), Expect = 0.66, Method: Compositional matrix adjust. Identities = 17/47 (36%), Positives = 28/47 (59%), Gaps = 1/47 (2%) Query: 117 TEAVVADETGAVLLLLETDAQRQLCQPGTEVLLLNVTVSCVGGFVCL 163 +EAVV DETG + L ++ + Q + QPG ++L N + G++ L Sbjct: 36 SEAVVGDETGVITLTVKNE-QNDVVQPGNTIILRNGLIKVFNGYMRL 81 >gi|13542067|ref|NP_111755.1| single-stranded DNA-binding protein [Thermoplasma volcanium GSS1] gi|14325498|dbj|BAB60402.1| hypothetical protein [Thermoplasma volcanium GSS1] Length = 145 Score = 36.2 bits (82), Expect = 1.6, Method: Compositional matrix adjust. Identities = 26/83 (31%), Positives = 38/83 (45%), Gaps = 1/83 (1%) Query: 81 VKDLLPQSGCCCCLVRVLQYLSHSLLKLPSGSLVQTTEAVVADETGAVLLLLETDAQRQL 140 +KDL P S L +VL ++ G TE + D+TG V+L L D Q Sbjct: 7 IKDLTPSSRRVNVLGKVLSVGEPKTIQTKFGDQRSVTEVTIGDDTGKVILSLWGDQAAQ- 65 Query: 141 CQPGTEVLLLNVTVSCVGGFVCL 163 + G +++ N VS V G + L Sbjct: 66 AKTGDTLMIGNGYVSLVRGHIRL 88 >gi|325183338|emb|CCA17796.1| conserved hypothetical protein [Albugo laibachii Nc14] Length = 147 Score = 35.4 bits (80), Expect = 2.9, Method: Compositional matrix adjust. Identities = 24/88 (27%), Positives = 41/88 (46%), Gaps = 1/88 (1%) Query: 76 APFVLVKDLLPQSGCCCCLVRVLQYLSHSLLKLPSGSLVQTTEAVVADETGAVLLLLETD 135 A +V V+D+ P + ++R++ K G+ + EAVV DETG V L + Sbjct: 13 ASYVKVEDVTPGTHGHNLVLRIVSVTPLVAKKRQDGNAPRMAEAVVGDETGIVTLTARNE 72 Query: 136 AQRQLCQPGTEVLLLNVTVSCVGGFVCL 163 L + G +V++ N G++ L Sbjct: 73 QIDSLKE-GGDVVIRNCNADVYNGYLRL 99 >gi|134037090|gb|ABO47885.1| hypothetical protein Tb10.6k15.3460 [Alexandrium fundyense] Length = 433 Score = 34.3 bits (77), Expect = 5.6, Method: Compositional matrix adjust. Identities = 14/51 (27%), Positives = 29/51 (56%) Query: 115 QTTEAVVADETGAVLLLLETDAQRQLCQPGTEVLLLNVTVSCVGGFVCLLL 165 + E +V D+TG V+L + +D +C+ G + + N V + G++ ++L Sbjct: 348 EVKEVLVGDDTGLVMLSIRSDTHADVCKAGAALRVQNAHVKMIKGYIRVVL 398 Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF excluding environmental samples from WGS projects Posted date: Jul 22, 2011 4:42 PM Number of letters in database: 5,058,227,080 Number of sequences in database: 14,777,732 Lambda K H 0.326 0.141 0.425 Gapped Lambda K H 0.267 0.0410 0.140 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Sequences: 14777732 Number of Hits to DB: 807,099,115 Number of extensions: 24586033 Number of successful extensions: 74512 Number of sequences better than 10.0: 13 Number of HSP's gapped: 74863 Number of HSP's successfully gapped: 13 Length of query: 166 Length of database: 5,058,227,080 Length adjustment: 127 Effective length of query: 39 Effective length of database: 3,181,455,116 Effective search space: 124076749524 Effective search space used: 124076749524 Neighboring words threshold: 11 Window for multiple hits: 40 X1: 15 ( 7.1 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 40 (21.7 bits) S2: 76 (33.9 bits)