BLASTP 2.2.24 [Aug-08-2010] Reference for compositional score matrix adjustment: Altschul, Stephen F., John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis, Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109. Query= Eten_4523_orf1 (157 letters) Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF excluding environmental samples from WGS projects 14,777,732 sequences; 5,058,227,080 total letters
Score E Sequences producing significant alignments: (bits) Value gi|336368965|gb|EGN97307.1| hypothetical protein SERLA73DRAFT_16... 36 2.2 gi|218529327|ref|YP_002420143.1| alkanesulfonate monooxygenase [... 36 2.4 gi|242815206|ref|XP_002486524.1| MGMT family protein [Talaromyce... 35 4.6 gi|327311664|ref|YP_004338561.1| flavin reductase domain-contain... 35 4.8 gi|296170317|ref|ZP_06851908.1| acyl-CoA dehydrogenase domain pr... 34 6.3 gi|163759191|ref|ZP_02166277.1| indolepyruvate ferredoxin oxidor... 34 8.6 >gi|336368965|gb|EGN97307.1| hypothetical protein SERLA73DRAFT_169686 [Serpula lacrymans var. lacrymans S7.3] gi|336381744|gb|EGO22895.1| hypothetical protein SERLADRAFT_450619 [Serpula lacrymans var. lacrymans S7.9] Length = 1261 Score = 35.8 bits (81), Expect = 2.2, Method: Compositional matrix adjust. Identities = 30/110 (27%), Positives = 46/110 (41%), Gaps = 6/110 (5%) Query: 7 NLVKQFESLRDRQWGDFPVIHATLSELASESPPTQALKEKVLGQPVGQCTTASAASVGLE 66 NLV+ FE+++ P + T + + E P Q++ V P + + Sbjct: 801 NLVQWFEAIKSTLVD--PSVSPTWTPVEFEDKPLQSILHAV---PESEDKARPGIVIRQA 855 Query: 67 GDSAEPFSFADARLDFHKLRVRVAPWLQVVGASGFLSEWGPGARVKDAAI 116 GDSA + D+RLDFH +R RV VG G W ++ I Sbjct: 856 GDSAVLVEYGDSRLDFH-MRARVHALETEVGKRGVKGVWAFAPCIRSTMI 904 >gi|218529327|ref|YP_002420143.1| alkanesulfonate monooxygenase [Methylobacterium chloromethanicum CM4] gi|218521630|gb|ACK82215.1| Alkanesulfonate monooxygenase [Methylobacterium chloromethanicum CM4] Length = 365 Score = 35.8 bits (81), Expect = 2.4, Method: Compositional matrix adjust. Identities = 34/104 (32%), Positives = 44/104 (42%), Gaps = 16/104 (15%) Query: 2 PKQLLNLVKQFESLRDRQWGDFPVIHA------TLSELASESPPTQALKEKVL-----GQ 50 P+ L+L +R W I A T LAS +PP ++L GQ Sbjct: 225 PRFSLSLRPILAETEERAWARAEAILAQTRRLRTARGLASAAPPQNEGSRRLLAAAAEGQ 284 Query: 51 PVGQC-TTASAASVGLEGDSA----EPFSFADARLDFHKLRVRV 89 + +C TA AA G G+S P ADA LD+H L VR Sbjct: 285 RLDKCLYTAIAAETGASGNSTALVGTPEQVADALLDYHDLGVRT 328 >gi|242815206|ref|XP_002486524.1| MGMT family protein [Talaromyces stipitatus ATCC 10500] gi|218714863|gb|EED14286.1| MGMT family protein [Talaromyces stipitatus ATCC 10500] Length = 165 Score = 34.7 bits (78), Expect = 4.6, Method: Compositional matrix adjust. Identities = 27/111 (24%), Positives = 46/111 (41%), Gaps = 13/111 (11%) Query: 23 FPVIHATLSELASESPPTQALKEKVLGQP-----VGQC--------TTASAASVGLEGDS 69 F ++ + E+ T ++LG P VG C T ++++ G +S Sbjct: 11 FNAVYEAVQEIPLGCVTTYGHIARLLGYPRRARQVGICLKHLPSNRTNRTSSAKGGGDES 70 Query: 70 AEPFSFADARLDFHKLRVRVAPWLQVVGASGFLSEWGPGARVKDAAILAHQ 120 AE + +H PW +VV + G +S GPG+ + A L H+ Sbjct: 71 AEEDENDEQGEIYHHFHDDNVPWQRVVNSKGTISHRGPGSASRQAVALRHE 121 >gi|327311664|ref|YP_004338561.1| flavin reductase domain-containing protein [Thermoproteus uzoniensis 768-20] gi|326948143|gb|AEA13249.1| flavin reductase domain protein, FMN-binding protein [Thermoproteus uzoniensis 768-20] Length = 188 Score = 34.7 bits (78), Expect = 4.8, Method: Compositional matrix adjust. Identities = 22/62 (35%), Positives = 32/62 (51%), Gaps = 2/62 (3%) Query: 64 GLEGDSAEPFSFADARLDFHKLRVRVAPWLQVVGASGFLSEWGPGARVKDAAILAHQIGR 123 GL+G A + A+LD ++R+ V L+V A G + EWGP + +L H GR Sbjct: 112 GLDGSIAVYEARVTAKLDIGEVRLYVFEVLKVKVAQGLVDEWGP--VLDKTNLLLHGAGR 169 Query: 124 RF 125 F Sbjct: 170 VF 171 >gi|296170317|ref|ZP_06851908.1| acyl-CoA dehydrogenase domain protein [Mycobacterium parascrofulaceum ATCC BAA-614] gi|295895040|gb|EFG74759.1| acyl-CoA dehydrogenase domain protein [Mycobacterium parascrofulaceum ATCC BAA-614] Length = 352 Score = 34.3 bits (77), Expect = 6.3, Method: Compositional matrix adjust. Identities = 20/48 (41%), Positives = 26/48 (54%), Gaps = 1/48 (2%) Query: 106 GPGARVKDAAILAHQIGRRFAAFPSADPECCADFISRLNGGWLDGAAG 153 G GA + D ++A +IGR A P D C A + RL G +GAAG Sbjct: 69 GDGATLVDLTLVAEEIGRSLAPVPWIDHVCTARLLGRL-GALAEGAAG 115 >gi|163759191|ref|ZP_02166277.1| indolepyruvate ferredoxin oxidoreductase [Hoeflea phototrophica DFL-43] gi|162283595|gb|EDQ33880.1| indolepyruvate ferredoxin oxidoreductase [Hoeflea phototrophica DFL-43] Length = 1139 Score = 33.9 bits (76), Expect = 8.6, Method: Compositional matrix adjust. Identities = 34/121 (28%), Positives = 54/121 (44%), Gaps = 20/121 (16%) Query: 13 ESLRDRQWGDFPVIHATLSELASESPPTQALK-EKVLGQPVGQCTTASAASVGLEGDSAE 71 +SL+DR GDF ++H + S T+ ++ G +G + A GL G + + Sbjct: 1009 QSLQDRFEGDFRIMHHFAPPMLSSGTDTRGRPLKRAFGPAIGPVLSLLARLKGLRGTAFD 1068 Query: 72 PFSF-ADARLD------FHKLRVRVAP---------WLQVVGASGFLSEWGPGARVKDAA 115 PFS+ A+ R++ F L R W QV+ A + +GP VK AA Sbjct: 1069 PFSYTAERRMERGLIVWFEALMGRCEHEIHSGNADIWRQVLSAPMEIRGYGP---VKHAA 1125 Query: 116 I 116 + Sbjct: 1126 V 1126 Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF excluding environmental samples from WGS projects Posted date: Jul 22, 2011 4:42 PM Number of letters in database: 5,058,227,080 Number of sequences in database: 14,777,732 Lambda K H 0.319 0.134 0.416 Gapped Lambda K H 0.267 0.0410 0.140 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Sequences: 14777732 Number of Hits to DB: 1,521,145,581 Number of extensions: 55341838 Number of successful extensions: 129364 Number of sequences better than 10.0: 13 Number of HSP's gapped: 130477 Number of HSP's successfully gapped: 13 Length of query: 157 Length of database: 5,058,227,080 Length adjustment: 119 Effective length of query: 38 Effective length of database: 3,299,676,972 Effective search space: 125387724936 Effective search space used: 125387724936 Neighboring words threshold: 11 Window for multiple hits: 40 X1: 16 ( 7.4 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 41 (21.8 bits) S2: 76 (33.9 bits)