BLASTP 2.2.24 [Aug-08-2010] Reference for compositional score matrix adjustment: Altschul, Stephen F., John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis, Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109. Query= Eten_8591_orf1 (132 letters) Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF excluding environmental samples from WGS projects 14,777,732 sequences; 5,058,227,080 total letters
Score E Sequences producing significant alignments: (bits) Value gi|196002721|ref|XP_002111228.1| hypothetical protein TRIADDRAFT... 35 4.8 gi|13236678|gb|AAK16196.1| AfaA [Escherichia coli] >gi|291291770... 34 6.6 gi|332799789|ref|YP_004461288.1| riboflavin biosynthesis protein... 34 7.7 >gi|196002721|ref|XP_002111228.1| hypothetical protein TRIADDRAFT_10225 [Trichoplax adhaerens] gi|190587179|gb|EDV27232.1| hypothetical protein TRIADDRAFT_10225 [Trichoplax adhaerens] Length = 1266 Score = 34.7 bits (78), Expect = 4.8, Method: Compositional matrix adjust. Identities = 28/108 (25%), Positives = 50/108 (46%), Gaps = 13/108 (12%) Query: 32 ILNRAELNSDSVWVVGCGPLKIVGSLQ-------NSITVGSARSEESS--CTAASKSNKN 82 +LNRA D + V+ G + +GS + N + + E+S+ C + + N Sbjct: 576 LLNRA----DKIIVISNGTIAAMGSYKSLLQSSRNFVELLPPSDEDSNNKCAESDGYDSN 631 Query: 83 LRSPYQKIRKNLNTPICSCVPHATAGLHREQREEGSSKLRRRIQYVVS 130 K +L+ S + +A + +E+R+EGS ++ IQY VS Sbjct: 632 SYLGVTKSYSSLSIASASMIFNADVKMDQEERQEGSVTMKTYIQYFVS 679 >gi|13236678|gb|AAK16196.1| AfaA [Escherichia coli] gi|291291770|gb|ADD91741.1| putative transcriptional regulator AfaA-VIII [Escherichia coli] gi|299929320|gb|ADJ58044.1| AfaA [Escherichia coli] Length = 127 Score = 34.3 bits (77), Expect = 6.6, Method: Compositional matrix adjust. Identities = 27/114 (23%), Positives = 52/114 (45%), Gaps = 10/114 (8%) Query: 17 NQVLKNEIC---TLSSSWILNRAELNSDSVWVV----GCGPLKIVGSLQNSITVGSARSE 69 N V+ +E+C S IL E++ W++ G K++ +L+ + +G R E Sbjct: 11 NGVMNSEVCLHLVKERSGILLPGEVSERHFWLLIEISGMHSEKVINALREHLVMGYTRRE 70 Query: 70 ESSCTAASKSNKNLRSPYQKIRKNLNTPICSCVPHATAGLHREQREEGSSKLRR 123 +C S L +I ++L+ + + +P+ +GL EEG ++R Sbjct: 71 --ACEQHGVSQGYLSGALVRI-QHLSQTVNNLLPYYVSGLAMNHEEEGDRSVKR 121 >gi|332799789|ref|YP_004461288.1| riboflavin biosynthesis protein RibD [Tepidanaerobacter sp. Re1] gi|332697524|gb|AEE91981.1| riboflavin biosynthesis protein RibD [Tepidanaerobacter sp. Re1] Length = 361 Score = 33.9 bits (76), Expect = 7.7, Method: Compositional matrix adjust. Identities = 27/88 (30%), Positives = 41/88 (46%), Gaps = 12/88 (13%) Query: 19 VLKNEICTLSSSWILNRAELNSDSVWVVGCGPLKIVGSLQNSITVGSARSEESSCTAASK 78 VLKN+ L+ +I N +E + +V+ KI S+ I + S +S+ +C AA K Sbjct: 127 VLKNKASKLNEVFIKNISE---QTPFVIA----KIAQSVDGKIALSSGKSKWITCDAARK 179 Query: 79 SNKNLRSPYQKIRKNLNT-----PICSC 101 +RS Y I + T PI C Sbjct: 180 KGHEIRSMYDAILVGIGTVLADDPILDC 207 Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF excluding environmental samples from WGS projects Posted date: Jul 22, 2011 4:42 PM Number of letters in database: 5,058,227,080 Number of sequences in database: 14,777,732 Lambda K H 0.314 0.127 0.384 Gapped Lambda K H 0.267 0.0410 0.140 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Sequences: 14777732 Number of Hits to DB: 1,256,473,373 Number of extensions: 41355932 Number of successful extensions: 101140 Number of sequences better than 10.0: 6 Number of HSP's gapped: 101987 Number of HSP's successfully gapped: 6 Length of query: 132 Length of database: 5,058,227,080 Length adjustment: 97 Effective length of query: 35 Effective length of database: 3,624,787,076 Effective search space: 126867547660 Effective search space used: 126867547660 Neighboring words threshold: 11 Window for multiple hits: 40 X1: 16 ( 7.3 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 41 (21.6 bits) S2: 76 (33.9 bits)