bitscore colors: <40, 40-50 , 50-80, 80-200, >200
BLASTP 2.2.30+ Reference: Stephen F. Altschul, Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Reference for composition-based statistics: Alejandro A. Schaffer, L. Aravind, Thomas L. Madden, Sergei Shavirin, John L. Spouge, Yuri I. Wolf, Eugene V. Koonin, and Stephen F. Altschul (2001), "Improving the accuracy of PSI-BLAST protein database searches with composition-based statistics and other refinements", Nucleic Acids Res. 29:2994-3005. Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF excluding environmental samples from WGS projects 49,011,213 sequences; 17,563,301,199 total letters Query= Contig-12_CDS_annotation_glimmer3.pl_2_1 Length=354 Score E Sequences producing significant alignments: (Bits) Value gi|494822881|ref|WP_007558289.1| hypothetical protein 56.2 1e-05 gi|575094344|emb|CDL65728.1| unnamed protein product 53.9 7e-05 gi|496050831|ref|WP_008775338.1| predicted protein 52.8 2e-04 gi|490418711|ref|WP_004291034.1| hypothetical protein 50.1 0.001 gi|575094319|emb|CDL65706.1| unnamed protein product 47.8 0.008 gi|575094372|emb|CDL65753.1| unnamed protein product 47.0 0.012 gi|575094301|emb|CDL65691.1| unnamed protein product 45.8 0.033 gi|547226428|ref|WP_021963491.1| putative uncharacterized protein 43.5 0.16 gi|547836612|ref|WP_022244433.1| putative uncharacterized protein 41.6 0.45 gi|649555290|gb|KDS61827.1| hypothetical protein M095_3809 41.6 0.61 >gi|494822881|ref|WP_007558289.1| hypothetical protein [Bacteroides plebeius] gi|198272097|gb|EDY96366.1| hypothetical protein BACPLE_00802 [Bacteroides plebeius DSM 17135] Length=344 Score = 56.2 bits (134), Expect = 1e-05, Method: Compositional matrix adjust. Identities = 58/178 (33%), Positives = 87/178 (49%), Gaps = 37/178 (21%) Query 31 KHQLEMQRIQNEWASSESQKSRDFAKSMFDATNEWNSAKNQRARLEEAGLNPYLMMNggs 90 + QLE Q I+ EW M++A NE+NSA +QR RLEEAGLNPY+MM+GGS Sbjct 59 REQLERQ-IEQEW-------------DMWNAENEYNSASSQRKRLEEAGLNPYMMMDGGS 104 Query 91 agtasstsantvsgasgsggsPYQFTPTNMIG-----DVAS-YAAAMKSMSDARK---TN 141 AG+ASS ++ A P +M G +AS + A +K+ D R N Sbjct 105 AGSASSMTSPAAQPAVVPQMQGATMQPADMSGLSGLRGIASEFIATLKAQEDIRGQQLIN 164 Query 142 TESDLLDQYGAATYESRIKKTMADTYFTQRQADVAIAQRANLLLRNDAQEVLNMYLPE 199 ++ +QY A + ++KT ++ F + Q Q+++N + PE Sbjct 165 EGQEIENQYKADKLLADLEKTRTESGFVRSQT--------------KGQDIMNRFRPE 208 >gi|575094344|emb|CDL65728.1| unnamed protein product [uncultured bacterium] Length=368 Score = 53.9 bits (128), Expect = 7e-05, Method: Compositional matrix adjust. Identities = 46/146 (32%), Positives = 68/146 (47%), Gaps = 13/146 (9%) Query 40 QNEWASSESQKSRDFAKSMFDATNEWNSAKNQRARLEEAGLNPYLMM-Nggsagtassts 98 Q EW S E+QK RDF M++ NE+N Q RLEEAG+NP+ M N A SS S Sbjct 49 QMEWQSQEAQKQRDFQLDMWNRNNEYNKPDEQMKRLEEAGINPWQSMGNSSVASGNSSLS 108 Query 99 antvsgasgsggsPYQFTPTNMIGDV-ASYAAAMKSMSDARKTNTESDLLDQYGAATYES 157 + S + + +P + DV A A K+ +D + NT ++ Sbjct 109 QPSGFVPSPAHAASSSLSPLGLAADVLGKIAQANKAGADTNRVNT-----------LLQT 157 Query 158 RIKKTMADTYFTQRQADVAIAQRANL 183 ++K +A+ F QA + +NL Sbjct 158 ELEKLIAEKNFVSLQAARQAIENSNL 183 >gi|496050831|ref|WP_008775338.1| predicted protein [Bacteroides sp. 2_2_4] gi|229448895|gb|EEO54686.1| hypothetical protein BSCG_01611 [Bacteroides sp. 2_2_4] Length=381 Score = 52.8 bits (125), Expect = 2e-04, Method: Compositional matrix adjust. Identities = 53/144 (37%), Positives = 79/144 (55%), Gaps = 10/144 (7%) Query 37 QRIQNEWASSESQKSRDFAKSMFDATNEWNSAKNQRARLEEAGLNPYLMMNggsagtass 96 Q++ ++W+ K A MF+ATNE+NSA QR R E AGLNPY+MMN GSAGTA++ Sbjct 66 QQVSDQWSFYNDAKQN--AWDMFNATNEYNSASAQRERYEAAGLNPYVMMNTGSAGTAAA 123 Query 97 tsantvsgasgsggsPYQFTP-----TNMIGDVASYAAAMKSMSDARKTNTESDLLD--- 148 TSA + + + G +P +P + ++ + + S+ D KT E+ L Sbjct 124 TSATSATAPTKQGITPPTASPYSADYSGIMQGLGQAIDQLSSIPDKAKTIAETGNLKIEG 183 Query 149 QYGAATYESRIKKTMADTYFTQRQ 172 +Y AA +RI ADT+ + Q Sbjct 184 KYKAAEAIARIANIKADTHSKKEQ 207 >gi|490418711|ref|WP_004291034.1| hypothetical protein [Bacteroides eggerthii] gi|217986638|gb|EEC52972.1| hypothetical protein BACEGG_02723 [Bacteroides eggerthii DSM 20697] Length=368 Score = 50.1 bits (118), Expect = 0.001, Method: Compositional matrix adjust. Identities = 74/251 (29%), Positives = 103/251 (41%), Gaps = 50/251 (20%) Query 41 NEWASSESQKSRDFAKSMFDATNEWNSAKNQRARLEEAGLNPYLMMNggsagtasstsan 100 N W E K+ + M++ NE+N QRARLE AGLNPY+MMNGGSAG A S S Sbjct 77 NAWKLYEDNKA--YQTEMWNKQNEYNDPSAQRARLEAAGLNPYMMMNGGSAGVAGSVSGT 134 Query 101 tvsgasgsggsPYQFTPTNMIGDVASYAAAMKSMSDARKT----------NTESDLL--- 147 S S S P A Y+ M+ + A T N ++D L Sbjct 135 QGSAPSAGSPSAQGVQPPTATPYSADYSGVMQGLGHAIDTIMTGSQRNIQNAQADNLRIE 194 Query 148 DQYGAA--------TYE---------------SRIKKTMADTYFT-------QRQADVAI 177 +Y A+ TY S I+K ++ + Q QA I Sbjct 195 GKYIASKAIAELYKTYNEAKNDDERVAIQRVLSSIQKDLSASQVAVNNENVRQIQAQTKI 254 Query 178 AQRANLLLRNDAQEVLNMYLPEEKRIQLQMNGAQYWNMIREGVISEEQAKNLIASRLEIE 237 A NLL + +LP E+R QL + A + ++E+QA++ I E Sbjct 255 AVTENLLREQQLK-----FLPYEQRTQLALGAADIALKYAQKNLTEKQARHEIEKLAETI 309 Query 238 ARTQGQHISNK 248 R GQ + N+ Sbjct 310 VRANGQAMQNQ 320 >gi|575094319|emb|CDL65706.1| unnamed protein product [uncultured bacterium] Length=396 Score = 47.8 bits (112), Expect = 0.008, Method: Compositional matrix adjust. Identities = 76/276 (28%), Positives = 112/276 (41%), Gaps = 51/276 (18%) Query 31 KHQLEMQRIQNEWASSESQKSRDFAKSMFDATNEWNSAKNQRARLEEAGLNPYLMMNggs 90 K+QL+ R N+ + ++ F + M++ NE+N QRARLE AGLNPYLMM+GGS Sbjct 50 KYQLQAVRETNQANREIADQNNKFNERMWNLQNEYNRPDMQRARLEAAGLNPYLMMDGGS 109 Query 91 agtasstsantvsgas------gsggsPYQFTPTNMIGDVASYAAAMKSMSDARKTNT-- 142 AG A S SG + YQ N I AS A M D +K N Sbjct 110 AGIAESAPTADTSGTQIAPDIGNTIAGGYQ-AMGNSISSAASQIAQMTFQDDLQKANVAK 168 Query 143 ------ESDLLDQYGAATYESRIKKTMADTYFTQRQADVAIAQRANLLLRNDAQEVLNM- 195 + L +Q+ E + + + Q+Q D++ + AN LR+ Q+ L+ Sbjct 169 TVAEAKNAHLQNQFDELRNEFAVANFLVNLRLKQKQGDISDYE-AN-YLRDSMQDRLDSV 226 Query 196 ---------------------------------YLPEEKRIQLQMNGAQYWNMIREGVIS 222 +LP+EK+ L M+ E ++ Sbjct 227 KFQNTLSGSQSSYYSQMAGLTDVQRQIEQTNLDWLPQEKQAGLAATLQNIRTMVSEMGLN 286 Query 223 EEQAKNLIASRLEIEARTQGQHISNKIARSTADAVI 258 QAKN A A +G I N++ ST D + Sbjct 287 YAQAKNAFAMASLNYANEEGLRIDNRLKESTFDLSV 322 >gi|575094372|emb|CDL65753.1| unnamed protein product [uncultured bacterium] Length=385 Score = 47.0 bits (110), Expect = 0.012, Method: Compositional matrix adjust. Identities = 20/44 (45%), Positives = 32/44 (73%), Gaps = 0/44 (0%) Query 39 IQNEWASSESQKSRDFAKSMFDATNEWNSAKNQRARLEEAGLNP 82 +QNE+ +SE++K+R F KS+++ + WNS NQ + +AGLNP Sbjct 69 LQNEFNASEAEKNRAFQKSLYERSLSWNSPSNQLKMMADAGLNP 112 >gi|575094301|emb|CDL65691.1| unnamed protein product [uncultured bacterium] Length=437 Score = 45.8 bits (107), Expect = 0.033, Method: Compositional matrix adjust. Identities = 23/57 (40%), Positives = 36/57 (63%), Gaps = 3/57 (5%) Query 31 KHQLEMQRIQNEWASSESQKSRDFAKSMFDATNEWNSAKNQRARLEEAGLNPYLMMN 87 +HQ E R + E+ RDFA+ M+ TN++N+ Q+ RLE+AG+NPY+ M+ Sbjct 45 RHQAEDAR---NFTHQENALQRDFARQMWKDTNDYNTPIAQKQRLEQAGMNPYVNMD 98 >gi|547226428|ref|WP_021963491.1| putative uncharacterized protein [Prevotella sp. CAG:1185] gi|524103380|emb|CCY83991.1| putative uncharacterized protein [Prevotella sp. CAG:1185] Length=416 Score = 43.5 bits (101), Expect = 0.16, Method: Compositional matrix adjust. Identities = 19/22 (86%), Positives = 20/22 (91%), Gaps = 0/22 (0%) Query 65 WNSAKNQRARLEEAGLNPYLMM 86 +NSAK QRARLE AGLNPYLMM Sbjct 91 YNSAKAQRARLEAAGLNPYLMM 112 >gi|547836612|ref|WP_022244433.1| putative uncharacterized protein [Roseburia sp. CAG:45] gi|524471221|emb|CDC12112.1| putative uncharacterized protein [Roseburia sp. CAG:45] Length=225 Score = 41.6 bits (96), Expect = 0.45, Method: Compositional matrix adjust. Identities = 20/62 (32%), Positives = 33/62 (53%), Gaps = 0/62 (0%) Query 281 DVGFRTGKMDRWSQDPVKARWDRGINNAGKFIDGLSNIVGAVTKFGSFRREGRSVIEQFD 340 +V RT MD W+ K R+ + INN+ + D L ++ +T+ ++ R +IE D Sbjct 161 EVQLRTIAMDFWASLEHKLRYKKHINNSEEIADKLKHVADVITEMDCEMQDIRHMIEFID 220 Query 341 DN 342 DN Sbjct 221 DN 222 >gi|649555290|gb|KDS61827.1| hypothetical protein M095_3809 [Parabacteroides distasonis str. 3999B T(B) 4] gi|649557306|gb|KDS63785.1| hypothetical protein M095_3404 [Parabacteroides distasonis str. 3999B T(B) 4] gi|649559158|gb|KDS65545.1| hypothetical protein M096_4689 [Parabacteroides distasonis str. 3999B T(B) 6] gi|649560567|gb|KDS66875.1| hypothetical protein M095_2448 [Parabacteroides distasonis str. 3999B T(B) 4] gi|649561016|gb|KDS67303.1| hypothetical protein M095_2410 [Parabacteroides distasonis str. 3999B T(B) 4] gi|649562727|gb|KDS68911.1| hypothetical protein M096_3341 [Parabacteroides distasonis str. 3999B T(B) 6] Length=288 Score = 41.6 bits (96), Expect = 0.61, Method: Compositional matrix adjust. Identities = 20/55 (36%), Positives = 33/55 (60%), Gaps = 0/55 (0%) Query 31 KHQLEMQRIQNEWASSESQKSRDFAKSMFDATNEWNSAKNQRARLEEAGLNPYLM 85 K +E+ + Q +W E++K+ + M++ NE+NS Q AR+ AGLNP L+ Sbjct 33 KANMEIAKYQAQWQQQENEKAYQRSLKMWNLQNEYNSPTQQMARIRAAGLNPNLV 87 Lambda K H a alpha 0.315 0.127 0.358 0.792 4.96 Gapped Lambda K H a alpha sigma 0.267 0.0410 0.140 1.90 42.6 43.6 Effective search space used: 2103429244710