bitscore colors: <40, 40-50 , 50-80, 80-200, >200
BLASTP 2.2.30+ Reference: Stephen F. Altschul, Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Reference for composition-based statistics: Alejandro A. Schaffer, L. Aravind, Thomas L. Madden, Sergei Shavirin, John L. Spouge, Yuri I. Wolf, Eugene V. Koonin, and Stephen F. Altschul (2001), "Improving the accuracy of PSI-BLAST protein database searches with composition-based statistics and other refinements", Nucleic Acids Res. 29:2994-3005. Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF excluding environmental samples from WGS projects 49,011,213 sequences; 17,563,301,199 total letters Query= Contig-5_CDS_annotation_glimmer3.pl_2_7 Length=378 Score E Sequences producing significant alignments: (Bits) Value gi|547226428|ref|WP_021963491.1| putative uncharacterized protein 77.0 4e-12 gi|490418711|ref|WP_004291034.1| hypothetical protein 75.5 7e-12 gi|494822881|ref|WP_007558289.1| hypothetical protein 73.9 2e-11 gi|496050831|ref|WP_008775338.1| predicted protein 64.7 3e-08 gi|575094344|emb|CDL65728.1| unnamed protein product 61.2 4e-07 gi|575094358|emb|CDL65740.1| unnamed protein product 59.3 1e-06 gi|575094319|emb|CDL65706.1| unnamed protein product 58.2 4e-06 gi|492501772|ref|WP_005867312.1| hypothetical protein 53.1 1e-04 gi|649555290|gb|KDS61827.1| hypothetical protein M095_3809 52.4 2e-04 gi|639237431|ref|WP_024568108.1| hypothetical protein 50.4 8e-04 >gi|547226428|ref|WP_021963491.1| putative uncharacterized protein [Prevotella sp. CAG:1185] gi|524103380|emb|CCY83991.1| putative uncharacterized protein [Prevotella sp. CAG:1185] Length=416 Score = 77.0 bits (188), Expect = 4e-12, Method: Compositional matrix adjust. Identities = 106/417 (25%), Positives = 170/417 (41%), Gaps = 75/417 (18%) Query 8 SGLFGGLGSVISGAIGAKTTADTNKTNLKIAQMNNDFNAREAQKARDFQLDMWNKENE-- 65 S L G S + G+K+ +DTNKTNL+IAQMNN++N R K ++ DM+N++ E Sbjct 7 SALIGAGASFLGNIFGSKSQSDTNKTNLQIAQMNNEYNERMFNKQLEYNQDMFNQQVEYD 66 Query 66 ------------------------YNSASSQRERLENAGYNPYM------NEaqagtatg 95 YNSA +QR RLE AG NPY+ A + + Sbjct 67 QKKMEQQNNFNARMQNEAIGAQQVYNSAKAQRARLEAAGLNPYLMMSGGNAGAVSAVSGS 126 Query 96 msgtsaasaagaaPQI-------PYTPDFQSV--------------GVNLASALKMMSEK 134 + S G P + PDF V GV A A + + Sbjct 127 SGSGGSPSPMGVNPPTASSAVMQAFRPDFSGVTGIIQTLLDIQAQKGVRDAQAFSLGEQA 186 Query 135 KKTDIENLNMSDLLRSQIWQNLGATDWRNASPEARAYNLSQGRRAAELG--MASLEENLS 192 IEN ++ L I+ + + +N+ N+S R A ++ + Sbjct 187 SGFKIENKYKAEKLLWDIYNSKADYNLKNSQESLN--NMSFARLQAMFSSDVSKAQREAE 244 Query 193 NQRWSNNLLVANIANSLLDADTKTILNKYLDQQQQAELNVKAANYEYLVMSGQLKRQEVN 252 N +++ L+ A A L KY DQ+ EL + +A LV +G+ + Sbjct 245 NAQFTGELIRAQTACQQLQGLLGAKELKYYDQKVLQELAIMSAQQYSLVAAGKASEAQAR 304 Query 253 NLIAEEIETYARANGYNLQNRILRETSDGLIR-ATNNTNFYFGSYYHSRAFNAGADAFHD 311 I + + G + N + ++T++ LI+ A NN N SY++S+ H+ Sbjct 305 QAIENALNLVEQREGIKVDNYVKQKTANALIKTARNNCN---TSYWNSK-------TAHN 354 Query 312 SSILRSRAGSASESYKQSAFDTKLQPWREALNSTNMIFNG-IGSGLDSYTNFQNGRY 367 S+ R +S+ Q F+ + + L S IF G +G GL+ Y + R+ Sbjct 355 QSL---RPSVFEDSFSQ-GFNKFINTYIAPLGSA--IFGGAVGYGLNIYNKNSDDRH 405 >gi|490418711|ref|WP_004291034.1| hypothetical protein [Bacteroides eggerthii] gi|217986638|gb|EEC52972.1| hypothetical protein BACEGG_02723 [Bacteroides eggerthii DSM 20697] Length=368 Score = 75.5 bits (184), Expect = 7e-12, Method: Compositional matrix adjust. Identities = 98/329 (30%), Positives = 149/329 (45%), Gaps = 63/329 (19%) Query 7 ASGLFGGLGSVISGAI----GAKTTADTNKTNLKIAQMNNDFNAR--------------- 47 A+ + G +GS I GA TT N+ N +IAQMNN FN + Sbjct 3 AAAMTGIVGSAIGAGTSLIGGASTTHMQNQANKEIAQMNNAFNEKMFDKQIAYNKEMYQT 62 Query 48 ---------EAQKA---------RDFQLDMWNKENEYNSASSQRERLENAGYNPY--MNE 87 + QKA + +Q +MWNK+NEYN S+QR RLE AG NPY MN Sbjct 63 QLGDQWKFYDDQKANAWKLYEDNKAYQTEMWNKQNEYNDPSAQRARLEAAGLNPYMMMNG 122 Query 88 aqagtatgmsgtsaasaagaaPQ---------IPYTPDFQSVGVNLASALKMMSEKKKTD 138 AG A +SGT ++ + +P PY+ D+ V L A+ + + + Sbjct 123 GSAGVAGSVSGTQGSAPSAGSPSAQGVQPPTATPYSADYSGVMQGLGHAIDTIMTGSQRN 182 Query 139 IENLNMSDLLRSQIWQNLGATDWRNASPEA-RAYNLSQG---RRAAELGMASLEENLS-N 193 I+N +D LR + G A E + YN ++ R A + ++S++++LS + Sbjct 183 IQNAQ-ADNLRIE-----GKYIASKAIAELYKTYNEAKNDDERVAIQRVLSSIQKDLSAS 236 Query 194 QRWSNNLLVANIANSLLDADTKTILN----KYLDQQQQAELNVKAANYEYLVMSGQLKRQ 249 Q NN V I A T+ +L K+L +Q+ +L + AA+ L + Sbjct 237 QVAVNNENVRQIQAQTKIAVTENLLREQQLKFLPYEQRTQLALGAADIALKYAQKNLTEK 296 Query 250 EVNNLIAEEIETYARANGYNLQNRILRET 278 + + I + ET RANG +QN+ ET Sbjct 297 QARHEIEKLAETIVRANGQAMQNQYDAET 325 >gi|494822881|ref|WP_007558289.1| hypothetical protein [Bacteroides plebeius] gi|198272097|gb|EDY96366.1| hypothetical protein BACPLE_00802 [Bacteroides plebeius DSM 17135] Length=344 Score = 73.9 bits (180), Expect = 2e-11, Method: Compositional matrix adjust. Identities = 33/56 (59%), Positives = 45/56 (80%), Gaps = 0/56 (0%) Query 30 TNKTNLKIAQMNNDFNAREAQKARDFQLDMWNKENEYNSASSQRERLENAGYNPYM 85 TN+ N++IAQM+N++N + ++ + + DMWN ENEYNSASSQR+RLE AG NPYM Sbjct 43 TNQANIQIAQMSNEYNREQLERQIEQEWDMWNAENEYNSASSQRKRLEEAGLNPYM 98 >gi|496050831|ref|WP_008775338.1| predicted protein [Bacteroides sp. 2_2_4] gi|229448895|gb|EEO54686.1| hypothetical protein BSCG_01611 [Bacteroides sp. 2_2_4] Length=381 Score = 64.7 bits (156), Expect = 3e-08, Method: Compositional matrix adjust. Identities = 80/283 (28%), Positives = 125/283 (44%), Gaps = 45/283 (16%) Query 37 IAQMNNDFNAREAQKARDFQL----------------------DMWNKENEYNSASSQRE 74 IAQMNN+FN R QK D+ DM+N NEYNSAS+QRE Sbjct 41 IAQMNNEFNERMLQKQMDYNTLAYDQQVSDQWSFYNDAKQNAWDMFNATNEYNSASAQRE 100 Query 75 RLENAGYNPYMNEaqagtatgmsgtsaasaagaaPQI------PYTPDFQSVGVNLASAL 128 R E AG NPY+ T + ++ ++ A I PY+ D+ + L A+ Sbjct 101 RYEAAGLNPYVMMNTGSAGTAAATSATSATAPTKQGITPPTASPYSADYSGIMQGLGQAI 160 Query 129 KMMS---EKKKTDIENLNMSDLLRSQIWQNLGATDWRNASPEARAYNLSQGRRAAELGMA 185 +S +K KT E N+ + + + + R A+ +A ++ + +L M Sbjct 161 DQLSSIPDKAKTIAETGNLKIEGKYKAAEAIA----RIANIKADTHSKKEQVALNKL-MY 215 Query 186 SLEENLSNQRWSNNLLVANIANSLLDADTKTILN-------KYLDQQQQAELNVKAANYE 238 S++++L++ + N NIAN + K I ++D Q+ EL KAAN + Sbjct 216 SIQKDLASSTMAVN--SQNIANMRAEEKFKNIQTLIADKQLSFMDATQKMELAEKAANIQ 273 Query 239 YLVMSGQLKRQEVNNLIAEEIETYARANGYNLQNRILRETSDG 281 + G L R + + I + ET AR N Q + E + G Sbjct 274 LKLAQGALTRNQAAHEIKKISETEARTTLINEQTSLTIEQNTG 316 >gi|575094344|emb|CDL65728.1| unnamed protein product [uncultured bacterium] Length=368 Score = 61.2 bits (147), Expect = 4e-07, Method: Compositional matrix adjust. Identities = 32/72 (44%), Positives = 44/72 (61%), Gaps = 5/72 (7%) Query 13 GLGSVISGAIGAKTTADTNKTNLKIAQMNNDFNAREAQKARDFQLDMWNKENEYNSASSQ 72 GLG ++ G G ++ + QM ++ ++EAQK RDFQLDMWN+ NEYN Q Sbjct 26 GLG-IVDGVAG--MFGQSSDQRFALGQM--EWQSQEAQKQRDFQLDMWNRNNEYNKPDEQ 80 Query 73 RERLENAGYNPY 84 +RLE AG NP+ Sbjct 81 MKRLEEAGINPW 92 >gi|575094358|emb|CDL65740.1| unnamed protein product [uncultured bacterium] Length=328 Score = 59.3 bits (142), Expect = 1e-06, Method: Compositional matrix adjust. Identities = 78/300 (26%), Positives = 142/300 (47%), Gaps = 33/300 (11%) Query 6 FASGLFGGLGSVISGAIGAKTTAD-TNKTNLKIAQMNNDFNAREAQKARDFQLDMWNKEN 64 A G +G+ + AI K D TN TN++IAQMNN+++ R +K + +MW K Sbjct 1 MAVGTMIDIGAGLYTAISNKNAVDNTNSTNMQIAQMNNEWSERMMEKQMAYNTEMWEKVA 60 Query 65 EYNSASSQRERLENAGYNPYMNEaqagtatgmsgtsaasaagaaPQIPYTP---DFQSV- 120 +YNS ++ ++ +AG NPYM + + + ++ + + + Q+ P DF SV Sbjct 61 DYNSLPNKMQQARDAGVNPYMALSGNAFGSISAPSANSVSLPSPSQVQAQPAQYDFSSVS 120 Query 121 -----GVNLASALKMMSEKK-----KTD---IENLNMSDLLRSQIWQNLGATDWRNASPE 167 G++L ++M ++ TD IEN + L S+I + + T + Sbjct 121 NSIIAGMDLFQKAQLMKSQQSNIDASTDQLRIENKYHAMKLVSEIAEKMANTK----DSQ 176 Query 168 ARAYNLSQGRRAAELGMAS-LE---ENLSNQRWSNNLLVANIANSLLDADTKTILNKYLD 223 A+A AE G+ + LE + LSN + + LV L +A T L ++ Sbjct 177 AKAVYQQIINEYAEQGIKTDLEIKNQTLSNMKETFRGLV------LENAMTSEQL-RFFP 229 Query 224 QQQQAELNVKAANYEYLVMSGQLKRQEVNNLIAEEIETYARANGYNLQNRILRETSDGLI 283 +Q +A+L + A+ + +L +Q++ I + +T + G + N ILR+ + ++ Sbjct 230 EQVRAQLGLTASQILLNQSNSKLSQQKMVESIYNQWKTDSERQGIQINNSILRKAAKDIV 289 >gi|575094319|emb|CDL65706.1| unnamed protein product [uncultured bacterium] Length=396 Score = 58.2 bits (139), Expect = 4e-06, Method: Compositional matrix adjust. Identities = 37/91 (41%), Positives = 46/91 (51%), Gaps = 22/91 (24%) Query 6 FASGLFGGLGSVISGAI---GAKTTA--------DTNKTNLKIAQMNNDFNAREAQKARD 54 +GL G GS+I+G G+K A +TN+ N +IA NN FN R Sbjct 24 LGAGLIAGAGSLINGLFSSNGSKQAAKYQLQAVRETNQANREIADQNNKFNER------- 76 Query 55 FQLDMWNKENEYNSASSQRERLENAGYNPYM 85 MWN +NEYN QR RLE AG NPY+ Sbjct 77 ----MWNLQNEYNRPDMQRARLEAAGLNPYL 103 >gi|492501772|ref|WP_005867312.1| hypothetical protein [Parabacteroides distasonis] gi|409230405|gb|EKN23269.1| hypothetical protein HMPREF1059_03254 [Parabacteroides distasonis CL09T03C24] Length=288 Score = 53.1 bits (126), Expect = 1e-04, Method: Compositional matrix adjust. Identities = 27/63 (43%), Positives = 38/63 (60%), Gaps = 0/63 (0%) Query 21 AIGAKTTADTNKTNLKIAQMNNDFNAREAQKARDFQLDMWNKENEYNSASSQRERLENAG 80 A+ K DTNK N++IA+ + +E +KA L+MWN +NEYNS + Q R+ AG Sbjct 22 AMNNKAVQDTNKANMEIAKYQAQWQQQENEKAYQRSLNMWNLQNEYNSPTQQMARIRAAG 81 Query 81 YNP 83 NP Sbjct 82 LNP 84 >gi|649555290|gb|KDS61827.1| hypothetical protein M095_3809 [Parabacteroides distasonis str. 3999B T(B) 4] gi|649557306|gb|KDS63785.1| hypothetical protein M095_3404 [Parabacteroides distasonis str. 3999B T(B) 4] gi|649559158|gb|KDS65545.1| hypothetical protein M096_4689 [Parabacteroides distasonis str. 3999B T(B) 6] gi|649560567|gb|KDS66875.1| hypothetical protein M095_2448 [Parabacteroides distasonis str. 3999B T(B) 4] gi|649561016|gb|KDS67303.1| hypothetical protein M095_2410 [Parabacteroides distasonis str. 3999B T(B) 4] gi|649562727|gb|KDS68911.1| hypothetical protein M096_3341 [Parabacteroides distasonis str. 3999B T(B) 6] Length=288 Score = 52.4 bits (124), Expect = 2e-04, Method: Compositional matrix adjust. Identities = 27/63 (43%), Positives = 37/63 (59%), Gaps = 0/63 (0%) Query 21 AIGAKTTADTNKTNLKIAQMNNDFNAREAQKARDFQLDMWNKENEYNSASSQRERLENAG 80 A+ K DTNK N++IA+ + +E +KA L MWN +NEYNS + Q R+ AG Sbjct 22 AMNNKAVQDTNKANMEIAKYQAQWQQQENEKAYQRSLKMWNLQNEYNSPTQQMARIRAAG 81 Query 81 YNP 83 NP Sbjct 82 LNP 84 >gi|639237431|ref|WP_024568108.1| hypothetical protein [Elizabethkingia anophelis] Length=287 Score = 50.4 bits (119), Expect = 8e-04, Method: Compositional matrix adjust. Identities = 23/44 (52%), Positives = 31/44 (70%), Gaps = 0/44 (0%) Query 40 MNNDFNAREAQKARDFQLDMWNKENEYNSASSQRERLENAGYNP 83 MNN N + A++ R F LDMWN+ NEYN+ +Q +RL+ AG NP Sbjct 18 MNNSSNKKIARENRAFALDMWNRNNEYNTPLAQMQRLKEAGLNP 61 Lambda K H a alpha 0.312 0.127 0.357 0.792 4.96 Gapped Lambda K H a alpha sigma 0.267 0.0410 0.140 1.90 42.6 43.6 Effective search space used: 2349684375798