bitscore colors: <40, 40-50 , 50-80, 80-200, >200
BLASTP 2.2.30+ Reference: Stephen F. Altschul, Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Reference for composition-based statistics: Alejandro A. Schaffer, L. Aravind, Thomas L. Madden, Sergei Shavirin, John L. Spouge, Yuri I. Wolf, Eugene V. Koonin, and Stephen F. Altschul (2001), "Improving the accuracy of PSI-BLAST protein database searches with composition-based statistics and other refinements", Nucleic Acids Res. 29:2994-3005. Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF excluding environmental samples from WGS projects 49,011,213 sequences; 17,563,301,199 total letters Query= Contig-13_CDS_annotation_glimmer3.pl_2_4 Length=438 Score E Sequences producing significant alignments: (Bits) Value gi|547226431|ref|WP_021963494.1| predicted protein 102 2e-20 gi|496050828|ref|WP_008775335.1| hypothetical protein 102 2e-20 gi|490418708|ref|WP_004291031.1| hypothetical protein 98.6 3e-19 gi|575094340|emb|CDL65724.1| unnamed protein product 79.7 8e-13 gi|494822887|ref|WP_007558295.1| hypothetical protein 78.2 3e-12 gi|575094322|emb|CDL65709.1| unnamed protein product 77.0 7e-12 gi|647452984|ref|WP_025792805.1| hypothetical protein 75.1 3e-11 gi|565841285|ref|WP_023924566.1| hypothetical protein 72.8 1e-10 gi|494610270|ref|WP_007368516.1| hypothetical protein 71.2 4e-10 gi|546189465|ref|WP_021825245.1| hypothetical protein 65.1 4e-08 >gi|547226431|ref|WP_021963494.1| predicted protein [Prevotella sp. CAG:1185] gi|524103383|emb|CCY83995.1| predicted protein [Prevotella sp. CAG:1185] Length=498 Score = 102 bits (255), Expect = 2e-20, Method: Compositional matrix adjust. Identities = 61/141 (43%), Positives = 79/141 (56%), Gaps = 10/141 (7%) Query 74 AFPQFKGLLKYVNIRDYQLFAKRLRKYLSKKIGKYEKIHSYVVSEYSPKTFRPHFHILFF 133 A G L Y + RD QLF KR+RK LSK EKI Y+VSEY PKTFR H+H+LFF Sbjct 122 AKCNLDGYLSYTSKRDAQLFLKRVRKNLSKYSD--EKIRYYIVSEYGPKTFRAHYHVLFF 179 Query 134 FDSDEIAKNFRQAVYQSWRLGRVDTQLAREQANSYVANYLNSVVSIPFVYKAKKSIRPRS 193 +D + K + + Q+W+ GRVD L+R + NSYVA Y+N +P + S +P S Sbjct 180 YDEVKTQKVMSKVIRQAWQFGRVDCSLSRGKCNSYVARYVNCNYCLP-RFLGDMSTKPFS 238 Query 194 RFSNLFGY-------EEVKKG 207 S F EE+ KG Sbjct 239 CHSIRFALGIHQSQKEEIYKG 259 >gi|496050828|ref|WP_008775335.1| hypothetical protein [Bacteroides sp. 2_2_4] gi|229448892|gb|EEO54683.1| hypothetical protein BSCG_01608 [Bacteroides sp. 2_2_4] Length=497 Score = 102 bits (255), Expect = 2e-20, Method: Compositional matrix adjust. Identities = 105/395 (27%), Positives = 167/395 (42%), Gaps = 51/395 (13%) Query 80 GLLKYVNIRDYQLFAKRLRKYLSKKIGKYEKIHSYVVSEYSPKTFRPHFHILFFFDSDEI 139 G + Y+ D QLF KRLR Y++K+ EK+ + V EY P FRPH+H+L F SDE Sbjct 115 GDVPYLRKTDLQLFLKRLRYYVTKQKPS-EKVRYFAVGEYGPVHFRPHYHLLLFLQSDEA 173 Query 140 AKNFRQAVYQSWRLGRVDTQLAREQANSYVANYLNSVVSIPFVYKAKKSIRPRSRFSNLF 199 + + + ++W GRVD Q+++ Q ++YVA+Y+NS +IP V+KA S+ P F Sbjct 174 LQICSENISKAWTFGRVDCQVSKGQCSNYVASYVNSSCTIPKVFKA-SSVCP-------F 225 Query 200 GYEEVKKGIQHASDKRSALFDGVP-------YISNQKFVRYVPSGSHIDRLFPRFTHYDG 252 K G +R ++ P + N K+ + S +PR + Sbjct 226 SVHSQKLGQGFLDCQREKIYSLTPENFIRSSIVLNGKYKEFDVWRSCYSFFYPRCKGFVT 285 Query 253 SFLRRSSQIYEVVQRVLRLFARNEPFKKATPRNVSEFVCWWCEYNFRRGCQIKDFPDYMQ 312 R + Y + LF P K T E + ++ + + D Y Sbjct 286 KSSRERAYSYSIYDTARLLF----PDAKTTFSLAKEIAIYIYYFHNPKETYLLDLYGYCS 341 Query 313 EFLHIVRLDGYSFLNWDVPI-----GKISRFFYR----------FNRFEAMKGSL---RS 354 + + L Y F + DV + G+ SR+ +R F F +L +S Sbjct 342 DQSKLYELSQY-FYDSDVLLHSFNSGEFSRYVHRIYTELLISKHFLYFVCTHNTLAERKS 400 Query 355 KLKAVSLFYDYRDYQSLKNQLSLQELVFAE--LGYSDELFDSFYVKPHIKVLKNAYID-- 410 K + + FY DY L Q+L + +G D D++ + N Y D Sbjct 401 KQRLIEEFYSRLDYMHLTKFFEAQQLFYESDLIGDDDLCTDNWDNSYYPYFYNNVYTDTN 460 Query 411 --------KWKDVNYKEVHYFRVKHKVLNDENNIF 437 + + K++ R+KHK LND N +F Sbjct 461 LFEKTPVYRLYSSDVKKLFNDRIKHKKLNDANKVF 495 >gi|490418708|ref|WP_004291031.1| hypothetical protein [Bacteroides eggerthii] gi|217986635|gb|EEC52969.1| hypothetical protein BACEGG_02720 [Bacteroides eggerthii DSM 20697] Length=422 Score = 98.6 bits (244), Expect = 3e-19, Method: Compositional matrix adjust. Identities = 104/395 (26%), Positives = 171/395 (43%), Gaps = 49/395 (12%) Query 80 GLLKYVNIRDYQLFAKRLRKYLSKKIGKYEKIHSYVVSEYSPKTFRPHFHILFFFDSDEI 139 G L Y+ D QLF KR R Y++K+ K EK+ + + EY P FRPH+HIL F SDE Sbjct 39 GYLPYLRKFDLQLFFKRFRYYVAKRFPK-EKVRYFAIGEYGPVHFRPHYHILLFLQSDEA 97 Query 140 AKNFRQAVYQSWRLGRVDTQLAREQANSYVANYLNSVVSIPFVYKAKKSIRPRSRFSNLF 199 + + V ++W GRVD QL++ + +SYVA Y+NS V +P V ++ P F Sbjct 98 LQVCSKVVSEAWPFGRVDCQLSKGKCSSYVAGYVNSSVLVPKVLTL-PTLCP-------F 149 Query 200 GYEEVKKGIQHASDKRSALFDGVP-------YISNQKFVRYVPSGSHIDRLFPRFTHYDG 252 K G +R+ ++ P + N ++ + S FP+ + Sbjct 150 CVHSQKLGQGFLQSERAKVYSLTPEQFVKRSIVINGRYKEFDVWRSAYAYFFPKCKGFAD 209 Query 253 SFLRRSSQIYEVVQRVLRLFARNEPFKKATPRNVSEFVCWWCEYNFRRGCQIKDFPDYMQ 312 R + Y + RLF P + T E V + ++ ++ D + Sbjct 210 KSSRERAYSYGLYDTARRLF----PSAETTFALAKEIVGYIYYFHNKKDTYCLDIFGEVS 265 Query 313 EFLHIVRLDGYSF----LNWDVPIGKISRFFYR----------FNRFEAMKGSL---RSK 355 + + + Y F +N+ + ++ R+ +R F F + +L + K Sbjct 266 DQSDLYQFSQYFFEPEIVNYSLDSIEMCRYVHRVYTELLLSKHFLYFVCDRPTLSEQKRK 325 Query 356 LKAVSLFYDYRDYQSLKNQLSLQELVFAE--LGYSDELFDS--------FYVKPHI--KV 403 LK + FY DY LK Q+L + +G D + D+ FY + +V Sbjct 326 LKLIEEFYSRLDYMHLKTFFENQQLFYESDLVGDLDLMSDAWENSYYPFFYDNVYFSSEV 385 Query 404 LKNAYIDKWKDVNYKEVHYFRVKHKVLNDENNIFL 438 K + + D+ ++ R+KHK LND N IF+ Sbjct 386 YKKTPVYRLYDMQISKLFSDRIKHKKLNDLNKIFV 420 >gi|575094340|emb|CDL65724.1| unnamed protein product [uncultured bacterium] Length=486 Score = 79.7 bits (195), Expect = 8e-13, Method: Compositional matrix adjust. Identities = 49/122 (40%), Positives = 73/122 (60%), Gaps = 12/122 (10%) Query 88 RDYQLFAKRLRKYLSKKIGKYEKIHSYVVSEYSPKTFRPHFHILFFFDSDEIA-KNFRQA 146 +D+ F KRLR L++ KI + SEY P T RPHFH +F+FDS ++ +FR A Sbjct 157 KDFVNFVKRLRINLTRNYNYEGKITYFKCSEYGPTTNRPHFHGIFWFDSRALSFDSFRSA 216 Query 147 VYQSWRLGRVDTQ-----LAREQANSYVANYLNSVVSIP--FVYKAKKSIRPRSRFSNLF 199 V +SW++ D Q +ARE A +YVA+Y+N + S+P F++K +RP+ S F Sbjct 217 VVESWKMCDKDKQYENVEIAREPA-TYVASYVNCLTSVPPLFLFKG---LRPKHSHSKGF 272 Query 200 GY 201 G+ Sbjct 273 GF 274 >gi|494822887|ref|WP_007558295.1| hypothetical protein [Bacteroides plebeius] gi|198272100|gb|EDY96369.1| hypothetical protein BACPLE_00805 [Bacteroides plebeius DSM 17135] Length=545 Score = 78.2 bits (191), Expect = 3e-12, Method: Compositional matrix adjust. Identities = 59/184 (32%), Positives = 82/184 (45%), Gaps = 41/184 (22%) Query 55 MLPEIAESLKKKNNTDASGAFPQFKGLLKYVNIRDYQLFAKRLRKYLSKKIGKYEKIHSY 114 M P++ +K+ N + +KG Y++ R+ QLF KRLRKYL K G +KI + Sbjct 96 MTPQLMNEYQKRVNYRIN-----YKGRFPYLSKRELQLFMKRLRKYLDKYEG--QKIRFF 148 Query 115 VVSEYSPKTFRPHFHILFFFDS-----------------------------DEIAKNFRQ 145 EY P +FRPHFHIL F D + Sbjct 149 ATGEYGPLSFRPHFHILLFVDDPSLFLPSVHTLGEYPYPYWSKYQKAHCGKGTLLSKLEY 208 Query 146 AVYQSWRLGRVDTQ-LAREQANSYVANYLNSVVSIPFVYK--AKKSIRPRSRF--SNLFG 200 + +SW G +D Q + + +SYVA Y+NS V +P K A KS SRF +FG Sbjct 209 YIRESWPFGGIDAQSVEQGSCSSYVAGYVNSSVPLPSCLKVDAVKSFSQHSRFLGRKIFG 268 Query 201 YEEV 204 E + Sbjct 269 TELI 272 >gi|575094322|emb|CDL65709.1| unnamed protein product [uncultured bacterium] Length=499 Score = 77.0 bits (188), Expect = 7e-12, Method: Compositional matrix adjust. Identities = 80/318 (25%), Positives = 130/318 (41%), Gaps = 59/318 (19%) Query 18 LRYPNFISKFRPFILRSIPRVSKLQNFKDEYFEELVWMLPEIAESLKKKNNTDASGAFPQ 77 LR +FIS F S L NF +++ +++ + + K + + G Sbjct 90 LRNDSFISDF----------CSDLHNFDNDFVDKMDYYSDYVINYESKYHKSCVYG---- 135 Query 78 FKGLLKYVNIRDYQLFAKRLRKYLSKKIGKYEKIHSYVVSEYSPKTFRPHFHILFFFDSD 137 GL + RD QLF KRLRK++ K G EKI Y++ EY K+ RPH+H L FF+S Sbjct 136 -HGLYALLYYRDIQLFLKRLRKHIYKYYG--EKIRFYIIGEYGTKSLRPHWHCLLFFNSS 192 Query 138 EIAKNFRQAVYQS---------------WRLGRVDTQLAREQANSYVANYLNSVVSIP-- 180 +++ F V W+ G D++ +A +YV++Y+N + P Sbjct 193 SLSQAFEDCVNVGTTSRPCSCPRFLRPFWQFGICDSKRTNGEAYNYVSSYVNQSANFPKL 252 Query 181 FVYKAKKSIRPRSRFSNLFGYEEVKKGIQHAS---DKRSALFDGVPYISNQKFVRYVPSG 237 V + + + + + + IQ +R D ++ R Sbjct 253 LVLLSNQKAYHSIQLGQILSEQSIVSAIQKGDFSFFERQFYLDTFGAANSYSVWR----- 307 Query 238 SHIDRLFPRFTHYDGSFLRRSSQI-YEVVQRVLRLFARNEPFKKATPRNVSEFVCWWCEY 296 S+ R FP+FT SSQ+ YE RVL + E + + +C Y Sbjct 308 SYYSRFFPKFTC--------SSQLTYEQTYRVLTCY---ETLRDLFDTDSVGVICRRLFY 356 Query 297 NFRRGCQIKDFPDYMQEF 314 ++ G +PDY F Sbjct 357 HYHFG-----YPDYHDIF 369 >gi|647452984|ref|WP_025792805.1| hypothetical protein [Prevotella histicola] Length=480 Score = 75.1 bits (183), Expect = 3e-11, Method: Compositional matrix adjust. Identities = 41/138 (30%), Positives = 66/138 (48%), Gaps = 11/138 (8%) Query 88 RDYQLFAKRLRKYLSKKI---GKYEKIHSYVVSEYSPKTFRPHFHILFFFDSDEIAKNFR 144 +D Q F KRLR + K+ G +I ++ SEY P TFRPH+H + ++DS+ + Sbjct 125 KDVQDFFKRLRSKIDYKLKPRGNEYRIRYFICSEYGPNTFRPHYHAILWYDSEILHNELN 184 Query 145 QAVYQSWRLGRVDTQLAREQANSYVANYLNSVVSIPFVYKAKKSIRPRSRFSNLFGYEEV 204 + ++W+ G D L A+ YVA Y+N +P R+ F++ F Sbjct 185 VLIRETWKNGNTDFSLVNSSASQYVAKYVNGDCDLPSFL--------RTEFTSTFHLASK 236 Query 205 KKGIQHASDKRSALFDGV 222 I + D AL++ V Sbjct 237 HPCIGYGKDDEEALYENV 254 >gi|565841285|ref|WP_023924566.1| hypothetical protein [Prevotella nigrescens] gi|564729906|gb|ETD29850.1| hypothetical protein HMPREF1173_00032 [Prevotella nigrescens CC14M] Length=484 Score = 72.8 bits (177), Expect = 1e-10, Method: Compositional matrix adjust. Identities = 64/237 (27%), Positives = 103/237 (43%), Gaps = 12/237 (5%) Query 51 ELVWMLPEIAESLKKKNNTDASGAFPQFKGLLKYVNIRDYQLFAKRLRKYLSKKIGKY-- 108 E+VW + + N D + Y D F KRLR LS K+ Sbjct 81 EMVWTSNRLCDEKVIVGNYDFIKVSNSDVQAVAYCCKSDIVKFFKRLRSKLSYYFKKHHI 140 Query 109 ---EKIHSYVVSEYSPKTFRPHFHILFFFDSDEIAKNFRQAVYQSWRLGRVDTQLAREQA 165 EKI +V SEY PKT RPH+H + +FDS+E+A+ + + SW G D + A Sbjct 141 ITNEKIRYFVCSEYGPKTLRPHYHAIIWFDSEEVARVIEKMLSSSWSNGFTDFEYVNSTA 200 Query 166 NSYVANYL--NSVVSIPFVYKAKKSIRPRSRFSNLFGY--EEVKKGIQHASDKRSALFDG 221 YVA Y+ NSV+ + A ++ +S+ ++ GY ++ +K + D F+ Sbjct 201 PQYVAKYVSGNSVLPEILQHDACRTFHLQSQAPSV-GYRSDDYEKFEKEVIDGCYGHFEY 259 Query 222 VPYISNQKFVRYVPSGSHIDRLFPRFTHYDGSFLRRSSQIYEVVQRVLRLFARNEPF 278 + FV+ P G+ R FP+ Y +IY + + ++ + P Sbjct 260 DSSSQSSVFVQ--PPGTLETRCFPKCREYRSLSRIEKLRIYAYKRDICSIYGIDTPI 314 >gi|494610270|ref|WP_007368516.1| hypothetical protein [Prevotella multiformis] gi|324988542|gb|EGC20505.1| hypothetical protein HMPREF9141_0984 [Prevotella multiformis DSM 16608] Length=479 Score = 71.2 bits (173), Expect = 4e-10, Method: Compositional matrix adjust. Identities = 40/135 (30%), Positives = 71/135 (53%), Gaps = 7/135 (5%) Query 53 VWMLPEIAESLKKKNNTDASGAFPQ---FKGLLKYVNIRDYQLFAKRLRKYLSKKIGKYE 109 VW ++ES K +++ PQ + Y +D Q + KRLR + ++ K + Sbjct 84 VWFSNRLSESGKFLSDSVCRSLPPQKMEDEVCFAYPCKKDVQDWFKRLRSAVDYQLNKNK 143 Query 110 ----KIHSYVVSEYSPKTFRPHFHILFFFDSDEIAKNFRQAVYQSWRLGRVDTQLAREQA 165 +I ++ SEY P+TFRPH+H + ++DS+E+ +N + + ++W+ G L A Sbjct 144 SNEFRIRYFICSEYGPRTFRPHYHAILWYDSEELQRNIGRLIRETWKNGNSVFSLVNNSA 203 Query 166 NSYVANYLNSVVSIP 180 + YVA Y+N +P Sbjct 204 SQYVAKYVNGDTRLP 218 >gi|546189465|ref|WP_021825245.1| hypothetical protein [Prevotella salivae] gi|544001993|gb|ERK01417.1| hypothetical protein HMPREF9145_2741 [Prevotella salivae F0493] Length=586 Score = 65.1 bits (157), Expect = 4e-08, Method: Compositional matrix adjust. Identities = 54/190 (28%), Positives = 89/190 (47%), Gaps = 15/190 (8%) Query 73 GAFPQFKGLLKYVNIRDYQLFAKRLRKYLS----KKIGKYEKIHSYVVSEYSPKTFRPHF 128 G+ P FK L ++ Y L+ + YL+ KK + + ++ SEY+P TFRPHF Sbjct 177 GSIP-FKEWLDDLDTETYDLYYSVYQYYLTDYEKKKESCKQSVRYFICSEYTPTTFRPHF 235 Query 129 HILFFFDSDEIAKNFRQAVYQSWRLG---RVDTQLAREQANSYVANYLNSVVSIPFVYKA 185 H LF+FD ++ + ++++W++ ++ Q A++YV+ Y+ ++P V +A Sbjct 236 HGLFWFDDEKAFSYAPRCIFKAWKMCAEININVQPVSGDASAYVSKYVTGNSNLPPVLQA 295 Query 186 KKSIRPRSRFSN--LFGYEEVKKGIQHASDKRSALFDGVPYISNQ---KFVRYVPSGSHI 240 KS R S GY+ R +F IS + V VPS S + Sbjct 296 -KSTRTFCLASKGPAIGYKSFSDKEVLEMFTRRCIFRSYETISKKGKLSGVSAVPS-SAV 353 Query 241 DRLFPRFTHY 250 R FP+ Y Sbjct 354 GRYFPKCYQY 363 Lambda K H a alpha 0.325 0.140 0.426 0.792 4.96 Gapped Lambda K H a alpha sigma 0.267 0.0410 0.140 1.90 42.6 43.6 Effective search space used: 2916668506332