bitscore colors: <40, 40-50 , 50-80, 80-200, >200
BLASTP 2.2.30+ Reference: Stephen F. Altschul, Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Reference for composition-based statistics: Alejandro A. Schaffer, L. Aravind, Thomas L. Madden, Sergei Shavirin, John L. Spouge, Yuri I. Wolf, Eugene V. Koonin, and Stephen F. Altschul (2001), "Improving the accuracy of PSI-BLAST protein database searches with composition-based statistics and other refinements", Nucleic Acids Res. 29:2994-3005. Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF excluding environmental samples from WGS projects 49,011,213 sequences; 17,563,301,199 total letters Query= Contig-6_CDS_annotation_glimmer3.pl_2_3 Length=382 Score E Sequences producing significant alignments: (Bits) Value gi|575094324|emb|CDL65715.1| unnamed protein product 97.4 2e-19 gi|639237431|ref|WP_024568108.1| hypothetical protein 61.2 2e-07 gi|494610273|ref|WP_007368519.1| hypothetical protein 62.0 3e-07 gi|547920047|ref|WP_022322418.1| putative uncharacterized protein 49.7 0.001 gi|575094372|emb|CDL65753.1| unnamed protein product 48.5 0.005 gi|575094301|emb|CDL65691.1| unnamed protein product 47.4 0.011 gi|649555290|gb|KDS61827.1| hypothetical protein M095_3809 47.0 0.012 gi|492501772|ref|WP_005867312.1| hypothetical protein 46.6 0.017 gi|647452992|ref|WP_025792810.1| hypothetical protein 43.5 0.23 gi|575094659|emb|CDL66002.1| unnamed protein product 42.0 0.31 >gi|575094324|emb|CDL65715.1| unnamed protein product [uncultured bacterium] Length=370 Score = 97.4 bits (241), Expect = 2e-19, Method: Compositional matrix adjust. Identities = 87/316 (28%), Positives = 155/316 (49%), Gaps = 39/316 (12%) Query 58 KFAREERLAQQQWIEQMYE-----------KNNSYNSPAAQMQRLKDAGLNPDLMYSRGD 106 K REE + W ++M E +YN+P+A M+RLKDAGLNPDLMY G Sbjct 32 KAQREENEKARNWQQKMAEWQVGIERENLADERAYNNPSAVMKRLKDAGLNPDLMYGSGA 91 Query 107 VGNatapeapaqaptpRYNVIPTNTYGQ---TAQIAADAGLKAAQAR-LANSESKKTDT- 161 G ++ NV P + G T + AA A+ +A +++ K DT Sbjct 92 SG---LVDSNVAGSASVGNVPPADVAGPIMGTPTMMESLFQGAAYAKTVAETKNIKADTS 148 Query 162 ----EESLLTADYLLRKARTESDIELNNSTIYVNHELGQLNHAEAEVAAKKLQEIDVAMS 217 E + L D ++ A +++ I+L+ + QL A+AE +++ ++ ++ Sbjct 149 KKEGEVTSLNIDNFVKAASSDNAIKLSGVEV-------QLTKAQAEYTSEQKTKLISEIN 201 Query 218 EARERISTMKAQQSQ-------IDENMVQLKFDRYLRSKEFELLCKRTYQDMKESNSRIN 270 + E ++ +KAQ S+ +D + V + L ++ F+L C+ + ++E+++++N Sbjct 202 DINEHVNLLKAQISETWARTSNLDSSTVLNRTTAILNNRRFDLECEEFARRVRETDAKVN 261 Query 271 LNAAEVQDMMATQLARVMNLNASTYMQKKQGILASEQ-TMTELYKQTGIDISNQHAKFNF 329 L+ AE + ++ T A+V N++A T +++ L Q T E Y + IDI A F Sbjct 262 LSEAEAKSILVTMYAKVNNIDADTALKQANIRLTDAQKTQVEHYTNS-IDIHRDAAVFKL 320 Query 330 DQAKNWDSTERFTNVA 345 Q + +D +R +VA Sbjct 321 QQDQKYDDAQRIVSVA 336 >gi|639237431|ref|WP_024568108.1| hypothetical protein [Elizabethkingia anophelis] Length=287 Score = 61.2 bits (147), Expect = 2e-07, Method: Compositional matrix adjust. Identities = 29/52 (56%), Positives = 36/52 (69%), Gaps = 4/52 (8%) Query 58 KFAREERLAQQQWIEQMYEKNNSYNSPAAQMQRLKDAGLNPDLMYSRGDVGN 109 K ARE R + M+ +NN YN+P AQMQRLK+AGLNP+LMY +G GN Sbjct 25 KIARENRA----FALDMWNRNNEYNTPLAQMQRLKEAGLNPNLMYGQGTTGN 72 >gi|494610273|ref|WP_007368519.1| hypothetical protein [Prevotella multiformis] gi|324988545|gb|EGC20508.1| hypothetical protein HMPREF9141_0987 [Prevotella multiformis DSM 16608] Length=437 Score = 62.0 bits (149), Expect = 3e-07, Method: Compositional matrix adjust. Identities = 93/347 (27%), Positives = 157/347 (45%), Gaps = 41/347 (12%) Query 40 FGNRSNRKASREAFERESKFAREERLAQQQWI-------EQMYEKNNSYNSPAAQMQRLK 92 FG SN+ A A E + ARE Q Q E+M+ + N YNSPAAQMQR Sbjct 84 FGGHSNKTAQDRANETNLQIAREANQNQYQMFQEQNAFNERMWNQMNQYNSPAAQMQRYT 143 Query 93 DAGLNPDLMYSRGDVGNatapeapaqaptpRY-NVIPTNTYGQTAQIA----ADAGLKAA 147 DAG+NP + GNA + A AP V+P G Q + + + A Sbjct 144 DAGINPYIAAGNVQTGNAQSALQSAPAPQQHVAQVMPATGMGDAVQNSFAQIGNVISQFA 203 Query 148 QARLANSESKKTDTEESLLTADYL----LRKARTES-DIELNNSTIYVNHELGQLNHAEA 202 Q +LA +++KKTD E S + D L + K E+ +I NS + +++++ ++ Sbjct 204 QNQLALAQAKKTDAEASWI--DRLNSAQMGKLGAETLNIHNQNSLLGLDYQI----KSDT 257 Query 203 EVAAKKLQEIDVAMSEARERISTMKAQQSQIDENMVQLKFDRYLRSKEFELLCKRTYQDM 262 K L ++ V + + + +++ + ++ + + ++++K E K+ Q++ Sbjct 258 LGNYKLLSDLSVQQAALTNNLVDAQTRKALFESDLAMV--ESHIKAKYGE---KQVLQEI 312 Query 263 KESNSRINLN--AAEVQDMMATQLARVMNLNASTYMQKKQGILASEQTMTELYKQTGIDI 320 ES SR LN +A + + T +M A T ++K+ GI T ++ +D Sbjct 313 SESVSRQYLNYVSAHNSEKLTTAQCNLMFEQAKTEVEKRFGIRLDNDTSRKISNFV-VDS 371 Query 321 SNQHAKFNFDQAK---------NWDSTERFTNVATTWINS-VSFAVG 357 + +AK + +QA+ D N WI+S V FA G Sbjct 372 YHSNAKIDANQARISFFDALKAGKDDKHYNANNWIRWISSFVPFASG 418 >gi|547920047|ref|WP_022322418.1| putative uncharacterized protein [Parabacteroides merdae CAG:48] gi|524592959|emb|CDD13571.1| putative uncharacterized protein [Parabacteroides merdae CAG:48] Length=259 Score = 49.7 bits (117), Expect = 0.001, Method: Compositional matrix adjust. Identities = 21/37 (57%), Positives = 26/37 (70%), Gaps = 0/37 (0%) Query 73 QMYEKNNSYNSPAAQMQRLKDAGLNPDLMYSRGDVGN 109 +M+ N YNSP AQM RL+ AGLNP+L+Y G GN Sbjct 24 EMWNMQNQYNSPTAQMSRLRQAGLNPNLVYGSGVTGN 60 >gi|575094372|emb|CDL65753.1| unnamed protein product [uncultured bacterium] Length=385 Score = 48.5 bits (114), Expect = 0.005, Method: Compositional matrix adjust. Identities = 52/225 (23%), Positives = 104/225 (46%), Gaps = 46/225 (20%) Query 74 MYEKNNSYNSPAAQMQRLKDAGLNPDLMYSRGDVGNatapeapaqaptpRYNVIPTNTYG 133 +YE++ S+NSP+ Q++ + DAGLNP+ +S G + +P+ + Sbjct 88 LYERSLSWNSPSNQLKMMADAGLNPN-NFSNGVTSAPS---------------VPSGSAA 131 Query 134 QTAQIAADAGLKAAQARLAN--------------SESKKTDTE----ESLLTADYLLRKA 175 + ++ A + + N +E+KKT++E ++L + LLR Sbjct 132 SGSALSGPAASASGPIAMQNPFNFEAVTQSIKNLAEAKKTESETKQVDALTATENLLRDG 191 Query 176 RTESDIELNNSTIYVNHELGQLNHAEAEVAAKKLQEIDVAMSEARERISTMKAQQ----S 231 + ++L NS I +N + +E + K+L +DV++ A++ I A + + Sbjct 192 K----VQLQNSEISLNIVDAHMRKSEMDKIGKELIALDVSIDAAKQSIRNSIASEAYTKA 247 Query 232 QIDENMVQLKFDRYLRSKEFELLCKRTYQDMKESNSRINLNAAEV 276 D +++ + R + K L +R ++KES SRI N +++ Sbjct 248 MADYQVLKSEEQRKINEK----LAERLRLELKESQSRIAKNYSDI 288 >gi|575094301|emb|CDL65691.1| unnamed protein product [uncultured bacterium] Length=437 Score = 47.4 bits (111), Expect = 0.011, Method: Compositional matrix adjust. Identities = 23/59 (39%), Positives = 33/59 (56%), Gaps = 0/59 (0%) Query 40 FGNRSNRKASREAFERESKFAREERLAQQQWIEQMYEKNNSYNSPAAQMQRLKDAGLNP 98 +G S +K R E F +E Q+ + QM++ N YN+P AQ QRL+ AG+NP Sbjct 35 WGGSSQQKFERHQAEDARNFTHQENALQRDFARQMWKDTNDYNTPIAQKQRLEQAGMNP 93 >gi|649555290|gb|KDS61827.1| hypothetical protein M095_3809 [Parabacteroides distasonis str. 3999B T(B) 4] gi|649557306|gb|KDS63785.1| hypothetical protein M095_3404 [Parabacteroides distasonis str. 3999B T(B) 4] gi|649559158|gb|KDS65545.1| hypothetical protein M096_4689 [Parabacteroides distasonis str. 3999B T(B) 6] gi|649560567|gb|KDS66875.1| hypothetical protein M095_2448 [Parabacteroides distasonis str. 3999B T(B) 4] gi|649561016|gb|KDS67303.1| hypothetical protein M095_2410 [Parabacteroides distasonis str. 3999B T(B) 4] gi|649562727|gb|KDS68911.1| hypothetical protein M096_3341 [Parabacteroides distasonis str. 3999B T(B) 6] Length=288 Score = 47.0 bits (110), Expect = 0.012, Method: Compositional matrix adjust. Identities = 27/72 (38%), Positives = 41/72 (57%), Gaps = 7/72 (10%) Query 45 NRKASREAFERESKFAREERLAQQQWIEQMYEKN-------NSYNSPAAQMQRLKDAGLN 97 N KA ++ + + A+ + QQQ E+ Y+++ N YNSP QM R++ AGLN Sbjct 24 NNKAVQDTNKANMEIAKYQAQWQQQENEKAYQRSLKMWNLQNEYNSPTQQMARIRAAGLN 83 Query 98 PDLMYSRGDVGN 109 P+L+Y G GN Sbjct 84 PNLVYGNGVTGN 95 >gi|492501772|ref|WP_005867312.1| hypothetical protein [Parabacteroides distasonis] gi|409230405|gb|EKN23269.1| hypothetical protein HMPREF1059_03254 [Parabacteroides distasonis CL09T03C24] Length=288 Score = 46.6 bits (109), Expect = 0.017, Method: Compositional matrix adjust. Identities = 27/72 (38%), Positives = 41/72 (57%), Gaps = 7/72 (10%) Query 45 NRKASREAFERESKFAREERLAQQQWIEQMYEKN-------NSYNSPAAQMQRLKDAGLN 97 N KA ++ + + A+ + QQQ E+ Y+++ N YNSP QM R++ AGLN Sbjct 24 NNKAVQDTNKANMEIAKYQAQWQQQENEKAYQRSLNMWNLQNEYNSPTQQMARIRAAGLN 83 Query 98 PDLMYSRGDVGN 109 P+L+Y G GN Sbjct 84 PNLVYGNGVTGN 95 >gi|647452992|ref|WP_025792810.1| hypothetical protein [Prevotella histicola] Length=424 Score = 43.5 bits (101), Expect = 0.23, Method: Compositional matrix adjust. Identities = 23/68 (34%), Positives = 39/68 (57%), Gaps = 4/68 (6%) Query 42 NRSNRKASREAFERESKFAREERLAQQQWIEQMYEKNNSYNSPAAQMQRLKDAGLNPDLM 101 N++N + +RE + + +E + E+MY + N+YN+P+AQMQR +AG+NP + Sbjct 30 NQTNLQIARETNQMNYQLFQESN----AFNEKMYHEANAYNTPSAQMQRYAEAGINPYIA 85 Query 102 YSRGDVGN 109 GN Sbjct 86 AGNVQTGN 93 >gi|575094659|emb|CDL66002.1| unnamed protein product [uncultured bacterium] Length=204 Score = 42.0 bits (97), Expect = 0.31, Method: Compositional matrix adjust. Identities = 17/29 (59%), Positives = 22/29 (76%), Gaps = 0/29 (0%) Query 79 NSYNSPAAQMQRLKDAGLNPDLMYSRGDV 107 N YN+P QM+RL+ AGLNP+L+Y G V Sbjct 39 NDYNNPINQMKRLQAAGLNPNLVYGSGSV 67 Lambda K H a alpha 0.312 0.124 0.340 0.792 4.96 Gapped Lambda K H a alpha sigma 0.267 0.0410 0.140 1.90 42.6 43.6 Effective search space used: 2369095665768