bitscore colors: <40, 40-50 , 50-80, 80-200, >200
BLASTP 2.2.30+ Reference: Stephen F. Altschul, Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Reference for composition-based statistics: Alejandro A. Schaffer, L. Aravind, Thomas L. Madden, Sergei Shavirin, John L. Spouge, Yuri I. Wolf, Eugene V. Koonin, and Stephen F. Altschul (2001), "Improving the accuracy of PSI-BLAST protein database searches with composition-based statistics and other refinements", Nucleic Acids Res. 29:2994-3005. Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF excluding environmental samples from WGS projects 49,011,213 sequences; 17,563,301,199 total letters Query= Contig-12_CDS_annotation_glimmer3.pl_2_5 Length=623 Score E Sequences producing significant alignments: (Bits) Value gi|575094321|emb|CDL65708.1| unnamed protein product 753 0.0 gi|490418709|ref|WP_004291032.1| hypothetical protein 298 1e-88 gi|575094354|emb|CDL65742.1| unnamed protein product 284 4e-83 gi|547226430|ref|WP_021963493.1| putative uncharacterized protein 275 8e-80 gi|496050829|ref|WP_008775336.1| hypothetical protein 264 6e-76 gi|494822885|ref|WP_007558293.1| hypothetical protein 227 6e-62 gi|647452987|ref|WP_025792807.1| hypothetical protein 166 8e-41 gi|517172762|ref|WP_018361580.1| hypothetical protein 162 1e-39 gi|494308783|ref|WP_007173938.1| hypothetical protein 161 2e-39 gi|496521299|ref|WP_009229582.1| capsid protein 157 7e-38 >gi|575094321|emb|CDL65708.1| unnamed protein product [uncultured bacterium] Length=642 Score = 753 bits (1945), Expect = 0.0, Method: Compositional matrix adjust. Identities = 397/655 (61%), Positives = 469/655 (72%), Gaps = 45/655 (7%) Query 1 MANRSNIMGLHGLKNKTSRNSFDLSHRNLFTAKVGELLPCFVQEVNPGDSIKLDSSYFTR 60 MANRSNIMGLHGLKNK SRNSFDLSHRN+FTAKVGELLPCFVQE+NPGDS+K+ SSYFTR Sbjct 1 MANRSNIMGLHGLKNKPSRNSFDLSHRNMFTAKVGELLPCFVQELNPGDSVKVSSSYFTR 60 Query 61 TAPLQTAAFTRLRENVQYFFVPYQCLWKYFEGQVKNMTKNANGGDISQIATSPFANAKVS 120 TAPLQ+ AFTRLRENVQYFFVPY LWKYF+ QV NMTKNANGGDIS+IA+S N KV+ Sbjct 61 TAPLQSNAFTRLRENVQYFFVPYSALWKYFDSQVLNMTKNANGGDISRIASSLVGNQKVT 120 Query 121 TEMPFISYTALHAYLNKLLNY----VDSSANPTELSNPFLYNNGCWRHAESAKLLQLLGY 176 T+MP ++Y LHAYL K +N D S P +N GC+RHAESAKLLQLLGY Sbjct 121 TQMPCVNYKTLHAYLLKFINRSTVGSDGSVGPE-------FNRGCYRHAESAKLLQLLGY 173 Query 177 GNFVQQFKNFS--------ASKPYSLLHVENAPALSVFRLLAYQKICNDFYTYRQWQPYN 228 GNF +QF NF + + + + N+P LS+FRLLAY KICND Y YRQWQPYN Sbjct 174 GNFPEQFANFKVNNDKHNQSGQNFKDVTYNNSPYLSIFRLLAYHKICNDHYLYRQWQPYN 233 Query 229 ASLCNIDYITPDssssmdlsskfssisVSDLG--KSNMLDMRFSNLPLDYFNGVLPTPQF 286 ASLCN+DY+TP+SSS + + SI + K N+LDMRFSNLPLDYF GVLPT QF Sbjct 234 ASLCNVDYLTPNSSSLLSIDDALLSIPDDSIKAEKLNLLDMRFSNLPLDYFTGVLPTSQF 293 Query 287 GSESVVSL-------SQNADVYTGFDKSQWQTLDG--safpsgsvsssnsdrsltANGKS 337 GSESVV+L S + T D +W+T G + S++ + + +NG Sbjct 294 GSESVVNLNLGNASGSAVLNGTTSKDSGRWRTTTGEWEMEQRVASSANGNLKLDNSNGTF 353 Query 338 IEHVHILPSG-----SITSSLSIAALRQATALQKYKEIQLANDPDFESQIEAHFGIKPKH 392 I H H S++ +LSI ALR A A QKYKEIQLAND DF+SQ+EAHFGIKP Sbjct 354 ISHDHTFSGNVAINTSLSGNLSIIALRNALAAQKYKEIQLANDVDFQSQVEAHFGIKPDE 413 Query 393 DMHKSRFIGGSSSMIDINPVVNQNLGAGQNQDNQAVTKAAPTGQGGASFKFTADTFGVVI 452 S FIGGSSSMI+IN +NQNL DN+A AAP G G AS KFTA T+GVVI Sbjct 414 KNENSLFIGGSSSMININEQINQNLSG----DNKATYGAAPQGNGSASIKFTAKTYGVVI 469 Query 453 GIYRCTPVLDYSHVGIDRTLLKTDASDFVIPELDSIGMQQTFQCELFAPT---SQMTA-S 508 GIYRCTPVLD++H+GIDRTL KTDASDFVIPE+DSIGMQQTF+CE+ AP + A Sbjct 470 GIYRCTPVLDFAHLGIDRTLFKTDASDFVIPEMDSIGMQQTFRCEVAAPAPYNDEFKAFR 529 Query 509 APDKRKYDMSRTFGYAPRYSEYKVSFDRYNGAFCDTLKSWVTGFNTHIFDSDRWNDRSYF 568 D DMS T+GYAPRYSE+K S+DRYNGAFC +LKSWVTG N ++ WN ++ Sbjct 530 VGDGSSPDMSETYGYAPRYSEFKTSYDRYNGAFCHSLKSWVTGINFDAIQNNVWN--TWA 587 Query 569 SISVPQLFVCRPDIVKDIFALQTYHDSNDDNLYVGMVNMCYATRNLSRYGLPYSN 623 I+ P +F CRPDIVK++F + + ++S+DD LYVGMVNMCYATRNLSRYGLPYSN Sbjct 588 GINAPNMFACRPDIVKNLFLVSSTNNSDDDQLYVGMVNMCYATRNLSRYGLPYSN 642 >gi|490418709|ref|WP_004291032.1| hypothetical protein [Bacteroides eggerthii] gi|217986636|gb|EEC52970.1| putative capsid protein (F protein) [Bacteroides eggerthii DSM 20697] Length=578 Score = 298 bits (762), Expect = 1e-88, Method: Compositional matrix adjust. Identities = 215/631 (34%), Positives = 302/631 (48%), Gaps = 68/631 (11%) Query 5 SNIMGLHGLKNKTSRNSFDLSHRNLFTAKVGELLPCFVQEVNPGDSIKLDSSYFTRTAPL 64 +NIM L ++NK SRN FDLS + FTAK GELLP V+EV PGD+ K++ FTRT P+ Sbjct 2 ANIMSLKSIRNKPSRNGFDLSFKKNFTAKAGELLPVMVKEVLPGDTFKINLKAFTRTQPV 61 Query 65 QTAAFTRLRENVQYFFVPYQCLWKYFEGQVKNMTKNANGGDISQIATSPFANAKVSTEMP 124 TAAF R+RE +FFVPY LW + M N ++ P N +S EMP Sbjct 62 NTAAFARIREYYDFFFVPYDLLWNKANTVLTQMYDNPQHA----VSIDPTRNFVLSGEMP 117 Query 125 FISYTALHAYLNKLLNYVDSSANPTELSNPFLYNNGCWRHAESAKLLQLLGYGNFVQQFK 184 +++ A+ +Y+N L +SA SN F YN R S KLL+ LGYGN+ Sbjct 118 YMTSEAIASYINALST---ASALADYKSNYFGYN----RSKSSVKLLEYLGYGNYESFLT 170 Query 185 NFSASKPY--SLLHVENAPALSVFRLLAYQKICNDFYTYRQWQPYNASLCNIDYITPDss 242 + + P +L H ++F LLAYQKI +DFY QW+ + S N+DY+ S Sbjct 171 DDWNTAPLMANLNH-------NIFGLLAYQKIYSDFYRDSQWERVSPSTFNVDYLDGSSM 223 Query 243 ssmdlsskfssisVSDLGKSNMLDMRFSNLPLDYFNGVLPTPQFGSESVVSLSQNADVYT 302 + + S N D+R+ N D F+GVLP Q+G +V S++ DV Sbjct 224 NLDNAYSTEFYQ------NYNFFDLRYCNWQKDLFHGVLPHQQYGETAVASIT--PDVTG 275 Query 303 GFDKSQWQTLDGsafpsgsvsssnsdrsltANGKSIEHVHILPSGSITSSLSIAALRQAT 362 S + T+ TA+G + ++ LP+ LSI LRQA Sbjct 276 KLTLSNFSTV--------------GTSPTTASGTATKN---LPAFDTVGDLSILVLRQAE 318 Query 363 ALQKYKEIQLANDPDFESQIEAHFGIKPKHDMHK-SRFIGGSSSMIDINPVVNQNLGAGQ 421 LQK+KEI + + D++ Q+E H+G+ + ++GG SS IDIN V+N N+ Sbjct 319 FLQKWKEITQSGNKDYKDQLEKHWGVSVGDGFSELCTYLGGVSSSIDINEVINTNITGSA 378 Query 422 NQDNQAVTKAAPTGQGGASFKFTADTFGVVIGIYRCTPVLDYSHVGIDRTLLKTDASDFV 481 D K G +F + +G+++ IY C P+LDY+ +D LK +++D+ Sbjct 379 AAD--IAGKGVGVANGEINFN-SNGRYGLIMCIYHCLPLLDYTTDMLDPAFLKVNSTDYA 435 Query 482 IPELDSIGMQQTFQCELFAPTSQMTASAPDKRKYDMSRTFGYAPRYSEYKVSFDRYNGAF 541 IPE D +GMQ +L P ++ GY PRY +YK S D+ G F Sbjct 436 IPEFDRVGMQSMPLVQLMNPLRSFANAS--------GLVLGYVPRYIDYKTSVDQSVGGF 487 Query 542 CDTLKSWVTGF-NTHIFDSDRW-NDRSYFSISVP---------QLFVCRPDIVKDIFALQ 590 TL SWV + N + ND S P F PD + IFA+Q Sbjct 488 KRTLNSWVISYGNISVLKQVTLPNDAPPIEPSEPVPSVAPMNFTFFKVNPDCLDPIFAVQ 547 Query 591 TYHDSNDDNLYVGMVNMCYATRNLSRYGLPY 621 D+N D A RNL GLPY Sbjct 548 AGDDTNTDQFLCSSFFDIKAVRNLDTDGLPY 578 >gi|575094354|emb|CDL65742.1| unnamed protein product [uncultured bacterium] Length=615 Score = 284 bits (727), Expect = 4e-83, Method: Compositional matrix adjust. Identities = 216/662 (33%), Positives = 310/662 (47%), Gaps = 95/662 (14%) Query 8 MGLHGLKNKTSRNSFDLSHRNLFTAKVGELLPCFVQEVNPGDSIKLDSSYFTRTAPLQTA 67 M + +KN+ SRN FDLS + FTAK GELLP + V PGDS ++ FTRT PL T+ Sbjct 1 MSMADIKNRPSRNGFDLSFKKNFTAKAGELLPVMTKVVLPGDSFNINLRSFTRTQPLNTS 60 Query 68 AFTRLRENVQYFFVPYQCLWKYFEGQVKNMTKNANGGDISQIATSPFA--NAKVSTEMPF 125 AF R+RE ++FVP++ +W F+ + M N Q A+ P N +S MP+ Sbjct 61 AFARMREYYDFYFVPFEQMWNKFDSCITQMNANV------QHASGPTLDDNTPLSGRMPY 114 Query 126 ISYTALHAYLNKLLNYVDSSANPTELSNPFLYNNGCWRHAESAKLLQLLGYGNF--VQQF 183 + + YLN NPF +N R + KLLQ LGYG++ Sbjct 115 FTSEQIADYLNDQAT--------AARKNPFGFN----RSTLTCKLLQYLGYGDYNSFDSE 162 Query 184 KNFSASKPYSLLHVENAPALSVFRLLAYQKICNDFYTYRQWQPYNASLCNIDYITPDsss 243 N ++KP L ++E LS F LLAYQKI +DFY Y QW+ N S N+DYI Sbjct 163 TNTWSAKPL-LYNLE----LSPFPLLAYQKIYSDFYRYTQWEKTNPSTFNLDYI-----K 212 Query 244 smdlsskfssisVSDLGKSNMLDMRFSNLPLDYFNGVLPTPQFGSESVVSLSQNADVYTG 303 + SD +N D+R+ N D F+GVLP Q+GS SVV ++ +V + Sbjct 213 GTSDLQMDLTGLPSD--DNNFFDIRYCNYQKDMFHGVLPVAQYGSASVVPINGQLNVISN 270 Query 304 FDK---------------SQWQTLDGsafpsgsvsssnsdrsltANGKSIE-HVHILPSG 347 D + + T+ G+ + GKS + + PS Sbjct 271 GDSGPIFKTSTPDPGTPGTSYVTVGGNIGVDNRSFGVSGSTLNV--GKSADPSGYGFPSN 328 Query 348 SITSSL-----------------SIAALRQATALQKYKEIQLANDPDFESQIEAHFGIKP 390 + T SL I ALRQA LQK+KE+ ++ + D++SQIE H+GIK Sbjct 329 ASTRSLLWENPNLIIENNQGFYVPILALRQAEFLQKWKEVSVSGEEDYKSQIEKHWGIKV 388 Query 391 KHDM-HKSRFIGGSSSMIDINPVVNQNLGAGQNQDNQAVTKAAPTGQGGASFKFTAD-TF 448 + H++R++GG ++ +DIN V+N N+ DN A T G S +F + + Sbjct 389 SDFLSHQARYLGGCATSLDINEVINNNITG----DNAADIAGKGTFTGNGSIRFESKGEY 444 Query 449 GVVIGIYRCTPVLDYSHVGIDRTLLKTDASDFVIPELDSIGMQQTFQCELFAPTSQMTAS 508 G+++ IY P++DY G+D + DA+ F IPELD IGM+ P + Sbjct 445 GIIMCIYHVLPIVDYVGSGVDHSCTLVDATSFPIPELDQIGMESVPLVRAMNPVKESDTP 504 Query 509 APDKRKYDMSRTF-GYAPRYSEYKVSFDRYNGAFCDTLKSW--------VTGFNTHIFDS 559 + D TF GYAPRY ++K S DR G F D+L++W +T N+ F S Sbjct 505 SAD--------TFLGYAPRYIDWKTSVDRSVGDFADSLRTWCLPVGDKELTSANSLNFPS 556 Query 560 DRWNDRSYFSISVPQLFVCRPDIVKDIFALQTYHDSNDDNLYVGMVNMCYATRNLSRYGL 619 + + + F P IV +FA+ D RNL GL Sbjct 557 NPNVEPDSIAAG---FFKVNPSIVDPLFAVVADSTVKTDEFLCSSFFDVKVVRNLDVNGL 613 Query 620 PY 621 PY Sbjct 614 PY 615 >gi|547226430|ref|WP_021963493.1| putative uncharacterized protein [Prevotella sp. CAG:1185] gi|524103382|emb|CCY83994.1| putative uncharacterized protein [Prevotella sp. CAG:1185] Length=573 Score = 275 bits (702), Expect = 8e-80, Method: Compositional matrix adjust. Identities = 202/623 (32%), Positives = 292/623 (47%), Gaps = 57/623 (9%) Query 5 SNIMGLHGLKNKTSRNSFDLSHRNLFTAKVGELLPCFVQEVNPGDSIKLDSSYFTRTAPL 64 S++M L LKN RN FDLS +N FTAKVGELLP +EV PGD + FTRT P+ Sbjct 2 SSVMSLTALKNSVKRNGFDLSFKNAFTAKVGELLPIMCKEVYPGDKFNIRGQAFTRTQPV 61 Query 65 QTAAFTRLRENVQYFFVPYQCLWKYFEGQVKNMTKNANGGDISQIATSPFANAKVSTEMP 124 +AA++RLRE ++FVPY+ LW NM + D+ ++ +S P Sbjct 62 NSAAYSRLREYYDFYFVPYRLLWNMAPTFFTNMPDPHHAADL-------VSSVNLSQRHP 114 Query 125 FISYTALHAYLNKLLNYVDSSANPTELSNPFLYNNGCWRHAESAKLLQLLGYGNFVQQFK 184 + ++ + YL L + S A N F G R S KLL L YG F + ++ Sbjct 115 WFTFFDIMEYLGNLNSL--SGAYEKYQKNFF----GFSRVELSVKLLNYLNYG-FGKDYE 167 Query 185 NFSASKPYSLLHVENAPALSVFRLLAYQKICNDFYTYRQWQPYNASLCNIDYITPDssss 244 + + LS F LLAYQKIC D++ QWQ N+DY+ Sbjct 168 SVKVPSD------SDDIVLSPFPLLAYQKICEDYFRDDQWQSAAPYRYNLDYL----YGK 217 Query 245 mdlsskfssisVSDLGKS-NMLDMRFSNLPLDYFNGVLPTPQFGSESVVSLSQNADVYTG 303 S +D K+ M D+ + N DYF G+LP Q+G SV S ++ Sbjct 218 SSGFHIPMSSFTNDAFKNPTMFDLNYCNFQKDYFTGMLPRAQYGDVSVAS-----PIFGD 272 Query 304 FDKSQWQTLDGsafpsgsvsssnsdrsltANGKSIEHVHILPSGSITSSLSIAALRQATA 363 D +L + S AN + + + + T+ LS+ ALRQA Sbjct 273 LDIGDSSSL-----------TFASAPQQGANTIQSGVLVVNNNSNTTAGLSVLALRQAEC 321 Query 364 LQKYKEIQLANDPDFESQIEAHFGIKPKHDMH-KSRFIGGSSSMIDINPVVNQNLGAGQN 422 LQK++EI + D+++Q++ HF + P + +++GG +S +DI+ VVN NL Sbjct 322 LQKWREIAQSGKMDYQTQMQKHFNVSPSATLSGHCKYLGGWTSNLDISEVVNTNLTG--- 378 Query 423 QDNQAVTKAAPTGQ-GGASFKFTADTFGVVIGIYRCTPVLDYSHVGIDRTLLKTDASDFV 481 DNQA + TG G F + G+++ IY C P+LD+S I R KT +D+ Sbjct 379 -DNQADIQGKGTGTLNGNKVDFESSEHGIIMCIYHCLPLLDWSINRIARQNFKTTFTDYA 437 Query 482 IPELDSIGMQQTFQCELFAPTSQMTASAPDKRKYDMSRTFGYAPRYSEYKVSFDRYNGAF 541 IPE DS+GMQQ + E+ + + S GY PRY++ K S D +G+F Sbjct 438 IPEFDSVGMQQLYPSEMIFGLEDLPSDPS-------SINMGYVPRYADLKTSIDEIHGSF 490 Query 542 CDTLKSWVTGFNTHIFDSDR--WNDRSYFSISVP-QLFVCRPDIVKDIFALQTYHDSNDD 598 DTL SWV+ + R D + I++ F P IV +IF ++ N D Sbjct 491 IDTLVSWVSPLTDSYISAYRQACKDAGFSDITMTYNFFKVNPHIVDNIFGVKADSTINTD 550 Query 599 NLYVGMVNMCYATRNLSRYGLPY 621 L + A RN GLPY Sbjct 551 QLLINSYFDIKAVRNFDYNGLPY 573 >gi|496050829|ref|WP_008775336.1| hypothetical protein [Bacteroides sp. 2_2_4] gi|229448893|gb|EEO54684.1| putative capsid protein (F protein) [Bacteroides sp. 2_2_4] Length=580 Score = 264 bits (675), Expect = 6e-76, Method: Compositional matrix adjust. Identities = 207/625 (33%), Positives = 306/625 (49%), Gaps = 54/625 (9%) Query 5 SNIMGLHGLKNKTSRNSFDLSHRNLFTAKVGELLPCFVQEVNPGDSIKLDSSYFTRTAPL 64 +NIM L L+NKTSRN FDLS + FTAK GELLP EV PGD +D FTRT PL Sbjct 2 ANIMSLKSLRNKTSRNGFDLSSKRNFTAKPGELLPVKCWEVLPGDKWSIDLKSFTRTQPL 61 Query 65 QTAAFTRLRENVQYFFVPYQCLWKYFEGQVKNMTKNANGGDISQIATS--PFANAKVSTE 122 TAAF R+RE ++FVPY LW + M N Q ATS P AN ++ Sbjct 62 NTAAFARMREYYDFYFVPYNLLWNKANTVLTQMYDNP------QHATSYIPSANQALAGV 115 Query 123 MPFISYTALHAYLNKLLNYVDSSANPTELSNPFLYNNGCWRHAESAKLLQLLGYGNFVQQ 182 MP ++ + YLN L D + + N F Y+ R +AKLL+ LGYGNF Sbjct 116 MPNVTCKGIADYLN--LVAPDVTTTNSYEKNYFGYS----RSLGTAKLLEYLGYGNFYTY 169 Query 183 FKNFSASKPYSLLHVENAPALSVFRLLAYQKICNDFYTYRQWQPYNASLCNIDYITPDss 242 S + ++ + + L+++ +LAYQKI D QW+ + S N+DY++ Sbjct 170 AT--SKNNTWTKSPLSSNLQLNIYGVLAYQKIYADHIRDSQWEKVSPSCFNVDYLSGTVD 227 Query 243 ssmdlsskfssisVSDLGKSNMLDMRFSNLPLDYFNGVLPTPQFGSESVVSLSQNADVYT 302 S+M + S + + NM D+R+ N D F+GVLP Q+G + V+++ ++V + Sbjct 228 SAMTIDSMITGQGFAPF--YNMFDLRYCNWQKDLFHGVLPRQQYGDTAAVNVNL-SNVLS 284 Query 303 GFDKSQWQTLDGsafpsgsvsssnsdrsltANGKSIEHVHILPSGSITSSLSIAALRQAT 362 + QT DG ++ G +++ V+ SG+ T + ALRQA Sbjct 285 A--QYMVQTPDG---------DPVGGSPFSSTGVNLQTVN--GSGTFT----VLALRQAE 327 Query 363 ALQKYKEIQLANDPDFESQIEAHFGIKPKHDMHK-SRFIGGSSSMIDINPVVNQNLGAGQ 421 LQK+KEI + + D++ QIE H+ + + S ++GG+++ +DIN VVN N+ Sbjct 328 FLQKWKEITQSGNKDYKDQIEKHWNVSVGEAYSEMSLYLGGTTASLDINEVVNNNITGSN 387 Query 422 NQDNQAVTKAAPTGQGGASFKFTADTFGVVIGIYRCTPVLDYSHVGIDRTLLKTDASDFV 481 D K G G SF + +G+++ IY P+LDY+ ++ K +++DF Sbjct 388 AAD--IAGKGVVVGNGRISFD-AGERYGLIMCIYHSLPLLDYTTDLVNPAFTKINSTDFA 444 Query 482 IPELDSIGMQQTFQCELFAPTSQMTASAPDKRKYDM-SRTFGYAPRYSEYKVSFDRYNGA 540 IPE D +GM+ L P + Y++ S GYAPRY YK D GA Sbjct 445 IPEFDRVGMESVPLVSLMNPL---------QSSYNVGSSILGYAPRYISYKTDVDSSVGA 495 Query 541 FCDTLKSWVTGF-NTHIFDSDRWND---RSYFSISVPQLFVCRPDIVKDIFALQTYHDSN 596 F TLKSWV + N + + + D S ++ F P+ V +FA+ + + Sbjct 496 FKTTLKSWVMSYDNQSVINQLNYQDDPNNSPGTLVNYTNFKVNPNCVDPLFAVAASNSID 555 Query 597 DDNLYVGMVNMCYATRNLSRYGLPY 621 D RNL GLPY Sbjct 556 TDQFLCSSFFDVKVVRNLDTDGLPY 580 >gi|494822885|ref|WP_007558293.1| hypothetical protein [Bacteroides plebeius] gi|198272099|gb|EDY96368.1| putative capsid protein (F protein) [Bacteroides plebeius DSM 17135] Length=613 Score = 227 bits (578), Expect = 6e-62, Method: Compositional matrix adjust. Identities = 197/644 (31%), Positives = 294/644 (46%), Gaps = 66/644 (10%) Query 5 SNIMGLHGLKNKTSRNSFDLSHRNLFTAKVGELLPCFVQEVNPGDSIKLDSSYFTRTAPL 64 +NIM + ++NK +R +DL+ + FTAK G L+P + V P D + F RT PL Sbjct 9 ANIMSMKSVRNKPTRAGYDLTQKINFTAKAGSLIPVWWTPVLPFDDLNATVKSFVRTQPL 68 Query 65 QTAAFTRLRENVQYFFVPYQCLWKYFEGQVKNMTKN---ANGGDISQIATSPFANAKVST 121 TAAF R+R ++FVP++ +W F + M N A+G ++ N +S Sbjct 69 NTAAFARMRGYFDFYFVPFRQMWNKFPTAITQMRTNLLHASGPVLAD-------NVPLSD 121 Query 122 EMPFISYTALHAYLNKLLNYVDSSANPTELSNPFLYNNGCWRHAESAKLLQLLGYGNFVQ 181 E+P+ + + Y+ L + N F G +R +L+ LGYG+F Sbjct 122 ELPYFTAEQVADYIVSL----------ADSKNQF----GYYRAWLVCIILEYLGYGDFYP 167 Query 182 QFKNFSASK--PYSLLHVENAPALSVFRLLAYQKICNDFYTYRQWQPYNASLCNIDYITP 239 + + ++ + N S F L AYQKI DF Y QW+ N S NIDYI+ Sbjct 168 YIVEAAGGEGATWATRPMLNNLKFSPFPLFAYQKIYADFNRYTQWERSNPSTFNIDYIS- 226 Query 240 DssssmdlsskfssisVSDLGKS-NMLDMRFSNLPLDYFNGVLPTPQFGSESVVSLSQNA 298 S +V S N+ DMR+SN D +G +P Q+G S V +S + Sbjct 227 -----GSADSLQLDFTVEGFKDSFNLFDMRYSNWQRDLLHGTIPQAQYGEASAVPVSGSM 281 Query 299 DVYTGFDKSQWQT-LDGsafpsgsvsssnsdrsltANGKSIEHVHILPSGSITSSL---- 353 V G + T DG AF +G+V+ S L A S+ IL + S L Sbjct 282 QVVEGPTPPAFTTGQDGVAFLNGNVTIQGSSGYLQAQ-TSVGESRILRFNNTNSGLIVEG 340 Query 354 ------SIAALRQATALQKYKEIQLANDPDFESQIEAHFGI---KPKHDMHKSRFIGGSS 404 SI ALR+A A QK+KE+ LA++ D+ SQIEAH+G K DM +++G + Sbjct 341 DSSFGVSILALRRAEAAQKWKEVALASEEDYPSQIEAHWGQSVNKAYSDM--CQWLGSIN 398 Query 405 SMIDINPVVNQNLGAGQNQDNQAVTKAAPTGQGGASFKFTADTFGVVIGIYRCTPVLDYS 464 + IN VVN N+ G+N + A K +G G +F +G+V+ ++ P LDY Sbjct 399 IDLSINEVVNNNI-TGENAADIA-GKGTMSGNGSINFN-VGGQYGIVMCVFHVLPQLDYI 455 Query 465 HVGIDRTLLKTDASDFVIPELDSIGMQQTFQCELFAPTSQMTASAPDKRKYDMSRT--FG 522 T+ DF IPE D IGM+Q P P + +S FG Sbjct 456 TSAPHFGTTLTNVLDFPIPEFDKIGMEQVPVIRGLNPVK------PKDGDFKVSPNLYFG 509 Query 523 YAPRYSEYKVSFDRYNGAFCDTLKSWVTGFNTHIF---DSDRWNDRSYFSISVPQ--LFV 577 YAP+Y +K + D+ G F +LK+W+ F+ DS + D + F Sbjct 510 YAPQYYNWKTTLDKSMGEFRRSLKTWIIPFDDEALLAADSVDFPDNPNVEADSVKAGFFK 569 Query 578 CRPDIVKDIFALQTYHDSNDDNLYVGMVNMCYATRNLSRYGLPY 621 P ++ ++FA++ D N D + R+L GLPY Sbjct 570 VSPSVLDNLFAVKANSDLNTDQFLCSTLFDVNVVRSLDPNGLPY 613 >gi|647452987|ref|WP_025792807.1| hypothetical protein [Prevotella histicola] Length=584 Score = 166 bits (420), Expect = 8e-41, Method: Compositional matrix adjust. Identities = 159/574 (28%), Positives = 242/574 (42%), Gaps = 112/574 (20%) Query 14 KNKTSRNSFDLSHRNLFTAKVGELLPCFVQEVNPGDSIKLDSSYFTRTAPLQTAAFTRLR 73 K + +RN FDLS R +F+AK G+LLP EVNP + K RT L TA++ R++ Sbjct 5 KPRLARNGFDLSSRRIFSAKAGQLLPIGCWEVNPSEHFKFSVQDLVRTTTLNTASYARMK 64 Query 74 ENVQYFFVPYQCLWKYFE--------------GQVKNMTKNANGGDISQIATSPFANAKV 119 E +FFV Y+ LW++F+ G KN T N N ++ Sbjct 65 EYYHFFFVSYRSLWQWFDQFIVGTNNPHSALNGVKKNGTTNYN---------------QI 109 Query 120 STEMPFISYTALHAYLNKLLNYVDSSANPTELSNPFLYNNGCWRHAESAKLLQLLGYG-- 177 + +P L KL+ + +S ++ F Y+ G +AKLL +L YG Sbjct 110 CSSVPTFD-------LGKLITRLKTSDMDSQ---GFNYSEG------AAKLLNMLNYGVT 153 Query 178 --NFVQQFKNFSASKPYSLLHVENAPA------LSVFRLLAYQKICNDFYTYRQWQPYNA 229 +N S Y + P+ +S FRLLAYQKI NDFY + W P + Sbjct 154 NKGKFMNLENLITSTSYLPSKDDKEPSSIYACKVSPFRLLAYQKIFNDFYRNQDWTPSDV 213 Query 230 SLCNIDYITPDssssmdlsskfssisVSDLGKSNMLDMRFSNLPLDYFNGVLPTPQFGSE 289 N+D DS+ +++ MR+ D+ + PTP + S+ Sbjct 214 RSFNVDDYADDSNLTIEPDVAL-----------KFCQMRYRPYAKDWLTSMKPTPNY-SD 261 Query 290 SVVSLSQ----NADVYTGFDKSQWQTLDGsafpsgsvsssnsdrsltANGKSIEHVHILP 345 + +L + N +V +KS +LD Sbjct 262 GIFNLPEYVRGNGNVILTNNKSGSVSLD-------------------------------- 289 Query 346 SGSIT-SSLSIAALRQATALQKYKE-IQLANDPDFESQIEAHFGIK-PKHDMHKSRFIGG 402 SG+++ SS S+ LR A AL K E + AN D+ SQIEAHFG K P+ + +RF+GG Sbjct 290 SGTVSPSSFSVNDLRAAFALDKMLEATRRANGLDYASQIEAHFGFKVPESRANDARFLGG 349 Query 403 SSSMIDINPVVNQNLGAGQNQDNQAVTKAAPTGQGGAS---FKFTADTFGVVIGIYRCTP 459 + I ++ VV+ N A + + ++ G G S +F + G+++ IY P Sbjct 350 FDNSIVVSEVVSTNGNAASDGSHASIGDLGGKGIGSMSSGTIEFDSTEHGIIMCIYSVAP 409 Query 460 VLDYSHVGIDRTLLKTDASDFVIPELDSIGMQQTFQCELFAPTSQMTASAPDKRKYDMSR 519 +Y+ +D K F PE +G Q +L T M +++ Sbjct 410 QSEYNASYLDPFNRKLTREQFYQPEFADLGYQALIGSDLICSTLGMNEKQAGFSDIELNN 469 Query 520 T-FGYAPRYSEYKVSFDRYNGAF--CDTLKSWVT 550 GY RY+EYK + D G F +L W T Sbjct 470 NLLGYQVRYNEYKTARDLVFGDFESGKSLSYWCT 503 >gi|517172762|ref|WP_018361580.1| hypothetical protein [Prevotella nanceiensis] Length=568 Score = 162 bits (410), Expect = 1e-39, Method: Compositional matrix adjust. Identities = 161/613 (26%), Positives = 256/613 (42%), Gaps = 75/613 (12%) Query 19 RNSFDLSHRNLFTAKVGELLPCFVQEVNPGDSIKLDSSYFTRTAPLQTAAFTRLRENVQY 78 RN+FD+S R+LFTA G LLP ++ P D +++++S F RT P+ +AAF +R ++ Sbjct 18 RNAFDISQRHLFTAPAGALLPVLSLDLLPHDHVEINASDFMRTLPMNSAAFMSMRGVYEF 77 Query 79 FFVPYQCLWKYFEGQVKNMTKNANGGDISQIATSPFANAKVSTEMPFISYTALHAYLNKL 138 +FVPY+ LW F+ + M S +S K T +S+ + KL Sbjct 78 YFVPYKQLWSGFDQFITGM---------SDYKSSFMYAFKGKTPPSCVSFD-----VQKL 123 Query 139 LNYVDSSANPTELSNPFLYNNGCWRHAESAKLLQLLGYGNFVQQFKNFSASKPYSLLHVE 198 +++ + N + + F N G +R +L LLGYG + SA PY+ Sbjct 124 VDWCKT--NTAKDIHGFDKNKGVYR------ILDLLGYGKYAN-----SAGVPYTNPTST 170 Query 199 NAPALSVFRLLAYQKICNDFYTYRQWQPYNASLCNIDYITPDssssmdlsskfssisVSD 258 + FR LAYQKI NDFY ++ Y N+D S K ++ Sbjct 171 TMGKCTPFRGLAYQKIYNDFYRNTTYEEYQLESFNVDMF--------YGSGKVKETIPNE 222 Query 259 LGKSNMLDMRFSNLPLDYFNGVLPTPQFGSESVVSLSQNADVYTGFDKSQWQTLDGsafp 318 + +R+ N D V PTP F + N +TG GS Sbjct 223 PWDYDWFTLRYRNAQKDLLTNVRPTPLFSIDDF-----NPQFFTG----------GSDIV 267 Query 319 sgsvsssnsdrsltANGKSIEHVHILPSG--SITSSLSIAALRQATALQKYKEIQLANDP 376 + + I ++ +G S + +S+A +R A AL+K + + Sbjct 268 MEKGPNVTGGTHEYRDSVVIVGKNLKENGVDSKRTMISVADIRNAFALEKLASVTMRAGK 327 Query 377 DFESQIEAHFGIKPKHDMH-KSRFIGGSSSMIDINPVVNQNLGAGQNQDNQAV------T 429 ++ Q+EAHFGI + + +IGG S I + V + + + T Sbjct 328 TYKEQMEAHFGISVEEGRDGRCTYIGGFDSNIQVGDVTQSSGTTVTGTKDTSFGGYLGRT 387 Query 430 KAAPTGQGGASFKFTADTFGVVIGIYRCTPVLDYSHVGIDRTLLKTDASDFVIPELDSIG 489 TG G +F A G+++ IY P + Y +D + K + DF +PE +++G Sbjct 388 TGKATGSGSGHIRFDAKEHGILMCIYSLVPDVQYDSKRVDPFVQKIERGDFFVPEFENLG 447 Query 490 MQQTFQCELFAPTSQMTASAPDKRKYDMSRTFGYAPRYSEYKVSFDRYNGAFC--DTLKS 547 MQ F + + TA++ K FG+ PRYSEYK + D +G F + L Sbjct 448 MQPLFAKNISYKYNNNTANSRIKN----LGAFGWQPRYSEYKTALDINHGQFVHQEPLSY 503 Query 548 WVTGFNTHIFDSDRWNDRSYFSISVPQLFVCRPDIVKDIFALQTYHDSNDDNLYVGMVNM 607 W R S F+IS F P + D+FA+ D ++ G Sbjct 504 WTVA-------RARGESMSNFNIST---FKINPKWLDDVFAVNYNGTELTDQVFGGCYFN 553 Query 608 CYATRNLSRYGLP 620 ++S G+P Sbjct 554 IVKVSDMSIDGMP 566 >gi|494308783|ref|WP_007173938.1| hypothetical protein [Prevotella bergensis] gi|270333035|gb|EFA43821.1| putative capsid protein (F protein) [Prevotella bergensis DSM 17361] Length=553 Score = 161 bits (408), Expect = 2e-39, Method: Compositional matrix adjust. Identities = 159/619 (26%), Positives = 258/619 (42%), Gaps = 99/619 (16%) Query 18 SRNSFDLSHRNLFTAKVGELLPCFVQEVNPGDSIKLDSSYFTRTAPLQTAAFTRLRENVQ 77 +RN+FDLS R+LFTA G LLP ++ P D +++++ F RT P+ TAAF +R + Sbjct 16 NRNAFDLSQRHLFTAHAGMLLPVLNLDLIPHDHVEINAQDFMRTLPMNTAAFASMRGVYE 75 Query 78 YFFVPYQCLWKYFEGQVKNMTKNANGGDIS-QIATSPFANAKVSTEMPFISYTALHAYLN 136 +FFVPY LW F+ + M + + S Q TSP ++P+ + ++ LN Sbjct 76 FFFVPYHQLWAQFDQFITGMNDFHSSANKSIQGGTSPL-------QVPYFNVDSVFNSLN 128 Query 137 KLLNYVDSSANPTELSNPFLYNNGCWRHAESAKLLQLLGYGNFVQQFKNFSASKPYSLLH 196 S + +L F Y G +R LL LLGYG ++F +F + P ++ Sbjct 129 T--GKESGSGSTDDLQYKFKY--GAFR------LLDLLGYG---RKFDSFGTAYPDNVSG 175 Query 197 VENAPA--LSVFRLLAYQKICNDFYTYRQWQPYNASLCNIDYITPDssssmdlsskfssi 254 ++N SVFR+LAY KI D+Y ++ ++ N D + Sbjct 176 LKNNLDYNCSVFRILAYNKIYQDYYRNSNYENFDTDSFNFDKFK---------GGLVDAK 226 Query 255 sVSDLGKSNMLDMRFSNLPLDYFNGVLPTPQFGSESVVSLSQNADVYTGFDKSQWQTLDG 314 V+DL K +R+ N DYF + + F + N ++ Sbjct 227 VVADLFK-----LRYRNAQTDYFTNLRQSQLFSFTTAFEDVDNINI-------------- 267 Query 315 safpsgsvsssnsdrsltANGKSIEHVHI-LPSGSITSSLSIAALRQATALQKYKEIQLA 373 + ++G + V+ + + S S+++LR A A+ K + + Sbjct 268 -----------APRDYVKSDGSNFTRVNFGVDTDSSEGDFSVSSLRAAFAVDKLLSVTMR 316 Query 374 NDPDFESQIEAHFGIK-PKHDMHKSRFIGGSSSMIDINPVVNQNLGAGQNQDNQA----V 428 F+ Q+ AH+G++ P + ++GG S + ++ V + +A Sbjct 317 AGKTFQDQMRAHYGVEIPDSRDGRVNYLGGFDSDMQVSDVTQTSGTTATEYKPEAGYLGR 376 Query 429 TKAAPTGQGGASFKFTADTFGVVIGIYRCTPVLDYSHVGIDRTLLKTDASDFVIPELDSI 488 TG G F A GV++ IY P + Y +D + K D D+ PE +++ Sbjct 377 VAGKGTGSGRGRIVFDAKEHGVLMCIYSLVPQIQYDCTRLDPMVDKLDRFDYFTPEFENL 436 Query 489 GMQQTFQCELFAPTSQMTASAPDKRKYDMSRTFGYAPRYSEYKVSFDRYNGAFC--DTLK 546 GMQ + S + P + GY PRYSEYK + D +G F D L Sbjct 437 GMQPLNSSYI----SSFCTTDP------KNPVLGYQPRYSEYKTALDVNHGQFAQSDALS 486 Query 547 SW-VTGFNTHIFDSDRWNDRSYFSISVPQL----FVCRPDIVKDIFALQTYHDSNDDNLY 601 SW V+ F RW + PQL F P + IF + +D +Y Sbjct 487 SWSVSRFR-------RW-------TTFPQLEIADFKIDPGCLNSIFPVDYNGTEANDCVY 532 Query 602 VGMVNMCYATRNLSRYGLP 620 G ++S G+P Sbjct 533 GGCNFNIVKVSDMSVDGMP 551 >gi|496521299|ref|WP_009229582.1| capsid protein [Prevotella sp. oral taxon 317] gi|288330570|gb|EFC69154.1| putative capsid protein (F protein) [Prevotella sp. oral taxon 317 str. F0108] Length=541 Score = 157 bits (396), Expect = 7e-38, Method: Compositional matrix adjust. Identities = 147/544 (27%), Positives = 233/544 (43%), Gaps = 99/544 (18%) Query 19 RNSFDLSHRNLFTAKVGELLPCFVQEVNPGDSIKLDSSYFTRTAPLQTAAFTRLRENVQY 78 R++FDLS ++L+TA G LLP ++ D I++ + F RT P+ +AAF +R ++ Sbjct 18 RSAFDLSQKHLYTAPAGALLPVLSVDLMFHDHIRIQAQDFMRTMPMNSAAFISMRGVYEF 77 Query 79 FFVPYQCLWKYFEGQVKNMTKNANGGDISQIATSPFANAKVSTEMPFISYTALHAYLNKL 138 FFVPY LW ++ + +M D S A K +P + ++ ++ + Sbjct 78 FFVPYSQLWHPYDQFITSMN------DYRSSVVSSAAGDKALDSVPNVKLADMYKFVRER 131 Query 139 LNYVDSSANPTELSNPFLYNNGCWRHAESAKLLQLLGYGNFVQQFKNFSASKPYSLLHVE 198 + D P NN C +L+ LLGYG + S+ P LL+ Sbjct 132 TD-KDIFGYPHS-------NNSC-------RLMDLLGYGKPIT-----SSKTPVPLLYTG 171 Query 199 NAPALSVFRLLAYQKICNDFYTYRQWQPYNASLCNIDYITPDssssmdlsskfssisVSD 258 N +++FRLLAY KI +D+Y ++ + NID+ K + + +D Sbjct 172 N---VNLFRLLAYNKIYSDYYRNTTYEGVDVYSFNIDH------------KKGTFVPTAD 216 Query 259 LGKSNMLDMRFSNLPLDYFNGVLPTPQF--GSESVVSLSQNADVYTGFDKSQWQTLDGsa 316 K L++ + N PLD++ + PTP F GS+S S+ Q +D TG Sbjct 217 EFK-KYLNLHYRNAPLDFYTNLRPTPLFTIGSDSFSSVLQLSDP-TG------------- 261 Query 317 fpsgsvsssnsdrsltANGKSIEHVHILPSGSITSSLSIAALRQATALQKYKEIQLANDP 376 +A+G S + P L+++A+R A AL K I + Sbjct 262 -----------SAGFSADGNSAKLNMASP-----DVLNVSAIRSAFALDKLLSISMRAGK 305 Query 377 DFESQIEAHFGIKPKHDMH-KSRFIGGSSSMIDINPVVNQNLGAGQNQDNQAVTKAA--- 432 + QIEAHFG+ + ++GG S + + V + N K A Sbjct 306 TYAEQIEAHFGVTVSEGRDGQVYYLGGFDSNVQVGDVTQTSGTTNPNVSEVGNAKLAGYL 365 Query 433 ------PTGQGGASFKFTADTFGVVIGIYRCTPVLDYSHVGIDRTLLKTDASDFVIPELD 486 TG G +F A GV++ IY P + Y + +D + K D+ IPE + Sbjct 366 GKITGKGTGSGYGEIQFDAKEPGVLMCIYSVVPAMQYDCMRLDPFVAKQTRGDYFIPEFE 425 Query 487 SIGMQQTFQCELFAPTSQMTASAPDKRKYDMSRTFGYAPRYSEYKVSFDRYNGAFC--DT 544 ++GMQ P A D ++G+ PRYSEYK +FD +G F + Sbjct 426 NLGMQP------IVPAFVSLNRAKDN-------SYGWQPRYSEYKTAFDINHGQFANGEP 472 Query 545 LKSW 548 L W Sbjct 473 LSYW 476 Lambda K H a alpha 0.319 0.134 0.404 0.792 4.96 Gapped Lambda K H a alpha sigma 0.267 0.0410 0.140 1.90 42.6 43.6 Effective search space used: 4697304392193