bitscore colors: <40, 40-50 , 50-80, 80-200, >200
BLASTP 2.2.30+ Reference: Stephen F. Altschul, Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Reference for composition-based statistics: Alejandro A. Schaffer, L. Aravind, Thomas L. Madden, Sergei Shavirin, John L. Spouge, Yuri I. Wolf, Eugene V. Koonin, and Stephen F. Altschul (2001), "Improving the accuracy of PSI-BLAST protein database searches with composition-based statistics and other refinements", Nucleic Acids Res. 29:2994-3005. Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF excluding environmental samples from WGS projects 49,011,213 sequences; 17,563,301,199 total letters Query= Contig-2_CDS_annotation_glimmer3.pl_2_8 Length=342 Score E Sequences producing significant alignments: (Bits) Value gi|575094354|emb|CDL65742.1| unnamed protein product 234 1e-67 gi|496050829|ref|WP_008775336.1| hypothetical protein 233 2e-67 gi|547226430|ref|WP_021963493.1| putative uncharacterized protein 226 7e-65 gi|490418709|ref|WP_004291032.1| hypothetical protein 218 5e-62 gi|494822885|ref|WP_007558293.1| hypothetical protein 208 6e-58 gi|575094321|emb|CDL65708.1| unnamed protein product 157 9e-40 gi|565841287|ref|WP_023924568.1| hypothetical protein 131 3e-30 gi|647452987|ref|WP_025792807.1| hypothetical protein 112 4e-24 gi|517172762|ref|WP_018361580.1| hypothetical protein 110 1e-23 gi|496521299|ref|WP_009229582.1| capsid protein 110 2e-23 >gi|575094354|emb|CDL65742.1| unnamed protein product [uncultured bacterium] Length=615 Score = 234 bits (598), Expect = 1e-67, Method: Compositional matrix adjust. Identities = 145/373 (39%), Positives = 212/373 (57%), Gaps = 35/373 (9%) Query 2 GVLPNSQFGDIAVIDIEGGLNIPASRIS--LSSNNRPTIGIKVGAQVSSPNNCSITNSSG 59 GVLP +Q+G +V+ I G LN+ ++ S + + P G + V+ N + N S Sbjct 246 GVLPVAQYGSASVVPINGQLNVISNGDSGPIFKTSTPDPGTPGTSYVTVGGNIGVDNRSF 305 Query 60 NLSTGDILSVGIPA--ASYKLQSSFN----------------------VLALRQAESLQK 95 +S G L+VG A + Y S+ + +LALRQAE LQK Sbjct 306 GVS-GSTLNVGKSADPSGYGFPSNASTRSLLWENPNLIIENNQGFYVPILALRQAEFLQK 364 Query 96 YREITQSVDTNYRDQIKAHFGVNVPASDSHMAQYIGGIARNLDISEVVNNNLQGDGEAVI 155 ++E++ S + +Y+ QI+ H+G+ V SH A+Y+GG A +LDI+EV+NNN+ GD A I Sbjct 365 WKEVSVSGEEDYKSQIEKHWGIKVSDFLSHQARYLGGCATSLDINEVINNNITGDNAADI 424 Query 156 YGKGVGTGTGSMRYTTGSKYCILMCIYHCMPVLDYDISGQHPQLLATSVDELPIPEFDNI 215 GKG TG GS+R+ + +Y I+MCIYH +P++DY SG PIPE D I Sbjct 425 AGKGTFTGNGSIRFESKGEYGIIMCIYHVLPIVDYVGSGVDHSCTLVDATSFPIPELDQI 484 Query 216 GMEGVPLVQLVNSNLYKTNKSVKIDSILGYNPRYYAWKSNIDRIHGAFTTTLQDWVSPVD 275 GME VPLV+ +N K + + D+ LGY PRY WK+++DR G F +L+ W PV Sbjct 485 GMESVPLVRAMNP--VKESDTPSADTFLGYAPRYIDWKTSVDRSVGDFADSLRTWCLPVG 542 Query 276 DSFLYS--TFGTPSSGSF----VTWPFFKVNPNTLDNIFAVKSDSTWESDQFLVNSYVGC 329 D L S + PS+ + + FFKVNP+ +D +FAV +DST ++D+FL +S+ Sbjct 543 DKELTSANSLNFPSNPNVEPDSIAAGFFKVNPSIVDPLFAVVADSTVKTDEFLCSSFFDV 602 Query 330 KVVRPLSRDGVPY 342 KVVR L +G+PY Sbjct 603 KVVRNLDVNGLPY 615 >gi|496050829|ref|WP_008775336.1| hypothetical protein [Bacteroides sp. 2_2_4] gi|229448893|gb|EEO54684.1| putative capsid protein (F protein) [Bacteroides sp. 2_2_4] Length=580 Score = 233 bits (594), Expect = 2e-67, Method: Compositional matrix adjust. Identities = 137/346 (40%), Positives = 193/346 (56%), Gaps = 32/346 (9%) Query 2 GVLPNSQFGDIAVIDIEGGLNIPASRISLSSNNRPTIGIKVGAQVSSPNNCSITNSSGNL 61 GVLP Q+GD A +++ + A + + + P G S+ N N SG Sbjct 262 GVLPRQQYGDTAAVNVNLSNVLSAQYMVQTPDGDPVGGSPFS---STGVNLQTVNGSG-- 316 Query 62 STGDILSVGIPAASYKLQSSFNVLALRQAESLQKYREITQSVDTNYRDQIKAHFGVNVPA 121 +F VLALRQAE LQK++EITQS + +Y+DQI+ H+ V+V Sbjct 317 -------------------TFTVLALRQAEFLQKWKEITQSGNKDYKDQIEKHWNVSVGE 357 Query 122 SDSHMAQYIGGIARNLDISEVVNNNLQGDGEAVIYGKGVGTGTGSMRYTTGSKYCILMCI 181 + S M+ Y+GG +LDI+EVVNNN+ G A I GKGV G G + + G +Y ++MCI Sbjct 358 AYSEMSLYLGGTTASLDINEVVNNNITGSNAADIAGKGVVVGNGRISFDAGERYGLIMCI 417 Query 182 YHCMPVLDYDISGQHPQLLATSVDELPIPEFDNIGMEGVPLVQLVNSNLYKTNKSVKIDS 241 YH +P+LDY +P + + IPEFD +GME VPLV L+N N S Sbjct 418 YHSLPLLDYTTDLVNPAFTKINSTDFAIPEFDRVGMESVPLVSLMNPLQSSYNVG---SS 474 Query 242 ILGYNPRYYAWKSNIDRIHGAFTTTLQDWVSPVDDSFL-----YSTFGTPSSGSFVTWPF 296 ILGY PRY ++K+++D GAF TTL+ WV D+ + Y S G+ V + Sbjct 475 ILGYAPRYISYKTDVDSSVGAFKTTLKSWVMSYDNQSVINQLNYQDDPNNSPGTLVNYTN 534 Query 297 FKVNPNTLDNIFAVKSDSTWESDQFLVNSYVGCKVVRPLSRDGVPY 342 FKVNPN +D +FAV + ++ ++DQFL +S+ KVVR L DG+PY Sbjct 535 FKVNPNCVDPLFAVAASNSIDTDQFLCSSFFDVKVVRNLDTDGLPY 580 >gi|547226430|ref|WP_021963493.1| putative uncharacterized protein [Prevotella sp. CAG:1185] gi|524103382|emb|CCY83994.1| putative uncharacterized protein [Prevotella sp. CAG:1185] Length=573 Score = 226 bits (576), Expect = 7e-65, Method: Compositional matrix adjust. Identities = 146/352 (41%), Positives = 202/352 (57%), Gaps = 42/352 (12%) Query 2 GVLPNSQFGDIAVID-IEGGLNIPASRISLSSNNRPTIG---IKVGAQVSSPNNCSITNS 57 G+LP +Q+GD++V I G L+I S SL+ + P G I+ G V + N +N+ Sbjct 253 GMLPRAQYGDVSVASPIFGDLDIGDSS-SLTFASAPQQGANTIQSGVLVVNNN----SNT 307 Query 58 SGNLSTGDILSVGIPAASYKLQSSFNVLALRQAESLQKYREITQSVDTNYRDQIKAHFGV 117 + LS VLALRQAE LQK+REI QS +Y+ Q++ HF V Sbjct 308 TAGLS---------------------VLALRQAECLQKWREIAQSGKMDYQTQMQKHFNV 346 Query 118 NVPASDSHMAQYIGGIARNLDISEVVNNNLQGDGEAVIYGKGVGTGTGSMRYTTGSKYCI 177 + A+ S +Y+GG NLDISEVVN NL GD +A I GKG GT G+ S++ I Sbjct 347 SPSATLSGHCKYLGGWTSNLDISEVVNTNLTGDNQADIQGKGTGTLNGNKVDFESSEHGI 406 Query 178 LMCIYHCMPVLDYDISGQHPQLLATSVDELPIPEFDNIGMEGVPLVQLVNSNLYKTNKSV 237 +MCIYHC+P+LD+ I+ Q T+ + IPEFD++GM+ QL S + + + Sbjct 407 IMCIYHCLPLLDWSINRIARQNFKTTFTDYAIPEFDSVGMQ-----QLYPSEMIFGLEDL 461 Query 238 KID--SI-LGYNPRYYAWKSNIDRIHGAFTTTLQDWVSPVDDSFLYSTFGTPSSGSF--- 291 D SI +GY PRY K++ID IHG+F TL WVSP+ DS++ + F Sbjct 462 PSDPSSINMGYVPRYADLKTSIDEIHGSFIDTLVSWVSPLTDSYISAYRQACKDAGFSDI 521 Query 292 -VTWPFFKVNPNTLDNIFAVKSDSTWESDQFLVNSYVGCKVVRPLSRDGVPY 342 +T+ FFKVNP+ +DNIF VK+DST +DQ L+NSY K VR +G+PY Sbjct 522 TMTYNFFKVNPHIVDNIFGVKADSTINTDQLLINSYFDIKAVRNFDYNGLPY 573 >gi|490418709|ref|WP_004291032.1| hypothetical protein [Bacteroides eggerthii] gi|217986636|gb|EEC52970.1| putative capsid protein (F protein) [Bacteroides eggerthii DSM 20697] Length=578 Score = 218 bits (556), Expect = 5e-62, Method: Compositional matrix adjust. Identities = 134/354 (38%), Positives = 195/354 (55%), Gaps = 42/354 (12%) Query 2 GVLPNSQFGDIAVIDIEGGLNIPASRISLSSNNRPTIGIKVGAQVSSPNNCSITNSSGNL 61 GVLP+ Q+G+ AV I P L+ +N T+G +SP S T ++ NL Sbjct 254 GVLPHQQYGETAVASI-----TPDVTGKLTLSNFSTVG-------TSPTTASGT-ATKNL 300 Query 62 STGDILSVGIPAASYKLQSSFNVLALRQAESLQKYREITQSVDTNYRDQIKAHFGVNVPA 121 D +VG ++L LRQAE LQK++EITQS + +Y+DQ++ H+GV+V Sbjct 301 PAFD--TVG----------DLSILVLRQAEFLQKWKEITQSGNKDYKDQLEKHWGVSVGD 348 Query 122 SDSHMAQYIGGIARNLDISEVVNNNLQGDGEAVIYGKGVGTGTGSMRYTTGSKYCILMCI 181 S + Y+GG++ ++DI+EV+N N+ G A I GKGVG G + + + +Y ++MCI Sbjct 349 GFSELCTYLGGVSSSIDINEVINTNITGSAAADIAGKGVGVANGEINFNSNGRYGLIMCI 408 Query 182 YHCMPVLDYDISGQHPQLLATSVDELPIPEFDNIGMEGVPLVQLVNSNLYKTNKSVKIDS 241 YHC+P+LDY P L + + IPEFD +GM+ +PLVQL+N N S Sbjct 409 YHCLPLLDYTTDMLDPAFLKVNSTDYAIPEFDRVGMQSMPLVQLMNPLRSFANAS---GL 465 Query 242 ILGYNPRYYAWKSNIDRIHGAFTTTLQDWV-------------SPVDDSFLYSTFGTPSS 288 +LGY PRY +K+++D+ G F TL WV P D + + PS Sbjct 466 VLGYVPRYIDYKTSVDQSVGGFKRTLNSWVISYGNISVLKQVTLPNDAPPIEPSEPVPSV 525 Query 289 GSFVTWPFFKVNPNTLDNIFAVKSDSTWESDQFLVNSYVGCKVVRPLSRDGVPY 342 + + FFKVNP+ LD IFAV++ +DQFL +S+ K VR L DG+PY Sbjct 526 AP-MNFTFFKVNPDCLDPIFAVQAGDDTNTDQFLCSSFFDIKAVRNLDTDGLPY 578 >gi|494822885|ref|WP_007558293.1| hypothetical protein [Bacteroides plebeius] gi|198272099|gb|EDY96368.1| putative capsid protein (F protein) [Bacteroides plebeius DSM 17135] Length=613 Score = 208 bits (530), Expect = 6e-58, Method: Compositional matrix adjust. Identities = 129/358 (36%), Positives = 197/358 (55%), Gaps = 23/358 (6%) Query 2 GVLPNSQFGDIAVIDIEGGLNIPASRISLSSNNRPTIGIKVGAQVSSPNNCSITNSSGNL 61 G +P +Q+G+ + + + G + + + P N +I SSG L Sbjct 262 GTIPQAQYGEASAVPVSGSMQV------VEGPTPPAFTTGQDGVAFLNGNVTIQGSSGYL 315 Query 62 ----STGD--ILSVGIPAASYKLQ--SSFNV--LALRQAESLQKYREITQSVDTNYRDQI 111 S G+ IL + ++ SSF V LALR+AE+ QK++E+ + + +Y QI Sbjct 316 QAQTSVGESRILRFNNTNSGLIVEGDSSFGVSILALRRAEAAQKWKEVALASEEDYPSQI 375 Query 112 KAHFGVNVPASDSHMAQYIGGIARNLDISEVVNNNLQGDGEAVIYGKGVGTGTGSMRYTT 171 +AH+G +V + S M Q++G I +L I+EVVNNN+ G+ A I GKG +G GS+ + Sbjct 376 EAHWGQSVNKAYSDMCQWLGSINIDLSINEVVNNNITGENAADIAGKGTMSGNGSINFNV 435 Query 172 GSKYCILMCIYHCMPVLDYDISGQHPQLLATSVDELPIPEFDNIGMEGVPLVQLVNSNLY 231 G +Y I+MC++H +P LDY S H T+V + PIPEFD IGME VP+++ +N Sbjct 436 GGQYGIVMCVFHVLPQLDYITSAPHFGTTLTNVLDFPIPEFDKIGMEQVPVIRGLNPVKP 495 Query 232 KT-NKSVKIDSILGYNPRYYAWKSNIDRIHGAFTTTLQDWVSPVDDSFLYS--TFGTPS- 287 K + V + GY P+YY WK+ +D+ G F +L+ W+ P DD L + + P Sbjct 496 KDGDFKVSPNLYFGYAPQYYNWKTTLDKSMGEFRRSLKTWIIPFDDEALLAADSVDFPDN 555 Query 288 ---SGSFVTWPFFKVNPNTLDNIFAVKSDSTWESDQFLVNSYVGCKVVRPLSRDGVPY 342 V FFKV+P+ LDN+FAVK++S +DQFL ++ VVR L +G+PY Sbjct 556 PNVEADSVKAGFFKVSPSVLDNLFAVKANSDLNTDQFLCSTLFDVNVVRSLDPNGLPY 613 >gi|575094321|emb|CDL65708.1| unnamed protein product [uncultured bacterium] Length=642 Score = 157 bits (398), Expect = 9e-40, Method: Compositional matrix adjust. Identities = 111/366 (30%), Positives = 177/366 (48%), Gaps = 36/366 (10%) Query 2 GVLPNSQFGDIAVIDIEGG-------LNIPASRISLSSNNRPTIG---IKVGAQVSSPNN 51 GVLP SQFG +V+++ G LN S+ S R T G ++ S+ N Sbjct 286 GVLPTSQFGSESVVNLNLGNASGSAVLNGTTSKDS--GRWRTTTGEWEMEQRVASSANGN 343 Query 52 CSITNSSGNLSTGDILSVGIPAASYKLQSSFNVLALRQAESLQKYREITQSVDTNYRDQI 111 + NS+G + D G A + L + +++ALR A + QKY+EI + D +++ Q+ Sbjct 344 LKLDNSNGTFISHDHTFSGNVAINTSLSGNLSIIALRNALAAQKYKEIQLANDVDFQSQV 403 Query 112 KAHFGVNVPASDSHMAQYIGGIARNLDISEVVNNNLQGDGEAVIYGKGVGTGTGSMRYTT 171 +AHFG+ P + + +IGG + ++I+E +N NL GD +A G G+ S+++T Sbjct 404 EAHFGIK-PDEKNENSLFIGGSSSMININEQINQNLSGDNKATYGAAPQGNGSASIKFTA 462 Query 172 GSKYCILMCIYHCMPVLDYDISGQHPQLLATSVDELPIPEFDNIGMEGVPLVQLVNSNLY 231 + Y +++ IY C PVLD+ G L T + IPE D+IGM+ ++ Y Sbjct 463 KT-YGVVIGIYRCTPVLDFAHLGIDRTLFKTDASDFVIPEMDSIGMQQTFRCEVAAPAPY 521 Query 232 KTN---------KSVKIDSILGYNPRYYAWKSNIDRIHGAFTTTLQDWVSPVDDSFLYST 282 S + GY PRY +K++ DR +GAF +L+ WV+ ++ Sbjct 522 NDEFKAFRVGDGSSPDMSETYGYAPRYSEFKTSYDRYNGAFCHSLKSWVTGIN------- 574 Query 283 FGTPSSGSFVTWP------FFKVNPNTLDNIFAVKSDSTWESDQFLVNSYVGCKVVRPLS 336 F + + TW F P+ + N+F V S + + DQ V C R LS Sbjct 575 FDAIQNNVWNTWAGINAPNMFACRPDIVKNLFLVSSTNNSDDDQLYVGMVNMCYATRNLS 634 Query 337 RDGVPY 342 R G+PY Sbjct 635 RYGLPY 640 >gi|565841287|ref|WP_023924568.1| hypothetical protein [Prevotella nigrescens] gi|564729907|gb|ETD29851.1| hypothetical protein HMPREF1173_00033 [Prevotella nigrescens CC14M] Length=656 Score = 131 bits (329), Expect = 3e-30, Method: Compositional matrix adjust. Identities = 90/264 (34%), Positives = 143/264 (54%), Gaps = 26/264 (10%) Query 87 LRQAESLQKYREITQSVD-TNYRDQIKAHFGVNVPASDSHMAQYIGGIARNLDISEVVN- 144 +R +L+K E T++ + +Y +QI AHFG VP S + A +IGG + ISEVV Sbjct 396 IRAMFALEKMLERTRAANGLDYSNQIAAHFGFKVPESRKNCASFIGGFDNQISISEVVTT 455 Query 145 NNLQGDGEAV-------IYGKGVGT-GTGSMRYTTGSKYCILMCIYHCMPVLDYDISGQH 196 +N DG A ++GKG+G +G + Y ++ ++MCIY P +DYD Sbjct 456 SNGSVDGTASTGSVVGQVFGKGIGAMNSGHISYDV-KEHGLIMCIYSIAPQVDYDARELD 514 Query 197 PQLLATSVDELPIPEFDNIGMEGVPLVQ---LVNSNLYKTNKSVKIDSILGYNPRYYAWK 253 P S ++ PEF+N+GM+ P++Q + N K++ S + +++LGY+ RY +K Sbjct 515 PFNRKFSREDYFQPEFENLGMQ--PVIQSDLCLCINSAKSDSSDQHNNVLGYSARYLEYK 572 Query 254 SNIDRIHGAFTT--TLQDWVSPVDDSFLYSTFGTPSSGSFVTWPFFKVNPNTLDNIFAVK 311 + D I G F + +L W +P ++ FG ++ P V+P L+ IFAVK Sbjct 573 TARDIIFGEFMSGGSLSAWATPKNNYTF--EFGK------LSLPDLLVDPKVLEPIFAVK 624 Query 312 SDSTWESDQFLVNSYVGCKVVRPL 335 + + +DQFLVNSY K +RP+ Sbjct 625 YNGSMSTDQFLVNSYFDVKAIRPM 648 >gi|647452987|ref|WP_025792807.1| hypothetical protein [Prevotella histicola] Length=584 Score = 112 bits (280), Expect = 4e-24, Method: Compositional matrix adjust. Identities = 90/290 (31%), Positives = 141/290 (49%), Gaps = 33/290 (11%) Query 80 SSFNVLALRQAESLQKYREITQSVD-TNYRDQIKAHFGVNVPASDSHMAQYIGGIARNLD 138 SSF+V LR A +L K E T+ + +Y QI+AHFG VP S ++ A+++GG ++ Sbjct 296 SSFSVNDLRAAFALDKMLEATRRANGLDYASQIEAHFGFKVPESRANDARFLGGFDNSIV 355 Query 139 ISEVV--NNNLQGDGEAVIYGKGVGTGTGSMRYTT----GSKYCILMCIYHCMPVLDYDI 192 +SEVV N N DG G G G GSM T +++ I+MCIY P +Y+ Sbjct 356 VSEVVSTNGNAASDGSHASIGDLGGKGIGSMSSGTIEFDSTEHGIIMCIYSVAPQSEYNA 415 Query 193 SGQHPQLLATSVDELPIPEFDNIGMEGVPLVQLVNSNLYKTNKSVKI------DSILGYN 246 S P + ++ PEF ++G + + L+ S L K +++LGY Sbjct 416 SYLDPFNRKLTREQFYQPEFADLGYQALIGSDLICSTLGMNEKQAGFSDIELNNNLLGYQ 475 Query 247 PRYYAWKSNIDRIHGAFTT--TLQDWVSPVDDSFLY--------------STFGTPSSGS 290 RY +K+ D + G F + +L W +P D F Y + + + S Sbjct 476 VRYNEYKTARDLVFGDFESGKSLSYWCTPRFD-FGYGDTEKKIAPENKGGADYRKKGNRS 534 Query 291 FVTWPFFKVNPNTLDNIFAVKSDSTWESDQFLVNSYVGCKVVRPLSRDGV 340 + F +NPN ++ IF S ++D F+VNS++ K VRP+S G+ Sbjct 535 HWSSRNFYINPNLVNPIFLT---SAVQADHFIVNSFLDVKAVRPMSVTGL 581 >gi|517172762|ref|WP_018361580.1| hypothetical protein [Prevotella nanceiensis] Length=568 Score = 110 bits (276), Expect = 1e-23, Method: Compositional matrix adjust. Identities = 81/282 (29%), Positives = 125/282 (44%), Gaps = 35/282 (12%) Query 79 QSSFNVLALRQAESLQKYREITQSVDTNYRDQIKAHFGVNVPASDSHMAQYIGGIARNLD 138 ++ +V +R A +L+K +T Y++Q++AHFG++V YIGG N+ Sbjct 301 RTMISVADIRNAFALEKLASVTMRAGKTYKEQMEAHFGISVEEGRDGRCTYIGGFDSNIQ 360 Query 139 ISEVVNNNLQGDGEAV--------------IYGKGVGTGTGSMRYTTGSKYCILMCIYHC 184 + +V Q G V GK G+G+G +R+ ++ ILMCIY Sbjct 361 VGDVT----QSSGTTVTGTKDTSFGGYLGRTTGKATGSGSGHIRF-DAKEHGILMCIYSL 415 Query 185 MPVLDYDISGQHPQLLATSVDELPIPEFDNIGMEGVPLVQLVNSNLYKTNKS---VKIDS 241 +P + YD P + + +PEF+N+GM+ PL S Y N + +K Sbjct 416 VPDVQYDSKRVDPFVQKIERGDFFVPEFENLGMQ--PLFAKNISYKYNNNTANSRIKNLG 473 Query 242 ILGYNPRYYAWKSNIDRIHGAFTTT--LQDWVSPVDDSFLYSTFGTPSSGSFVTWPFFKV 299 G+ PRY +K+ +D HG F L W S F + FK+ Sbjct 474 AFGWQPRYSEYKTALDINHGQFVHQEPLSYWTVARARGESMSNFNIST---------FKI 524 Query 300 NPNTLDNIFAVKSDSTWESDQFLVNSYVGCKVVRPLSRDGVP 341 NP LD++FAV + T +DQ Y V +S DG+P Sbjct 525 NPKWLDDVFAVNYNGTELTDQVFGGCYFNIVKVSDMSIDGMP 566 >gi|496521299|ref|WP_009229582.1| capsid protein [Prevotella sp. oral taxon 317] gi|288330570|gb|EFC69154.1| putative capsid protein (F protein) [Prevotella sp. oral taxon 317 str. F0108] Length=541 Score = 110 bits (275), Expect = 2e-23, Method: Compositional matrix adjust. Identities = 94/338 (28%), Positives = 150/338 (44%), Gaps = 53/338 (16%) Query 28 ISLSSNNRPTIGIKVGA-------QVSSPNNCSITNSSGNLSTGDILSVGIPAASYKLQS 80 + +N RPT +G+ Q+S P + ++ GN + ++ S + Sbjct 231 LDFYTNLRPTPLFTIGSDSFSSVLQLSDPTGSAGFSADGNSAKLNMASPDV--------- 281 Query 81 SFNVLALRQAESLQKYREITQSVDTNYRDQIKAHFGVNVPASDSHMAQYIGGIARNLDIS 140 NV A+R A +L K I+ Y +QI+AHFGV V Y+GG N+ + Sbjct 282 -LNVSAIRSAFALDKLLSISMRAGKTYAEQIEAHFGVTVSEGRDGQVYYLGGFDSNVQVG 340 Query 141 EV------VNNNLQGDGEA-------VIYGKGVGTGTGSMRYTTGSKYCILMCIYHCMPV 187 +V N N+ G A I GKG G+G G +++ +LMCIY +P Sbjct 341 DVTQTSGTTNPNVSEVGNAKLAGYLGKITGKGTGSGYGEIQFDAKEP-GVLMCIYSVVPA 399 Query 188 LDYDISGQHPQLLATSVDELPIPEFDNIGMEGVPLV-QLVNSNLYKTNKSVKIDSILGYN 246 + YD P + + + IPEF+N+GM+ P+V V+ N K N G+ Sbjct 400 MQYDCMRLDPFVAKQTRGDYFIPEFENLGMQ--PIVPAFVSLNRAKDNS-------YGWQ 450 Query 247 PRYYAWKSNIDRIHGAFTT--TLQDW-VSPVDDSFLYSTFGTPSSGSFVTWPFFKVNPNT 303 PRY +K+ D HG F L W ++ S +TF + K+NP+ Sbjct 451 PRYSEYKTAFDINHGQFANGEPLSYWSIARARGSDTLNTFNVAA---------LKINPHW 501 Query 304 LDNIFAVKSDSTWESDQFLVNSYVGCKVVRPLSRDGVP 341 LD++FAV + T +D ++ + V ++ DG+P Sbjct 502 LDSVFAVNYNGTEVTDCMFGYAHFNIEKVSDMTEDGMP 539 Lambda K H a alpha 0.316 0.134 0.399 0.792 4.96 Gapped Lambda K H a alpha sigma 0.267 0.0410 0.140 1.90 42.6 43.6 Effective search space used: 2000070484950