bitscore colors: <40, 40-50 , 50-80, 80-200, >200
BLASTP 2.2.30+ Reference: Stephen F. Altschul, Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Reference for composition-based statistics: Alejandro A. Schaffer, L. Aravind, Thomas L. Madden, Sergei Shavirin, John L. Spouge, Yuri I. Wolf, Eugene V. Koonin, and Stephen F. Altschul (2001), "Improving the accuracy of PSI-BLAST protein database searches with composition-based statistics and other refinements", Nucleic Acids Res. 29:2994-3005. Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF excluding environmental samples from WGS projects 49,011,213 sequences; 17,563,301,199 total letters Query= Contig-5_CDS_annotation_glimmer3.pl_2_3 Length=811 Score E Sequences producing significant alignments: (Bits) Value gi|575094326|emb|CDL65712.1| unnamed protein product 451 6e-143 gi|547920049|ref|WP_022322420.1| capsid protein VP1 144 6e-33 gi|649569140|gb|KDS75238.1| capsid family protein 137 2e-31 gi|649555287|gb|KDS61824.1| capsid family protein 137 6e-31 gi|492501782|ref|WP_005867318.1| hypothetical protein 130 1e-28 gi|639237429|ref|WP_024568106.1| hypothetical protein 125 9e-27 gi|649557305|gb|KDS63784.1| capsid family protein 117 1e-25 gi|609718276|emb|CDN73650.1| conserved hypothetical protein 120 3e-25 gi|12085136|ref|NP_073538.1| major capsid protein 111 2e-22 gi|530695351|gb|AGT39907.1| major capsid protein 105 2e-20 >gi|575094326|emb|CDL65712.1| unnamed protein product [uncultured bacterium] Length=758 Score = 451 bits (1161), Expect = 6e-143, Method: Compositional matrix adjust. Identities = 244/446 (55%), Positives = 314/446 (70%), Gaps = 22/446 (5%) Query 371 LTSDKPVDLTLGSS---PYYNSGSAN-KDKQIKISAYSFRAYEGIYNAYIRDNRNNPYYV 426 +T DK +D+ +GSS PYY GSAN DK IK+SAY FRAYE IYNAYIR+ RNNP+ + Sbjct 330 ITFDK-LDVFIGSSGKYPYY--GSANMSDKAIKLSAYPFRAYEAIYNAYIRNTRNNPFVL 386 Query 427 NGQVQYNKWIPTYDGGADQ-NIYELRYANWEKDFLTTAVQSPQQGTAPLVGIttytetve 485 NG+ YN+WI T GG+D +LR+ANW+ D TTA+ +PQQG APLVG+TTY Sbjct 387 NGKKTYNRWITTDAGGSDTLTPRDLRFANWQSDAYTTALTAPQQGVAPLVGLTTYEIRSV 446 Query 486 ttSDDGTPVTRELSRIALVDEDGKKYQVSFDSDSEGLKGVSYVELDNEVKLRQPRNLIDV 545 +D G VT A+VDE+G Y+V F+S+ E LKGV+Y L + ++L+ Sbjct 447 --NDAGHEVT--TVNTAIVDEEGNAYKVDFESNGEALKGVNYTPLKAGEAVNM-QSLVSP 501 Query 546 VTSGISINDLRNVNAYQKFLELNMRKGYSYRDIIEGRFNVKVRYDELLMPEFFGGFSRDI 605 VTSGISIND RNVNAYQ++LELN +G+SY++IIEGRF+V VRYD L MPE+ GG +RDI Sbjct 502 VTSGISINDFRNVNAYQRYLELNQFRGFSYKEIIEGRFDVNVRYDALNMPEYLGGITRDI 561 Query 606 EMHSISQTVDQDLDGSQTYAKALGSQSGIAGVRGDSGRALECFCDEESIVMGILIVTPLP 665 ++ I+QTV+ GS +Y +LGSQSG+A G++ ++ FCDEESIVMGI+ V P+P Sbjct 562 VVNPITQTVETT--GSGSYVGSLGSQSGLATCFGNTDGSISVFCDEESIVMGIMYVMPMP 619 Query 666 VYTQLLPKHFTYRGLLDHYQPEFNHIGFQPILYKEVCPLQAYSDGPDTLSDVFGYNRPWY 725 VY LLPK TYR LD + PEF+HIG+QPI KE+ P+Q D D + VFGY RPWY Sbjct 620 VYDSLLPKWLTYRERLDSFNPEFDHIGYQPIYAKELGPMQCVQDDIDP-NTVFGYQRPWY 678 Query 726 EYVQKYDQAHGLFRTNLSNFLMHRVFNQKPQLAQSFLVIDPAQVTDVFAVTKADDGTELA 785 EYV K D+AHGLF ++L NF+M R F+ P+L QSF V+ P V +VF+V TE++ Sbjct 679 EYVAKPDRAHGLFLSSLRNFIMFRSFDNVPELGQSFTVMQPGSVNNVFSV------TEVS 732 Query 786 DKIYGQIWFDCTAKLPISRVAIPRLD 811 DKI GQI FDCTA+LPISRV +PRL+ Sbjct 733 DKILGQIHFDCTAQLPISRVVVPRLE 758 Score = 139 bits (349), Expect = 1e-30, Method: Compositional matrix adjust. Identities = 64/129 (50%), Positives = 90/129 (70%), Gaps = 4/129 (3%) Query 5 AFDATLDVNNEIKVNNFDWSHANNLTTQIGRVTPVFCELVPAHGSVRINPRFGLQFMPMV 64 F+ D+ N++K N+FDWSH NN TT +GR+TPVF ELVP + S+RI P FGL+FMPM+ Sbjct 4 VFNKIGDIKNDVKRNSFDWSHDNNFTTDLGRITPVFTELVPPNSSIRIKPEFGLRFMPMM 63 Query 65 FPIQTRLRARMMFFKYPLRALWDGYRDFI-GNFREDLEEPYLDLNTV--TRLDAMAKTGS 121 FPIQT+++A + F+K PLR LW Y DFI + E+ + PY+ ++ + +A +G Sbjct 64 FPIQTKMKAYLSFYKVPLRTLWADYMDFISSDNTEEFQPPYMSFDSTDYSEGGTLAPSG- 122 Query 122 LGDYLGLPT 130 LGDY G+PT Sbjct 123 LGDYFGIPT 131 >gi|547920049|ref|WP_022322420.1| capsid protein VP1 [Parabacteroides merdae CAG:48] gi|524592961|emb|CDD13573.1| capsid protein VP1 [Parabacteroides merdae CAG:48] Length=553 Score = 144 bits (362), Expect = 6e-33, Method: Compositional matrix adjust. Identities = 131/430 (30%), Positives = 192/430 (45%), Gaps = 53/430 (12%) Query 399 KISAYSFRAYEGIYNAYIRD-NRNNPYYVNGQVQYNKWIPTYDGGADQ--NIYELRYANW 455 ++SA FRAY+ IYN Y RD N P + + T GG DQ + LR W Sbjct 159 QVSALPFRAYQLIYNEYYRDQNLTEP------IDFTLGSGTTVGG-DQLMALMSLRRRAW 211 Query 456 EKDFLTTAVQSPQQG---TAPLVGIttytetvettSDDGTPVTRELSRIALVDEDGKKYQ 512 EKD+ T+A+ Q+G T P+ G + V D R E+G Y Sbjct 212 EKDYFTSALPWLQRGPEVTVPVQGAGGSMDVVYERQSDSQKWVDSSGREF---ENGHAYD 268 Query 513 VSF----DSDSEGLKGVS------YVELDNEVKLRQPRNLIDVVTSGISINDLRNVNAYQ 562 ++ D +S + V+ ELD L+ ++V GI+INDLR NA Q Sbjct 269 ITMARANDPNSALMVAVNGGTNNRAPELDPNGTLK-----VNVDEMGININDLRTSNALQ 323 Query 563 KFLELNMRKGYSYRDIIEGRFNVKVRYDELLMPEFFGGFSRDIEMHSISQTVDQDLDGSQ 622 ++ E N R G Y + I F V+ L P+F GG I + + QT D Q Sbjct 324 RWFERNARGGSRYIEQILSHFGVRSSDARLQRPQFLGGGRMPISVSEVLQTSSTDETSPQ 383 Query 623 TYAKALGSQSGIAGVRGDSGRALECFCDEESIVMGILIVTPLPVYTQLLPKHFTYRGLLD 682 G +GI + + +E ++GI+ +TP Y Q +P+ FT +D Sbjct 384 ANMAGHGISAGI-------NNGFKHYFEEHGYIIGIMSITPRSGYQQGVPRDFTKFDNMD 436 Query 683 HYQPEFNHIGFQPILYKE--VCPLQAYSDGPDTLSDVFGYNRPWYEYVQKYDQAHGLFRT 740 Y PEF H+ Q I +E V AY++G FGY + EY +AHG FR Sbjct 437 FYFPEFAHLSEQEIKNQELFVSEDAAYNNG------TFGYTPRYAEYKYHPSEAHGDFRG 490 Query 741 NLSNFLMHRVFNQKPQLAQSFLVIDPAQVTDVFAVTKADDGTELADKIYGQIWFDCTAKL 800 NLS + ++R+F KP L +F+ P+ VFA ++ +D DK + Q++ D A Sbjct 491 NLSFWHLNRIFEDKPNLNTTFVECKPS--NRVFATSETED-----DKFWVQMYQDVKALR 543 Query 801 PISRVAIPRL 810 + + P L Sbjct 544 LMPKYGTPML 553 Score = 55.8 bits (133), Expect = 1e-04, Method: Compositional matrix adjust. Identities = 37/124 (30%), Positives = 60/124 (48%), Gaps = 10/124 (8%) Query 17 KVNNFDWSHANNLTTQIGRVTPVFCELVPAHGSVRINPRFGLQFMPMVFPIQTRLRARMM 76 + N F+ S+ + LT +G + P+ C V + R+ ++ P+V P+ R+ Sbjct 14 RRNAFNLSYESKLTLNMGELVPIMCMPVVSGDKFRVKTESLVRLAPLVAPMMHRVNVFTH 73 Query 77 FFKYPLRALWDGYRDFI--GNFREDLEE-PYLDLNTVTRLDAMAKT-------GSLGDYL 126 +F P R +W+ + DFI G ED+ P + +N + L + A SL DYL Sbjct 74 YFFVPNRLVWNEWEDFITKGVDGEDMPMFPKIQINQDSHLVSSASLIKEYFGDSSLWDYL 133 Query 127 GLPT 130 GLPT Sbjct 134 GLPT 137 >gi|649569140|gb|KDS75238.1| capsid family protein, partial [Parabacteroides distasonis str. 3999B T(B) 6] Length=390 Score = 137 bits (345), Expect = 2e-31, Method: Compositional matrix adjust. Identities = 115/417 (28%), Positives = 183/417 (44%), Gaps = 37/417 (9%) Query 399 KISAYSFRAYEGIYNAYIRDNRNNPYYVNGQVQYNKWIPTYDGGADQNIYELRYANWEKD 458 K+SA FRAY IYN Y RD + +++ Y + ++++L WEKD Sbjct 6 KVSALPFRAYHLIYNEYYRDQN-----LTSELEITLDSGNYQLPVNSSLWQLHRRAWEKD 60 Query 459 FLTTAVQSPQQGTAPLVGIttytetvettSDD--GTPVTRELSRIALVDEDGKKYQVSFD 516 + T+A+ Q+G V I E + +T R + + S Sbjct 61 YFTSALPWVQRGPEVTVPINGGGEIPVEMKEGFAAQKITTFPDRKPISGSEVLYSAPSVL 120 Query 517 SDSE--GLKGVSYVELDNEVKLRQPRNLIDVVTSGISINDLRNVNAYQKFLELNMRKGYS 574 S + +KG + +E DN V ++ G++IND+R NA Q++ E N R G Sbjct 121 SYGQIGSIKGQALIEPDNFV--------VNTDQMGVNINDIRTSNALQRWFERNARSGSR 172 Query 575 YRDIIEGRFNVKVRYDELLMPEFFGGFSRDIEMHSISQTVDQDLDGSQTYAKALGSQSGI 634 Y + I F V+ L P+F GG I + + QT D Q G +G+ Sbjct 173 YIEQILSHFGVRSSDARLQRPQFLGGGRTPISVSEVLQTSSTDSTSPQANMAGHGISAGV 232 Query 635 AGVRGDSGRALECFCDEESIVMGILIVTPLPVYTQLLPKHFTYRGLLDHYQPEFNHIGFQ 694 + +E +MGI+ + P Y Q +PK F +D Y PEF H+G Q Sbjct 233 -------NHGFTRYFEEHGYIMGIMSIRPRTGYQQGVPKDFRKFDNMDFYFPEFAHLGEQ 285 Query 695 PILYKEVCPLQAYSDGPDTLSD-VFGYNRPWYEYVQKYDQAHGLFRTNLSNFLMHRVFNQ 753 I +E+ Y + D ++ FGY + EY ++ HG FR N++ + ++R+F + Sbjct 286 EIKNEEL-----YLNESDAANEGTFGYTPRYAEYKYSQNEVHGDFRGNMAFWHLNRIFKE 340 Query 754 KPQLAQSFLVIDPAQVTDVFAVTKADDGTELADKIYGQIWFDCTAKLPISRVAIPRL 810 KP L +F+ +P+ VFA + D DK + QI+ D A + + P L Sbjct 341 KPNLNTTFVECNPS--NRVFATAETSD-----DKYWVQIYQDIKALRLMPKYGTPML 390 >gi|649555287|gb|KDS61824.1| capsid family protein [Parabacteroides distasonis str. 3999B T(B) 4] gi|649560568|gb|KDS66876.1| capsid family protein [Parabacteroides distasonis str. 3999B T(B) 4] gi|649561020|gb|KDS67307.1| capsid family protein [Parabacteroides distasonis str. 3999B T(B) 4] gi|649562724|gb|KDS68908.1| capsid family protein [Parabacteroides distasonis str. 3999B T(B) 6] Length=541 Score = 137 bits (346), Expect = 6e-31, Method: Compositional matrix adjust. Identities = 115/417 (28%), Positives = 183/417 (44%), Gaps = 37/417 (9%) Query 399 KISAYSFRAYEGIYNAYIRDNRNNPYYVNGQVQYNKWIPTYDGGADQNIYELRYANWEKD 458 K+SA FRAY IYN Y RD + +++ Y + ++++L WEKD Sbjct 157 KVSALPFRAYHLIYNEYYRDQN-----LTSELEITLDSGNYQLPVNSSLWQLHRRAWEKD 211 Query 459 FLTTAVQSPQQGTAPLVGIttytetvettSDD--GTPVTRELSRIALVDEDGKKYQVSFD 516 + T+A+ Q+G V I E + +T R + + S Sbjct 212 YFTSALPWVQRGPEVTVPINGGGEIPVEMKEGFAAQKITTFPDRKPISGSEVLYSAPSVL 271 Query 517 SDSE--GLKGVSYVELDNEVKLRQPRNLIDVVTSGISINDLRNVNAYQKFLELNMRKGYS 574 S + +KG + +E DN V ++ G++IND+R NA Q++ E N R G Sbjct 272 SYGQIGSIKGQALIEPDNFV--------VNTDQMGVNINDIRTSNALQRWFERNARSGSR 323 Query 575 YRDIIEGRFNVKVRYDELLMPEFFGGFSRDIEMHSISQTVDQDLDGSQTYAKALGSQSGI 634 Y + I F V+ L P+F GG I + + QT D Q G +G+ Sbjct 324 YIEQILSHFGVRSSDARLQRPQFLGGGRTPISVSEVLQTSSTDSTSPQANMAGHGISAGV 383 Query 635 AGVRGDSGRALECFCDEESIVMGILIVTPLPVYTQLLPKHFTYRGLLDHYQPEFNHIGFQ 694 + +E +MGI+ + P Y Q +PK F +D Y PEF H+G Q Sbjct 384 -------NHGFTRYFEEHGYIMGIMSIRPRTGYQQGVPKDFRKFDNMDFYFPEFAHLGEQ 436 Query 695 PILYKEVCPLQAYSDGPDTLSD-VFGYNRPWYEYVQKYDQAHGLFRTNLSNFLMHRVFNQ 753 I +E+ Y + D ++ FGY + EY ++ HG FR N++ + ++R+F + Sbjct 437 EIKNEEL-----YLNESDAANEGTFGYTPRYAEYKYSQNEVHGDFRGNMAFWHLNRIFKE 491 Query 754 KPQLAQSFLVIDPAQVTDVFAVTKADDGTELADKIYGQIWFDCTAKLPISRVAIPRL 810 KP L +F+ +P+ VFA + D DK + QI+ D A + + P L Sbjct 492 KPNLNTTFVECNPS--NRVFATAETSD-----DKYWVQIYQDIKALRLMPKYGTPML 541 Score = 60.1 bits (144), Expect = 5e-06, Method: Compositional matrix adjust. Identities = 37/120 (31%), Positives = 58/120 (48%), Gaps = 6/120 (5%) Query 17 KVNNFDWSHANNLTTQIGRVTPVFCELVPAHGSVRINPRFGLQFMPMVFPIQTRLRARMM 76 + N F+ S+ N LT G + P+ C+ V R+N ++ P+V P+ R+ Sbjct 14 RRNVFNLSYENKLTVNAGELIPIMCKPVVPGDKFRVNTEMLVRLAPLVAPMMHRVDVFTH 73 Query 77 FFKYPLRALWDGYRDFIGNFREDLEEP----YLDLNTVTRLDAMAK--TGSLGDYLGLPT 130 +F P R +W+ + DFI + + P Y +TV +A GSL DYLGLP+ Sbjct 74 YFFVPNRLIWNKWEDFITKGVDGTDSPVFPTYSFPSTVDTANAHNSFGDGSLWDYLGLPS 133 >gi|492501782|ref|WP_005867318.1| hypothetical protein [Parabacteroides distasonis] gi|409230408|gb|EKN23272.1| hypothetical protein HMPREF1059_03257 [Parabacteroides distasonis CL09T03C24] Length=538 Score = 130 bits (328), Expect = 1e-28, Method: Compositional matrix adjust. Identities = 117/417 (28%), Positives = 181/417 (43%), Gaps = 41/417 (10%) Query 399 KISAYSFRAYEGIYNAYIRD-NRNNPYYVNGQVQYNKWIPTYDGGADQNIYELRYANWEK 457 ++SA FRAY+ IYN Y RD N P + N I + LR WEK Sbjct 158 QVSALPFRAYQLIYNEYYRDQNLTKPI----EFSLNSGI-VLSADEVTRLLTLRRRTWEK 212 Query 458 DFLTTAVQSPQQG---TAPLVGIttytetvettSDDGTPVTRELSRIALVDEDGKKYQVS 514 D+ T+A+ Q+G T P+ G G + L A D + Sbjct 213 DYFTSALPWVQRGPEVTVPIQG-------------SGGNLDVTLKNDAHADTYRMPGTSN 259 Query 515 FDSDSEGLKGVSYVELDNEVKLRQPRNL-IDVVTSGISINDLRNVNAYQKFLELNMRKGY 573 + + L G + + + +P N ++V G+SINDLR NA Q++ E N R G Sbjct 260 RPAGAMQLVGGALIAGGTDGAYLEPDNFQVNVDELGVSINDLRTSNALQRWFERNARSGS 319 Query 574 SYRDIIEGRFNVKVRYDELLMPEFFGGFSRDIEMHSISQTVDQDLDGSQTYAKALGSQSG 633 Y + I F V+ L P+F GG I + + QT D Q G +G Sbjct 320 RYIEQILSHFGVRSSDARLQRPQFLGGGRTPISVSEVLQTSATDSTSPQANMAGHGISAG 379 Query 634 IAGVRGDSGRALECFCDEESIVMGILIVTPLPVYTQLLPKHFTYRGLLDHYQPEFNHIGF 693 + + + +E ++GI+ + P Y Q +PK F +D Y PEF H+G Sbjct 380 V-------NHGFKRYFEEHGYIIGIMSIRPRTGYQQGVPKDFRKFDNMDFYFPEFAHLGE 432 Query 694 QPILYKEVCPLQAYSDGPDTLSDVFGYNRPWYEYVQKYDQAHGLFRTNLSNFLMHRVFNQ 753 Q I +EV Q P + + FGY + EY ++ HG FR N++ + ++R+F++ Sbjct 433 QEIKNEEVYLQQT----PASNNGTFGYTPRYAEYKYSMNEVHGDFRGNMAFWHLNRIFSE 488 Query 754 KPQLAQSFLVIDPAQVTDVFAVTKADDGTELADKIYGQIWFDCTAKLPISRVAIPRL 810 P L +F+ +P+ VFA + D DK + Q++ D A + + P L Sbjct 489 SPNLNTTFVECNPS--NRVFATAETSD-----DKYWIQLYQDVKALRLMPKYGTPML 538 Score = 63.2 bits (152), Expect = 5e-07, Method: Compositional matrix adjust. Identities = 37/121 (31%), Positives = 55/121 (45%), Gaps = 7/121 (6%) Query 17 KVNNFDWSHANNLTTQIGRVTPVFCELVPAHGSVRINPRFGLQFMPMVFPIQTRLRARMM 76 + N F+ S+ N LT G + P+ C+ V R+N ++ P+V P+ R+ Sbjct 14 RRNVFNLSYENKLTANAGELVPIMCKPVVPGDKFRVNTEMLVRLAPLVAPMMHRVDVFTH 73 Query 77 FFKYPLRALWDGYRDFIGNFREDLEEPYL-------DLNTVTRLDAMAKTGSLGDYLGLP 129 +F P R LW+ + DFI + + P D T + GSL DYLGLP Sbjct 74 YFFVPNRLLWNQWEDFITKGVDGTDTPVFPKIALRPDWVNPTSAAVLLDDGSLWDYLGLP 133 Query 130 T 130 T Sbjct 134 T 134 >gi|639237429|ref|WP_024568106.1| hypothetical protein [Elizabethkingia anophelis] Length=546 Score = 125 bits (313), Expect = 9e-27, Method: Compositional matrix adjust. Identities = 119/427 (28%), Positives = 185/427 (43%), Gaps = 39/427 (9%) Query 399 KISAYSFRAYEGIYNAYIRD-NRNNPYYV--NGQVQ------YNKWIPTYDGGADQNIYE 449 ++S F AY+ I++ Y RD N + +V NG + N W P+ Q +++ Sbjct 138 RVSMLPFLAYQKIWDEYYRDENLIDSVFVDKNGDKRELFIDGINYWNPSLPYEFRQ-LFD 196 Query 450 LRYANWEKDFLTTAVQSPQQGTAPLVGIttytetvettSDDGTPVTRE----LSRIALVD 505 ++ W D+ T+A+ Q+G A V + + G ++ LS Sbjct 197 IKKRAWHHDYFTSALPFAQKGAA--VKMPLQMTADLFYNPGGNTFVKKPDGSLSHTGFRL 254 Query 506 EDGKKYQVSFDSDSEGLKGVSYVELDNEVKLRQPRNL-IDVVT-SGISINDLRNVNAYQK 563 EDG V D + S N V + NL +D+ T SG +INDLR Q+ Sbjct 255 EDG---SVPADGIGHLMVETSSTGNSNPVNIDNSSNLGVDLKTASGSTINDLRRAFKLQE 311 Query 564 FLELNMRKGYSYRDIIEGRFNVKVRYDELLMPEFFGGFSRDIEMHSISQTVDQDLDGSQT 623 +LE N R G Y + I F VK L PEF GG I + + Q D Q Sbjct 312 WLEKNARAGSRYAESILSFFGVKTSDGRLQRPEFLGGNKTPILISEVLQQSSTDSTTPQG 371 Query 624 YAKALGSQSGIAGVRGDSGRALECFCDEESIVMGILIVTPLPVYTQLLPKHFTYRGLLDH 683 G G G F +E V+G++ V P Y+Q +P+HF+ D+ Sbjct 372 NMAGHGISVGKEG-------GFSKFFEEHGYVIGLMSVIPKTSYSQGIPRHFSKFDKFDY 424 Query 684 YQPEFNHIGFQPILYKEVCPLQAYSDGPDTLSDVFGYNRPWYEYVQKYDQAHGLFRTNLS 743 + P+F HIG QP+ KE+ A + G VFGY + EY HG F+ L Sbjct 425 FWPQFEHIGEQPVYNKEIF---AKNVGDYDSGGVFGYVPRYSEYKYSPSTIHGDFKDTLY 481 Query 744 NFLMHRVFNQK--PQLAQSFLVIDPAQVTDVFAVTKADDGTELADKIYGQIWFDCTAKLP 801 + + R+F+ P+L + F+ ++ + ++ +FAV + +DK Y ++ TAK Sbjct 482 FWHLGRIFDSSAPPKLNRDFIEVNKSGLSRIFAV------EDNSDKFYCHLYQKITAKRK 535 Query 802 ISRVAIP 808 +S P Sbjct 536 MSYFGDP 542 Score = 57.0 bits (136), Expect = 4e-05, Method: Compositional matrix adjust. Identities = 34/120 (28%), Positives = 55/120 (46%), Gaps = 4/120 (3%) Query 12 VNNEIKVNNFDWSHANNLTTQIGRVTPVFCELVPAHGSVRINPRFGLQFMPMVFPIQTRL 71 V+ K + F+ S+ + G + P+ C+ + + INP+ + PM+ P+ + Sbjct 8 VSKAPKSSTFNMSYDRKFSMNFGDLVPIHCQEIVPGDKISINPQHMTRLAPMLAPVMHEV 67 Query 72 RARMMFFKYPLRALWDGYRDFIGNFREDLEEPYLDLNTVTRLDAMAKTGSLGDYLGLPTT 131 + +F P R LW + FI + L+ L + V L SLGDYLGLP T Sbjct 68 NVFIHYFFVPNRILWKNWEAFITGGQSGLDAHMLPV--VQNLP--VPKSSLGDYLGLPLT 123 >gi|649557305|gb|KDS63784.1| capsid family protein [Parabacteroides distasonis str. 3999B T(B) 4] gi|649559156|gb|KDS65543.1| capsid family protein [Parabacteroides distasonis str. 3999B T(B) 6] Length=245 Score = 117 bits (292), Expect = 1e-25, Method: Compositional matrix adjust. Identities = 79/263 (30%), Positives = 123/263 (47%), Gaps = 20/263 (8%) Query 549 GISINDLRNVNAYQKFLELNMRKGYSYRDIIEGRFNVKVRYDELLMPEFFGGFSRDIEMH 608 G++IND+R NA Q++ E N R G Y + I F V+ L P+F GG I + Sbjct 2 GVNINDIRTSNALQRWFERNARSGSRYIEQILSHFGVRSSDARLQRPQFLGGGRTPISVS 61 Query 609 SISQTVDQDLDGSQTYAKALGSQSGIAGVRGDSGRALECFCDEESIVMGILIVTPLPVYT 668 + QT D Q G +G+ + +E +MGI+ + P Y Sbjct 62 EVLQTSSTDSTSPQANMAGHGISAGV-------NHGFTRYFEEHGYIMGIMSIRPRTGYQ 114 Query 669 QLLPKHFTYRGLLDHYQPEFNHIGFQPILYKEVCPLQAYSDGPDTLSD-VFGYNRPWYEY 727 Q +PK F +D Y PEF H+G Q I +E+ Y + D ++ FGY + EY Sbjct 115 QGVPKDFRKFDNMDFYFPEFAHLGEQEIKNEEL-----YLNESDAANEGTFGYTPRYAEY 169 Query 728 VQKYDQAHGLFRTNLSNFLMHRVFNQKPQLAQSFLVIDPAQVTDVFAVTKADDGTELADK 787 ++ HG FR N++ + ++R+F +KP L +F+ +P+ VFA + D DK Sbjct 170 KYSQNEVHGDFRGNMAFWHLNRIFKEKPNLNTTFVECNPS--NRVFATAETSD-----DK 222 Query 788 IYGQIWFDCTAKLPISRVAIPRL 810 + QI+ D A + + P L Sbjct 223 YWVQIYQDIKALRLMPKYGTPML 245 >gi|609718276|emb|CDN73650.1| conserved hypothetical protein [Elizabethkingia anophelis] Length=537 Score = 120 bits (301), Expect = 3e-25, Method: Compositional matrix adjust. Identities = 119/426 (28%), Positives = 190/426 (45%), Gaps = 48/426 (11%) Query 400 ISAYSFRAYEGIYNAY----------IRDNRNNPYYVNGQVQYNKWIPTYDGGADQNIYE 449 ++ F AY+ I++ + RD+ NP + + +P Y + +++ Sbjct 139 VNLLPFLAYQKIWDEFYRDENLIQPLFRDSNGNPVKMFNDGINDHNLPPYSKFTE--LFK 196 Query 450 LRYANWEKDFLTTAVQSPQQGTAPLVGIttytetvettSDDGTPVTRELSRIALVDEDGK 509 +R W D+ T+A+ Q+G A + I P+T E+ + + Sbjct 197 MRKRAWHHDYFTSALPFAQKGNAVKIPIF---------PQGNVPLTYEMGSQTFIKDMAG 247 Query 510 KYQVSFD--SDSEG-LKGVSYVELDNEVKLRQPRNL-IDVVTSGIS-INDLRNVNAYQKF 564 + D SD G L+ VS + L +NL +++ + +S +NDLR Q++ Sbjct 248 NPAPNKDLRSDVNGNLQDVS----GQPLSLDPSKNLKLNMASENVSTVNDLRRAFKLQEW 303 Query 565 LELNMRKGYSYRDIIEGRFNVKVRYDELLMPEFFGGFSRDIEMHSISQTVDQDLDGSQTY 624 LE N R G Y + I F VK L PEF GG I IS+ + Q S T Sbjct 304 LEKNARAGSRYAESILSFFGVKTSDGRLQRPEFLGGNKSPI---MISEVLQQSATDSTTP 360 Query 625 AKALGSQSGIAGVRGDSGRALECFCDEESIVMGILIVTPLPVYTQLLPKHFTYRGLLDHY 684 + GI G+ D G F +E V+G++ V P Y+Q +P+HF+ D++ Sbjct 361 QGNMAGH-GI-GIGKDGG--FSRFFEEHGYVIGLMSVIPKTSYSQGIPRHFSKSDKFDYF 416 Query 685 QPEFNHIGFQPILYKEVCPLQAYSDGPDTLSDVFGYNRPWYEYVQKYDQAHGLFRTNLSN 744 P+F HIG QP+ KE+ D D+ + VFGY + EY HG F+ +L Sbjct 417 WPQFEHIGEQPVYNKEI--FAKNIDAFDSEA-VFGYLPRYSEYKFSPSTVHGDFKDDLYF 473 Query 745 FLMHRVF--NQKPQLAQSFLVIDPAQVTDVFAVTKADDGTELADKIYGQIWFDCTAKLPI 802 + + R+F ++ P L QSF+ D ++ +FAV DD DK Y ++ TAK + Sbjct 474 WHLGRIFDTDKPPVLNQSFIECDKNALSRIFAV--EDD----TDKFYCHLYQKITAKRKM 527 Query 803 SRVAIP 808 S P Sbjct 528 SYFGDP 533 Score = 58.2 bits (139), Expect = 2e-05, Method: Compositional matrix adjust. Identities = 37/136 (27%), Positives = 63/136 (46%), Gaps = 11/136 (8%) Query 17 KVNNFDWSHANNLTTQIGRVTPVFCELVPAHGSVRINPRFGLQFMPMVFPIQTRLRARMM 76 K + F+ S+ + G + P+ C+ V + INP+ + PM+ P+ + + Sbjct 13 KSSTFNMSYDRKFSMNFGDLVPIHCQEVIPGDKISINPQHMTRLAPMIAPVMHEVNVFIH 72 Query 77 FFKYPLRALWDGYRDFIGNFREDLEEPYLDLNTVTRLDAM-AKTGSLGDYLGLPTTLFGK 135 +F P R +W + FI E LD + + R+ + GSL D+LGLP T G+ Sbjct 73 YFFVPNRIIWSNWEQFITG-----GESGLDQHLMPRVGNLPVSKGSLADHLGLPLTT-GR 126 Query 136 FGTTLSVATTGHIFGL 151 F +V G ++ L Sbjct 127 F----AVGNAGVLYNL 138 >gi|12085136|ref|NP_073538.1| major capsid protein [Bdellovibrio phage phiMH2K] gi|75089173|sp|Q9G059.1|F_BPPHM RecName: Full=Capsid protein VP1; Short=VP1 [Bdellovibrio phage phiMH2K] gi|12017984|gb|AAG45340.1|AF306496_1 Vp1 [Bdellovibrio phage phiMH2K] Length=533 Score = 111 bits (277), Expect = 2e-22, Method: Compositional matrix adjust. Identities = 114/432 (26%), Positives = 180/432 (42%), Gaps = 36/432 (8%) Query 384 SPYYNSGSANKDKQIKISAYSFRAYEGIYNAYIRDNRNNPYYVNGQVQYNKWIPTYDGGA 443 S Y + G + ++I+A FRAY IYN + RD + G++ +P DG Sbjct 125 SIYDHFGIPTQVANLEINALPFRAYNLIYNDWFRDQN-----LIGKIA----VPKGDGPD 175 Query 444 DQNIYELRYANWEKDFLTTAVQSPQQGTAPLVGIttytetvettSDDGTPVTR--ELSRI 501 + Y+L A D+ T+A+ PQ+G A + I + P + Sbjct 176 NHADYQLLKAAKPHDYFTSALPWPQKGMAVEMPIGNSAPITYVPNAGNGPYPHFNWVQTP 235 Query 502 ALVDEDGKKYQVSFDSDSEGLKGVSYVELDNEVKLRQPRNLIDVVT-SGISINDLRNVNA 560 +G QV+F G K +S D Q + D+ + + +IN LR Sbjct 236 GGPGNNGALSQVTFG----GQKAISAAGNDPIGYDPQGTLIADLSSATAATINQLRQAMM 291 Query 561 YQKFLELNMRKGYSYRDIIEGRFNVKVRYDELLMPEFFGGFSRDIEMHSISQTVDQDLDG 620 Q LEL+ R G Y +I++ FNV L PE+ G + D++ + + QT D Sbjct 292 MQSLLELDARGGTRYVEILKSHFNVISLDFRLQRPEYLSGGTIDLQQNPVPQTSSSTTDS 351 Query 621 SQTYAKALGSQSGIAGVRGDSGRALECFCDEESIVMGILIVTPLPVYTQLLPKHFTYRGL 680 Q A + S G S + F E V+G + Y Q L K ++ + Sbjct 352 PQGNLAAFSTASEFGNKIGFS----KSFV-EHGYVLGFIRARGQVTYQQGLHKMWSRQTR 406 Query 681 LDHYQPEFNHIGFQPILYKEVCPLQAYSDGPDTLSDVFGYNRPWYEYVQKYDQAHGLFRT 740 D + P+F +G Q IL KE+ Y+ G T S++FGY + EY + + G FR+ Sbjct 407 WDFFWPKFQELGEQAILNKEI-----YAQGNATDSEIFGYQERYGEYRFRPSEIKGQFRS 461 Query 741 NLSNFL----MHRVFNQKPQLAQSFLVIDPAQVTDVFAVTKADDGTELADKIYGQIWFDC 796 N + L + F KP L ++F+ + + VT+ D + G WFD Sbjct 462 NFAESLDVWHLAEYFTVKPSLNKTFIESN-TPIERSLVVTRPD-----YPDLIGDFWFDY 515 Query 797 TAKLPISRVAIP 808 T P+ +P Sbjct 516 THVRPMVTYGVP 527 Score = 41.2 bits (95), Expect = 3.0, Method: Compositional matrix adjust. Identities = 25/114 (22%), Positives = 50/114 (44%), Gaps = 0/114 (0%) Query 19 NNFDWSHANNLTTQIGRVTPVFCELVPAHGSVRINPRFGLQFMPMVFPIQTRLRARMMFF 78 + F+ S T + +TP+F + + ++ +N + ++ V P+ R+ FF Sbjct 23 SKFNRSFGTKDTFKFDDLTPIFIDEILPGDTINMNTKTFIRLATQVVPVMDRMMLDFYFF 82 Query 79 KYPLRALWDGYRDFIGNFREDLEEPYLDLNTVTRLDAMAKTGSLGDYLGLPTTL 132 P R +WD + F G + + T+T + S+ D+ G+PT + Sbjct 83 FVPCRLVWDNWEKFNGAQDNPSDSTDYLIPTITAPAGGFENMSIYDHFGIPTQV 136 >gi|530695351|gb|AGT39907.1| major capsid protein [Marine gokushovirus] Length=539 Score = 105 bits (263), Expect = 2e-20, Method: Compositional matrix adjust. Identities = 109/428 (25%), Positives = 172/428 (40%), Gaps = 35/428 (8%) Query 389 SGSANKDKQIKISAYSFRAYEGIYNAYIRDNRNNPYYVNGQVQYNKWIPTYDGGADQNIY 448 +G + I ++ RAY I+N + RD +Q + + DG Y Sbjct 137 AGQVDAGSSISHNSLFTRAYNLIWNEWFRDE---------NLQDSVVVDKGDGPDTYTDY 187 Query 449 ELRYANWEKDFLTTAVQSPQQGTAPLVGIttytetvettSDDGTPV-TRELSRIALVDED 507 L D+ T+A+ PQ+G A V + +D G P RE+S + Sbjct 188 TLLRRGKRHDYFTSALPWPQKGDA--VTLPLGGSANVVYNDTGDPAYIREVSTGNVWTTP 245 Query 508 GKKYQVSFDSDSEGLKGVSYVELDNEVKLRQPRNLIDVVTSGISINDLRNVNAYQKFLEL 567 ++ S ++ G V ++ + + +IN +R Q+ LE Sbjct 246 SRE---SVSKEANGNMSVPTGSVNAQYDPNGSLVADLSTATAATINAIRQSFQIQRLLER 302 Query 568 NMRKGYSYRDIIEGRFNVKVRYDELLMPEFFGGFSRDIEMHSISQTVDQDLDGSQTYAKA 627 + R G Y +I+ F V + PE+ GG S I ++ ++Q G+ T Sbjct 303 DARGGTRYTEIVRSHFGVISPDARMQRPEYLGGGSAPIIVNPVAQQSASGASGTDTPLGT 362 Query 628 LGS-QSGIAGVRGDSGRALECFCDEESIVMGILIVTPLPVYTQLLPKHFTYRGLLDHYQP 686 LG+ +G+A SG E +V+G+ V Y Q L + F+ D + P Sbjct 363 LGAVGTGLA-----SGHGFASSFTEHGVVVGLCSVRADLTYQQGLHRMFSRSTRYDFFFP 417 Query 687 EFNHIGFQPILYKEVCPLQAYSDGPDTLSDVFGYNRPWYEYVQKYDQAHGLFRTNLSNFL 746 F+H+G QPIL KE+ Y+ G T DVFGY W EY K Q GL R+ + L Sbjct 418 VFSHLGEQPILNKEL-----YATGTSTDDDVFGYQEAWAEYRYKPSQVTGLMRSTAAGTL 472 Query 747 ----MHRVFNQKPQLAQSFLVIDPAQVTDVFAVTKADDGTELADKIYGQIWFDCTAKLPI 802 + + F P L +F + D V V AV +G + + FD P+ Sbjct 473 DAWHLAQNFGSLPTLNSTF-IEDTPPVDRVVAVGSEANGQQFIFDAF----FDINMARPM 527 Query 803 SRVAIPRL 810 ++P L Sbjct 528 PMYSVPGL 535 Lambda K H a alpha 0.320 0.137 0.411 0.792 4.96 Gapped Lambda K H a alpha sigma 0.267 0.0410 0.140 1.90 42.6 43.6 Effective search space used: 6454025655732