bitscore colors: <40, 40-50 , 50-80, 80-200, >200
BLASTP 2.2.30+ Reference: Stephen F. Altschul, Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Reference for composition-based statistics: Alejandro A. Schaffer, L. Aravind, Thomas L. Madden, Sergei Shavirin, John L. Spouge, Yuri I. Wolf, Eugene V. Koonin, and Stephen F. Altschul (2001), "Improving the accuracy of PSI-BLAST protein database searches with composition-based statistics and other refinements", Nucleic Acids Res. 29:2994-3005. Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF excluding environmental samples from WGS projects 49,011,213 sequences; 17,563,301,199 total letters Query= Contig-1_CDS_annotation_glimmer3.pl_2_6 Length=561 Score E Sequences producing significant alignments: (Bits) Value gi|575094431|emb|CDL65804.1| unnamed protein product 479 2e-159 gi|575094544|emb|CDL65904.1| unnamed protein product 464 5e-154 gi|575096056|emb|CDL66947.1| unnamed protein product 457 9e-151 gi|575094572|emb|CDL65928.1| unnamed protein product 449 1e-147 gi|575094492|emb|CDL65859.1| unnamed protein product 438 9e-144 gi|575094496|emb|CDL65862.1| unnamed protein product 430 2e-140 gi|575094415|emb|CDL65790.1| unnamed protein product 420 2e-136 gi|557745632|ref|YP_008798242.1| major capsid protein 401 1e-129 gi|530695351|gb|AGT39907.1| major capsid protein 384 1e-122 gi|313766927|gb|ADR80653.1| putative major coat protein 383 1e-122 >gi|575094431|emb|CDL65804.1| unnamed protein product [uncultured bacterium] Length=560 Score = 479 bits (1232), Expect = 2e-159, Method: Compositional matrix adjust. Identities = 264/583 (45%), Positives = 355/583 (61%), Gaps = 52/583 (9%) Query 1 VNRNNERHFNQIPEMKASRTRFNRDQTILTTFDSGKLIPFYVDEVLPGDTFSVDTSAIIR 60 +NRN+ +F + P + SR+RFNR L TFD+G+++P YVDEVLPGDTF +D +AIIR Sbjct 1 MNRNSNFNFARNPGVSLSRSRFNRTSDRLDTFDTGEIVPIYVDEVLPGDTFELDMTAIIR 60 Query 61 MTTPKYPVMDDAFIDFYYFYCPNRILWDNFKQFMGEVEETPWMPKKEYSVPKIVI--NGK 118 +TP +PVMD++F+D Y+F+ PNR+ W+++++ MGE T W +YSVP++ G Sbjct 61 GSTPIFPVMDNSFLDVYFFFVPNRLTWEHWRELMGENRTTAWTQPVDYSVPQVTAPAGGW 120 Query 119 EKSPEPYEDSILDYMGIPTKVKKVFKVNALPIRAYVKIWNEFFRDENVDNAATIKTDDAS 178 E E S+ D+MGIPTKV + VNALP RAY I+NEFFR++N+ N ++ DA+ Sbjct 121 E------ELSLADHMGIPTKVDNI-SVNALPFRAYGLIYNEFFRNQNLTNPTQVEVTDAN 173 Query 179 IDYQDGGESEDKTDDILKKAIGGGRCLPVNKFHDYFTSCLPYPQRGPEVTLPMQGN-APV 237 I ++ + ++ D AI G +CL KF DYFT LP PQ+G V + + + PV Sbjct 174 IAGKNPNDVKNSND----WAITGAKCLKSAKFFDYFTGALPQPQKGEPVEINLASSWLPV 229 Query 238 GMYKNDSLTEFGTINGNSEIFLNQALNGSAL---APKISNSSKEGARRALVT--GSTNPT 292 G+ G+ L++ N L +P ++K +V G NP Sbjct 230 GI-------------GDYHGPLDKVSNSDTLTWESPSSEGNTKRTYALGMVQQEGEVNPN 276 Query 293 T----------QVSDAAYLAA---NLGE---TTATTINDLRKAVAVQQYYEALARGGSRY 336 S++ +AA NL T A T+N LR+A VQ+ E ARGG+RY Sbjct 277 GLKNFETKAGGSFSESGAVAAYPTNLWASPVTAAATVNQLRQAFQVQKLLEKDARGGTRY 336 Query 337 REQVQALWDVVISDKTVQIPEYLGGGRYHVNMNQIVQTSGQQSSTD-TPIGETGAMSVTP 395 RE ++ + V SD +QIPEYLGG + +N++Q+VQTS +STD +P G T A+SVTP Sbjct 337 REILKNHFGVTTSDARMQIPEYLGGCKVPINVSQVVQTS---ASTDASPQGNTAAISVTP 393 Query 396 VNESSFTKSFEEHGFIIGVCCVRHNHSYQQGLERLWSRTDRLDYYVPQFANLGEQPVKKK 455 ++S FTKSF+EHGFIIGV R SYQQG+ER+WSR DRLDYY P AN+GEQ + K Sbjct 394 FSKSMFTKSFDEHGFIIGVATARTAQSYQQGIERMWSRKDRLDYYFPVLANIGEQAILNK 453 Query 456 EIMLTGEASDEETFGYQEAWADYRMKPNRVSSKMRSNATGTLDFWHYADNYKSVPTLSQG 515 EI G A D+E FGYQEAWADYR KPN + + RSNA +LD WHY +Y +PTLS Sbjct 454 EIYAQGNAKDDEAFGYQEAWADYRYKPNTICGRFRSNAQQSLDAWHYGQDYDKLPTLSTD 513 Query 516 WMAERKTEIARTLIVQDEPQFFGAIRVANKTTRRMPLYSVPGL 558 WM + E+ RTL VQ EP F R KT R MPLYS+PGL Sbjct 514 WMEQSDIEMKRTLAVQTEPDFIANFRFNCKTVRVMPLYSIPGL 556 >gi|575094544|emb|CDL65904.1| unnamed protein product [uncultured bacterium] Length=551 Score = 464 bits (1195), Expect = 5e-154, Method: Compositional matrix adjust. Identities = 248/567 (44%), Positives = 348/567 (61%), Gaps = 29/567 (5%) Query 1 VNRNNERHFNQIPEMKASRTRFNRDQTILTTFDSGKLIPFYVDEVLPGDTFSVDTSAIIR 60 +NRN E HF+++P + SR++F+R ++ TTF+ G LIPFY+DEVLPGDTF+V +S +IR Sbjct 1 MNRNVESHFSRLPSVDISRSQFDRSSSLKTTFNVGDLIPFYIDEVLPGDTFNVKSSKVIR 60 Query 61 MTTPKYPVMDDAFIDFYYFYCPNRILWDNFKQFMGEVEETPWMPKKEYSVPKIVINGKEK 120 M + P+MD+ ++D YYF+ PNR++W +++QF GE E+ W+P EY VP++ Sbjct 61 MQSLVTPIMDNIYLDTYYFFVPNRLVWSHWQQFNGENTESAWLPTTEYQVPQVTAPANGW 120 Query 121 SPEPYEDSILDYMGIPTKVKKVFKVNALPIRAYVKIWNEFFRDENVDNAATIKTDDASID 180 S +I DY GIPT V VNALP RAY I NE+FRDEN+ + I DA++ Sbjct 121 S----IGTIADYFGIPTGV--ACSVNALPFRAYALICNEWFRDENLSDPLNIPISDATVV 174 Query 181 YQDGGESEDKTDDILKKAIGGGRCLPVNKFHDYFTSCLPYPQRGPEVTLPMQGNAPVGMY 240 +G D+ + + GG K+HDYFTSCLP PQ+GP+V LP+ ++PV + Sbjct 175 GSNG-------DNYITDIVKGGMPFKACKYHDYFTSCLPAPQKGPDVLLPL-SSSPVPVT 226 Query 241 KNDSLTE-----FGTINGNSEIFLNQALNGSALAPKISNSSKEGARRAL--VTGSTNPTT 293 +D++ + + G L+ L + + P EGA + TG PT Sbjct 227 TSDTMVDPLQYSKYPMAGVDSWNLSPTLMRNIIRPF---EGVEGANYQVHQFTGDI-PTI 282 Query 294 QVSDAAYLAANLGETTATTINDLRKAVAVQQYYEALARGGSRYREQVQALWDVVISDKTV 353 L ANL TA +IN LR A +Q+ YE ARGG+RY E +++ + V D + Sbjct 283 DAFRPLNLVANLQNATAASINQLRLAFQIQRLYERDARGGTRYIEILKSHFGVTSPDARL 342 Query 354 QIPEYLGGGRYHVNMNQIVQTSGQQSSTDTPIGETGAMSVTPVNESSFTKSFEEHGFIIG 413 Q PEYLGG R +N+NQ++Q S ++++ +P G S+T + F KSF EHGF+IG Sbjct 343 QRPEYLGGNRIPININQVLQQS--ETTSTSPQGNPVGQSLTTDTNADFVKSFVEHGFVIG 400 Query 414 VCCVRHNHSYQQGLERLWSRTDRLDYYVPQFANLGEQPVKKKEIMLTGEASDEETFGYQE 473 + R++H+YQQGLER WSR DR DYY P FA++GEQ V KEI +G A D+E FGYQE Sbjct 401 LMVARYDHTYQQGLERFWSRKDRFDYYWPVFAHIGEQAVLNKEIYTSGTAVDDEVFGYQE 460 Query 474 AWADYRMKPNRVSSKMRSNATGTLDFWHYADNYKSVPTLSQGWMAERKTEIARTLIVQD- 532 A+ADYR KP+RV+ +MRS A +LD WH AD+Y S+P+LS W+ E + + R L V Sbjct 461 AYADYRYKPSRVTGEMRSAAPQSLDVWHLADDYASLPSLSDSWIRESASTVDRVLAVSSN 520 Query 533 -EPQFFGAIRVANKTTRRMPLYSVPGL 558 Q F I + N++TR MP+YSVPGL Sbjct 521 VSAQLFCDIYIQNRSTRPMPMYSVPGL 547 >gi|575096056|emb|CDL66947.1| unnamed protein product [uncultured bacterium] Length=570 Score = 457 bits (1175), Expect = 9e-151, Method: Compositional matrix adjust. Identities = 254/586 (43%), Positives = 356/586 (61%), Gaps = 49/586 (8%) Query 1 VNRNNERHFNQIPEMKASRTRFNRDQTILTTFDSGKLIPFYVDEVLPGDTFSVDTSAIIR 60 +NRN E HF+ +P + SR+RF+R +I TTF++G ++PF+++EVLPGDTFSVD+S ++R Sbjct 2 MNRNTESHFSLLPHVDISRSRFDRSSSIKTTFNAGDVVPFFLEEVLPGDTFSVDSSKVVR 61 Query 61 MTTPKYPVMDDAFIDFYYFYCPNRILWDNFKQFMGEVEETPWMPKKEYSVPKIVINGKEK 120 M T P+MD+ ++D YYF+ PNR++W ++K+F GE E+ W+P+ EY++P++ K Sbjct 62 MQTLLTPMMDNVYLDTYYFFVPNRLVWQHWKEFCGENNESAWIPQTEYAIPQL------K 115 Query 121 SP-EPYE-DSILDYMGIPTKVKKVFKVNALPIRAYVKIWNEFFRDENVDNAATIKTDDAS 178 SP +E +I DY G+PT V + V+ALP RAY I NE+FRDEN+ + + TDDA+ Sbjct 116 SPVGGFEVGTIADYFGLPTGVANL-SVSALPFRAYALIMNEWFRDENLMDPLVVPTDDAT 174 Query 179 IDYQDGGESEDKTDDILKKAIGGGRCLPVNKFHDYFTSCLPYPQRGPEVTLPMQ--GNAP 236 + G + D+ K GG+ K+HDYFTS LP PQ+GP+V +P+ GN Sbjct 175 VT---GVNTGIFVTDVAK----GGKPFVAAKYHDYFTSALPAPQKGPDVVIPVASAGNYN 227 Query 237 V-----GMYKNDSLTEFGTING-------NSEIFLNQALNGSALAPKISNSSKEGARRAL 284 V G+ +D NG +E+F + L + S + Sbjct 228 VVGNGKGLALSDGSKMSIICNGLSGSNGQGTELFASGILGSQVGSSGGFGSGSSLRGDGI 287 Query 285 VTGSTNPTTQVSDAAYLAANLGET----------TATTINDLRKAVAVQQYYEALARGGS 334 + G V AA L NL + A TIN LR A +Q++YE ARGGS Sbjct 288 ILG-------VPTAAQLGNNLENSGLIAIASGNAAAATINQLRMAFQIQKFYEKQARGGS 340 Query 335 RYREQVQALWDVVISDKTVQIPEYLGGGRYHVNMNQIVQTSGQQSSTDTPIGETGAMSVT 394 RY E +++ + V D +Q EYLGG R +N+NQ++Q SG S++ TP G MS T Sbjct 341 RYTEVIRSFFGVTSPDARLQRSEYLGGNRIPININQVIQQSGTGSASTTPQGTVVGMSQT 400 Query 395 PVNESSFTKSFEEHGFIIGVCCVRHNHSYQQGLERLWSRTDRLDYYVPQFANLGEQPVKK 454 S FTKSF EHGFIIGV C R++H+YQQG++R+WSR D+ DYY P F+N+GEQ +K Sbjct 401 TDTHSDFTKSFTEHGFIIGVMCARYDHTYQQGIDRMWSRKDKFDYYWPVFSNIGEQAIKN 460 Query 455 KEIMLTGEASDEETFGYQEAWADYRMKPNRVSSKMRSNATGTLDFWHYADNYKSVPTLSQ 514 KEI G A+D+E FGYQEAWA+YR KP+RV+ +MRS+ +LD WH AD+Y +P+LS Sbjct 461 KEIYAQGNATDDEVFGYQEAWAEYRYKPSRVTGEMRSSYAQSLDVWHLADDYSKLPSLSD 520 Query 515 GWMAERKTEIARTLIVQDE--PQFFGAIRVANKTTRRMPLYSVPGL 558 W+ E + R L V D+ QFF I V N TR MP+YS+PGL Sbjct 521 EWIREDAKTLNRVLAVSDQNSNQFFADIYVKNLCTRPMPMYSIPGL 566 >gi|575094572|emb|CDL65928.1| unnamed protein product [uncultured bacterium] Length=556 Score = 449 bits (1154), Expect = 1e-147, Method: Compositional matrix adjust. Identities = 245/575 (43%), Positives = 348/575 (61%), Gaps = 40/575 (7%) Query 1 VNRNNERHFNQIP-EMKASRTRFNRDQTILTTFDSGKLIPFYVDEVLPGDTFSVDTSAII 59 +NRN E HF + P + SR+ F+R ++ TF++G++IPF+++EVLPGDTF V TS +I Sbjct 1 MNRNVESHFAKNPTNIDISRSTFDRSSSVKLTFNTGEIIPFFIEEVLPGDTFKVKTSKVI 60 Query 60 RMTTPKYPVMDDAFIDFYYFYCPNRILWDNFKQFMGEVEETPWMPKKEYSVPKIVINGKE 119 R+ T P+MD+ ++D YYF+ PNR++W+++K+F GE ++ W+P+ EY +P++ Sbjct 61 RLQTLLTPMMDNIYLDTYYFFVPNRLVWEHWKEFNGENTQSAWIPEVEYQIPQLT----- 115 Query 120 KSPEPYED--SILDYMGIPTKVKKVFKVNALPIRAYVKIWNEFFRDENVDNAATIKTDDA 177 +PE + ++ DY GIPT V + VNALP RAY + NE+FRD+N+ + I DA Sbjct 116 -APEGGWNIGTLADYFGIPTGVSGI-SVNALPFRAYALVCNEWFRDQNLSDPLNIPVGDA 173 Query 178 SIDYQDGGESEDKTDDILKKAIGGGRCLPVNKFHDYFTSCLPYPQRGPEVTLPMQG--NA 235 ++ + T + + GG K+HDYFTSCLP PQ+GP+VT+P+ N Sbjct 174 TV-------TGVNTGTFITDVVKGGLPYTAAKYHDYFTSCLPAPQKGPDVTIPVTSGHNL 226 Query 236 PVGMYKNDS-----LTEFGTINGNSEI---FLNQALNGSALAPKISNSSKEGARRALVTG 287 PV M+ N++ FG NSE+ + + + A + ++S+ E G Sbjct 227 PV-MFLNETHDAGPYKPFGVGIQNSELRNFYGFGSGSSGATSTSDTSSTVEVGSDGTGIG 285 Query 288 ST--NPTTQVSDAAYLAANLGETTATTINDLRKAVAVQQYYEALARGGSRYREQVQALWD 345 PT A G+ TIN LR A +Q+ YE ARGG+RY E +++ + Sbjct 286 QNFWTPTNM------WAVESGDVGMATINQLRLAFQLQKLYEKDARGGTRYTEIIRSHFG 339 Query 346 VVISDKTVQIPEYLGGGRYHVNMNQIVQTSGQQSSTDTPIGETGAMSVTPVNESSFTKSF 405 VV D +Q PEYLGG R +N+NQI+Q S QS+ +P+G MSVT S F KSF Sbjct 340 VVSPDSRLQRPEYLGGNRIPINVNQIIQQS--QSTEQSPLGALAGMSVTTDKNSDFIKSF 397 Query 406 EEHGFIIGVCCVRHNHSYQQGLERLWSRTDRLDYYVPQFANLGEQPVKKKEIMLTGEASD 465 EHG+IIG+ R++H+YQQGL+R+WSR DR D+Y P AN+GEQ V KEI + G +D Sbjct 398 VEHGYIIGLVVARYDHTYQQGLDRMWSRKDRFDFYWPVLANIGEQAVLNKEIYIDGSDTD 457 Query 466 EETFGYQEAWADYRMKPNRVSSKMRSNATGTLDFWHYADNYKSVPTLSQGWMAERKTEIA 525 +E FGYQEAWA+YR KPNRV +MRS+A +LD WH D+Y S+P LS W+ E KT + Sbjct 458 DEVFGYQEAWAEYRYKPNRVCGEMRSSAPQSLDVWHLGDDYSSLPYLSDSWIREDKTNVD 517 Query 526 RTLIVQD--EPQFFGAIRVANKTTRRMPLYSVPGL 558 R L V Q F I + NK TR MP+YS+PGL Sbjct 518 RVLAVTSSVSDQLFADIYICNKATRPMPMYSIPGL 552 >gi|575094492|emb|CDL65859.1| unnamed protein product [uncultured bacterium] Length=551 Score = 438 bits (1127), Expect = 9e-144, Method: Compositional matrix adjust. Identities = 238/540 (44%), Positives = 327/540 (61%), Gaps = 34/540 (6%) Query 30 TTFDSGKLIPFYVDEVLPGDTFSVDTSAIIRMTTPKYPVMDDAFIDFYYFYCPNRILWDN 89 TTF+ G LIPFYVDE+LPGDTFS+DTS ++RM + PVMD+ ++D Y+F+ PNR+ W + Sbjct 31 TTFNVGDLIPFYVDEILPGDTFSIDTSKVVRMQSLLTPVMDNIYLDTYFFFVPNRLTWSH 90 Query 90 FKQFMGEVEETPWMPKKEYSVPKIVINGKEKSPEPYED--SILDYMGIPTKVKKVFKVNA 147 +++ MGE ++ W P+ EYSVP+I +PE + +I DYMGIPT V + VNA Sbjct 91 WRELMGENTQSAWTPQVEYSVPQIT------APEGGWNVGTIADYMGIPTGVSGL-SVNA 143 Query 148 LPIRAYVKIWNEFFRDENVDNAATIKTDDASIDYQDGGESEDKTDDILKKAIGGGRCLPV 207 +P RAY I NE+FRDEN+ + I DA++ G + D+ K GG Sbjct 144 MPFRAYALICNEWFRDENLTDPLNIPVGDATVA---GVNTGTYVTDVAK----GGLPFKA 196 Query 208 NKFHDYFTSCLPYPQRGPEVTLPMQGNAPVGMYKNDSLTEFGTINGNSEIFLNQALNGSA 267 K+HDYFTSCLP PQ+GP+V + G+ V + D+ + +N F+ + Sbjct 197 AKYHDYFTSCLPAPQKGPDVLISAVGSGIVPVTATDNDNDSLNVNSPGMRFVGNS----- 251 Query 268 LAPKISNSSKEGARRALVTGSTNPTTQVSDAAYLAANLGE--TTAT-----TINDLRKAV 320 + ++ + G +VT + P+T + + + NL +TAT TIN LR A Sbjct 252 -STSVNYLAFGGGDGYVVTDTPKPSTPIHGISMIPTNLWADLSTATDLPVATINQLRTAF 310 Query 321 AVQQYYEALARGGSRYREQVQALWDVVISDKTVQIPEYLGGGRYHVNMNQIVQTSGQQSS 380 +Q+ YE ARGG+RY E +++ + V D +Q PEYLGG R +N+NQ++Q+S + Sbjct 311 QIQKLYERDARGGTRYIEILKSHFGVTSPDARLQRPEYLGGSRVPININQVIQSS---ET 367 Query 381 TDTPIGETGAMSVTPVNESSFTKSFEEHGFIIGVCCVRHNHSYQQGLERLWSRTDRLDYY 440 TP G A S+T + S FTKSF EHGFIIG+ R++HSYQQGL+R WSR DR DYY Sbjct 368 GATPQGNAAAYSLTTDSHSEFTKSFVEHGFIIGLMVARYDHSYQQGLQRFWSRKDRFDYY 427 Query 441 VPQFANLGEQPVKKKEIMLTGEASDEETFGYQEAWADYRMKPNRVSSKMRSNATGTLDFW 500 P FANLGE VK KEI G D+E FGYQEAWADYR KP+ V+ +MRS +LD W Sbjct 428 WPVFANLGEMAVKNKEIFAQGTDVDDEVFGYQEAWADYRYKPSVVTGEMRSQYAQSLDIW 487 Query 501 HYADNYKSVPTLSQGWMAERKTEIARTLIVQD--EPQFFGAIRVANKTTRRMPLYSVPGL 558 H AD+Y+++P+LS W+ E + + R L V D Q F I + TR MPLYS+PGL Sbjct 488 HLADDYENLPSLSDSWIREDSSTVNRVLAVSDSVSAQLFCDIYIRCLATRPMPLYSIPGL 547 >gi|575094496|emb|CDL65862.1| unnamed protein product [uncultured bacterium] Length=568 Score = 430 bits (1106), Expect = 2e-140, Method: Compositional matrix adjust. Identities = 237/579 (41%), Positives = 339/579 (59%), Gaps = 39/579 (7%) Query 3 RNNERHFNQIP-EMKASRTRFNRDQTILTTFDSGKLIPFYVDEVLPGDTFSVDTSAIIRM 61 RN F++ P + R+ FNR T T+ + G+LIPFY DEVLPGDTF V T+ ++R+ Sbjct 2 RNENSRFSENPVTLDIQRSTFNRSSTYKTSANIGELIPFYYDEVLPGDTFQVKTNKVVRL 61 Query 62 TTPKYPVMDDAFIDFYYFYCPNRILWDNFKQFMGEVEETPWMPKKEYSVPKIVINGKEKS 121 MD+ + D YYF+ PNR++W+++++FMGE ++ W+P+ EY++P+I + Sbjct 62 QPLVSAPMDNLYFDTYYFFVPNRLVWEHWEEFMGENKQGAWIPQTEYTIPQIT----SPA 117 Query 122 PEPYE-DSILDYMGIPTKVKKVFKVNALPIRAYVKIWNEFFRDENVDNAATIKTDDASID 180 +E +I DY GIPT V + V+ALP RAY I +E+FRD+N+ I DD ++ Sbjct 118 STGFEIGTIADYFGIPTGVPNL-SVSALPFRAYALIVDEWFRDQNLQLPLNIPLDDTTLQ 176 Query 181 YQDGGESEDKTDDILKKAIGGGRCLPVNKFHDYFTSCLPYPQRGPEVTLPMQGNAPV--- 237 + G D + + GG+ K+HDYFTSCLP PQ+GP+VT+ G+ PV Sbjct 177 GVNTG-------DYVTDTVKGGKPFVAAKYHDYFTSCLPSPQKGPDVTIAAVGDFPVYTG 229 Query 238 ----------GMYKNDSLTEFGTINGNSEIFLNQALNGSALAPKISNSSKEGARRALVT- 286 ++ S G+++ + ++ ++ + + K A +T Sbjct 230 DPHNNNGSNKALHYGISNISSGSVSFSQGNYIIPSVLTTGSTQSVPAQGKLNASNITMTT 289 Query 287 --GSTNPTTQVSDAAY---LAANLGETTATTINDLRKAVAVQQYYEALARGGSRYREQVQ 341 GS + + + Y L A+ G TATTIN LR A +Q+ YE AR GSRYRE ++ Sbjct 290 SPGSPDSSFGSKLSVYPDNLYASSG--TATTINQLRMAFQIQKLYEKDARAGSRYRELIR 347 Query 342 ALWDVVISDKTVQIPEYLGGGRYHVNMNQIVQTSGQQSSTDTPIGETGAMSVTPVNESSF 401 + + V D +Q+PEYLGG R +N+NQ+VQTS Q+S +P G S+T + F Sbjct 348 SHFSVTPLDARMQVPEYLGGNRIPININQVVQTS--QTSDVSPQGNVAGQSLTSDSHGDF 405 Query 402 TKSFEEHGFIIGVCCVRHNHSYQQGLERLWSRTDRLDYYVPQFANLGEQPVKKKEIMLTG 461 KSF EHG +IGV R++H+YQQG+ +LWSR R DYY P AN+GEQ V KEI G Sbjct 406 IKSFTEHGMLIGVAVARYDHTYQQGVSKLWSRKTRFDYYWPVLANIGEQAVLNKEIYAQG 465 Query 462 EASDEETFGYQEAWADYRMKPNRVSSKMRSNATGTLDFWHYADNYKSVPTLSQGWMAERK 521 A DEE FGYQEAWA+YR KP+ V+ +MRS+A +LD WH+AD+Y S+P LS W+ E K Sbjct 466 TAQDEEVFGYQEAWAEYRYKPSIVTGEMRSSARTSLDSWHFADDYNSLPKLSADWIKEDK 525 Query 522 TEIARTLIVQDEP--QFFGAIRVANKTTRRMPLYSVPGL 558 T I R L V Q+F + N+TTR +P YS+PGL Sbjct 526 TNIDRVLAVSSSVSNQYFADFYIENETTRALPFYSIPGL 564 >gi|575094415|emb|CDL65790.1| unnamed protein product [uncultured bacterium] Length=569 Score = 420 bits (1079), Expect = 2e-136, Method: Compositional matrix adjust. Identities = 237/596 (40%), Positives = 324/596 (54%), Gaps = 68/596 (11%) Query 1 VNRNNERHFNQIPEMKASRTRFNRDQTILTTFDSGKLIPFYVDEVLPGDTFSVDTSAIIR 60 +NRN E H++QIP R +F RD + LTT + G L+P YVDEVLPGDT + +++R Sbjct 1 MNRNAEAHYSQIPHANIQRAKFKRDFSYLTTINEGDLVPIYVDEVLPGDTIKIKQRSLVR 60 Query 61 MTTPKYPVMDDAFIDFYYFYCPNRILWDNFKQFMGEVEETPWMPKKEYSVPKIVINGKEK 120 M+TP YPVMD+ ++D +YF+ P R++WD+++ MGE ++ W P +Y+ P Sbjct 61 MSTPLYPVMDNCYLDIWYFFVPCRLVWDHWQNLMGENTKSYWAPDVQYTTPLT----SAP 116 Query 121 SPEPYEDSILDYMGIPTKVKKVFKVNALPIRAYVKIWNEFFRDENVDNAATIKTDDASID 180 S +I DYMGIPT V + KVN++P+RAY +IWNE+FRDEN+ T +DDA+ Sbjct 117 SGGWQVGTIADYMGIPTGVSGI-KVNSMPMRAYARIWNEWFRDENLQQPVTQHSDDATTT 175 Query 181 YQDGGESEDKTDDILKKAIGGGRCLPVNKFHDYFTSCLPYPQRG----------PEV--- 227 + G L A GG L V KF DYFTSCLP PQ+G P+V Sbjct 176 GSNTGTE-------LTDAESGGLPLKVAKFKDYFTSCLPAPQKGEAIGFDFNQTPKVKGI 228 Query 228 --TLPMQGNAP---------------VGMYKNDSLTEFG------TINGNSEIFLNQALN 264 P++ N VG N S F T+NG F N Sbjct 229 GLVFPLETNTGHTATDILWRQPDAQLVGENYNTSYNNFNSITTQTTVNGKKAFFFNNG-K 287 Query 265 GSALAPKISNSSKEGARRALVTGSTNPTTQVSDAAYLAANLGETTATTINDLRKAVAVQQ 324 G L+ + + G + +T +A N T +INDLR+A+A+Q Sbjct 288 GPMLSARFEDDYNGGVEQVELTA-------------VAEN--STNFLSINDLRQAIALQH 332 Query 325 YYEALARGGSRYREQVQALWDVVISDKTVQIPEYLGGGRYHVNMNQIVQTSGQQSSTDTP 384 EA ARGG+RY E ++ + V D +Q EY+GG R +N++Q++Q+S S T +P Sbjct 333 ILEADARGGTRYVEILKNEFGVSSPDARLQRSEYIGGERIPINVSQVIQSSA--SDTTSP 390 Query 385 IGETGAMSVTPVNESSFTKSFEEHGFIIGVCCVRHNHSYQQGLERLWSRTDRLDYYVPQF 444 G A S+T + S EHG+I+G+ +R +HSYQQGL R+W+R+DR YY P Sbjct 391 QGNAAAYSLTTSANTIRAYSAVEHGYILGLAAIRVDHSYQQGLSRMWTRSDRFSYYHPML 450 Query 445 ANLGEQPVKKKEIMLTGEASDEETFGYQEAWADYRMKPNRVSSKMRSNATGTLDFWHYAD 504 ANLGEQ V +EI G +D E FGYQEAWADYR + N ++ +MRS +LD WHY D Sbjct 451 ANLGEQAVLNQEIYAQGTTADTEVFGYQEAWADYRYRTNMITGEMRSTYAQSLDAWHYGD 510 Query 505 NYKSVPTLSQGWMAERKTEIARTLIVQDE--PQFFGAIRVANKTTRRMPLYSVPGL 558 Y +P LS W+ E + I RTL VQ E QF + R MP+YSVPGL Sbjct 511 KYTDLPRLSNDWIKEGQENIDRTLAVQSENSHQFICNLYFDQTWVRPMPIYSVPGL 566 >gi|557745632|ref|YP_008798242.1| major capsid protein [Marine gokushovirus] gi|530695345|gb|AGT39902.1| major capsid protein [Marine gokushovirus] Length=538 Score = 401 bits (1031), Expect = 1e-129, Method: Compositional matrix adjust. Identities = 231/570 (41%), Positives = 320/570 (56%), Gaps = 61/570 (11%) Query 1 VNRNNERHFNQIPEMKASRTRFNRDQTILTTFDSGKLIPFYVDEVLPGDTFSVDTSAIIR 60 + + F+++P R+ F+R + TTF++G+L+P YVDE LPGDTFS + +A R Sbjct 13 IGSAKQHQFSEVPHADIQRSTFDRSHGLKTTFNAGQLVPIYVDEALPGDTFSCNLTAFSR 72 Query 61 MTTPKYPVMDDAFIDFYYFYCPNRILWDNFKQFMGE-----------VEETPWMPKKEYS 109 + TP +P MD+AF+D ++F P R++WD+F++FMGE ++ TP Sbjct 73 LATPIHPTMDNAFMDTHFFAVPVRLVWDDFEEFMGETKTYKAAGSDRLDGTPDFSVAAPV 132 Query 110 VPKIVINGKEKSPEPYEDSILDYMGIPTKVKKVFKVNALPIRAYVKIWNEFFRDENVDNA 169 P I +G ++ E S+ DY GIPTKV + + +AL RAY +WN++FRDEN+ Sbjct 133 PPTITASGSGEA----EASLSDYFGIPTKVGGL-EFSALWHRAYTLVWNDWFRDENLQAP 187 Query 170 ATIKTDDASIDYQDGGESEDKTDDILKKAIGGGRCLPVNKFHDYFTSCLPYPQRGPEVTL 229 TI T +D A+ L K HDYFTS LP+PQ+G +VT+ Sbjct 188 KTIDTTSG--------------NDTTTYAL-----LNRGKKHDYFTSALPWPQKGADVTI 228 Query 230 PMQGNAPV--GMYKNDSLTEFGTINGNSEIFLNQALNGSALAPKISNSSKEGARRALVTG 287 P+ +APV N +T F GN+ FLN A + + P N+ + ARR Sbjct 229 PLGTSAPVTTANSSNQDVTIFTPNIGNTHRFLNSA--STNVYPGDENTDE--ARR----- 279 Query 288 STNPTTQVSDAAYLAANLGETTATTINDLRKAVAVQQYYEALARGGSRYREQVQALWDVV 347 L A+L E T+ TIN LR A A Q++ E ARGGSRY E ++ ++V Sbjct 280 -------------LYADLSEATSATINQLRLAFATQKFLEIQARGGSRYIEVIKNHFNVT 326 Query 348 ISDKTVQIPEYLGGGRYHVNMNQIVQTSGQQSSTDTPIGETGAMSVTPVNESSFTKSFEE 407 D +Q PEYLGGG VN++ + QTS ++T P G A+ T ++ SFTKSF E Sbjct 327 SPDARLQRPEYLGGGSSPVNISPVAQTSSTDATT--PQGNLSAIGTTVLSGHSFTKSFTE 384 Query 408 HGFIIGVCCVRHNHSYQQGLERLWSRTDRLDYYVPQFANLGEQPVKKKEIMLTGEASDEE 467 H +IG+ VR + +YQQGL R++SR DYY P + +GEQ VK KEI G A+DE Sbjct 385 HTIVIGMVSVRTDLTYQQGLNRMFSRETIYDYYWPTLSTIGEQAVKNKEIYAQGSAADET 444 Query 468 TFGYQEAWADYRMKPNRVSSKMRSNATGTLDFWHYADNYKSVPTLSQGWMAERKTEIART 527 TFGYQE +A+YR KP+ V+ K RSNATGTL+ WHYA Y S+P L W+ T + RT Sbjct 445 TFGYQERYAEYRYKPSSVTGKFRSNATGTLESWHYAQEYASLPLLGDSWIQVTDTNVQRT 504 Query 528 LIVQDEPQFFGAIRVANKTTRRMPLYSVPG 557 L V EPQF + TR MP+ S+PG Sbjct 505 LAVASEPQFIFDSLFKLRCTRPMPVNSIPG 534 >gi|530695351|gb|AGT39907.1| major capsid protein [Marine gokushovirus] Length=539 Score = 384 bits (985), Expect = 1e-122, Method: Compositional matrix adjust. Identities = 220/569 (39%), Positives = 324/569 (57%), Gaps = 50/569 (9%) Query 2 NRNNERH-FNQIPEMKASRTRFNRDQTILTTFDSGKLIPFYVDEVLPGDTFSVDTSAIIR 60 N++ H F+ IP + R++F+ +T+ T FDSG L+P VDEVLPGD+ ++ +A R Sbjct 5 NKSASAHQFSMIPRAEIPRSKFDAQKTLKTAFDSGYLVPILVDEVLPGDSMNLRMTAFTR 64 Query 61 MTTPKYPVMDDAFIDFYYFYCPNRILWDNFKQFMGEVEETPWMPKKEYSVPKIVI-NGKE 119 + TP +PVMD+ ++D ++F+ PNR+LW N+++FMGE + P +Y++P + NG Sbjct 65 LATPLFPVMDNMYLDTFFFFVPNRLLWSNWQRFMGERDPDP-DSSIDYTIPTMTSPNGGY 123 Query 120 KSPEPYEDSILDYMGIPTK----VKKVFKVNALPIRAYVKIWNEFFRDENVDNAATIKTD 175 +S+ DYMG+PT N+L RAY IWNE+FRDEN+ ++ + Sbjct 124 AV-----NSLQDYMGLPTAGQVDAGSSISHNSLFTRAYNLIWNEWFRDENLQDSVVVDKG 178 Query 176 DASIDYQDGGESEDKTDDILKKAIGGGRCLPVNKFHDYFTSCLPYPQRGPEVTLPMQGNA 235 D Y D +L++ K HDYFTS LP+PQ+G VTLP+ G+A Sbjct 179 DGPDTYTD--------YTLLRRG----------KRHDYFTSALPWPQKGDAVTLPLGGSA 220 Query 236 PVGMYKNDSLTEFGTINGNSEIFLNQALNGSA-LAPKISNSSKEG-ARRALVTGSTNPTT 293 V ND+ ++ + G+ P + SKE ++ TGS N Sbjct 221 NV--VYNDT---------GDPAYIREVSTGNVWTTPSRESVSKEANGNMSVPTGSVN--A 267 Query 294 QVSDAAYLAANLGETTATTINDLRKAVAVQQYYEALARGGSRYREQVQALWDVVISDKTV 353 Q L A+L TA TIN +R++ +Q+ E ARGG+RY E V++ + V+ D + Sbjct 268 QYDPNGSLVADLSTATAATINAIRQSFQIQRLLERDARGGTRYTEIVRSHFGVISPDARM 327 Query 354 QIPEYLGGGRYHVNMNQIVQTSGQQSS-TDTPIGETGAMSVTPVNESSFTKSFEEHGFII 412 Q PEYLGGG + +N + Q S +S TDTP+G GA+ + F SF EHG ++ Sbjct 328 QRPEYLGGGSAPIIVNPVAQQSASGASGTDTPLGTLGAVGTGLASGHGFASSFTEHGVVV 387 Query 413 GVCCVRHNHSYQQGLERLWSRTDRLDYYVPQFANLGEQPVKKKEIMLTGEASDEETFGYQ 472 G+C VR + +YQQGL R++SR+ R D++ P F++LGEQP+ KE+ TG ++D++ FGYQ Sbjct 388 GLCSVRADLTYQQGLHRMFSRSTRYDFFFPVFSHLGEQPILNKELYATGTSTDDDVFGYQ 447 Query 473 EAWADYRMKPNRVSSKMRSNATGTLDFWHYADNYKSVPTLSQGWMAERKTEIARTLIVQD 532 EAWA+YR KP++V+ MRS A GTLD WH A N+ S+PTL+ ++ E + R + V Sbjct 448 EAWAEYRYKPSQVTGLMRSTAAGTLDAWHLAQNFGSLPTLNSTFI-EDTPPVDRVVAVGS 506 Query 533 EP---QFFGAIRVANKTTRRMPLYSVPGL 558 E QF R MP+YSVPGL Sbjct 507 EANGQQFIFDAFFDINMARPMPMYSVPGL 535 >gi|313766927|gb|ADR80653.1| putative major coat protein [Uncultured Microviridae] Length=533 Score = 383 bits (983), Expect = 1e-122, Method: Compositional matrix adjust. Identities = 220/552 (40%), Positives = 320/552 (58%), Gaps = 52/552 (9%) Query 7 RHFNQIPEMKASRTRFNRDQTILTTFDSGKLIPFYVDEVLPGDTFSVDTSAIIRMTTPKY 66 F+++P+ R+ F+R + TTF+SG LIP YVDEVLPGDTF ++ + R+ TP Y Sbjct 15 HEFSRVPQADIQRSTFSRVHGLKTTFNSGDLIPIYVDEVLPGDTFQMNATGFGRLATPLY 74 Query 67 PVMDDAFIDFYYFYCPNRILWDNFKQFMGEVEETPWMPKKEYSVPKIVINGKEKSPEPYE 126 PVMD+ +++ ++FY PNRI+WDN+++F G ++ ++ VP+I +S E Sbjct 75 PVMDNMYVETFFFYVPNRIIWDNWEKFNGAQDDP--NDSTDFLVPQI------QSATVAE 126 Query 127 DSILDYMGIPTKVKKVFKVNALPIRAYVKIWNEFFRDENVDNAATIKTDDASIDYQDGGE 186 S+ DYMG+PT++ + N L RAY IWNE+FRDEN+ ++ + DD Y Sbjct 127 GSLFDYMGLPTQIAGI-DFNNLHGRAYNLIWNEWFRDENLQDSLGVPKDDGPDTY----- 180 Query 187 SEDKTDDILKKAIGGGRCLPVNKFHDYFTSCLPYPQRGPEVTLPMQGNAPVGMYKNDSLT 246 T ++K K HDYFTS LP+PQ+G V+LP+ +A + + Sbjct 181 ----TGYTIQKR---------GKRHDYFTSALPWPQKGDAVSLPLGTSADI-----HTAA 222 Query 247 EFGTINGNSEIFLNQALNGSALAPKISNSSKEGARRALVTGSTNPTTQVSDAAYLAANLG 306 GT G + GS+ +++ E A ++G T P T + A+L Sbjct 223 AAGTDIGIYSV-------GSSDFRLLTSDPVEVA----LSGGTPPETNK-----MFADLS 266 Query 307 ETTATTINDLRKAVAVQQYYEALARGGSRYREQVQALWDVVISDKTVQIPEYLGGGRYHV 366 TA TIN LR+A +Q+ YE ARGG+RY E +Q+ + V D +Q PEYLGG + V Sbjct 267 NATAATINQLREAFQIQRLYEKDARGGTRYTEILQSHFGVTSPDARLQRPEYLGGQKTEV 326 Query 367 NMNQIVQTSGQQSSTDTPIGETGAMSVTPVNESSFTKSFEEHGFIIGVCCVRHNHSYQQG 426 M + QTS S++ P G A+ T + F+KSF EHG +IG+ CV + +YQQG Sbjct 327 MMQTVPQTSSTDSTS--PQGNLAALG-TATSRGGFSKSFVEHGVLIGLACVFADLTYQQG 383 Query 427 LERLWSRTDRLDYYVPQFANLGEQPVKKKEIMLTGEASDEETFGYQEAWADYRMKPNRVS 486 + R+WSR DR D+Y P A+LGEQ V +EI G ++D +TFGYQE +A+YR KP++++ Sbjct 384 MNRMWSRRDRWDFYWPSLAHLGEQAVLNQEIYTQGTSADTQTFGYQERFAEYRYKPSQIT 443 Query 487 SKMRSNATGTLDFWHYADNYKSVPTLSQGWMAERKTEIARTLIVQDEPQFFGAIRVANKT 546 KMRSNATGTLD WH A ++ ++P L+ ++ E + R + V EP+F KT Sbjct 444 GKMRSNATGTLDAWHLAQDFTALPALNASFI-EENPPVDRVIAVPSEPEFIWDWYFDLKT 502 Query 547 TRRMPLYSVPGL 558 TR MP+YSVPGL Sbjct 503 TRPMPVYSVPGL 514 Lambda K H a alpha 0.316 0.133 0.397 0.792 4.96 Gapped Lambda K H a alpha sigma 0.267 0.0410 0.140 1.90 42.6 43.6 Effective search space used: 4106350928880