bitscore colors: <40, 40-50 , 50-80, 80-200, >200
BLASTP 2.2.30+ Reference: Stephen F. Altschul, Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Reference for composition-based statistics: Alejandro A. Schaffer, L. Aravind, Thomas L. Madden, Sergei Shavirin, John L. Spouge, Yuri I. Wolf, Eugene V. Koonin, and Stephen F. Altschul (2001), "Improving the accuracy of PSI-BLAST protein database searches with composition-based statistics and other refinements", Nucleic Acids Res. 29:2994-3005. Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF excluding environmental samples from WGS projects 49,011,213 sequences; 17,563,301,199 total letters Query= Contig-25_CDS_annotation_glimmer3.pl_2_1 Length=449 Score E Sequences producing significant alignments: (Bits) Value gi|575094496|emb|CDL65862.1| unnamed protein product 323 1e-100 gi|575094544|emb|CDL65904.1| unnamed protein product 311 6e-96 gi|575094492|emb|CDL65859.1| unnamed protein product 302 1e-92 gi|575094572|emb|CDL65928.1| unnamed protein product 292 1e-88 gi|575094431|emb|CDL65804.1| unnamed protein product 289 1e-87 gi|575096056|emb|CDL66947.1| unnamed protein product 282 1e-84 gi|313766927|gb|ADR80653.1| putative major coat protein 276 1e-82 gi|444298010|dbj|GAC77834.1| major capsid protein 272 8e-82 gi|19387569|ref|NP_598320.1| capsid protein 270 3e-80 gi|530695351|gb|AGT39907.1| major capsid protein 269 5e-80 >gi|575094496|emb|CDL65862.1| unnamed protein product [uncultured bacterium] Length=568 Score = 323 bits (828), Expect = 1e-100, Method: Compositional matrix adjust. Identities = 186/449 (41%), Positives = 256/449 (57%), Gaps = 31/449 (7%) Query 28 GSIADYLGLPIGSISQSSPVSVLPFRCFALIYDKYFRNENTTDEIYIQKKGFSLSELIGA 87 G+IADY G+P G + S VS LPFR +ALI D++FR++N + I +L + Sbjct 124 GTIADYFGIPTGVPNLS--VSALPFRAYALIVDEWFRDQNLQLPLNIPLDDTTLQGVNTG 181 Query 88 QNFSPNSYCGKLPKVNKYKDYFTSCVPNPQKGAPVTFNLGDQAVVRTSDSELVTGPQEQM 147 + GK KY DYFTSC+P+PQKG VT V T D G + + Sbjct 182 DYVTDTVKGGKPFVAAKYHDYFTSCLPSPQKGPDVTIAAVGDFPVYTGDPHNNNGSNKAL 241 Query 148 --ALTNSQSGSASVGEH----PLIVGLGGM-------RFDAAAFSGTVAAG--------- 185 ++N SGS S + P ++ G + +A+ + T + G Sbjct 242 HYGISNISSGSVSFSQGNYIIPSVLTTGSTQSVPAQGKLNASNITMTTSPGSPDSSFGSK 301 Query 186 --LYPNNLYADLSSANAISVDDLRLAFAYQKMLERDAIYGSRYNEYLYGHFGVHIPDAYI 243 +YP+NLYA SS A +++ LR+AF QK+ E+DA GSRY E + HF V DA + Sbjct 302 LSVYPDNLYA--SSGTATTINQLRMAFQIQKLYEKDARAGSRYRELIRSHFSVTPLDARM 359 Query 244 QFPQYLGGGRTPLNIVQVAQTSQGTEESPLGNVGAYSWTNGRTG-YSRKFKEHGIVMTVA 302 Q P+YLGG R P+NI QV QTSQ ++ SP GNV S T+ G + + F EHG+++ VA Sbjct 360 QVPEYLGGNRIPININQVVQTSQTSDVSPQGNVAGQSLTSDSHGDFIKSFTEHGMLIGVA 419 Query 303 CLRYRHTYQQGIAKKWRRKVREDFYDPLFSTIGQQPVYTTELYAQAFPQ--TVFGYREAW 360 RY HTYQQG++K W RK R D+Y P+ + IG+Q V E+YAQ Q VFGY+EAW Sbjct 420 VARYDHTYQQGVSKLWSRKTRFDYYWPVLANIGEQAVLNKEIYAQGTAQDEEVFGYQEAW 479 Query 361 SELRNIPNTISGEMRSGVTNSLDIWHFADNYSSSPTLSQSFTEETPSYVDRTLSVPSSSQ 420 +E R P+ ++GEMRS SLD WHFAD+Y+S P LS + +E + +DR L+V SS Sbjct 480 AEYRYKPSIVTGEMRSSARTSLDSWHFADDYNSLPKLSADWIKEDKTNIDRVLAVSSSVS 539 Query 421 NNFILNFYFDMSAVRKMPVYSMPSLIDHH 449 N + +FY + R +P YS+P LIDHH Sbjct 540 NQYFADFYIENETTRALPFYSIPGLIDHH 568 >gi|575094544|emb|CDL65904.1| unnamed protein product [uncultured bacterium] Length=551 Score = 311 bits (796), Expect = 6e-96, Method: Compositional matrix adjust. Identities = 185/462 (40%), Positives = 262/462 (57%), Gaps = 22/462 (5%) Query 3 GNPNPSAYVNNVLEEIPMTYGEVNP---GSIADYLGLPIGSISQSSPVSVLPFRCFALIY 59 G SA++ ++P N G+IADY G+P G + V+ LPFR +ALI Sbjct 95 GENTESAWLPTTEYQVPQVTAPANGWSIGTIADYFGIPTGV---ACSVNALPFRAYALIC 151 Query 60 DKYFRNENTTDEIYIQKKGFSLSELIGA--QNFSPNSYCGKLP-KVNKYKDYFTSCVPNP 116 +++FR+EN +D + I S + ++G+ N+ + G +P K KY DYFTSC+P P Sbjct 152 NEWFRDENLSDPLNIP---ISDATVVGSNGDNYITDIVKGGMPFKACKYHDYFTSCLPAP 208 Query 117 QKGAPVTFNLGDQAVVRTSDSELVTGPQEQ---MALTNSQSGSASVGEHPL--IVGLGGM 171 QKG V L V T+ +V Q MA +S + S ++ + + G+ G Sbjct 209 QKGPDVLLPLSSSPVPVTTSDTMVDPLQYSKYPMAGVDSWNLSPTLMRNIIRPFEGVEGA 268 Query 172 RFDAAAFSGTVAA--GLYPNNLYADLSSANAISVDDLRLAFAYQKMLERDAIYGSRYNEY 229 + F+G + P NL A+L +A A S++ LRLAF Q++ ERDA G+RY E Sbjct 269 NYQVHQFTGDIPTIDAFRPLNLVANLQNATAASINQLRLAFQIQRLYERDARGGTRYIEI 328 Query 230 LYGHFGVHIPDAYIQFPQYLGGGRTPLNIVQVAQTSQGTEESPLGN-VGAYSWTNGRTGY 288 L HFGV PDA +Q P+YLGG R P+NI QV Q S+ T SP GN VG T+ + Sbjct 329 LKSHFGVTSPDARLQRPEYLGGNRIPININQVLQQSETTSTSPQGNPVGQSLTTDTNADF 388 Query 289 SRKFKEHGIVMTVACLRYRHTYQQGIAKKWRRKVREDFYDPLFSTIGQQPVYTTELYAQ- 347 + F EHG V+ + RY HTYQQG+ + W RK R D+Y P+F+ IG+Q V E+Y Sbjct 389 VKSFVEHGFVIGLMVARYDHTYQQGLERFWSRKDRFDYYWPVFAHIGEQAVLNKEIYTSG 448 Query 348 -AFPQTVFGYREAWSELRNIPNTISGEMRSGVTNSLDIWHFADNYSSSPTLSQSFTEETP 406 A VFGY+EA+++ R P+ ++GEMRS SLD+WH AD+Y+S P+LS S+ E+ Sbjct 449 TAVDDEVFGYQEAYADYRYKPSRVTGEMRSAAPQSLDVWHLADDYASLPSLSDSWIRESA 508 Query 407 SYVDRTLSVPSSSQNNFILNFYFDMSAVRKMPVYSMPSLIDH 448 S VDR L+V S+ + Y + R MP+YS+P LIDH Sbjct 509 STVDRVLAVSSNVSAQLFCDIYIQNRSTRPMPMYSVPGLIDH 550 >gi|575094492|emb|CDL65859.1| unnamed protein product [uncultured bacterium] Length=551 Score = 302 bits (773), Expect = 1e-92, Method: Compositional matrix adjust. Identities = 184/466 (39%), Positives = 258/466 (55%), Gaps = 25/466 (5%) Query 1 VFGNPNPSAYVNNVLEEIPMTY---GEVNPGSIADYLGLPIGSISQSSPVSVLPFRCFAL 57 + G SA+ V +P G N G+IADY+G+P G S V+ +PFR +AL Sbjct 94 LMGENTQSAWTPQVEYSVPQITAPEGGWNVGTIADYMGIPTGVSGLS--VNAMPFRAYAL 151 Query 58 IYDKYFRNENTTDEIYIQKKGFSLSELIGAQNFSPNSYCGKLP-KVNKYKDYFTSCVPNP 116 I +++FR+EN TD + I G + + + + G LP K KY DYFTSC+P P Sbjct 152 ICNEWFRDENLTDPLNI-PVGDATVAGVNTGTYVTDVAKGGLPFKAAKYHDYFTSCLPAP 210 Query 117 QKGAPVTFNLGDQAVVRTS------DSELVTGPQEQMALTNSQSGSASVGEHPLIVGLGG 170 QKG V + +V + DS V P + + S SV G G Sbjct 211 QKGPDVLISAVGSGIVPVTATDNDNDSLNVNSPGMRFV----GNSSTSVNYLAFGGGDGY 266 Query 171 MRFDAAAFSGTV-AAGLYPNNLYADLSSANAI---SVDDLRLAFAYQKMLERDAIYGSRY 226 + D S + + P NL+ADLS+A + +++ LR AF QK+ ERDA G+RY Sbjct 267 VVTDTPKPSTPIHGISMIPTNLWADLSTATDLPVATINQLRTAFQIQKLYERDARGGTRY 326 Query 227 NEYLYGHFGVHIPDAYIQFPQYLGGGRTPLNIVQVAQTSQGTEESPLGNVGAYSWT-NGR 285 E L HFGV PDA +Q P+YLGG R P+NI QV Q+S+ T +P GN AYS T + Sbjct 327 IEILKSHFGVTSPDARLQRPEYLGGSRVPININQVIQSSE-TGATPQGNAAAYSLTTDSH 385 Query 286 TGYSRKFKEHGIVMTVACLRYRHTYQQGIAKKWRRKVREDFYDPLFSTIGQQPVYTTELY 345 + +++ F EHG ++ + RY H+YQQG+ + W RK R D+Y P+F+ +G+ V E++ Sbjct 386 SEFTKSFVEHGFIIGLMVARYDHSYQQGLQRFWSRKDRFDYYWPVFANLGEMAVKNKEIF 445 Query 346 AQA--FPQTVFGYREAWSELRNIPNTISGEMRSGVTNSLDIWHFADNYSSSPTLSQSFTE 403 AQ VFGY+EAW++ R P+ ++GEMRS SLDIWH AD+Y + P+LS S+ Sbjct 446 AQGTDVDDEVFGYQEAWADYRYKPSVVTGEMRSQYAQSLDIWHLADDYENLPSLSDSWIR 505 Query 404 ETPSYVDRTLSVPSSSQNNFILNFYFDMSAVRKMPVYSMPSLIDHH 449 E S V+R L+V S + Y A R MP+YS+P LIDHH Sbjct 506 EDSSTVNRVLAVSDSVSAQLFCDIYIRCLATRPMPLYSIPGLIDHH 551 >gi|575094572|emb|CDL65928.1| unnamed protein product [uncultured bacterium] Length=556 Score = 292 bits (747), Expect = 1e-88, Method: Compositional matrix adjust. Identities = 177/468 (38%), Positives = 252/468 (54%), Gaps = 28/468 (6%) Query 3 GNPNPSAYVNNVLEEIPMTY---GEVNPGSIADYLGLPIGSISQSSPVSVLPFRCFALIY 59 G SA++ V +IP G N G++ADY G+P G S V+ LPFR +AL+ Sbjct 96 GENTQSAWIPEVEYQIPQLTAPEGGWNIGTLADYFGIPTGVSGIS--VNALPFRAYALVC 153 Query 60 DKYFRNENTTDEIYIQKKGFSLSELIGAQNFSPNSYCGKLP-KVNKYKDYFTSCVPNPQK 118 +++FR++N +D + I +++ + F + G LP KY DYFTSC+P PQK Sbjct 154 NEWFRDQNLSDPLNIPVGDATVTG-VNTGTFITDVVKGGLPYTAAKYHDYFTSCLPAPQK 212 Query 119 GAPVTF------NLGDQAVVRTSDSELVTGPQEQ--MALTNSQSGSASVGEHPLIVGLGG 170 G VT NL + T D+ GP + + + NS+ + Sbjct 213 GPDVTIPVTSGHNLPVMFLNETHDA----GPYKPFGVGIQNSELRNFYGFGSGSSGATST 268 Query 171 MRFDAAAFSGTVAAGL-----YPNNLYA-DLSSANAISVDDLRLAFAYQKMLERDAIYGS 224 + G+ G+ P N++A + +++ LRLAF QK+ E+DA G+ Sbjct 269 SDTSSTVEVGSDGTGIGQNFWTPTNMWAVESGDVGMATINQLRLAFQLQKLYEKDARGGT 328 Query 225 RYNEYLYGHFGVHIPDAYIQFPQYLGGGRTPLNIVQVAQTSQGTEESPLGNVGAYSWTNG 284 RY E + HFGV PD+ +Q P+YLGG R P+N+ Q+ Q SQ TE+SPLG + S T Sbjct 329 RYTEIIRSHFGVVSPDSRLQRPEYLGGNRIPINVNQIIQQSQSTEQSPLGALAGMSVTTD 388 Query 285 R-TGYSRKFKEHGIVMTVACLRYRHTYQQGIAKKWRRKVREDFYDPLFSTIGQQPVYTTE 343 + + + + F EHG ++ + RY HTYQQG+ + W RK R DFY P+ + IG+Q V E Sbjct 389 KNSDFIKSFVEHGYIIGLVVARYDHTYQQGLDRMWSRKDRFDFYWPVLANIGEQAVLNKE 448 Query 344 LYAQA--FPQTVFGYREAWSELRNIPNTISGEMRSGVTNSLDIWHFADNYSSSPTLSQSF 401 +Y VFGY+EAW+E R PN + GEMRS SLD+WH D+YSS P LS S+ Sbjct 449 IYIDGSDTDDEVFGYQEAWAEYRYKPNRVCGEMRSSAPQSLDVWHLGDDYSSLPYLSDSW 508 Query 402 TEETPSYVDRTLSVPSSSQNNFILNFYFDMSAVRKMPVYSMPSLIDHH 449 E + VDR L+V SS + + Y A R MP+YS+P LIDHH Sbjct 509 IREDKTNVDRVLAVTSSVSDQLFADIYICNKATRPMPMYSIPGLIDHH 556 >gi|575094431|emb|CDL65804.1| unnamed protein product [uncultured bacterium] Length=560 Score = 289 bits (740), Expect = 1e-87, Method: Compositional matrix adjust. Identities = 171/444 (39%), Positives = 252/444 (57%), Gaps = 30/444 (7%) Query 29 SIADYLGLP--IGSISQSSPVSVLPFRCFALIYDKYFRNENTTDEIYIQKKGFSLSELIG 86 S+AD++G+P + +IS V+ LPFR + LIY+++FRN+N T+ ++ +++ Sbjct 124 SLADHMGIPTKVDNIS----VNALPFRAYGLIYNEFFRNQNLTNPTQVEVTDANIAGKNP 179 Query 87 AQNFSPNSYC---GKLPKVNKYKDYFTSCVPNPQKGAPVTFNL---------GD--QAVV 132 + N + K K K+ DYFT +P PQKG PV NL GD + Sbjct 180 NDVKNSNDWAITGAKCLKSAKFFDYFTGALPQPQKGEPVEINLASSWLPVGIGDYHGPLD 239 Query 133 RTSDSELVTGPQEQMALTNSQSGSASVGEHPLIVGLGGMR-FDAAA---FSGTVAAGLYP 188 + S+S+ +T ++ + + + V G++ F+ A FS + A YP Sbjct 240 KVSNSDTLTWESPSSEGNTKRTYALGMVQQEGEVNPNGLKNFETKAGGSFSESGAVAAYP 299 Query 189 NNLYADLSSANAISVDDLRLAFAYQKMLERDAIYGSRYNEYLYGHFGVHIPDAYIQFPQY 248 NL+A +A A +V+ LR AF QK+LE+DA G+RY E L HFGV DA +Q P+Y Sbjct 300 TNLWASPVTA-AATVNQLRQAFQVQKLLEKDARGGTRYREILKNHFGVTTSDARMQIPEY 358 Query 249 LGGGRTPLNIVQVAQTSQGTEESPLGNVGAYSWTN-GRTGYSRKFKEHGIVMTVACLRYR 307 LGG + P+N+ QV QTS T+ SP GN A S T ++ +++ F EHG ++ VA R Sbjct 359 LGGCKVPINVSQVVQTSASTDASPQGNTAAISVTPFSKSMFTKSFDEHGFIIGVATARTA 418 Query 308 HTYQQGIAKKWRRKVREDFYDPLFSTIGQQPVYTTELYAQ--AFPQTVFGYREAWSELRN 365 +YQQGI + W RK R D+Y P+ + IG+Q + E+YAQ A FGY+EAW++ R Sbjct 419 QSYQQGIERMWSRKDRLDYYFPVLANIGEQAILNKEIYAQGNAKDDEAFGYQEAWADYRY 478 Query 366 IPNTISGEMRSGVTNSLDIWHFADNYSSSPTLSQSFTEETPSYVDRTLSVPSSSQNNFIL 425 PNTI G RS SLD WH+ +Y PTLS + E++ + RTL+V ++ +FI Sbjct 479 KPNTICGRFRSNAQQSLDAWHYGQDYDKLPTLSTDWMEQSDIEMKRTLAV--QTEPDFIA 536 Query 426 NFYFDMSAVRKMPVYSMPSLIDHH 449 NF F+ VR MP+YS+P LIDH+ Sbjct 537 NFRFNCKTVRVMPLYSIPGLIDHN 560 >gi|575096056|emb|CDL66947.1| unnamed protein product [uncultured bacterium] Length=570 Score = 282 bits (721), Expect = 1e-84, Method: Compositional matrix adjust. Identities = 180/479 (38%), Positives = 247/479 (52%), Gaps = 36/479 (8%) Query 3 GNPNPSAYVNNVLEEIPMT---YGEVNPGSIADYLGLPIGSISQSSPVSVLPFRCFALIY 59 G N SA++ IP G G+IADY GLP G + S VS LPFR +ALI Sbjct 96 GENNESAWIPQTEYAIPQLKSPVGGFEVGTIADYFGLPTGVANLS--VSALPFRAYALIM 153 Query 60 DKYFRNENTTDEIYIQKKGFSLSELIGAQNFSPNSYCGKLPKVNKYKDYFTSCVPNPQKG 119 +++FR+EN D + + +++ + + + GK KY DYFTS +P PQKG Sbjct 154 NEWFRDENLMDPLVVPTDDATVTGVNTGIFVTDVAKGGKPFVAAKYHDYFTSALPAPQKG 213 Query 120 APVTF---NLGDQAVVRTSDSELVTGPQEQMALTNSQSGSASVGEHPLIVGL-------- 168 V + G+ VV ++ + + N SGS G G+ Sbjct 214 PDVVIPVASAGNYNVVGNGKGLALSDGSKMSIICNGLSGSNGQGTELFASGILGSQVGSS 273 Query 169 ------GGMRFDAAAFSGTVAAGLYPNNL------YADLSSANAISVDDLRLAFAYQKML 216 +R D AA L NNL +A A +++ LR+AF QK Sbjct 274 GGFGSGSSLRGDGIILGVPTAAQL-GNNLENSGLIAIASGNAAAATINQLRMAFQIQKFY 332 Query 217 ERDAIYGSRYNEYLYGHFGVHIPDAYIQFPQYLGGGRTPLNIVQVAQTSQGT---EESPL 273 E+ A GSRY E + FGV PDA +Q +YLGG R P+NI QV Q S GT +P Sbjct 333 EKQARGGSRYTEVIRSFFGVTSPDARLQRSEYLGGNRIPININQVIQQS-GTGSASTTPQ 391 Query 274 GNV-GAYSWTNGRTGYSRKFKEHGIVMTVACLRYRHTYQQGIAKKWRRKVREDFYDPLFS 332 G V G T+ + +++ F EHG ++ V C RY HTYQQGI + W RK + D+Y P+FS Sbjct 392 GTVVGMSQTTDTHSDFTKSFTEHGFIIGVMCARYDHTYQQGIDRMWSRKDKFDYYWPVFS 451 Query 333 TIGQQPVYTTELYAQ--AFPQTVFGYREAWSELRNIPNTISGEMRSGVTNSLDIWHFADN 390 IG+Q + E+YAQ A VFGY+EAW+E R P+ ++GEMRS SLD+WH AD+ Sbjct 452 NIGEQAIKNKEIYAQGNATDDEVFGYQEAWAEYRYKPSRVTGEMRSSYAQSLDVWHLADD 511 Query 391 YSSSPTLSQSFTEETPSYVDRTLSVPSSSQNNFILNFYFDMSAVRKMPVYSMPSLIDHH 449 YS P+LS + E ++R L+V + N F + Y R MP+YS+P LIDHH Sbjct 512 YSKLPSLSDEWIREDAKTLNRVLAVSDQNSNQFFADIYVKNLCTRPMPMYSIPGLIDHH 570 >gi|313766927|gb|ADR80653.1| putative major coat protein [Uncultured Microviridae] Length=533 Score = 276 bits (705), Expect = 1e-82, Method: Compositional matrix adjust. Identities = 169/430 (39%), Positives = 240/430 (56%), Gaps = 42/430 (10%) Query 25 VNPGSIADYLGLP--IGSISQSSPVSVLPFRCFALIYDKYFRNENTTDEIYIQKKGFSLS 82 V GS+ DY+GLP I I ++ L R + LI++++FR+EN D + + K Sbjct 124 VAEGSLFDYMGLPTQIAGIDFNN----LHGRAYNLIWNEWFRDENLQDSLGVPKDD---- 175 Query 83 ELIGAQNFSPNSYCG-KLPKVNKYKDYFTSCVPNPQKGAPVTFNLGDQAVVRTSDSELVT 141 P++Y G + K K DYFTS +P PQKG V+ LG A + T+ Sbjct 176 --------GPDTYTGYTIQKRGKRHDYFTSALPWPQKGDAVSLPLGTSADIHTA------ 221 Query 142 GPQEQMALTNSQSGSASVGEHPLIVGLGGMRFDAAAFSGTVAAGLYPNNLYADLSSANAI 201 A + G SVG + L + A GT N ++ADLS+A A Sbjct 222 ------AAAGTDIGIYSVGSSDFRL-LTSDPVEVALSGGTPPE---TNKMFADLSNATAA 271 Query 202 SVDDLRLAFAYQKMLERDAIYGSRYNEYLYGHFGVHIPDAYIQFPQYLGGGRTPLNIVQV 261 +++ LR AF Q++ E+DA G+RY E L HFGV PDA +Q P+YLGG +T + + V Sbjct 272 TINQLREAFQIQRLYEKDARGGTRYTEILQSHFGVTSPDARLQRPEYLGGQKTEVMMQTV 331 Query 262 AQTSQGTEESPLGNVGAYSWTNGRTGYSRKFKEHGIVMTVACLRYRHTYQQGIAKKWRRK 321 QTS SP GN+ A R G+S+ F EHG+++ +AC+ TYQQG+ + W R+ Sbjct 332 PQTSSTDSTSPQGNLAALGTATSRGGFSKSFVEHGVLIGLACVFADLTYQQGMNRMWSRR 391 Query 322 VREDFYDPLFSTIGQQPVYTTELYAQ---AFPQTVFGYREAWSELRNIPNTISGEMRSGV 378 R DFY P + +G+Q V E+Y Q A QT FGY+E ++E R P+ I+G+MRS Sbjct 392 DRWDFYWPSLAHLGEQAVLNQEIYTQGTSADTQT-FGYQERFAEYRYKPSQITGKMRSNA 450 Query 379 TNSLDIWHFADNYSSSPTLSQSFTEETPSYVDRTLSVPSSSQNNFILNFYFDMSAVRKMP 438 T +LD WH A ++++ P L+ SF EE P VDR ++VPS + FI ++YFD+ R MP Sbjct 451 TGTLDAWHLAQDFTALPALNASFIEENPP-VDRVIAVPSEPE--FIWDWYFDLKTTRPMP 507 Query 439 VYSMPSLIDH 448 VYS+P LIDH Sbjct 508 VYSVPGLIDH 517 >gi|444298010|dbj|GAC77834.1| major capsid protein [uncultured marine virus] Length=480 Score = 272 bits (695), Expect = 8e-82, Method: Compositional matrix adjust. Identities = 167/441 (38%), Positives = 245/441 (56%), Gaps = 36/441 (8%) Query 18 IPMTYGEVNPGSIADYLGLPIGSISQSSPVSVLPFRCFALIYDKYFRNENTTDEIYIQKK 77 IP V GS+AD++GLP+G ++ LPFRC+ LIY+++FR+EN D I Sbjct 65 IPKMVAAVIEGSLADHMGLPLGG---QFSINALPFRCYNLIYNEWFRDENLQDSIPENTD 121 Query 78 GFSLSELIGAQNFSPNSYCG-KLPKVNKYKDYFTSCVPNPQKGAPVTFNLGDQA-VVRTS 135 P+S L + K DYFTSC+P PQKGA VT LG A + Sbjct 122 D------------GPDSVLDYLLRRRGKRHDYFTSCLPWPQKGAAVTLPLGTSAPITGIG 169 Query 136 DSELVTGPQEQMALTNSQSGSASVGEHPLIVGLGGMRFDAAAFSGTVAAGLYPNNLYADL 195 D V Q + T+ +G+ S H I F AF +PN + ADL Sbjct 170 DQLQVYAGQTDVHETDG-TGTVSYANH--IESATAASF---AFEEDPDNAGFPN-IRADL 222 Query 196 SSANAISVDDLRLAFAYQKMLERDAIYGSRYNEYLYGHFGVHIPDAYIQFPQYLGGGRTP 255 ++A A +++ LR AF QK+LERDA G+RY E + HF V PD+ +Q P+YLGGG + Sbjct 223 TNATAATINQLRQAFQIQKLLERDARGGTRYTEIIRAHFSVLSPDSRLQRPEYLGGGSSN 282 Query 256 LNIVQVAQTSQGT----EESPLGNVGAY--SWTNGRTGYSRKFKEHGIVMTVACLRYRHT 309 +NI +AQT + +E+P GN+ A S +G G+++ F EHG ++ + +R T Sbjct 283 INITPIAQTQRSDTTTPDETPQGNLAAIGTSAFSGH-GFTKSFTEHGYILGLCEVRADLT 341 Query 310 YQQGIAKKWRRKVREDFYDPLFSTIGQQPVYTTELYAQAFP--QTVFGYREAWSELRNIP 367 YQQGI + W R R DFY P S IG+Q V + E++A A + VFGY+E ++E R P Sbjct 342 YQQGIDRLWSRDTRYDFYWPALSHIGEQAVLSKEIFADATAGDEDVFGYQERFAEYRYKP 401 Query 368 NTISGEMRSGVTNSLDIWHFADNYSSSPTLSQSFTEETPSYVDRTLSVPSSSQNNFILNF 427 + IS RS SLD+WH + ++++ P L+ +F E+TP +DR ++V + + + +L+ Sbjct 402 SRISSLFRSNAAASLDVWHLSQDFAARPVLNSTFIEDTPP-IDRVIAV--TDEPHILLDA 458 Query 428 YFDMSAVRKMPVYSMPSLIDH 448 YF + R +P+Y +P LIDH Sbjct 459 YFKLRCARPLPLYGVPGLIDH 479 >gi|19387569|ref|NP_598320.1| capsid protein [Spiroplasma phage 4] gi|137968|sp|P11333.1|F_SPV4 RecName: Full=Capsid protein VP1; AltName: Full=VP1 [Spiroplasma phage 4] gi|334999|gb|AAA72621.1| capsid protein [Spiroplasma phage 4] Length=553 Score = 270 bits (689), Expect = 3e-80, Method: Compositional matrix adjust. Identities = 171/458 (37%), Positives = 248/458 (54%), Gaps = 21/458 (5%) Query 2 FGNPNPSAYVNNV--LEEIPMTYGEVNPGSIADYLGLPIGSISQSSP---VSVLPFRCFA 56 FG + S V N + +I G + G++AD+ G I+ P V L FR +A Sbjct 100 FGENSDSWDVKNAPPVPDIVAPSGGWDYGTLADHFG-----ITPKVPGIRVKSLRFRAYA 154 Query 57 LIYDKYFRNENTTDEIYIQKKGFSLSELIGAQNFSPNSYCGKLPKVNKYKDYFTSCVPNP 116 I + +FR++N + E + + G+ + GK NKY DYFTSC+P P Sbjct 155 KIINDWFRDQNLSSECALTLDSSNSQGSNGSNQVTDIQLGGKPYIANKYHDYFTSCLPAP 214 Query 117 QKGAPVTFNLGDQAVVRTSDSELVTGPQEQMALTNSQSGSASVGEHPLIVGLGGMRFDAA 176 QKGAP T N+G A V T ++ + +++ + G+ L +G F A Sbjct 215 QKGAPTTLNVGGMAPVTTKFRDVPNLSGTPLIFRDNKGRTIKTGQ--LGIGPVDAGFLVA 272 Query 177 AFSGTVAAG--LYPNNLYADLSSANAISVDDLRLAFAYQKMLERDAIYGSRYNEYLYGHF 234 + A G P+NL+ADLS+A IS+ DLRLA YQ E DA G+RY E+ HF Sbjct 273 QNTAQAANGERAIPSNLWADLSNATGISISDLRLAITYQHYKEMDARGGTRYVEFTLNHF 332 Query 235 GVHIPDAYIQFPQYLGGGRTPLNIVQVAQTSQGTEE-SPLGNVGAYSWTNGRTGY--SRK 291 GVH DA +Q ++LGG L + V QTS E+ +P GN+ A+S T + Y ++ Sbjct 333 GVHTADARLQRSEFLGGHSQSLLVQSVPQTSSTVEKMTPQGNLAAFSETMIQNNYLVNKT 392 Query 292 FKEHGIVMTVACLRYRHTYQQGIAKKW-RRKVREDFYDPLFSTIGQQPVYTTELYAQAFP 350 F EH ++ +A +RY+HTYQQGI W R + + D YDPL + I +QPV E+ Q Sbjct 393 FTEHSYIIVLAVVRYKHTYQQGIEADWFRGQDKFDMYDPLLANISEQPVKNREIMVQGNS 452 Query 351 Q--TVFGYREAWSELRNIPNTISGEMRSGVTNSLDIWHFADNYSSSPTLSQSFTEETPSY 408 Q +FG++EAW++LR PN+++G MRS SLD WHFAD+Y+ P LS + +E Sbjct 453 QDNEIFGFQEAWADLRFKPNSVAGVMRSSHPQSLDYWHFADHYAQLPKLSSEWLKEDYKN 512 Query 409 VDRTLSVPSSSQN-NFILNFYFDMSAVRKMPVYSMPSL 445 VDRTL++ +S ++F F+ A + MP+YS P L Sbjct 513 VDRTLALKASDNTPQLRVDFMFNTIAEKPMPLYSTPGL 550 >gi|530695351|gb|AGT39907.1| major capsid protein [Marine gokushovirus] Length=539 Score = 269 bits (687), Expect = 5e-80, Method: Compositional matrix adjust. Identities = 174/466 (37%), Positives = 258/466 (55%), Gaps = 50/466 (11%) Query 4 NPNPSAYVNNVLEEIPMTYGEVNPGSIADYLGLPI-GSISQSSPVS--VLPFRCFALIYD 60 +P+P + ++ + + G S+ DY+GLP G + S +S L R + LI++ Sbjct 102 DPDPDSSIDYTIPTMTSPNGGYAVNSLQDYMGLPTAGQVDAGSSISHNSLFTRAYNLIWN 161 Query 61 KYFRNENTTDEIYIQKKGFSLSELIGAQNFSPNSYCG-KLPKVNKYKDYFTSCVPNPQKG 119 ++FR+EN D + + K P++Y L + K DYFTS +P PQKG Sbjct 162 EWFRDENLQDSVVVDKGD------------GPDTYTDYTLLRRGKRHDYFTSALPWPQKG 209 Query 120 APVTFNLGDQAVVRTSDS-------ELVTGPQEQMALTNSQSGSASVGEHPLIVGLGGMR 172 VT LG A V +D+ E+ TG S S A+ G M Sbjct 210 DAVTLPLGGSANVVYNDTGDPAYIREVSTGNVWTTPSRESVSKEAN----------GNM- 258 Query 173 FDAAAFSGTVAAGLYPN-NLYADLSSANAISVDDLRLAFAYQKMLERDAIYGSRYNEYLY 231 + +G+V A PN +L ADLS+A A +++ +R +F Q++LERDA G+RY E + Sbjct 259 ---SVPTGSVNAQYDPNGSLVADLSTATAATINAIRQSFQIQRLLERDARGGTRYTEIVR 315 Query 232 GHFGVHIPDAYIQFPQYLGGGRTPLNIVQVAQTS----QGTEESPLGNVGAY--SWTNGR 285 HFGV PDA +Q P+YLGGG P+ + VAQ S GT ++PLG +GA +G Sbjct 316 SHFGVISPDARMQRPEYLGGGSAPIIVNPVAQQSASGASGT-DTPLGTLGAVGTGLASGH 374 Query 286 TGYSRKFKEHGIVMTVACLRYRHTYQQGIAKKWRRKVREDFYDPLFSTIGQQPVYTTELY 345 G++ F EHG+V+ + +R TYQQG+ + + R R DF+ P+FS +G+QP+ ELY Sbjct 375 -GFASSFTEHGVVVGLCSVRADLTYQQGLHRMFSRSTRYDFFFPVFSHLGEQPILNKELY 433 Query 346 A--QAFPQTVFGYREAWSELRNIPNTISGEMRSGVTNSLDIWHFADNYSSSPTLSQSFTE 403 A + VFGY+EAW+E R P+ ++G MRS +LD WH A N+ S PTL+ +F E Sbjct 434 ATGTSTDDDVFGYQEAWAEYRYKPSQVTGLMRSTAAGTLDAWHLAQNFGSLPTLNSTFIE 493 Query 404 ETPSYVDRTLSVPSSSQ-NNFILNFYFDMSAVRKMPVYSMPSLIDH 448 +TP VDR ++V S + FI + +FD++ R MP+YS+P L+DH Sbjct 494 DTPP-VDRVVAVGSEANGQQFIFDAFFDINMARPMPMYSVPGLVDH 538 Lambda K H a alpha 0.317 0.134 0.400 0.792 4.96 Gapped Lambda K H a alpha sigma 0.267 0.0410 0.140 1.90 42.6 43.6 Effective search space used: 3028457194728