bitscore colors: <40, 40-50 , 50-80, 80-200, >200
BLASTP 2.2.30+ Reference: Stephen F. Altschul, Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Reference for composition-based statistics: Alejandro A. Schaffer, L. Aravind, Thomas L. Madden, Sergei Shavirin, John L. Spouge, Yuri I. Wolf, Eugene V. Koonin, and Stephen F. Altschul (2001), "Improving the accuracy of PSI-BLAST protein database searches with composition-based statistics and other refinements", Nucleic Acids Res. 29:2994-3005. Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF excluding environmental samples from WGS projects 49,011,213 sequences; 17,563,301,199 total letters Query= Contig-29_CDS_annotation_glimmer3.pl_2_3 Length=334 Score E Sequences producing significant alignments: (Bits) Value gi|575094572|emb|CDL65928.1| unnamed protein product 434 1e-145 gi|575094544|emb|CDL65904.1| unnamed protein product 424 2e-141 gi|575094492|emb|CDL65859.1| unnamed protein product 423 3e-141 gi|575094496|emb|CDL65862.1| unnamed protein product 401 2e-132 gi|575096056|emb|CDL66947.1| unnamed protein product 395 5e-130 gi|575094431|emb|CDL65804.1| unnamed protein product 333 4e-106 gi|575094415|emb|CDL65790.1| unnamed protein product 312 4e-98 gi|313766927|gb|ADR80653.1| putative major coat protein 306 6e-96 gi|530695351|gb|AGT39907.1| major capsid protein 303 1e-94 gi|444297964|dbj|GAC77857.1| major capsid protein 290 4e-93 >gi|575094572|emb|CDL65928.1| unnamed protein product [uncultured bacterium] Length=556 Score = 434 bits (1116), Expect = 1e-145, Method: Compositional matrix adjust. Identities = 205/291 (70%), Positives = 244/291 (84%), Gaps = 12/291 (4%) Query 46 DVESVATGVG--FDAPTSRDGSMYLDNLWAIQSGNVTAATINQLRMAFQIQKLYEKDARG 103 +V S TG+G F PT N+WA++SG+V ATINQLR+AFQ+QKLYEKDARG Sbjct 276 EVGSDGTGIGQNFWTPT---------NMWAVESGDVGMATINQLRLAFQLQKLYEKDARG 326 Query 104 GTRYIEILKSHFGVTSPDARLQRPEYLGGNRIPVNINQVVQSSATQSSGTPLGDTAAFSV 163 GTRY EI++SHFGV SPD+RLQRPEYLGGNRIP+N+NQ++Q S + + +PLG A SV Sbjct 327 GTRYTEIIRSHFGVVSPDSRLQRPEYLGGNRIPINVNQIIQQSQS-TEQSPLGALAGMSV 385 Query 164 TTDVHGDFIKSFVEHGFVIGIMVARYDHTYQQGLERFWSRRDRLDYYFPVFANIGEQPIL 223 TTD + DFIKSFVEHG++IG++VARYDHTYQQGL+R WSR+DR D+Y+PV ANIGEQ +L Sbjct 386 TTDKNSDFIKSFVEHGYIIGLVVARYDHTYQQGLDRMWSRKDRFDFYWPVLANIGEQAVL 445 Query 224 NKEIYAQGTVQDNEVFGYQEAWADYRYKPSRVAGEMRSKAPTSLDVWHLADEYTQLPKLS 283 NKEIY G+ D+EVFGYQEAWA+YRYKP+RV GEMRS AP SLDVWHL D+Y+ LP LS Sbjct 446 NKEIYIDGSDTDDEVFGYQEAWAEYRYKPNRVCGEMRSSAPQSLDVWHLGDDYSSLPYLS 505 Query 284 DAWIREDKTNVDRVLAVTSAVSNQMFADLYIQCKATRPMPMYSIPGLIDHH 334 D+WIREDKTNVDRVLAVTS+VS+Q+FAD+YI KATRPMPMYSIPGLIDHH Sbjct 506 DSWIREDKTNVDRVLAVTSSVSDQLFADIYICNKATRPMPMYSIPGLIDHH 556 >gi|575094544|emb|CDL65904.1| unnamed protein product [uncultured bacterium] Length=551 Score = 424 bits (1089), Expect = 2e-141, Method: Compositional matrix adjust. Identities = 197/256 (77%), Positives = 225/256 (88%), Gaps = 1/256 (0%) Query 78 NVTAATINQLRMAFQIQKLYEKDARGGTRYIEILKSHFGVTSPDARLQRPEYLGGNRIPV 137 N TAA+INQLR+AFQIQ+LYE+DARGGTRYIEILKSHFGVTSPDARLQRPEYLGGNRIP+ Sbjct 296 NATAASINQLRLAFQIQRLYERDARGGTRYIEILKSHFGVTSPDARLQRPEYLGGNRIPI 355 Query 138 NINQVVQSSATQSSGTPLGDTAAFSVTTDVHGDFIKSFVEHGFVIGIMVARYDHTYQQGL 197 NINQV+Q S T S+ +P G+ S+TTD + DF+KSFVEHGFVIG+MVARYDHTYQQGL Sbjct 356 NINQVLQQSETTST-SPQGNPVGQSLTTDTNADFVKSFVEHGFVIGLMVARYDHTYQQGL 414 Query 198 ERFWSRRDRLDYYFPVFANIGEQPILNKEIYAQGTVQDNEVFGYQEAWADYRYKPSRVAG 257 ERFWSR+DR DYY+PVFA+IGEQ +LNKEIY GT D+EVFGYQEA+ADYRYKPSRV G Sbjct 415 ERFWSRKDRFDYYWPVFAHIGEQAVLNKEIYTSGTAVDDEVFGYQEAYADYRYKPSRVTG 474 Query 258 EMRSKAPTSLDVWHLADEYTQLPKLSDAWIREDKTNVDRVLAVTSAVSNQMFADLYIQCK 317 EMRS AP SLDVWHLAD+Y LP LSD+WIRE + VDRVLAV+S VS Q+F D+YIQ + Sbjct 475 EMRSAAPQSLDVWHLADDYASLPSLSDSWIRESASTVDRVLAVSSNVSAQLFCDIYIQNR 534 Query 318 ATRPMPMYSIPGLIDH 333 +TRPMPMYS+PGLIDH Sbjct 535 STRPMPMYSVPGLIDH 550 >gi|575094492|emb|CDL65859.1| unnamed protein product [uncultured bacterium] Length=551 Score = 423 bits (1087), Expect = 3e-141, Method: Compositional matrix adjust. Identities = 198/272 (73%), Positives = 231/272 (85%), Gaps = 4/272 (1%) Query 65 SMYLDNLWAIQSG--NVTAATINQLRMAFQIQKLYEKDARGGTRYIEILKSHFGVTSPDA 122 SM NLWA S ++ ATINQLR AFQIQKLYE+DARGGTRYIEILKSHFGVTSPDA Sbjct 282 SMIPTNLWADLSTATDLPVATINQLRTAFQIQKLYERDARGGTRYIEILKSHFGVTSPDA 341 Query 123 RLQRPEYLGGNRIPVNINQVVQSSATQSSGTPLGDTAAFSVTTDVHGDFIKSFVEHGFVI 182 RLQRPEYLGG+R+P+NINQV+QSS T TP G+ AA+S+TTD H +F KSFVEHGF+I Sbjct 342 RLQRPEYLGGSRVPININQVIQSSET--GATPQGNAAAYSLTTDSHSEFTKSFVEHGFII 399 Query 183 GIMVARYDHTYQQGLERFWSRRDRLDYYFPVFANIGEQPILNKEIYAQGTVQDNEVFGYQ 242 G+MVARYDH+YQQGL+RFWSR+DR DYY+PVFAN+GE + NKEI+AQGT D+EVFGYQ Sbjct 400 GLMVARYDHSYQQGLQRFWSRKDRFDYYWPVFANLGEMAVKNKEIFAQGTDVDDEVFGYQ 459 Query 243 EAWADYRYKPSRVAGEMRSKAPTSLDVWHLADEYTQLPKLSDAWIREDKTNVDRVLAVTS 302 EAWADYRYKPS V GEMRS+ SLD+WHLAD+Y LP LSD+WIRED + V+RVLAV+ Sbjct 460 EAWADYRYKPSVVTGEMRSQYAQSLDIWHLADDYENLPSLSDSWIREDSSTVNRVLAVSD 519 Query 303 AVSNQMFADLYIQCKATRPMPMYSIPGLIDHH 334 +VS Q+F D+YI+C ATRPMP+YSIPGLIDHH Sbjct 520 SVSAQLFCDIYIRCLATRPMPLYSIPGLIDHH 551 >gi|575094496|emb|CDL65862.1| unnamed protein product [uncultured bacterium] Length=568 Score = 401 bits (1030), Expect = 2e-132, Method: Compositional matrix adjust. Identities = 193/270 (71%), Positives = 222/270 (82%), Gaps = 4/270 (1%) Query 65 SMYLDNLWAIQSGNVTAATINQLRMAFQIQKLYEKDARGGTRYIEILKSHFGVTSPDARL 124 S+Y DNL+A SG TA TINQLRMAFQIQKLYEKDAR G+RY E+++SHF VT DAR+ Sbjct 303 SVYPDNLYA-SSG--TATTINQLRMAFQIQKLYEKDARAGSRYRELIRSHFSVTPLDARM 359 Query 125 QRPEYLGGNRIPVNINQVVQSSATQSSGTPLGDTAAFSVTTDVHGDFIKSFVEHGFVIGI 184 Q PEYLGGNRIP+NINQVVQ+S T S +P G+ A S+T+D HGDFIKSF EHG +IG+ Sbjct 360 QVPEYLGGNRIPININQVVQTSQT-SDVSPQGNVAGQSLTSDSHGDFIKSFTEHGMLIGV 418 Query 185 MVARYDHTYQQGLERFWSRRDRLDYYFPVFANIGEQPILNKEIYAQGTVQDNEVFGYQEA 244 VARYDHTYQQG+ + WSR+ R DYY+PV ANIGEQ +LNKEIYAQGT QD EVFGYQEA Sbjct 419 AVARYDHTYQQGVSKLWSRKTRFDYYWPVLANIGEQAVLNKEIYAQGTAQDEEVFGYQEA 478 Query 245 WADYRYKPSRVAGEMRSKAPTSLDVWHLADEYTQLPKLSDAWIREDKTNVDRVLAVTSAV 304 WA+YRYKPS V GEMRS A TSLD WH AD+Y LPKLS WI+EDKTN+DRVLAV+S+V Sbjct 479 WAEYRYKPSIVTGEMRSSARTSLDSWHFADDYNSLPKLSADWIKEDKTNIDRVLAVSSSV 538 Query 305 SNQMFADLYIQCKATRPMPMYSIPGLIDHH 334 SNQ FAD YI+ + TR +P YSIPGLIDHH Sbjct 539 SNQYFADFYIENETTRALPFYSIPGLIDHH 568 >gi|575096056|emb|CDL66947.1| unnamed protein product [uncultured bacterium] Length=570 Score = 395 bits (1014), Expect = 5e-130, Method: Compositional matrix adjust. Identities = 182/253 (72%), Positives = 207/253 (82%), Gaps = 1/253 (0%) Query 83 TINQLRMAFQIQKLYEKDARGGTRYIEILKSHFGVTSPDARLQRPEYLGGNRIPVNINQV 142 TINQLRMAFQIQK YEK ARGG+RY E+++S FGVTSPDARLQR EYLGGNRIP+NINQV Sbjct 318 TINQLRMAFQIQKFYEKQARGGSRYTEVIRSFFGVTSPDARLQRSEYLGGNRIPININQV 377 Query 143 VQSSATQS-SGTPLGDTAAFSVTTDVHGDFIKSFVEHGFVIGIMVARYDHTYQQGLERFW 201 +Q S T S S TP G S TTD H DF KSF EHGF+IG+M ARYDHTYQQG++R W Sbjct 378 IQQSGTGSASTTPQGTVVGMSQTTDTHSDFTKSFTEHGFIIGVMCARYDHTYQQGIDRMW 437 Query 202 SRRDRLDYYFPVFANIGEQPILNKEIYAQGTVQDNEVFGYQEAWADYRYKPSRVAGEMRS 261 SR+D+ DYY+PVF+NIGEQ I NKEIYAQG D+EVFGYQEAWA+YRYKPSRV GEMRS Sbjct 438 SRKDKFDYYWPVFSNIGEQAIKNKEIYAQGNATDDEVFGYQEAWAEYRYKPSRVTGEMRS 497 Query 262 KAPTSLDVWHLADEYTQLPKLSDAWIREDKTNVDRVLAVTSAVSNQMFADLYIQCKATRP 321 SLDVWHLAD+Y++LP LSD WIRED ++RVLAV+ SNQ FAD+Y++ TRP Sbjct 498 SYAQSLDVWHLADDYSKLPSLSDEWIREDAKTLNRVLAVSDQNSNQFFADIYVKNLCTRP 557 Query 322 MPMYSIPGLIDHH 334 MPMYSIPGLIDHH Sbjct 558 MPMYSIPGLIDHH 570 >gi|575094431|emb|CDL65804.1| unnamed protein product [uncultured bacterium] Length=560 Score = 333 bits (853), Expect = 4e-106, Method: Compositional matrix adjust. Identities = 167/298 (56%), Positives = 202/298 (68%), Gaps = 17/298 (6%) Query 37 NFGSNLGGTDVESVATGVGFDAPTSRDGSMYLDNLWAIQSGNVTAATINQLRMAFQIQKL 96 NF + GG+ ES A Y NLWA S AAT+NQLR AFQ+QKL Sbjct 280 NFETKAGGSFSESGAVAA------------YPTNLWA--SPVTAAATVNQLRQAFQVQKL 325 Query 97 YEKDARGGTRYIEILKSHFGVTSPDARLQRPEYLGGNRIPVNINQVVQSSATQSSGTPLG 156 EKDARGGTRY EILK+HFGVT+ DAR+Q PEYLGG ++P+N++QVVQ+SA+ + +P G Sbjct 326 LEKDARGGTRYREILKNHFGVTTSDARMQIPEYLGGCKVPINVSQVVQTSAS-TDASPQG 384 Query 157 DTAAFSVTTDVHGDFIKSFVEHGFVIGIMVARYDHTYQQGLERFWSRRDRLDYYFPVFAN 216 +TAA SVT F KSF EHGF+IG+ AR +YQQG+ER WSR+DRLDYYFPV AN Sbjct 385 NTAAISVTPFSKSMFTKSFDEHGFIIGVATARTAQSYQQGIERMWSRKDRLDYYFPVLAN 444 Query 217 IGEQPILNKEIYAQGTVQDNEVFGYQEAWADYRYKPSRVAGEMRSKAPTSLDVWHLADEY 276 IGEQ ILNKEIYAQG +D+E FGYQEAWADYRYKP+ + G RS A SLD WH +Y Sbjct 445 IGEQAILNKEIYAQGNAKDDEAFGYQEAWADYRYKPNTICGRFRSNAQQSLDAWHYGQDY 504 Query 277 TQLPKLSDAWIREDKTNVDRVLAVTSAVSNQMFADLYIQCKATRPMPMYSIPGLIDHH 334 +LP LS W+ + + R LAV + A+ CK R MP+YSIPGLIDH+ Sbjct 505 DKLPTLSTDWMEQSDIEMKRTLAVQT--EPDFIANFRFNCKTVRVMPLYSIPGLIDHN 560 >gi|575094415|emb|CDL65790.1| unnamed protein product [uncultured bacterium] Length=569 Score = 312 bits (800), Expect = 4e-98, Method: Compositional matrix adjust. Identities = 144/260 (55%), Positives = 188/260 (72%), Gaps = 1/260 (0%) Query 71 LWAIQSGNVTAATINQLRMAFQIQKLYEKDARGGTRYIEILKSHFGVTSPDARLQRPEYL 130 L A+ + +IN LR A +Q + E DARGGTRY+EILK+ FGV+SPDARLQR EY+ Sbjct 308 LTAVAENSTNFLSINDLRQAIALQHILEADARGGTRYVEILKNEFGVSSPDARLQRSEYI 367 Query 131 GGNRIPVNINQVVQSSATQSSGTPLGDTAAFSVTTDVHGDFIKSFVEHGFVIGIMVARYD 190 GG RIP+N++QV+QSSA+ ++ +P G+ AA+S+TT + S VEHG+++G+ R D Sbjct 368 GGERIPINVSQVIQSSASDTT-SPQGNAAAYSLTTSANTIRAYSAVEHGYILGLAAIRVD 426 Query 191 HTYQQGLERFWSRRDRLDYYFPVFANIGEQPILNKEIYAQGTVQDNEVFGYQEAWADYRY 250 H+YQQGL R W+R DR YY P+ AN+GEQ +LN+EIYAQGT D EVFGYQEAWADYRY Sbjct 427 HSYQQGLSRMWTRSDRFSYYHPMLANLGEQAVLNQEIYAQGTTADTEVFGYQEAWADYRY 486 Query 251 KPSRVAGEMRSKAPTSLDVWHLADEYTQLPKLSDAWIREDKTNVDRVLAVTSAVSNQMFA 310 + + + GEMRS SLD WH D+YT LP+LS+ WI+E + N+DR LAV S S+Q Sbjct 487 RTNMITGEMRSTYAQSLDAWHYGDKYTDLPRLSNDWIKEGQENIDRTLAVQSENSHQFIC 546 Query 311 DLYIQCKATRPMPMYSIPGL 330 +LY RPMP+YS+PGL Sbjct 547 NLYFDQTWVRPMPIYSVPGL 566 >gi|313766927|gb|ADR80653.1| putative major coat protein [Uncultured Microviridae] Length=533 Score = 306 bits (783), Expect = 6e-96, Method: Compositional matrix adjust. Identities = 153/256 (60%), Positives = 188/256 (73%), Gaps = 5/256 (2%) Query 78 NVTAATINQLRMAFQIQKLYEKDARGGTRYIEILKSHFGVTSPDARLQRPEYLGGNRIPV 137 N TAATINQLR AFQIQ+LYEKDARGGTRY EIL+SHFGVTSPDARLQRPEYLGG + V Sbjct 267 NATAATINQLREAFQIQRLYEKDARGGTRYTEILQSHFGVTSPDARLQRPEYLGGQKTEV 326 Query 138 NINQVVQSSATQSSGTPLGDTAAFSVTTDVHGDFIKSFVEHGFVIGIMVARYDHTYQQGL 197 + V Q+S+T S+ +P G+ AA T G F KSFVEHG +IG+ D TYQQG+ Sbjct 327 MMQTVPQTSSTDST-SPQGNLAALGTATS-RGGFSKSFVEHGVLIGLACVFADLTYQQGM 384 Query 198 ERFWSRRDRLDYYFPVFANIGEQPILNKEIYAQGTVQDNEVFGYQEAWADYRYKPSRVAG 257 R WSRRDR D+Y+P A++GEQ +LN+EIY QGT D + FGYQE +A+YRYKPS++ G Sbjct 385 NRMWSRRDRWDFYWPSLAHLGEQAVLNQEIYTQGTSADTQTFGYQERFAEYRYKPSQITG 444 Query 258 EMRSKAPTSLDVWHLADEYTQLPKLSDAWIREDKTNVDRVLAVTSAVSNQMFADLYIQCK 317 +MRS A +LD WHLA ++T LP L+ ++I E+ VDRV+AV S + D Y K Sbjct 445 KMRSNATGTLDAWHLAQDFTALPALNASFIEENPP-VDRVIAVPS--EPEFIWDWYFDLK 501 Query 318 ATRPMPMYSIPGLIDH 333 TRPMP+YS+PGLIDH Sbjct 502 TTRPMPVYSVPGLIDH 517 >gi|530695351|gb|AGT39907.1| major capsid protein [Marine gokushovirus] Length=539 Score = 303 bits (775), Expect = 1e-94, Method: Compositional matrix adjust. Identities = 148/257 (58%), Positives = 181/257 (70%), Gaps = 4/257 (2%) Query 80 TAATINQLRMAFQIQKLYEKDARGGTRYIEILKSHFGVTSPDARLQRPEYLGGNRIPVNI 139 TAATIN +R +FQIQ+L E+DARGGTRY EI++SHFGV SPDAR+QRPEYLGG P+ + Sbjct 283 TAATINAIRQSFQIQRLLERDARGGTRYTEIVRSHFGVISPDARMQRPEYLGGGSAPIIV 342 Query 140 NQVVQSSATQSSGT--PLGDTAAFSVTTDVHGDFIKSFVEHGFVIGIMVARYDHTYQQGL 197 N V Q SA+ +SGT PLG A F SF EHG V+G+ R D TYQQGL Sbjct 343 NPVAQQSASGASGTDTPLGTLGAVGTGLASGHGFASSFTEHGVVVGLCSVRADLTYQQGL 402 Query 198 ERFWSRRDRLDYYFPVFANIGEQPILNKEIYAQGTVQDNEVFGYQEAWADYRYKPSRVAG 257 R +SR R D++FPVF+++GEQPILNKE+YA GT D++VFGYQEAWA+YRYKPS+V G Sbjct 403 HRMFSRSTRYDFFFPVFSHLGEQPILNKELYATGTSTDDDVFGYQEAWAEYRYKPSQVTG 462 Query 258 EMRSKAPTSLDVWHLADEYTQLPKLSDAWIREDKTNVDRVLAVTSAVSNQMFA-DLYIQC 316 MRS A +LD WHLA + LP L+ +I ED VDRV+AV S + Q F D + Sbjct 463 LMRSTAAGTLDAWHLAQNFGSLPTLNSTFI-EDTPPVDRVVAVGSEANGQQFIFDAFFDI 521 Query 317 KATRPMPMYSIPGLIDH 333 RPMPMYS+PGL+DH Sbjct 522 NMARPMPMYSVPGLVDH 538 >gi|444297964|dbj|GAC77857.1| major capsid protein, partial [uncultured marine virus] Length=285 Score = 290 bits (743), Expect = 4e-93, Method: Compositional matrix adjust. Identities = 151/275 (55%), Positives = 194/275 (71%), Gaps = 15/275 (5%) Query 60 TSRDG-SMYLDNLWAIQSGNVTAATINQLRMAFQIQKLYEKDARGGTRYIEILKSHFGVT 118 TS +G S+Y D + TA+TINQLR AFQIQKL E+DARGGTRYIEI+KSHFGVT Sbjct 21 TSTEGQSLYAD------LSDATASTINQLRQAFQIQKLLERDARGGTRYIEIVKSHFGVT 74 Query 119 SPDARLQRPEYLGGNRIPVNINQVVQSSATQSSGTP--LGDTAAFSVTTDVHGDFIKSFV 176 SPD R RPEYLGG PVNIN VV ++A S+G+P LGD AA++ T + F KSF Sbjct 75 SPDLRATRPEYLGGGSNPVNINPVV-NTAGFSAGSPQNLGDLAAYATTVIQNNGFTKSFT 133 Query 177 EHGFVIGIMVARYDHTYQQGLERFWSRRDRLDYYFPVFANIGEQPILNKEIYAQGTV--Q 234 EH +IG++ R D TYQQG+ R WSR DR D+YFP A+IGEQ +LNKEIYA G Q Sbjct 134 EHCIIIGLVSVRADLTYQQGMNRMWSRSDRYDFYFPALAHIGEQAVLNKEIYAIGNQADQ 193 Query 235 DNEVFGYQEAWADYRYKPSRVAGEMRSKAPTSLDVWHLADEYTQLPKLSDAWIREDKTNV 294 D +VFGYQE +A+YRYKPS++ G+ RS A ++LD WHL+ ++T LP+L+ A+I+E+ + Sbjct 194 DEDVFGYQERFAEYRYKPSQITGKFRSTAASTLDAWHLSQKFTSLPELNSAFIQENPP-M 252 Query 295 DRVLAVTSAVSNQMFADLYIQCKATRPMPMYSIPG 329 DRV+AV S + D Y + + RPMP+Y +P Sbjct 253 DRVVAVDS--EPEFIWDSYFKMRCVRPMPLYGVPA 285 Lambda K H a alpha 0.318 0.133 0.404 0.792 4.96 Gapped Lambda K H a alpha sigma 0.267 0.0410 0.140 1.90 42.6 43.6 Effective search space used: 1917593351550