bitscore colors: <40, 40-50 , 50-80, 80-200, >200
BLASTP 2.2.30+ Reference: Stephen F. Altschul, Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Reference for composition-based statistics: Alejandro A. Schaffer, L. Aravind, Thomas L. Madden, Sergei Shavirin, John L. Spouge, Yuri I. Wolf, Eugene V. Koonin, and Stephen F. Altschul (2001), "Improving the accuracy of PSI-BLAST protein database searches with composition-based statistics and other refinements", Nucleic Acids Res. 29:2994-3005. Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF excluding environmental samples from WGS projects 49,011,213 sequences; 17,563,301,199 total letters Query= Contig-18_CDS_annotation_glimmer3.pl_2_7 Length=346 Score E Sequences producing significant alignments: (Bits) Value gi|575094431|emb|CDL65804.1| unnamed protein product 305 4e-95 gi|575096056|emb|CDL66947.1| unnamed protein product 297 4e-92 gi|575094544|emb|CDL65904.1| unnamed protein product 285 2e-87 gi|575094572|emb|CDL65928.1| unnamed protein product 285 2e-87 gi|575094492|emb|CDL65859.1| unnamed protein product 283 1e-86 gi|575094496|emb|CDL65862.1| unnamed protein product 272 2e-82 gi|557745632|ref|YP_008798242.1| major capsid protein 249 8e-74 gi|313766927|gb|ADR80653.1| putative major coat protein 247 2e-73 gi|575094415|emb|CDL65790.1| unnamed protein product 244 9e-72 gi|530695351|gb|AGT39907.1| major capsid protein 242 3e-71 >gi|575094431|emb|CDL65804.1| unnamed protein product [uncultured bacterium] Length=560 Score = 305 bits (781), Expect = 4e-95, Method: Compositional matrix adjust. Identities = 143/249 (57%), Positives = 177/249 (71%), Gaps = 2/249 (1%) Query 95 AATINQLRQAFAVQHYYEALARGGSRYREQVRALFGVSISDKTVQIPEYLGGGRYHVNMN 154 AAT+NQLRQAF VQ E ARGG+RYRE ++ FGV+ SD +QIPEYLGG + +N++ Sbjct 310 AATVNQLRQAFQVQKLLEKDARGGTRYREILKNHFGVTTSDARMQIPEYLGGCKVPINVS 369 Query 155 QIVQTSGQESNYGTPIGETGAMSVTPINESSFTKSFEEHGFVIGVMCVRHDHSYQQGLER 214 Q+VQTS S +P G T A+SVTP ++S FTKSF+EHGF+IGV R SYQQG+ER Sbjct 370 QVVQTSA--STDASPQGNTAAISVTPFSKSMFTKSFDEHGFIIGVATARTAQSYQQGIER 427 Query 215 FWSRSDRLDYYFPQFANLGEQPVKKKEIMLTGKSTDEETFGYQEAWADYRMKPNRVSGKM 274 WSR DRLDYYFP AN+GEQ + KEI G + D+E FGYQEAWADYR KPN + G+ Sbjct 428 MWSRKDRLDYYFPVLANIGEQAILNKEIYAQGNAKDDEAFGYQEAWADYRYKPNTICGRF 487 Query 275 RSNAEGTLDFWHYADNYATVPTLSQEWMKEGKNEIARTLIVENEPQFFGAIRVMNKTTRC 334 RSNA+ +LD WHY +Y +PTLS +WM++ E+ RTL V+ EP F R KT R Sbjct 488 RSNAQQSLDAWHYGQDYDKLPTLSTDWMEQSDIEMKRTLAVQTEPDFIANFRFNCKTVRV 547 Query 335 MPLYSVPGL 343 MPLYS+PGL Sbjct 548 MPLYSIPGL 556 >gi|575096056|emb|CDL66947.1| unnamed protein product [uncultured bacterium] Length=570 Score = 297 bits (761), Expect = 4e-92, Method: Compositional matrix adjust. Identities = 138/249 (55%), Positives = 176/249 (71%), Gaps = 2/249 (1%) Query 97 TINQLRQAFAVQHYYEALARGGSRYREQVRALFGVSISDKTVQIPEYLGGGRYHVNMNQI 156 TINQLR AF +Q +YE ARGGSRY E +R+ FGV+ D +Q EYLGG R +N+NQ+ Sbjct 318 TINQLRMAFQIQKFYEKQARGGSRYTEVIRSFFGVTSPDARLQRSEYLGGNRIPININQV 377 Query 157 VQTSGQESNYGTPIGETGAMSVTPINESSFTKSFEEHGFVIGVMCVRHDHSYQQGLERFW 216 +Q SG S TP G MS T S FTKSF EHGF+IGVMC R+DH+YQQG++R W Sbjct 378 IQQSGTGSASTTPQGTVVGMSQTTDTHSDFTKSFTEHGFIIGVMCARYDHTYQQGIDRMW 437 Query 217 SRSDRLDYYFPQFANLGEQPVKKKEIMLTGKSTDEETFGYQEAWADYRMKPNRVSGKMRS 276 SR D+ DYY+P F+N+GEQ +K KEI G +TD+E FGYQEAWA+YR KP+RV+G+MRS Sbjct 438 SRKDKFDYYWPVFSNIGEQAIKNKEIYAQGNATDDEVFGYQEAWAEYRYKPSRVTGEMRS 497 Query 277 NAEGTLDFWHYADNYATVPTLSQEWMKEGKNEIARTLIV--ENEPQFFGAIRVMNKTTRC 334 + +LD WH AD+Y+ +P+LS EW++E + R L V +N QFF I V N TR Sbjct 498 SYAQSLDVWHLADDYSKLPSLSDEWIREDAKTLNRVLAVSDQNSNQFFADIYVKNLCTRP 557 Query 335 MPLYSVPGL 343 MP+YS+PGL Sbjct 558 MPMYSIPGL 566 >gi|575094544|emb|CDL65904.1| unnamed protein product [uncultured bacterium] Length=551 Score = 285 bits (729), Expect = 2e-87, Method: Compositional matrix adjust. Identities = 137/260 (53%), Positives = 181/260 (70%), Gaps = 4/260 (2%) Query 86 LGTDLSNIEAATINQLRQAFAVQHYYEALARGGSRYREQVRALFGVSISDKTVQIPEYLG 145 L +L N AA+INQLR AF +Q YE ARGG+RY E +++ FGV+ D +Q PEYLG Sbjct 290 LVANLQNATAASINQLRLAFQIQRLYERDARGGTRYIEILKSHFGVTSPDARLQRPEYLG 349 Query 146 GGRYHVNMNQIVQTSGQESNYGTPIGETGAMSVTPINESSFTKSFEEHGFVIGVMCVRHD 205 G R +N+NQ++Q S E+ +P G S+T + F KSF EHGFVIG+M R+D Sbjct 350 GNRIPININQVLQQS--ETTSTSPQGNPVGQSLTTDTNADFVKSFVEHGFVIGLMVARYD 407 Query 206 HSYQQGLERFWSRSDRLDYYFPQFANLGEQPVKKKEIMLTGKSTDEETFGYQEAWADYRM 265 H+YQQGLERFWSR DR DYY+P FA++GEQ V KEI +G + D+E FGYQEA+ADYR Sbjct 408 HTYQQGLERFWSRKDRFDYYWPVFAHIGEQAVLNKEIYTSGTAVDDEVFGYQEAYADYRY 467 Query 266 KPNRVSGKMRSNAEGTLDFWHYADNYATVPTLSQEWMKEGKNEIARTLIVEN--EPQFFG 323 KP+RV+G+MRS A +LD WH AD+YA++P+LS W++E + + R L V + Q F Sbjct 468 KPSRVTGEMRSAAPQSLDVWHLADDYASLPSLSDSWIRESASTVDRVLAVSSNVSAQLFC 527 Query 324 AIRVMNKTTRCMPLYSVPGL 343 I + N++TR MP+YSVPGL Sbjct 528 DIYIQNRSTRPMPMYSVPGL 547 >gi|575094572|emb|CDL65928.1| unnamed protein product [uncultured bacterium] Length=556 Score = 285 bits (729), Expect = 2e-87, Method: Compositional matrix adjust. Identities = 137/266 (52%), Positives = 179/266 (67%), Gaps = 9/266 (3%) Query 85 WLGTDLSNIEA-----ATINQLRQAFAVQHYYEALARGGSRYREQVRALFGVSISDKTVQ 139 W T++ +E+ ATINQLR AF +Q YE ARGG+RY E +R+ FGV D +Q Sbjct 289 WTPTNMWAVESGDVGMATINQLRLAFQLQKLYEKDARGGTRYTEIIRSHFGVVSPDSRLQ 348 Query 140 IPEYLGGGRYHVNMNQIVQTSGQESNYGTPIGETGAMSVTPINESSFTKSFEEHGFVIGV 199 PEYLGG R +N+NQI+Q S +S +P+G MSVT S F KSF EHG++IG+ Sbjct 349 RPEYLGGNRIPINVNQIIQQS--QSTEQSPLGALAGMSVTTDKNSDFIKSFVEHGYIIGL 406 Query 200 MCVRHDHSYQQGLERFWSRSDRLDYYFPQFANLGEQPVKKKEIMLTGKSTDEETFGYQEA 259 + R+DH+YQQGL+R WSR DR D+Y+P AN+GEQ V KEI + G TD+E FGYQEA Sbjct 407 VVARYDHTYQQGLDRMWSRKDRFDFYWPVLANIGEQAVLNKEIYIDGSDTDDEVFGYQEA 466 Query 260 WADYRMKPNRVSGKMRSNAEGTLDFWHYADNYATVPTLSQEWMKEGKNEIARTLIVEN-- 317 WA+YR KPNRV G+MRS+A +LD WH D+Y+++P LS W++E K + R L V + Sbjct 467 WAEYRYKPNRVCGEMRSSAPQSLDVWHLGDDYSSLPYLSDSWIREDKTNVDRVLAVTSSV 526 Query 318 EPQFFGAIRVMNKTTRCMPLYSVPGL 343 Q F I + NK TR MP+YS+PGL Sbjct 527 SDQLFADIYICNKATRPMPMYSIPGL 552 >gi|575094492|emb|CDL65859.1| unnamed protein product [uncultured bacterium] Length=551 Score = 283 bits (723), Expect = 1e-86, Method: Compositional matrix adjust. Identities = 139/263 (53%), Positives = 176/263 (67%), Gaps = 8/263 (3%) Query 86 LGTDLS---NIEAATINQLRQAFAVQHYYEALARGGSRYREQVRALFGVSISDKTVQIPE 142 L DLS ++ ATINQLR AF +Q YE ARGG+RY E +++ FGV+ D +Q PE Sbjct 288 LWADLSTATDLPVATINQLRTAFQIQKLYERDARGGTRYIEILKSHFGVTSPDARLQRPE 347 Query 143 YLGGGRYHVNMNQIVQTSGQESNYGTPIGETGAMSVTPINESSFTKSFEEHGFVIGVMCV 202 YLGG R +N+NQ++Q+S + TP G A S+T + S FTKSF EHGF+IG+M Sbjct 348 YLGGSRVPININQVIQSSETGA---TPQGNAAAYSLTTDSHSEFTKSFVEHGFIIGLMVA 404 Query 203 RHDHSYQQGLERFWSRSDRLDYYFPQFANLGEQPVKKKEIMLTGKSTDEETFGYQEAWAD 262 R+DHSYQQGL+RFWSR DR DYY+P FANLGE VK KEI G D+E FGYQEAWAD Sbjct 405 RYDHSYQQGLQRFWSRKDRFDYYWPVFANLGEMAVKNKEIFAQGTDVDDEVFGYQEAWAD 464 Query 263 YRMKPNRVSGKMRSNAEGTLDFWHYADNYATVPTLSQEWMKEGKNEIARTLIVEN--EPQ 320 YR KP+ V+G+MRS +LD WH AD+Y +P+LS W++E + + R L V + Q Sbjct 465 YRYKPSVVTGEMRSQYAQSLDIWHLADDYENLPSLSDSWIREDSSTVNRVLAVSDSVSAQ 524 Query 321 FFGAIRVMNKTTRCMPLYSVPGL 343 F I + TR MPLYS+PGL Sbjct 525 LFCDIYIRCLATRPMPLYSIPGL 547 >gi|575094496|emb|CDL65862.1| unnamed protein product [uncultured bacterium] Length=568 Score = 272 bits (696), Expect = 2e-82, Method: Compositional matrix adjust. Identities = 131/251 (52%), Positives = 170/251 (68%), Gaps = 4/251 (2%) Query 95 AATINQLRQAFAVQHYYEALARGGSRYREQVRALFGVSISDKTVQIPEYLGGGRYHVNMN 154 A TINQLR AF +Q YE AR GSRYRE +R+ F V+ D +Q+PEYLGG R +N+N Sbjct 316 ATTINQLRMAFQIQKLYEKDARAGSRYRELIRSHFSVTPLDARMQVPEYLGGNRIPININ 375 Query 155 QIVQTSGQESNYGTPIGETGAMSVTPINESSFTKSFEEHGFVIGVMCVRHDHSYQQGLER 214 Q+VQTS Q S+ +P G S+T + F KSF EHG +IGV R+DH+YQQG+ + Sbjct 376 QVVQTS-QTSDV-SPQGNVAGQSLTSDSHGDFIKSFTEHGMLIGVAVARYDHTYQQGVSK 433 Query 215 FWSRSDRLDYYFPQFANLGEQPVKKKEIMLTGKSTDEETFGYQEAWADYRMKPNRVSGKM 274 WSR R DYY+P AN+GEQ V KEI G + DEE FGYQEAWA+YR KP+ V+G+M Sbjct 434 LWSRKTRFDYYWPVLANIGEQAVLNKEIYAQGTAQDEEVFGYQEAWAEYRYKPSIVTGEM 493 Query 275 RSNAEGTLDFWHYADNYATVPTLSQEWMKEGKNEIARTLIVENEP--QFFGAIRVMNKTT 332 RS+A +LD WH+AD+Y ++P LS +W+KE K I R L V + Q+F + N+TT Sbjct 494 RSSARTSLDSWHFADDYNSLPKLSADWIKEDKTNIDRVLAVSSSVSNQYFADFYIENETT 553 Query 333 RCMPLYSVPGL 343 R +P YS+PGL Sbjct 554 RALPFYSIPGL 564 >gi|557745632|ref|YP_008798242.1| major capsid protein [Marine gokushovirus] gi|530695345|gb|AGT39902.1| major capsid protein [Marine gokushovirus] Length=538 Score = 249 bits (635), Expect = 8e-74, Method: Compositional matrix adjust. Identities = 133/285 (47%), Positives = 171/285 (60%), Gaps = 2/285 (1%) Query 58 NNGNTAPLVNGQYIQTMSQDDANFFDAWLGTDLSNIEAATINQLRQAFAVQHYYEALARG 117 N GNT +N D+ L DLS +ATINQLR AFA Q + E ARG Sbjct 252 NIGNTHRFLNSASTNVYPGDENTDEARRLYADLSEATSATINQLRLAFATQKFLEIQARG 311 Query 118 GSRYREQVRALFGVSISDKTVQIPEYLGGGRYHVNMNQIVQTSGQESNYGTPIGETGAMS 177 GSRY E ++ F V+ D +Q PEYLGGG VN++ + QTS ++ TP G A+ Sbjct 312 GSRYIEVIKNHFNVTSPDARLQRPEYLGGGSSPVNISPVAQTSSTDAT--TPQGNLSAIG 369 Query 178 VTPINESSFTKSFEEHGFVIGVMCVRHDHSYQQGLERFWSRSDRLDYYFPQFANLGEQPV 237 T ++ SFTKSF EH VIG++ VR D +YQQGL R +SR DYY+P + +GEQ V Sbjct 370 TTVLSGHSFTKSFTEHTIVIGMVSVRTDLTYQQGLNRMFSRETIYDYYWPTLSTIGEQAV 429 Query 238 KKKEIMLTGKSTDEETFGYQEAWADYRMKPNRVSGKMRSNAEGTLDFWHYADNYATVPTL 297 K KEI G + DE TFGYQE +A+YR KP+ V+GK RSNA GTL+ WHYA YA++P L Sbjct 430 KNKEIYAQGSAADETTFGYQERYAEYRYKPSSVTGKFRSNATGTLESWHYAQEYASLPLL 489 Query 298 SQEWMKEGKNEIARTLIVENEPQFFGAIRVMNKTTRCMPLYSVPG 342 W++ + RTL V +EPQF + TR MP+ S+PG Sbjct 490 GDSWIQVTDTNVQRTLAVASEPQFIFDSLFKLRCTRPMPVNSIPG 534 >gi|313766927|gb|ADR80653.1| putative major coat protein [Uncultured Microviridae] Length=533 Score = 247 bits (631), Expect = 2e-73, Method: Compositional matrix adjust. Identities = 126/255 (49%), Positives = 168/255 (66%), Gaps = 4/255 (2%) Query 89 DLSNIEAATINQLRQAFAVQHYYEALARGGSRYREQVRALFGVSISDKTVQIPEYLGGGR 148 DLSN AATINQLR+AF +Q YE ARGG+RY E +++ FGV+ D +Q PEYLGG + Sbjct 264 DLSNATAATINQLREAFQIQRLYEKDARGGTRYTEILQSHFGVTSPDARLQRPEYLGGQK 323 Query 149 YHVNMNQIVQTSGQESNYGTPIGETGAMSVTPINESSFTKSFEEHGFVIGVMCVRHDHSY 208 V M + QTS +S +P G A+ T + F+KSF EHG +IG+ CV D +Y Sbjct 324 TEVMMQTVPQTSSTDST--SPQGNLAALG-TATSRGGFSKSFVEHGVLIGLACVFADLTY 380 Query 209 QQGLERFWSRSDRLDYYFPQFANLGEQPVKKKEIMLTGKSTDEETFGYQEAWADYRMKPN 268 QQG+ R WSR DR D+Y+P A+LGEQ V +EI G S D +TFGYQE +A+YR KP+ Sbjct 381 QQGMNRMWSRRDRWDFYWPSLAHLGEQAVLNQEIYTQGTSADTQTFGYQERFAEYRYKPS 440 Query 269 RVSGKMRSNAEGTLDFWHYADNYATVPTLSQEWMKEGKNEIARTLIVENEPQFFGAIRVM 328 +++GKMRSNA GTLD WH A ++ +P L+ +++E + R + V +EP+F Sbjct 441 QITGKMRSNATGTLDAWHLAQDFTALPALNASFIEENP-PVDRVIAVPSEPEFIWDWYFD 499 Query 329 NKTTRCMPLYSVPGL 343 KTTR MP+YSVPGL Sbjct 500 LKTTRPMPVYSVPGL 514 >gi|575094415|emb|CDL65790.1| unnamed protein product [uncultured bacterium] Length=569 Score = 244 bits (623), Expect = 9e-72, Method: Compositional matrix adjust. Identities = 121/252 (48%), Positives = 159/252 (63%), Gaps = 4/252 (2%) Query 97 TINQLRQAFAVQHYYEALARGGSRYREQVRALFGVSISDKTVQIPEYLGGGRYHVNMNQI 156 +IN LRQA A+QH EA ARGG+RY E ++ FGVS D +Q EY+GG R +N++Q+ Sbjct 320 SINDLRQAIALQHILEADARGGTRYVEILKNEFGVSSPDARLQRSEYIGGERIPINVSQV 379 Query 157 VQTSGQESNYGTPIGETGAMSVTPINESSFTKSFEEHGFVIGVMCVRHDHSYQQGLERFW 216 +Q+S ++ +P G A S+T + S EHG+++G+ +R DHSYQQGL R W Sbjct 380 IQSSASDTT--SPQGNAAAYSLTTSANTIRAYSAVEHGYILGLAAIRVDHSYQQGLSRMW 437 Query 217 SRSDRLDYYFPQFANLGEQPVKKKEIMLTGKSTDEETFGYQEAWADYRMKPNRVSGKMRS 276 +RSDR YY P ANLGEQ V +EI G + D E FGYQEAWADYR + N ++G+MRS Sbjct 438 TRSDRFSYYHPMLANLGEQAVLNQEIYAQGTTADTEVFGYQEAWADYRYRTNMITGEMRS 497 Query 277 NAEGTLDFWHYADNYATVPTLSQEWMKEGKNEIARTLIV--ENEPQFFGAIRVMNKTTRC 334 +LD WHY D Y +P LS +W+KEG+ I RTL V EN QF + R Sbjct 498 TYAQSLDAWHYGDKYTDLPRLSNDWIKEGQENIDRTLAVQSENSHQFICNLYFDQTWVRP 557 Query 335 MPLYSVPGLEKL 346 MP+YSVPGL + Sbjct 558 MPIYSVPGLSMI 569 >gi|530695351|gb|AGT39907.1| major capsid protein [Marine gokushovirus] Length=539 Score = 242 bits (617), Expect = 3e-71, Method: Compositional matrix adjust. Identities = 141/299 (47%), Positives = 182/299 (61%), Gaps = 20/299 (7%) Query 53 SVNKNNNGN-TAPL--VNGQYIQTMSQDDANFFDAWLGTDLSNIEAATINQLRQAFAVQH 109 SV+K NGN + P VN QY D N L DLS AATIN +RQ+F +Q Sbjct 249 SVSKEANGNMSVPTGSVNAQY-------DPN---GSLVADLSTATAATINAIRQSFQIQR 298 Query 110 YYEALARGGSRYREQVRALFGVSISDKTVQIPEYLGGGRYHVNMNQIVQTSGQ-ESNYGT 168 E ARGG+RY E VR+ FGV D +Q PEYLGGG + +N + Q S S T Sbjct 299 LLERDARGGTRYTEIVRSHFGVISPDARMQRPEYLGGGSAPIIVNPVAQQSASGASGTDT 358 Query 169 PIGETGAMSVTPINESSFTKSFEEHGFVIGVMCVRHDHSYQQGLERFWSRSDRLDYYFPQ 228 P+G GA+ + F SF EHG V+G+ VR D +YQQGL R +SRS R D++FP Sbjct 359 PLGTLGAVGTGLASGHGFASSFTEHGVVVGLCSVRADLTYQQGLHRMFSRSTRYDFFFPV 418 Query 229 FANLGEQPVKKKEIMLTGKSTDEETFGYQEAWADYRMKPNRVSGKMRSNAEGTLDFWHYA 288 F++LGEQP+ KE+ TG STD++ FGYQEAWA+YR KP++V+G MRS A GTLD WH A Sbjct 419 FSHLGEQPILNKELYATGTSTDDDVFGYQEAWAEYRYKPSQVTGLMRSTAAGTLDAWHLA 478 Query 289 DNYATVPTLSQEWMKEGKNEIARTLIVENEP---QF-FGAIRVMNKTTRCMPLYSVPGL 343 N+ ++PTL+ ++ E + R + V +E QF F A +N R MP+YSVPGL Sbjct 479 QNFGSLPTLNSTFI-EDTPPVDRVVAVGSEANGQQFIFDAFFDINM-ARPMPMYSVPGL 535 Lambda K H a alpha 0.316 0.132 0.392 0.792 4.96 Gapped Lambda K H a alpha sigma 0.267 0.0410 0.140 1.90 42.6 43.6 Effective search space used: 2041309051650