bitscore colors: <40, 40-50 , 50-80, 80-200, >200
BLASTP 2.2.30+ Reference: Stephen F. Altschul, Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Reference for composition-based statistics: Alejandro A. Schaffer, L. Aravind, Thomas L. Madden, Sergei Shavirin, John L. Spouge, Yuri I. Wolf, Eugene V. Koonin, and Stephen F. Altschul (2001), "Improving the accuracy of PSI-BLAST protein database searches with composition-based statistics and other refinements", Nucleic Acids Res. 29:2994-3005. Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF excluding environmental samples from WGS projects 49,011,213 sequences; 17,563,301,199 total letters Query= Contig-29_CDS_annotation_glimmer3.pl_2_2 Length=335 Score E Sequences producing significant alignments: (Bits) Value gi|575094486|emb|CDL65860.1| unnamed protein product 141 3e-35 gi|575096057|emb|CDL66940.1| unnamed protein product 139 3e-35 gi|575094568|emb|CDL65929.1| unnamed protein product 127 2e-30 gi|575094545|emb|CDL65905.1| unnamed protein product 125 2e-29 gi|393707865|ref|YP_004732987.1| structural protein VP2 68.6 4e-10 gi|575094495|emb|CDL65861.1| unnamed protein product 66.6 2e-09 gi|547839281|ref|WP_022246923.1| putative minor capsid protein 55.8 8e-06 gi|568290031|gb|ETN78178.1| hypothetical protein NECAME_18237 53.5 1e-05 gi|575094416|emb|CDL65791.1| unnamed protein product 54.3 4e-05 gi|12085140|ref|NP_073542.1| minor capsid protein 52.4 8e-05 >gi|575094486|emb|CDL65860.1| unnamed protein product [uncultured bacterium] Length=344 Score = 141 bits (356), Expect = 3e-35, Method: Compositional matrix adjust. Identities = 118/280 (42%), Positives = 168/280 (60%), Gaps = 30/280 (11%) Query 57 TSANNAWSAQQAEELRKWQEAQTDKAMKYNSQEAEKNRSWQEYMSSTAHQREIADLKAAG 116 T++NNAWSA QA++ +Q +Q ++N EAE +R WQE MS+TAHQREI DL+AAG Sbjct 72 TASNNAWSAAQAQKQMDFQASQGALVRQFNHDEAELSRLWQERMSNTAHQREIKDLQAAG 131 Query 117 LNPVLSAMggngasvtsgatass-sapsgamgstDTSGSSALVNLLGAMLTSTTELSKMS 175 LNPVLSAMGG+GA VTSG+TAS S PSG+ G TDTS + ALV+LLG+ + + ++ + Sbjct 132 LNPVLSAMGGSGAPVTSGSTASGYSPPSGSKGDTDTSLAGALVSLLGSSMMAQASMANTA 191 Query 176 TSALTNLAVADKYNSVNKYLGELSSATQLKGYQISAQTALSTANISAAAQRYVSDNNLKG 235 SA T +VADKY +++K + E I +T LS + ISA A RY +D + Sbjct 192 MSARTQESVADKYTAMSKLVAE-----------IQQETTLSASTISAMASRYAADRSADA 240 Query 236 SLanaaatkiaatihaeaSKYAADKGYLSSENVANINASVNKQLKEMGIKADFDFAQMYP 295 S K+AA+IHA A +Y D ++ ++A+ NA VNK L +MG + DFD + YP Sbjct 241 S-------KVAASIHAAAQRYGYDVQAMTQRDIASFNAQVNKDLAQMGYQHDFDIKEAYP 293 Query 296 NNLYQMTGATVNNLKGILGDLMSQSAIDSVSSSKGLIKPW 335 +++ G++ L +S + + GL W Sbjct 294 -----------SSMAGLMASLFGESILGNDKGLSGLSDLW 322 >gi|575096057|emb|CDL66940.1| unnamed protein product [uncultured bacterium] Length=275 Score = 139 bits (351), Expect = 3e-35, Method: Compositional matrix adjust. Identities = 91/172 (53%), Positives = 118/172 (69%), Gaps = 11/172 (6%) Query 59 ANNAWSAQQAEELRKWQEAQTDKAMKYNSQEAEKNRSWQEYMSSTAHQREIADLKAAGLN 118 AN+AW+A+QAE R WQEAQ KAM++NS EA KNR WQE MS+TAHQRE+ DL AAGLN Sbjct 34 ANSAWNAEQAEIQRDWQEAQNAKAMQFNSMEAAKNRKWQEMMSNTAHQREVKDLMAAGLN 93 Query 119 PVLSAMggngasvtsgatasssapsgamgstDTSGSSALVNLLGAMLTSTTELSKMSTSA 178 PVLSAM GNGA+V SGATAS +GA G DTS S A+ NLLG++L+++T + + +A Sbjct 94 PVLSAMNGNGAAVGSGATASGVTSAGAKGEADTSTSGAIANLLGSILSASTAIQAANVNA 153 Query 179 LTNLAVADKYNSVNKYLGELSSATQLKGYQISAQTALSTANISAAAQRYVSD 230 T AVADKY ++++ + E++ A L +A I A A RY +D Sbjct 154 RTQEAVADKYTAMSQIVAEINKA-----------ATLGSAGIHAGATRYAAD 194 >gi|575094568|emb|CDL65929.1| unnamed protein product [uncultured bacterium] Length=310 Score = 127 bits (319), Expect = 2e-30, Method: Compositional matrix adjust. Identities = 94/182 (52%), Positives = 121/182 (66%), Gaps = 11/182 (6%) Query 60 NNAWSAQQAEELRKWQEAQTDKAMKYNSQEAEKNRSWQEYMSSTAHQREIADLKAAGLNP 119 N A S Q+AE LR WQE Q AM++N+ EAEKNR+WQE MS+TAHQRE+ DL AAGLNP Sbjct 53 NTARSVQEAESLRTWQEEQNRIAMQFNAAEAEKNRNWQEIMSNTAHQREVNDLMAAGLNP 112 Query 120 VLSAMggngasvtsgatasssapsgamgstDTSGSSALVNLLGAMLTSTTELSKMSTSAL 179 VLSA GGNGA+VTSGATAS SGA G DTS SSA+V +LG+ML+S T ++ +TSA+ Sbjct 113 VLSAGGGNGAAVTSGATASGVTSSGAKGDVDTSASSAVVGILGSMLSSLTNIANANTSAI 172 Query 180 TNLAVADKYNSVNKYLGELSSATQLK-----GYQISAQTA------LSTANISAAAQRYV 228 T++A +K +N+ + ++ LK G AQ A L+ A + A A RY Sbjct 173 TSMANTEKLGQINQLIAHANNENALKVAETYGKYGVAQAATAGRYSLNAAQVHADATRYS 232 Query 229 SD 230 +D Sbjct 233 AD 234 >gi|575094545|emb|CDL65905.1| unnamed protein product [uncultured bacterium] Length=325 Score = 125 bits (313), Expect = 2e-29, Method: Compositional matrix adjust. Identities = 116/256 (45%), Positives = 157/256 (61%), Gaps = 40/256 (16%) Query 58 SANNAWSAQQAEELRKWQEAQTDKAMKYNSQEAEKNRSWQEYMSSTAHQREIADLKAAGL 117 S N+A++A QA R WQ+ Q + AM+++S EA KNR WQ YMS+TAHQRE+ADLKAAGL Sbjct 32 SDNSAFNASQAAANRNWQQQQNNIAMQFSSAEAAKNRDWQSYMSNTAHQREVADLKAAGL 91 Query 118 NPVLSAMggngasvtsgatasssapsgamgstDTSGSSALVNLLGAMLTSTTELSKMSTS 177 NPVLSAMGGNGA+VTSGATA SG S DTS ++ALV LLG++L + T ++ +T+ Sbjct 92 NPVLSAMGGNGAAVTSGATAQGYTSSGGQASADTSATAALVGLLGSLLNAQTSIANTATN 151 Query 178 ALTNLAVADKYNSVNKYLGELSSATQLKGYQISAQTALSTANISAAAQRYVSDNNLKGSL 237 A+ NL+VADKY S +Y ++ GY A T+ S AN++A A R+ S+N L S Sbjct 152 AVANLSVADKYTSATRYAADV-------GY---AGTSYS-ANVAAYASRFASNNALAAS- 199 Query 238 anaaatkiaatihaeaSKYAADKGYLSSE----------------------NVANINASV 275 K A+ ASKYA+D+ YL+S+ ++A NA+V Sbjct 200 ------KYASDNSRAASKYASDQSYLASKFASILQSNTAKYNIDTRTATDRDLAEFNAAV 253 Query 276 NKQLKEMGIKADFDFA 291 N+ L++ I A F A Sbjct 254 NRDLQKNEIDAKFSLA 269 >gi|393707865|ref|YP_004732987.1| structural protein VP2 [Microviridae phi-CA82] gi|311336637|gb|ADP89808.1| structural protein VP2 [Microviridae phi-CA82] Length=234 Score = 68.6 bits (166), Expect = 4e-10, Method: Compositional matrix adjust. Identities = 30/60 (50%), Positives = 44/60 (73%), Gaps = 0/60 (0%) Query 63 WSAQQAEELRKWQEAQTDKAMKYNSQEAEKNRSWQEYMSSTAHQREIADLKAAGLNPVLS 122 W QA++ KW QT+K+ ++N+QEA+KNR WQE MS+TA QR++ D + AGLNP+ + Sbjct 7 WMTAQADKQNKWNAEQTEKSNQFNAQEAQKNRDWQEQMSNTALQRKMQDAEKAGLNPIFA 66 >gi|575094495|emb|CDL65861.1| unnamed protein product [uncultured bacterium] Length=266 Score = 66.6 bits (161), Expect = 2e-09, Method: Compositional matrix adjust. Identities = 62/152 (41%), Positives = 84/152 (55%), Gaps = 14/152 (9%) Query 79 TDKAMKYNSQEAEKNRSWQEYMSSTAHQREIADLKAAGLNPVLSAMggngasvtsgatas 138 TDK + YN+Q A + ++QE MSSTAHQRE+ DL AAGLNPVLSA + Sbjct 60 TDKLLNYNTQSAREQMAFQERMSSTAHQREVKDLIAAGLNPVLSA-----------GGSG 108 Query 139 ssapsgamgstDTSGSSALVNLLGAMLTSTTELSKMSTSALTNLAVADKYNSVNKYLGEL 198 +SAPSGAM + D+S SA N A L +++ + N A D ++NKY ++ Sbjct 109 ASAPSGAMATADSSMMSAKAN---AALQKRIVNAQLKNAKDINKAQLDAQKAMNKYSVDV 165 Query 199 SSATQLKGYQISAQTALSTANISAAAQRYVSD 230 + T L QISA + A +AAA Y S+ Sbjct 166 GAQTSLANAQISASASKFGAMQAAAASMYGSN 197 >gi|547839281|ref|WP_022246923.1| putative minor capsid protein [Clostridium sp. CAG:306] gi|524476581|emb|CDC18646.1| putative minor capsid protein [Clostridium sp. CAG:306] Length=236 Score = 55.8 bits (133), Expect = 8e-06, Method: Compositional matrix adjust. Identities = 23/29 (79%), Positives = 27/29 (93%), Gaps = 0/29 (0%) Query 96 WQEYMSSTAHQREIADLKAAGLNPVLSAM 124 +QE MSSTAHQRE+ DL+AAGLNP+LSAM Sbjct 41 FQERMSSTAHQREVKDLRAAGLNPILSAM 69 >gi|568290031|gb|ETN78178.1| hypothetical protein NECAME_18237 [Necator americanus] Length=112 Score = 53.5 bits (127), Expect = 1e-05, Method: Compositional matrix adjust. Identities = 25/41 (61%), Positives = 30/41 (73%), Gaps = 0/41 (0%) Query 80 DKAMKYNSQEAEKNRSWQEYMSSTAHQREIADLKAAGLNPV 120 D A K N + + +WQE MS+TAHQRE ADLKAAGLNP+ Sbjct 28 DAANKANRKMMREQMAWQERMSNTAHQREQADLKAAGLNPI 68 >gi|575094416|emb|CDL65791.1| unnamed protein product [uncultured bacterium] Length=311 Score = 54.3 bits (129), Expect = 4e-05, Method: Compositional matrix adjust. Identities = 23/42 (55%), Positives = 31/42 (74%), Gaps = 0/42 (0%) Query 82 AMKYNSQEAEKNRSWQEYMSSTAHQREIADLKAAGLNPVLSA 123 A +YN +EA+ RSW + M TA+Q + DLKAAGLNP+L+A Sbjct 144 AQRYNREEAQAERSWAQSMRQTAYQDTVKDLKAAGLNPILAA 185 >gi|12085140|ref|NP_073542.1| minor capsid protein [Bdellovibrio phage phiMH2K] gi|75089169|sp|Q9G055.1|H_BPPHM RecName: Full=Minor spike protein H; AltName: Full=H protein; AltName: Full=Pilot protein; AltName: Full=Protein VP2; Short=VP2 [Bdellovibrio phage phiMH2K] gi|12017988|gb|AAG45344.1|AF306496_5 Vp2 [Bdellovibrio phage phiMH2K] Length=199 Score = 52.4 bits (124), Expect = 8e-05, Method: Compositional matrix adjust. Identities = 23/40 (58%), Positives = 30/40 (75%), Gaps = 0/40 (0%) Query 84 KYNSQEAEKNRSWQEYMSSTAHQREIADLKAAGLNPVLSA 123 + N EA +NR WQE MS++AHQRE DL+ AGLN +L+A Sbjct 40 RENQAEAARNRKWQEQMSNSAHQREANDLQTAGLNRLLTA 79 Lambda K H a alpha 0.306 0.119 0.324 0.792 4.96 Gapped Lambda K H a alpha sigma 0.267 0.0410 0.140 1.90 42.6 43.6 Effective search space used: 1927902993225