BLASTP 2.2.21+ Reference: Stephen F. Altschul, Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Reference for compositional score matrix adjustment: Stephen F. Altschul, John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis, Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109. RID: 3H99D5WA013 Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF excluding environmental samples from WGS projects 9,090,450 sequences; 3,112,979,771 total letters Query= gi|168481420|gb|ACA24900.1| WfgL [Escherichia coli] Length=257 Score E Sequences producing significant alignments: (Bits) Value gb|ACA24900.1| WfgL [Escherichia coli] 526 9e-150 ref|YP_001744314.1| glycosyl transferase family protein [Esch... 122 3e-28 ALIGNMENTS >gb|ACA24900.1| WfgL [Escherichia coli] Length=257 Score = 526 bits (1355), Expect = 9e-150, Method: Compositional matrix adjust. Identities = 257/257 (100%), Positives = 257/257 (100%), Gaps = 0/257 (0%) Query 1 MRLMANIYIATHKNYPFPPGYIPLHVGKRLSSVYVPNAIGDDSKNNISDLNPFFCELTGL 60 MRLMANIYIATHKNYPFPPGYIPLHVGKRLSSVYVPNAIGDDSKNNISDLNPFFCELTGL Sbjct 1 MRLMANIYIATHKNYPFPPGYIPLHVGKRLSSVYVPNAIGDDSKNNISDLNPFFCELTGL 60 Query 61 YWIWQNDADDVIGLVHYRRYFKHKNDYITIKNKKIASCNDLIKEFDSYDLILPKPSYLFK 120 YWIWQNDADDVIGLVHYRRYFKHKNDYITIKNKKIASCNDLIKEFDSYDLILPKPSYLFK Sbjct 61 YWIWQNDADDVIGLVHYRRYFKHKNDYITIKNKKIASCNDLIKEFDSYDLILPKPSYLFK 120 Query 121 KTLKEQYIKYHHEDDLIKLRQIIEKKYPDYISTFDTVLNGNKGYYCNMFIAKKNIIEPYF 180 KTLKEQYIKYHHEDDLIKLRQIIEKKYPDYISTFDTVLNGNKGYYCNMFIAKKNIIEPYF Sbjct 121 KTLKEQYIKYHHEDDLIKLRQIIEKKYPDYISTFDTVLNGNKGYYCNMFIAKKNIIEPYF 180 Query 181 QWVFDILFELKSSLDISGYDDYQKRVFGFLSERLFAVWIEYNKNRIQITHRSVVEIESNK 240 QWVFDILFELKSSLDISGYDDYQKRVFGFLSERLFAVWIEYNKNRIQITHRSVVEIESNK Sbjct 181 QWVFDILFELKSSLDISGYDDYQKRVFGFLSERLFAVWIEYNKNRIQITHRSVVEIESNK 240 Query 241 VISVKRYIRNFLAKLTG 257 VISVKRYIRNFLAKLTG Sbjct 241 VISVKRYIRNFLAKLTG 257 >ref|YP_001744314.1| glycosyl transferase family protein [Escherichia coli SMS-3-5] gb|ACB15494.1| glycosyl transferase family 8 [Escherichia coli SMS-3-5] Length=630 Score = 122 bits (307), Expect = 3e-28, Method: Compositional matrix adjust. Identities = 86/281 (30%), Positives = 133/281 (47%), Gaps = 53/281 (18%) Query 7 IYIATHKNYPFPPGYI--PLHVGKRLSSVYVP-NAIGDDSKNNISDLNPFFCELTGLYWI 63 IY HK F I PLHVGK ++ Y GDDS +NIS NPF+CELT YW+ Sbjct 6 IYTCHHKPSAFLNASIIKPLHVGK--ANTYNDIGCEGDDSGDNISFKNPFYCELTAHYWV 63 Query 64 WQNDA-DDVIGLVHYRRYFK------HKND----------------YITIKNKKIASCND 100 W+N++ D +G +HYRR+ H D + ++ I++C Sbjct 64 WKNESLADYVGFMHYRRHLNFAEQQNHPEDNWGVVNYPLINAEYESQFGLSDESISTC-- 121 Query 101 LIKEFDSYDLILPKPSYLFKKTLK---EQYIK--YHHEDDLIKLRQIIEKKYPDYISTFD 155 D YDL+LPK + K + Y K + H D ++E+ YP Y + Sbjct 122 ----VDGYDLLLPKKWSVTSAGSKNNLDHYAKGEFLHIKDYQSALDVVEELYPQYKAAIQ 177 Query 156 TVLNGNKGYYCNMFIAKKNIIEPYFQWVFDILFELKSSLDISGYDDYQKRVFGFLSERLF 215 N GYY NMF+ +K++ Y +W+F IL L+ + ++ Y+ +KRV G ++ERLF Sbjct 178 QFNNATDGYYTNMFVMRKDMFLDYSEWLFAILSNLEDRISMNNYNAQEKRVIGHIAERLF 237 Query 216 AVWIEYNKNRIQITHRSVVEIESNKVISVKRYIRNFLAKLT 256 ++I ++ + +K + +K R F+ T Sbjct 238 NIYI--------------IKCQQDKQLKIKELQRTFVTAET 264 Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF excluding environmental samples from WGS projects Posted date: Jun 16, 2009 5:41 PM Number of letters in database: 26,573,871 Number of sequences in database: 84,272 Lambda K H 0.324 0.142 0.438 Gapped Lambda K H 0.267 0.0410 0.140 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Sequences: 84272 Number of Hits to DB: 3574057 Number of extensions: 151138 Number of successful extensions: 249 Number of sequences better than 0.1: 0 Number of HSP's better than 0.1 without gapping: 0 Number of HSP's gapped: 250 Number of HSP's successfully gapped: 0 Length of query: 257 Length of database: 26573871 Length adjustment: 102 Effective length of query: 155 Effective length of database: 17978127 Effective search space: 2786609685 Effective search space used: 2786609685 T: 11 A: 40 X1: 15 (7.0 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 41 (20.4 bits) S2: 79 (35.0 bits)