BLASTP 2.2.21+ Reference: Stephen F. Altschul, Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Reference for compositional score matrix adjustment: Stephen F. Altschul, John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis, Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109. RID: 3H9FS32F013 Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF excluding environmental samples from WGS projects 9,079,606 sequences; 3,109,523,384 total letters Query= gi|56159885|gb|AAV80749.1| predicted glycosyl transferase [Escherichia coli O127:H6 str. E2348/69] >gi|37528724|gb|AAO37709.1| putative glycosyltransferase [Escherichia coli] >gi|40794691|gb|AAR90884.1| putative glycosyltransferase [Escherichia coli] >gi|56159885|gb|AAV80749.1| putative glycosyltransferase [Escherichia coli] >gi|56384973|gb|AAV85953.1| WcmA [Escherichia coli] >gi|215265334|emb|CAS09729.1| predicted glycosyl transferase [Escherichia coli O127:H6 str. E2348/69] Length=338 Score E Sequences producing significant alignments: (Bits) Value ref|YP_002329693.1| predicted glycosyl transferase [Escherich... 689 0.0 ref|YP_002387516.1| putative glycosyltransferase [Escherichia... 296 2e-80 gb|ACA24904.1| WfgO [Escherichia coli] 111 1e-24 gb|ABI34559.1| putative glycosyl transferase [Escherichia coli] 105 7e-23 gb|AAO37690.1| putative galactosyltransferase [Escherichia coli] 88.2 1e-17 dbj|BAG11904.1| putative galactosyltransferase WbgM [Escheric... 87.0 3e-17 gb|AAL67552.1|AF461121_3 putative galactosyltransferase WbgM ... 85.5 8e-17 ref|YP_541302.1| putative galactosyltransferase WbgM [Escheri... 77.4 2e-14 gb|ACH97149.1| WclR [Escherichia coli] 75.1 1e-13 gb|AAD50490.1|AF172324_8 WbnE [Escherichia coli] 68.2 1e-11 gb|ACD37115.1| WffO [Escherichia coli] 63.9 2e-10 ref|YP_002293579.1| putative glycosyl transferase [Escherichi... 55.1 1e-07 gb|AAC45847.1| putative GlcNAc transferase [Escherichia coli] 53.9 3e-07 gb|AAZ20762.1| glycosyltransferase [Escherichia coli] 52.8 5e-07 gb|ACA24903.1| WfgN [Escherichia coli] 52.4 7e-07 gb|ACA24823.1| WfgE [Escherichia coli] 50.8 2e-06 ref|ZP_03033404.1| Cps2D [Escherichia coli F11] >gb|EDV67486.... 50.1 4e-06 gb|AAD50487.1|AF172324_5 WbnB [Escherichia coli] 49.3 6e-06 ref|NP_755569.1| hypothetical protein c3694 [Escherichia coli... 49.3 6e-06 gb|ABB29908.1| WfaO [Escherichia coli] 48.9 8e-06 gb|ACA24850.1| WffZ [Escherichia coli] 45.1 1e-04 gb|ACA24849.1| WffY [Escherichia coli] 44.3 2e-04 ref|YP_002403325.1| WbwB [Escherichia coli 55989] >emb|CAU981... 43.5 3e-04 gb|AAK64373.1|AF361371_8 WbwB [Escherichia coli] 43.5 4e-04 ref|ZP_03061297.1| glycosyl transferase, group 1 family prote... 42.7 5e-04 ref|ZP_03029511.1| WbbG [Escherichia coli B7A] >gb|ABA42235.1... 42.7 6e-04 ref|ZP_03052481.1| hypothetical protein EcE110019_3652 [Esche... 42.7 6e-04 gb|ABB29914.1| WfaQ [Escherichia coli] 42.4 8e-04 ref|YP_001726129.1| glycosyl transferase group 1 [Escherichia... 40.8 0.002 gb|ACD37088.1| WfeG [Escherichia coli] 39.3 0.007 gb|AAD21569.1| glycosyltransferase WcaO [Escherichia coli] 38.9 0.008 ALIGNMENTS >ref|YP_002329693.1| predicted glycosyl transferase [Escherichia coli O127:H6 str. E2348/69] gb|AAO37709.1| putative glycosyltransferase [Escherichia coli] gb|AAR90884.1| putative glycosyltransferase [Escherichia coli] gb|AAV80749.1| putative glycosyltransferase [Escherichia coli] gb|AAV85953.1| WcmA [Escherichia coli] emb|CAS09729.1| predicted glycosyl transferase [Escherichia coli O127:H6 str. E2348/69] Length=338 Score = 689 bits (1778), Expect = 0.0, Method: Compositional matrix adjust. Identities = 338/338 (100%), Positives = 338/338 (100%), Gaps = 0/338 (0%) Query 1 MKNVGFIVTKSEIGGAQTWVNEISNLIKEECNIFLITSEEGWLTHKDVFAGVFVIPGIKK 60 MKNVGFIVTKSEIGGAQTWVNEISNLIKEECNIFLITSEEGWLTHKDVFAGVFVIPGIKK Sbjct 1 MKNVGFIVTKSEIGGAQTWVNEISNLIKEECNIFLITSEEGWLTHKDVFAGVFVIPGIKK 60 Query 61 YFDFLTLFKLRKILKENNISTLIASSANAGVYARLVRLLVDFKCIYVSHGWSCLYNGGRL 120 YFDFLTLFKLRKILKENNISTLIASSANAGVYARLVRLLVDFKCIYVSHGWSCLYNGGRL Sbjct 61 YFDFLTLFKLRKILKENNISTLIASSANAGVYARLVRLLVDFKCIYVSHGWSCLYNGGRL 120 Query 121 KSIFCIVEKYLSLLTDVIWCVSKSDEKKAIENIGIKEPKIITVSNSVPQMPRCNNKQLQY 180 KSIFCIVEKYLSLLTDVIWCVSKSDEKKAIENIGIKEPKIITVSNSVPQMPRCNNKQLQY Sbjct 121 KSIFCIVEKYLSLLTDVIWCVSKSDEKKAIENIGIKEPKIITVSNSVPQMPRCNNKQLQY 180 Query 181 KVLFVGRLTHPKRPELLANVISKKPQYSLHIVGGGERLESLKKQFSECENIHFLGEVNNF 240 KVLFVGRLTHPKRPELLANVISKKPQYSLHIVGGGERLESLKKQFSECENIHFLGEVNNF Sbjct 181 KVLFVGRLTHPKRPELLANVISKKPQYSLHIVGGGERLESLKKQFSECENIHFLGEVNNF 240 Query 241 YNYHEYDLFSLISDSEGLPMSGLEAHTAAIPLLLSDVGGCFELIEGNGLLVENTEDDIGY 300 YNYHEYDLFSLISDSEGLPMSGLEAHTAAIPLLLSDVGGCFELIEGNGLLVENTEDDIGY Sbjct 241 YNYHEYDLFSLISDSEGLPMSGLEAHTAAIPLLLSDVGGCFELIEGNGLLVENTEDDIGY 300 Query 301 KLDKIFDDYENYREQAIRASGKFVIENYASAYKSIILG 338 KLDKIFDDYENYREQAIRASGKFVIENYASAYKSIILG Sbjct 301 KLDKIFDDYENYREQAIRASGKFVIENYASAYKSIILG 338 >ref|YP_002387516.1| putative glycosyltransferase [Escherichia coli IAI1] emb|CAQ98958.1| putative glycosyltransferase [Escherichia coli IAI1] Length=352 Score = 296 bits (759), Expect = 2e-80, Method: Compositional matrix adjust. Identities = 154/342 (45%), Positives = 221/342 (64%), Gaps = 13/342 (3%) Query 2 KNVGFIVTKSEIGGAQTWVNEISNLIKEECNIFLITSEEGWLTHKDVFAGVFVIPGIKKY 61 K + FI+TKSE+GGAQ WV+E L++++ + FLITS GWLT VF +P + Sbjct 7 KRLVFIITKSEVGGAQKWVSEQKLLLEDKYDTFLITSCTGWLTDNFSPDKVFFVPALTNI 66 Query 62 FDFLTLFKLRKILKENNISTLIASSANAGVYARLVRLLVDFKCIYVSHGWSCLYNGGRLK 121 LF + KIL+ ++++SANAG+YARL +++ + IYVSHGWSC+YNGGR K Sbjct 67 KKISNLFSIAKILRMLKADIVVSNSANAGLYARLAKIIWKHRSIYVSHGWSCIYNGGRAK 126 Query 122 SIFCIVEKYLSLLTDVIWCVSKSDEKKAIENIGIKEPKIITVSNSVPQMPRCNNKQLQY- 180 I C +E++LS +D I CVS++D+ A+ IGIKE K+ + N+ P K+ + Sbjct 127 KILCFIERFLSFFSDAILCVSENDKDNALNIIGIKESKLKLIKNAT--FPTNKEKKFWHI 184 Query 181 -----KVLFVGRLTHPKRPELLANVISKKPQYSLHIVGGGERLESLKKQFSECENIHFLG 235 +++FVGR+THPKRP+LLA +S+K L +VGGGE LE LK + +NIHF+G Sbjct 185 MPKVLRLVFVGRMTHPKRPDLLAETLSRKKDVELFLVGGGEYLERLKNIYKNYDNIHFVG 244 Query 236 EVNNFYNYHEYDLFSLISDSEGLPMSGLEAHTAAIPLLLSDVGGCFELI---EG--NGLL 290 E+ +F NY +YD F L+S+SEGLPMS +EA +PLLLSDVGGC ELI EG NG+L Sbjct 245 EIKDFNNYDDYDAFILVSESEGLPMSAIEAGVTGLPLLLSDVGGCHELIGEYEGKYNGVL 304 Query 291 VENTEDDIGYKLDKIFDDYENYREQAIRASGKFVIENYASAY 332 N +DI +D++ ++YE Y + A + S +F + ++ Y Sbjct 305 FNNNINDISRAIDEVRNNYEQYCKVANKISCQFNLNSFKEDY 346 >gb|ACA24904.1| WfgO [Escherichia coli] Length=368 Score = 111 bits (277), Expect = 1e-24, Method: Compositional matrix adjust. Identities = 105/336 (31%), Positives = 172/336 (51%), Gaps = 35/336 (10%) Query 4 VGFIVTKS-EIGGAQTWVNEISNLIKEEC-NIFLITSEEGWLTHKDVFAGV--FVIPG-- 57 V +I+TK+ EIGGAQ + ++S+ +KE+ ++ +I E G L + + GV ++P Sbjct 3 VLYIITKADEIGGAQIHIRDLSSRLKEDGHDVVVIVGEHGALVDELIKRGVAYHIVPSLV 62 Query 58 -----IKKYFDFLTLFKLRKILKENNISTLIASSANAGVYARLVRLLVDFKCIYVSHGWS 112 IK + + KL IL + IS S+ AG+ RL L I+ +HGW+ Sbjct 63 REINPIKDLRAVIEISKLISILDPDIISL---HSSKAGIIGRLAALRKKKPVIFTAHGWA 119 Query 113 CLYNG--GRLKSIFCIVEKYLSLLTDVIWCVSKSDEKKAIE-NIGIKEPKII----TVSN 165 NG + ++CI+EK + L I VS+ D++ A+E N+ E +++ + + Sbjct 120 -FANGVSKNRQKLYCIIEKIIEPLASKIITVSEQDKQLALELNVSSHEKQVVIHNGMMQS 178 Query 166 SVPQ--MPRCNNKQLQYKVLFVGRLTHPKRPELLANVISK--KPQYSLHIVGGGERLE-- 219 S+P + R +NK ++ ++ V R + K L +S+ + L +VG G LE Sbjct 179 SLPPRFVNRTSNKTVE--LISVARFSEQKDHRTLFVALSQINNLNWRLTLVGKGPLLEYY 236 Query 220 -SLKKQFSECENIHFLGEVNNFYNYH-EYDLFSLISDSEGLPMSGLEAHTAAIPLLLSDV 277 +L ++ + E I FLGE ++ D+F LIS EG P S LEA A +P++ S+V Sbjct 237 KTLARKLNIHERIQFLGERHDVAELMVRSDVFLLISKWEGFPRSILEAMRAGLPVIASNV 296 Query 278 GGCFELIEG--NGLLVENTE-DDIGYKLDKIFDDYE 310 GG E I G LVE + D + +KL K+ + E Sbjct 297 GGTSEAINDGITGFLVEREDVDGLKHKLCKLLSEPE 332 >gb|ABI34559.1| putative glycosyl transferase [Escherichia coli] Length=363 Score = 105 bits (262), Expect = 7e-23, Method: Compositional matrix adjust. Identities = 94/314 (29%), Positives = 158/314 (50%), Gaps = 29/314 (9%) Query 1 MKNVGFIVTKSEIGGAQTWVNEISNLIKEECNI--FLITSEEGWLTHKDVFA----GVFV 54 MK + I E+GGAQT V ++ + + NI +L +G T D+ V Sbjct 1 MKVLFLITRGDELGGAQTHVKDVILGLINKYNIECYLACGTKGIFT--DIMEENNINVIH 58 Query 55 IPGIKK---YFDFLTLFKLRKILKENNISTLIASSANAGVYARLVRLLVDFKCIYVSHGW 111 I +K+ + D + L KL I+K+ N + S+ AGV RL L K ++ +HGW Sbjct 59 IDSMKREICFGDIIALKKLNDIIKDINPDIISCHSSKAGVLGRLASLGTRTKKVFTAHGW 118 Query 112 SCLYNGG---RLKSIFCIVEKYLSLLTDVIWCVSKSDEKKAIENIGIKEPKIITVSNSVP 168 + + G + +I+ +E LS +TD VS D+K A++ G+K + + N +P Sbjct 119 A--FTEGISPKKAAIYKKIELLLSYITDATINVSYYDKKLALK-AGLKSQHYV-IHNCIP 174 Query 169 QMPRCNNKQLQYKV----LFVGRLTHPKRPELLANVISK--KPQYSLHIVGGGER--LES 220 + N + K + V R K E L S K ++ L ++GGG+ ++ Sbjct 175 DVHYEKNNGIANKTVLEFIMVARFCAQKDHETLLKAFSNIDKEKWRLTLIGGGDSKSIKE 234 Query 221 LKKQFSECENIHFLGEVNNFYNY-HEYDLFSLISDSEGLPMSGLEAHTAAIPLLLSDVGG 279 L K+ + NI+F+G+ N ++ + D+F LIS+ EG P+S LEA +++P++ +DVGG Sbjct 235 LAKKLNIDNNINFVGQTKNVVDFLNHSDVFLLISNWEGFPISILEAMRSSLPIIATDVGG 294 Query 280 CFELIEG--NGLLV 291 E ++ NG L+ Sbjct 295 VSEAVKHGYNGFLI 308 >gb|AAO37690.1| putative galactosyltransferase [Escherichia coli] Length=373 Score = 88.2 bits (217), Expect = 1e-17, Method: Compositional matrix adjust. Identities = 86/333 (25%), Positives = 155/333 (46%), Gaps = 27/333 (8%) Query 11 SEIGGAQTWVNEISNLIKEECNIFLITSEEGWLTHKDVFAGV--FVIPGIKKYF----DF 64 S+I GAQ + + + +++ S+EG T + GV +VI + + DF Sbjct 11 SKISGAQRVSLDEMKTLSNHYSQYMVCSKEGDFTQEADRIGVKTYVIETLVREISPLKDF 70 Query 65 LTLFKLRKILKENNISTLIASSANAGVYARLVRLLVDFK-CIYVSHGWSCLYNGGRL-KS 122 +L KL K +K+ + S+ +G RL L K I+ HG++ R+ K Sbjct 71 YSLIKLYKFIKQEKFDIIHTHSSKSGFLGRLAAKLAGTKQIIHTVHGFAFPSTSNRVVKL 130 Query 123 IFCIVEKYLSLLTDVIWCVSKSDEKKAIENIGIKEPKIITVSNSVPQMPRCN-------- 174 I+ ++E + SL + VI ++++DEK A + +T+ N+ + + Sbjct 131 IYFLMEYFASLCSSVIIVMNENDEKIARKYFSSAPWTKVTLLNNAVDIKKFQKRYIGIES 190 Query 175 ----NKQLQYKVLFVGRLTHPKRPELLANVIS-KKPQYSLHIVGGGERLESLKKQFSEC- 228 N+Q ++K++ +GRL K P L+ N + Y + VG G L+ Q ++ Sbjct 191 KSEINEQKKFKMVMIGRLCEQKNPLLIINALKILGDHYYVDFVGDGPLRSDLESQIAKRG 250 Query 229 --ENIHFLGEVNNFYNY-HEYDLFSLISDSEGLPMSGLEAHTAAIPLLLSDVGGCFELIE 285 + + LG ++ +YDLF L S EG+P++ LEA + +P+L S++ LI Sbjct 251 LEKRVRLLGWCSSVEEIIFKYDLFLLPSKWEGMPLAILEAMASKVPVLCSNIDANAYLIN 310 Query 286 GNGLLVENTED--DIGYKLDKIFDDYENYREQA 316 + N +D D+ + IFD+ + R+ A Sbjct 311 KTSGFLFNNDDAKDLAKNIKYIFDNVDVRRKVA 343 >dbj|BAG11904.1| putative galactosyltransferase WbgM [Escherichia coli O55:H7] Length=366 Score = 87.0 bits (214), Expect = 3e-17, Method: Compositional matrix adjust. Identities = 93/350 (26%), Positives = 176/350 (50%), Gaps = 26/350 (7%) Query 12 EIGGAQTWVNEISNLIKEECNIFLITSEEGWLTHKDVFAGV--FVIPGIKK----YFDFL 65 +I GAQ + + + +E ++ S+EG L + GV +IP + + + D Sbjct 12 DISGAQRVSLDEMHTLSQEFQQSMVCSKEGRLAEQARCFGVCTHIIPTLTREISLFKDCA 71 Query 66 TLFKLRKILKENNISTLIASSANAGVYARLV-RLLVDFKCIYVSHGWSCLYNGGRL-KSI 123 +LF+L KI+K+ + S+ G R+ +L K ++ HG++ +L K I Sbjct 72 SLFQLYKIIKKEKFDIVHTHSSKTGFLGRVAAKLAGTKKIVHTVHGFAFPSTENKLIKFI 131 Query 124 FCIVEKYLSLLTDVIWCVSKSDEKKAIEN-IGIKEPKIITVSNSVPQMPRCNNKQLQYK- 181 + ++E S +++I +++SDE+ A + + K+ K++ ++N++ +K Sbjct 132 YFLMELIASYCSNIIIVMNESDERIARKYFVKNKKSKLLLINNAIDVDKYNKDKDKDKDK 191 Query 182 ------VLFVGRLTHPKRPELLANVISKKPQYSLH--IVGGGE-RLESLKK--QFSECEN 230 ++ VGRL K P LL I K + ++H I+G G +++ L+K Q++ + Sbjct 192 DKDIFKIVMVGRLCDQKNPLLLIEAI-KDLESNIHVDIIGDGPLKVKLLEKINQYNIADK 250 Query 231 IHFLGEVNNFYNY-HEYDLFSLISDSEGLPMSGLEAHTAAIPLLLSDVGGCFELIEGNGL 289 + FLG ++ + ++YDLF L S EG+P++ LEA A +P+L SD+ LIE Sbjct 251 VSFLGWIDAVEEHLYKYDLFVLPSRWEGMPLAMLEAMAAKVPVLSSDIEANKYLIEKTAG 310 Query 290 LVENTED--DIGYKLDKIFDDYENYREQAIRASGKFVIENYASAYKSIIL 337 +V ED D+ K+D + + E R + + +IE++ ++ IL Sbjct 311 VVFKDEDSKDLKRKIDVLHANPE-LRNNLAHKAYQALIEDFDLTKRTKIL 359 >gb|AAL67552.1|AF461121_3 putative galactosyltransferase WbgM [Escherichia coli] dbj|BAG11846.1| putative galactosyltransferase [Escherichia coli O55:H7] dbj|BAG11960.1| putative galactosyltransferase WbgM [Escherichia coli O55:H6] Length=364 Score = 85.5 bits (210), Expect = 8e-17, Method: Compositional matrix adjust. Identities = 92/348 (26%), Positives = 176/348 (50%), Gaps = 24/348 (6%) Query 12 EIGGAQTWVNEISNLIKEECNIFLITSEEGWLTHKDVFAGV--FVIPGIKK----YFDFL 65 +I GAQ + + + +E ++ S+EG L + GV +IP + + + D Sbjct 12 DISGAQRVSLDEMHTLSQEFQQSMVCSKEGRLAEQARCFGVCTHIIPTLTREISLFKDCA 71 Query 66 TLFKLRKILKENNISTLIASSANAGVYARLV-RLLVDFKCIYVSHGWSCLYNGGRL-KSI 123 +LF+L KI+K+ + S+ G R+ +L K ++ HG++ +L K I Sbjct 72 SLFQLYKIIKKEKFDIVHTHSSKTGFLGRVAAKLAGTKKIVHTVHGFAFPSTENKLIKFI 131 Query 124 FCIVEKYLSLLTDVIWCVSKSDEKKAIEN-IGIKEPKIITVSNSVPQMPRCNNKQLQYK- 181 + ++E S +++I +++SDE+ A + + K+ K++ ++N++ +K Sbjct 132 YFLMELIASYCSNIIIVMNESDERIARKYFVKNKKSKLLLINNAIDVDKYNKDKDKDKDK 191 Query 182 ----VLFVGRLTHPKRPELLANVISKKPQYSLH--IVGGGE-RLESLKK--QFSECENIH 232 ++ VGRL K P LL I K + ++H I+G G +++ L+K Q++ + + Sbjct 192 DIFKIVMVGRLCDQKNPLLLIEAI-KDLESNIHVDIIGDGPLKVKLLEKINQYNIADKVS 250 Query 233 FLGEVNNFYNY-HEYDLFSLISDSEGLPMSGLEAHTAAIPLLLSDVGGCFELIEGNGLLV 291 FLG ++ + ++YDLF L S EG+P++ LEA A +P+L SD+ LIE +V Sbjct 251 FLGWIDAVEEHLYKYDLFVLPSRWEGMPLAMLEAMAAKVPVLSSDIEANKYLIEKTAGVV 310 Query 292 ENTED--DIGYKLDKIFDDYENYREQAIRASGKFVIENYASAYKSIIL 337 ED D+ K++ + + E R + + +IE++ ++ IL Sbjct 311 FKDEDSKDLKRKINVLHANPE-LRNNLAHKAYQALIEDFDLTKRTKIL 357 >ref|YP_541302.1| putative galactosyltransferase WbgM [Escherichia coli UTI89] gb|ABE07771.1| putative galactosyltransferase WbgM [Escherichia coli UTI89] Length=367 Score = 77.4 bits (189), Expect = 2e-14, Method: Compositional matrix adjust. Identities = 86/300 (28%), Positives = 138/300 (46%), Gaps = 23/300 (7%) Query 14 GGAQTWVNEISNLIKEECNIFLITSEEGWLTHK--DVFAGVFVIPGIKKYF----DFLTL 67 G + +NEIS L + + L+ S++G LT + IP + + DF L Sbjct 19 GVQRVTLNEISALY-TDYDYTLVCSKKGPLTKALLEYDVDCHCIPELTREITVKNDFKAL 77 Query 68 FKLRKILKENNISTLIASSANAGVYARLVRLLVDF-KCIYVSHGWSCLYNGGRLKS--IF 124 FKL K +K+ + S+ G+ R+ L K I+ HG+S + KS ++ Sbjct 78 FKLYKFIKKEKFDIVHTHSSKTGILGRVAAKLARVGKVIHTVHGFSFPAASSK-KSYYLY 136 Query 125 CIVEKYLSLLTDVIWCVSKSDEKKAIENIGIKEPKIITVSNSVP--QMPRCNNK--QLQY 180 +E TD + ++ DE AI + K K+ + N V + NK Sbjct 137 FFMEWIAKFFTDKLIVLNVDDEYIAINKLKFKRDKVFLIPNGVDTDKFSPLENKIYSSTL 196 Query 181 KVLFVGRLTHPKRPELL----ANVISKKPQYSLHIVGGGERLESLKKQFSECE-NIHFLG 235 ++ VGRL+ K PE L ++++ L +VG GE E L+ +F + I F G Sbjct 197 NLVMVGRLSKQKDPETLLLAVEKLLNENVNVKLTLVGDGELKEQLESRFKRQDGRIIFHG 256 Query 236 EVNNFYNYHEY-DLFSLISDSEGLPMSGLEAHTAAIPLLLSDVGGCFELIEG--NGLLVE 292 +N N + DLF L S EG+P++ LEA + +P +++++ G LIE NG L E Sbjct 257 WSDNIVNILKVNDLFILPSLWEGMPLAILEALSCGLPCIVTNIPGNNSLIEDGYNGCLFE 316 >gb|ACH97149.1| WclR [Escherichia coli] Length=387 Score = 75.1 bits (183), Expect = 1e-13, Method: Compositional matrix adjust. Identities = 88/334 (26%), Positives = 143/334 (42%), Gaps = 44/334 (13%) Query 14 GGAQTWVNEISNLIKEECNIFLITSEEGWLT-HKDVFAGVFVIPGIKKYF----DFLTLF 68 G + + E L E+ +I LI E G LT + D F +P + + D +L Sbjct 21 GVQRVSLQEFELLPNEQFDINLICKESGPLTDYLDDSVRAFFVPTLCRNISLIKDMKSLI 80 Query 69 KLRKILKENNISTLIASSANAGVYARLVRLLVDFKCI-YVSHGWSCLYNGGRLKSIFCI- 126 L K+LK+ + S+ G+ R+ L C+ + HG++ + + KS+ + Sbjct 81 SLYKLLKKEKYDIVHTHSSKTGILGRIAARLAGVPCVVHTVHGFA--FESTKRKSVKLVY 138 Query 127 --VEKYLSLLTDVIWCVSKSDEKKAIENIGIKEPKIITVSNSVPQ---MPRCNNKQLQYK 181 +E + + T + C+ D++ I+ + + KI + N V P N L+ K Sbjct 139 KWLEIFAAKCTTRLICLHNEDKEICIKELYVDPMKISVIPNGVDLEKFAPAINKGDLKEK 198 Query 182 VL----------FVGRLTHPKRPELLA---------NVISKKPQYSLHIVGGGERLESLK 222 +L VGRL K P A N+I P IVG GE + LK Sbjct 199 ILGLKRNSFVFTMVGRLWPQKNPLYFAEAAKYIIENNLI---PDSVFVIVGDGELMNDLK 255 Query 223 KQFSECENIH----FLGEVNNFYN-YHEYDLFSLISDSEGLPMSGLEAHTAAIPLLLSDV 277 + N+ LG N+ N D+F L S EG+P++ LEA + +P ++S++ Sbjct 256 YNYQTDMNLKKRLLLLGWRNDIPNILKASDVFVLPSLWEGMPLAILEAQSTGLPCIVSNI 315 Query 278 GG--CFELIEGNGLLVE-NTEDDIGYKLDKIFDD 308 G C E +G L+E N D L ++ DD Sbjct 316 NGNNCLVKNEFDGFLIELNDIDSFINALVRVTDD 349 >gb|AAD50490.1|AF172324_8 WbnE [Escherichia coli] Length=392 Score = 68.2 bits (165), Expect = 1e-11, Method: Compositional matrix adjust. Identities = 81/328 (24%), Positives = 144/328 (43%), Gaps = 35/328 (10%) Query 1 MKNVGFIVTKSEIGGAQ-TWVNEISNLIKEECNIFLITSEEGWLTHKDVFAGV--FVIPG 57 MK + I + G Q + E+ L E +LI EEG LT + G+ V+ Sbjct 1 MKKIAHIQLLPMLSGVQKVTLQELMILNDNEYTKYLICKEEGELTEECKRLGIKTHVVKD 60 Query 58 IKKYF----DFLTLFKLRKILKENNISTLIASSANAGVYARLVRLLVDFKCI-YVSHGWS 112 + + D + L+K+ K LK N+I + A G R+ L I + HG+ Sbjct 61 LTREINAVKDIIALYKIYKFLKANDIDIVHTHFAKTGFLGRVAAKLAGIPLIVHTVHGFP 120 Query 113 CLYNGGRLKSIFCIVEKYLSL-LTDVIWCVSKSDEKKAIENIGIKEPKIITVSNSVP--- 168 + + F V +++S D+I C+ D++ + + + E K++ + N + Sbjct 121 FDSAKNKYIAFFYKVLEFISARFADIIICLHDGDKETCKKLLHVPESKVLVLPNGIAFTE 180 Query 169 --QMPRCNNKQLQYKV---------LFVGRLTHPKRPELLAN----VISKKPQYSL--HI 211 ++ C+ K+ + + VGRL K P LL N VI++ P + + Sbjct 181 FFRLSECDKKKARTILGIPESSLVFTMVGRLWEQKNPLLLINAAKEVINEYPSDDIIFLL 240 Query 212 VGGG---ERLESLKKQFSECENIHFLGEVNNFYNYHE-YDLFSLISDSEGLPMSGLEAHT 267 +G G + +E + ++ I LG + + D+F L S EG+P++ LEA Sbjct 241 IGDGFLRKEIERIAEREIYHNKIVLLGWRKDIPDILSCSDVFVLPSRWEGMPLAILEAQA 300 Query 268 AAIPLLLSDVGGCFELIEG--NGLLVEN 293 +P ++SD+ G L++ NG L E+ Sbjct 301 TGLPCIVSDIPGNNNLVKDGVNGYLFES 328 >gb|ACD37115.1| WffO [Escherichia coli] Length=389 Score = 63.9 bits (154), Expect = 2e-10, Method: Compositional matrix adjust. Identities = 80/314 (25%), Positives = 137/314 (43%), Gaps = 37/314 (11%) Query 14 GGAQTWVNEISNLIKEECNIFLITSEEGWLT---HKDVFAGVFVIP----GIKKYFDFLT 66 G + + EI NL E I LI E G L +K V F IP I D + Sbjct 21 GVQRVSLQEIENLPPEYFEIDLICKEGGPLVDALNKKVRK--FFIPTLCRNISPVEDLKS 78 Query 67 LFKLRKILKENNISTLIASSANAGVYARLVRLLVDFKCI-YVSHGWSCLYNGGR-LKSIF 124 L L KI K + S+ G+ R+ + C+ + HG++ + +K+++ Sbjct 79 LISLYKIFKRERYDIVHTHSSKTGILGRIAARMARVPCVVHTVHGFAFESTKKQAIKNLY 138 Query 125 CIVEKYLSLLTDVIWCVSKSDEKKAIENIGIKEPKIITVSNSVPQ---MPRCNNKQLQYK 181 +E + + I C+ + D+ + + IK KI+ + N V P N +++ + Sbjct 139 KWLEMIGAKCSTKIICLHEEDKNICLNILKIKADKIVVIPNGVDINKFTPATNKGKIKEE 198 Query 182 VL----------FVGRLTHPKRP----ELLANVISKK--PQYSLHIVGGGERLESLKKQF 225 +L VGRL K P E+ +I + P +VG GE + +K+ + Sbjct 199 ILSLRESNFVFTMVGRLWPQKNPLYFVEVAKQIIKNELIPGSIFVLVGDGELMSVIKEHY 258 Query 226 SECENIHFLGEVNNFYN-----YHEYDLFSLISDSEGLPMSGLEAHTAAIPLLLSDVGGC 280 E E +H + + + N D+F L S EG+P++ LEA + +P ++S++ G Sbjct 259 LEDELLHNILLLLGWRNDISDSLKARDVFVLPSLWEGMPLAILEAQSTGLPWVVSNINGN 318 Query 281 FELIEG--NGLLVE 292 L+ +G LVE Sbjct 319 KSLVTNKFDGYLVE 332 >ref|YP_002293579.1| putative glycosyl transferase [Escherichia coli SE11] dbj|BAG77828.1| putative glycosyl transferase [Escherichia coli SE11] Length=357 Score = 55.1 bits (131), Expect = 1e-07, Method: Compositional matrix adjust. Identities = 52/158 (32%), Positives = 77/158 (48%), Gaps = 10/158 (6%) Query 173 CNNKQLQ-YKVLFVGRLTHPKRPELLANVI---SKKPQYSLHIVGGGERLESLKK--QFS 226 C L+ +K++ VGRL + K +LL SK +SL I G G + L++ QF+ Sbjct 177 CGESTLRNHKIIAVGRLEYQKGFDLLIQAFARASKDTDWSLDIYGDGTLRKELEEIIQFN 236 Query 227 ECENIHFLGEVNNFYN-YHEYDLFSLISDSEGLPMSGLEAHTAAIPLL-LSDVGGCFELI 284 E NI+ LG V+N Y +Y LF S EG M LEA A +P + + G E+ Sbjct 237 EISNINLLGNVSNIDEIYKDYSLFVFSSRFEGFGMVLLEAMRAGLPCISFNCPTGPAEIF 296 Query 285 EGN--GLLVENTEDDIGYKLDKIFDDYENYREQAIRAS 320 + G+LV+N D + K+F D R + + S Sbjct 297 DNGEYGILVDNGNIDELSNVMKMFMDSFELRSKFSKLS 334 >gb|AAC45847.1| putative GlcNAc transferase [Escherichia coli] Length=349 Score = 53.9 bits (128), Expect = 3e-07, Method: Compositional matrix adjust. Identities = 65/237 (27%), Positives = 104/237 (43%), Gaps = 30/237 (12%) Query 109 HGWSCLYNGGR------LKSIFCIVEKYLSLLTDVIWCVSKSDEKKAIENIGIKEPKIIT 162 HG Y GR +K++F + SLL I C S+ K E+ I K+I Sbjct 103 HGGDVKYLKGRSFIFHKIKNVFTVTLFKHSLL---ILCPSQQYAKYLCEHYNINISKVIV 159 Query 163 VSNSVPQMPRCNNKQLQY-------KVLFVGRLTHPKRPELLANVISKKPQYSLHIVGGG 215 + + +C + + K+ F GRL K +L+ N I + L IVG G Sbjct 160 YPSG--GVKKCFKYETFFPVHDESVKIGFAGRLVKSKNVDLIINAIKQLKNVQLSIVGDG 217 Query 216 ERLESLKKQFSECENIHFLGEVNNFYN-----YHEYDLFSLISDSEGLPMSGLEAHTAAI 270 E+ + L K +C+ I FLG +N +N Y ++ S+SE L + LEA ++ + Sbjct 218 EQKDYLYKLAKDCD-IEFLGPMN--HNELARWYKTINVLVYPSESESLGLVPLEAMSSGV 274 Query 271 PLLLSDVGGCFELIEGNGLLVENTEDDIGYKLDKIFDDYENYREQAIRASGKFVIEN 327 +LS + +E I +GL E+ Y I + EN+++ I K + N Sbjct 275 YCILSKIPAFYE-IRQHGLTFSFIEN---YDSQSIAREIENFQKINITHLNKILKRN 327 >gb|AAZ20762.1| glycosyltransferase [Escherichia coli] Length=358 Score = 52.8 bits (125), Expect = 5e-07, Method: Compositional matrix adjust. Identities = 45/136 (33%), Positives = 71/136 (52%), Gaps = 11/136 (8%) Query 168 PQMPRCNNKQLQYKVLFVGRLTHPKRPELL----ANVISKKP-QYSLHIVGGGERLESLK 222 P++P +KQ +L VGRLT K+ LL N+I++ P ++LHIVG GE LK Sbjct 174 PELPYEMHKQNSKTILCVGRLTADKQHLLLLKMWKNIINEIPFGWTLHIVGDGELKPILK 233 Query 223 KQFSE---CENIHFLGEVNNFYNYH-EYDLFSLISDSEGLPMSGLEAHTAAIPLLLSDV- 277 +E +++ N Y+ + F+L S SEG M LEA + +P++ D Sbjct 234 DFINENGLSQSVRLSDSTKNISKYYIDSSFFALTSKSEGFGMVILEALSFGLPIISFDCP 293 Query 278 GGCFELI-EGNGLLVE 292 G ++I + NG L++ Sbjct 294 SGPRDMINDNNGFLIQ 309 >gb|ACA24903.1| WfgN [Escherichia coli] Length=408 Score = 52.4 bits (124), Expect = 7e-07, Method: Compositional matrix adjust. Identities = 35/132 (26%), Positives = 72/132 (54%), Gaps = 9/132 (6%) Query 192 KRPELLANVISKKPQYSLHIVGGGER--LESLKKQFSECENIHFLGEVNNFYNYHEY--- 246 K ++L+N ++K +LHI+G G++ ++L + + ENI+F+G +++ +Y Sbjct 239 KAVKILSNSTNEK--ITLHIIGPGDKKKYQNLASRLNLLENINFVGSLSDSKAVQKYLNE 296 Query 247 --DLFSLISDSEGLPMSGLEAHTAAIPLLLSDVGGCFELIEGNGLLVENTEDDIGYKLDK 304 D++ S EG+P + LEA + IP ++S GG E+I + + + + + + K Sbjct 297 YIDIYIQPSYQEGMPRAVLEAMSCGIPCIVSCAGGMPEIISSDYVHAKGDYKQLAHLIKK 356 Query 305 IFDDYENYREQA 316 I + YR+++ Sbjct 357 ISSSEKIYRQES 368 >gb|ACA24823.1| WfgE [Escherichia coli] Length=356 Score = 50.8 bits (120), Expect = 2e-06, Method: Compositional matrix adjust. Identities = 40/123 (32%), Positives = 61/123 (49%), Gaps = 10/123 (8%) Query 181 KVLFVGRLTHPKRPELLANVISK----KPQYSLHIVGGG--ERLESLKKQFSECENIHFL 234 K++ VGRL H K +LL ++ +K P + LHI G G E+ + K N+ + Sbjct 186 KIMAVGRLEHQKGFDLLIDIFAKVNKSNPGWELHIYGVGTCEKFLTDKINKHGLNNVKLM 245 Query 235 GEVNNFYNYH-EYDLFSLISDSEGLPMSGLEAHTAAIPLLLSDVGGCFELIEGN---GLL 290 G V++ Y+ +Y +F+ S EG M LEA +P + D I G+ G+L Sbjct 246 GSVDHIQQYYPKYSIFAFSSRFEGFGMVLLEAMECGLPCISFDCPTGPSEILGDGEYGIL 305 Query 291 VEN 293 VEN Sbjct 306 VEN 308 >ref|ZP_03033404.1| Cps2D [Escherichia coli F11] gb|EDV67486.1| Cps2D [Escherichia coli F11] Length=1266 Score = 50.1 bits (118), Expect = 4e-06, Method: Compositional matrix adjust. Identities = 41/127 (32%), Positives = 63/127 (49%), Gaps = 11/127 (8%) Query 181 KVLFV--GRLTHPKRPELLANVISK----KPQYSLHIVGGGERLESLKKQFSEC---ENI 231 K+ F+ GRL+ K + L N + P L I+G G L++Q +++ Sbjct 1102 KIYFITLGRLSVEKDQQKLINAFCRLQKLYPNIELLILGDGPLKIDLQRQIKTLGLEKSV 1161 Query 232 HFLGEVNN-FYNYHEYDLFSLISDSEGLPMSGLEAHTAAIPLLLSDVGGCFELIEG-NGL 289 H LG ++N F D F L S+ EG PM EA P++ +D+ G +EG +G+ Sbjct 1162 HLLGRISNPFPLLKRADCFVLSSNHEGQPMVLFEAMILDKPIISTDITGSRSALEGRSGV 1221 Query 290 LVENTED 296 LVEN+ D Sbjct 1222 LVENSVD 1228 >gb|AAD50487.1|AF172324_5 WbnB [Escherichia coli] Length=350 Score = 49.3 bits (116), Expect = 6e-06, Method: Compositional matrix adjust. Identities = 55/204 (26%), Positives = 99/204 (48%), Gaps = 21/204 (10%) Query 88 NAGVYARLVRLLVDFKCIYVSHGWSCLYNGGRLKSIF-----CIVEKYLSLLTDVIWCVS 142 ++G+Y + ++ FK I + H S + R SIF I++ ++ L+D VS Sbjct 94 SSGIYLFISKIF--FKKINIVHSHSDRRSIDRRSSIFKKIYIFIMKFLINRLSDYKIAVS 151 Query 143 KSDEKKAIENIGIKE----PKIITVSNSVPQMPRCNNKQLQYKVLFVGRLTHPKRPELLA 198 + K I P I+ S+P + + ++ + +K+ +GR + K + Sbjct 152 ERAGKSLFYGSFITHYCGVPDIML---SLPDIKKVSSSE--FKIYHIGRNSDAKNYPFIF 206 Query 199 NVISKKPQY-SLHIVGGGERLESLKKQFSE--CENIHFLGEVNN--FYNYHEYDLFSLIS 253 ++ +Y ++HI G LE L+K+ E +N+HFLG + N + Y ++F + S Sbjct 207 SIAHSLREYENVHIYCMGAGLELLQKKSQEENLKNMHFLGFIENPLSHIYIHANVFIMPS 266 Query 254 DSEGLPMSGLEAHTAAIPLLLSDV 277 EGLP+S +EA +P L+SD Sbjct 267 LWEGLPLSVVEAQKCNVPCLVSDT 290 >ref|NP_755569.1| hypothetical protein c3694 [Escherichia coli CFT073] gb|AAN82142.1|AE016766_230 Hypothetical protein c3694 [Escherichia coli CFT073] Length=1266 Score = 49.3 bits (116), Expect = 6e-06, Method: Compositional matrix adjust. Identities = 41/127 (32%), Positives = 63/127 (49%), Gaps = 11/127 (8%) Query 181 KVLFV--GRLTHPKRPELLANVISK----KPQYSLHIVGGGERLESLKKQFSEC---ENI 231 K+ F+ GRL+ K + L N + P L I+G G L++Q +++ Sbjct 1102 KIYFITLGRLSVEKDQQKLINAFCRLQKLYPNIELLILGDGPLKIDLQRQIITLGLEKSV 1161 Query 232 HFLGEVNN-FYNYHEYDLFSLISDSEGLPMSGLEAHTAAIPLLLSDVGGCFELIEG-NGL 289 H LG ++N F D F L S+ EG PM EA P++ +D+ G +EG +G+ Sbjct 1162 HLLGRISNPFPLLKRADCFVLSSNHEGQPMVLFEAMILDKPIISTDITGSRSALEGRSGV 1221 Query 290 LVENTED 296 LVEN+ D Sbjct 1222 LVENSVD 1228 >gb|ABB29908.1| WfaO [Escherichia coli] Length=356 Score = 48.9 bits (115), Expect = 8e-06, Method: Compositional matrix adjust. Identities = 40/123 (32%), Positives = 62/123 (50%), Gaps = 10/123 (8%) Query 181 KVLFVGRLTHPKRPELLANVISK----KPQYSLHIVGGGERLESLKKQFSE--CENIHFL 234 K++ VGRL + K ++L ++ ++ P + LHI G G E L+ + ++ NI + Sbjct 186 KIIAVGRLEYQKGFDILIDIFARVNKEHPGWELHIYGVGTCEEFLRDKINQYKLNNIKLM 245 Query 235 GEVNNFYNYH-EYDLFSLISDSEGLPMSGLEAHTAAIPLLLSDVGGCFELIEGN---GLL 290 G V+N Y+ +Y +F S EG M LEA +P + D I GN G+L Sbjct 246 GCVDNIQLYYPKYSVFVFSSRFEGFGMVLLEAMECGLPCISFDCPTGPSEILGNGQYGIL 305 Query 291 VEN 293 VEN Sbjct 306 VEN 308 >gb|ACA24850.1| WffZ [Escherichia coli] Length=351 Score = 45.1 bits (105), Expect = 1e-04, Method: Compositional matrix adjust. Identities = 45/145 (31%), Positives = 72/145 (49%), Gaps = 15/145 (10%) Query 181 KVLFVGRLTHPKRPELLANVISKKPQYSLHIVGGGERLESLKKQFSE--CENIHFLGEVN 238 + ++VGR++ K + + V P Y L ++G G LKKQF + NI FLG ++ Sbjct 193 RFIYVGRISSEKNIDFMVKVFKTLP-YELILIGDG----PLKKQFDDKTYSNIRFLGYID 247 Query 239 NFYNYHEY---DLFSLISDSEGLPMSGLEAHTAAIPLLLSDVGGCFELI--EGNGLL--V 291 N E D F L S SE + EA T +P+++S+ GC + + NG++ V Sbjct 248 NKKLSKELLKSDCFILPSLSEPWGLVVEEALTLGLPVIVSNHVGCHSDLVNDRNGIIFDV 307 Query 292 ENTEDDIGYKLDKIFDDYENYREQA 316 +T+ I L K+ +YE + A Sbjct 308 NDTQSFID-ALSKMEKNYERFARGA 331 >gb|ACA24849.1| WffY [Escherichia coli] Length=366 Score = 44.3 bits (103), Expect = 2e-04, Method: Compositional matrix adjust. Identities = 49/144 (34%), Positives = 67/144 (46%), Gaps = 17/144 (11%) Query 158 PKIITV-SNSVPQMPRCNNKQLQYKVLFVGRLTHPKRPELL--ANVISKKPQYSLHIVGG 214 P II+ + +PQ NK Q VL VGRLTH K +LL A + + L I+G Sbjct 177 PNIISFEATDIPQ-----NKIEQKNVLAVGRLTHQKGFDLLLQAWADANTHDWRLKIIGD 231 Query 215 GERLESLKKQFSE-----CENIHFLGEVNNFYNYHEYDLFSLISDSEGLPMSGLEAHTAA 269 GE L L +E E I F ++ +Y +F L S EGL M LEA ++ Sbjct 232 GEELNHLNSLITELNISNAEIIPFQKDIQR--HYSSAGIFVLSSRFEGLGMVLLEALSSG 289 Query 270 IPLLLSDV-GGCFELIEG-NGLLV 291 + + D G +I NG+LV Sbjct 290 LACISFDCPAGPKSIISSDNGVLV 313 >ref|YP_002403325.1| WbwB [Escherichia coli 55989] emb|CAU98164.1| WbwB [Escherichia coli 55989] Length=407 Score = 43.5 bits (101), Expect = 3e-04, Method: Compositional matrix adjust. Identities = 39/140 (27%), Positives = 68/140 (48%), Gaps = 17/140 (12%) Query 154 GIKEPKIITVSNSVPQMPRCNNKQLQYK---------VLFVGRLTHPKRPE----LLANV 200 G + +IIT+ N N ++Q + ++ V LT KR + + + Sbjct 196 GFNDKEIITIYNPFNFTELEGNSRIQCEGNIPLPKEFIVTVSTLTDRKRVDRTIKAMPKI 255 Query 201 ISKKPQYSLHIVGGGE---RLESLKKQFSECENIHFLG-EVNNFYNYHEYDLFSLISDSE 256 I + + L I+G G+ L++L K+ + + +HFLG + N +Y ++ L L SDSE Sbjct 256 IREYGEIDLLIIGEGQLRNDLQNLVKELNIEKYVHFLGFQTNPYYFINKAQLLILSSDSE 315 Query 257 GLPMSGLEAHTAAIPLLLSD 276 GLP +E+ P+L +D Sbjct 316 GLPTVIIESLILGTPVLSTD 335 >gb|AAK64373.1|AF361371_8 WbwB [Escherichia coli] Length=407 Score = 43.5 bits (101), Expect = 4e-04, Method: Compositional matrix adjust. Identities = 39/140 (27%), Positives = 68/140 (48%), Gaps = 17/140 (12%) Query 154 GIKEPKIITVSNSVPQMPRCNNKQLQYK---------VLFVGRLTHPKRPE----LLANV 200 G + +IIT+ N N ++Q + ++ V LT KR + + + Sbjct 196 GFNDKEIITIYNPFNFTELEGNSRIQCEGNIPLPKEFIVTVSTLTDRKRVDRTIKAMPKI 255 Query 201 ISKKPQYSLHIVGGGE---RLESLKKQFSECENIHFLG-EVNNFYNYHEYDLFSLISDSE 256 I + + L I+G G+ L++L K+ + + +HFLG + N +Y ++ L L SDSE Sbjct 256 IREYGEIDLLIIGEGQLRNDLQNLVKELNIEKYVHFLGFQTNPYYFINKAQLLILSSDSE 315 Query 257 GLPMSGLEAHTAAIPLLLSD 276 GLP +E+ P+L +D Sbjct 316 GLPTVIIESLILGTPVLSTD 335 >ref|ZP_03061297.1| glycosyl transferase, group 1 family protein [Escherichia coli B171] gb|AAD46732.1|AF078736_12 putative glycosyl transferase [Escherichia coli] gb|EDX29494.1| glycosyl transferase, group 1 family protein [Escherichia coli B171] Length=374 Score = 42.7 bits (99), Expect = 5e-04, Method: Compositional matrix adjust. Identities = 61/247 (24%), Positives = 101/247 (40%), Gaps = 32/247 (12%) Query 64 FLTLFKLRKILKENNISTLIASSANAGVYARLVRLLVD-FKCIYVSHGWSCLYNGGRLKS 122 F LF+++KI+ + + +A +++R +R+L+ I +H N G Sbjct 65 FRALFQVKKIIVALKPDIIHSHMFHANIFSRFIRMLIPAVPLICTAHN----KNEGGNAR 120 Query 123 IFCIVEKYLSLLTDVIWCVSKSDEKKAIENIGIKEPKIITVSN-----------SVPQMP 171 +FC + L + VSK ++ I + KI+ + N +V + Sbjct 121 MFCY--RLSDFLASITTNVSKEAVQEFIARKATPKNKIVEIPNFINTNKFDFDINVRKKT 178 Query 172 R--CNNKQLQYKVLFVGRLTHPKRPELLANVI--------SKKPQYSLHIVGGG---ERL 218 R N K +L VGRL K L N I S + L I G G +L Sbjct 179 RDAFNLKDSTAVLLAVGRLVEAKDYPNLLNAINHLILSKTSNCNDFILLIAGDGALRNKL 238 Query 219 ESLKKQFSECENIHFLGEVNNFYNYH-EYDLFSLISDSEGLPMSGLEAHTAAIPLLLSDV 277 L Q + + + FLG+ ++ DLF L S+ EG + EA P++ +D Sbjct 239 LDLVCQLNLVDKVFFLGQRSDIKELMCAADLFVLSSEWEGFGLVVAEAMACERPVVATDS 298 Query 278 GGCFELI 284 GG E++ Sbjct 299 GGVKEVV 305 >ref|ZP_03029511.1| WbbG [Escherichia coli B7A] gb|ABA42235.1| WbbG [Escherichia coli] gb|EDV61957.1| WbbG [Escherichia coli B7A] Length=363 Score = 42.7 bits (99), Expect = 6e-04, Method: Compositional matrix adjust. Identities = 80/318 (25%), Positives = 126/318 (39%), Gaps = 43/318 (13%) Query 47 DVFAGVFVIPGIKK---YFDFLTLFKLRKILKENNISTLIASSANAGVYARLVRLLVDFK 103 D+ V IP +K+ + DF L K+ + +S G+ AR+ L K Sbjct 55 DIGVRVITIPTLKRNIGWHDFRCFIDLYNFFKKEKFDIVHTNSTKPGIIARIAARLAGTK 114 Query 104 C-IYVSHGWSCLYNGGRLKSIF--CIVEKYLSLLTDVIWCVSKSDEKKAIENIGIKEP-- 158 I+ HG + ++ IF C+ E + +L + V+ EN P Sbjct 115 LIIHTVHGIAFHRKENTVRKIFYYCL-ENFATLFGSINVTVN--------ENYLKYYPFV 165 Query 159 KIITVSNSVPQMPRCNNKQLQ--YKVLFVGRLTHPKRP-ELL--ANVISKK---PQYSLH 210 K + N V C NK+ + F+ RL K P E + N+I KK + Sbjct 166 KSHIIYNGVDFNVLCCNKKDHDFLHIAFMARLDKQKNPLEFIRAVNIIKKKLPNERLKFT 225 Query 211 IVGGGERLESLKK---QFSECENIHFLGEV---NNFYNYHEYDLFSLISDSEGLPMSGLE 264 + G GE KK F + I G + N FYN D+ S+ E + +E Sbjct 226 LAGCGELENECKKLIEHFHLTDVIDMPGWIVDKNTFYN--SVDIICQPSNWEAFGLVFVE 283 Query 265 AHTAAIPLLLSDVGGCFELIEGN--GLLVENTEDDIGYKLDKIFDDYE-------NYREQ 315 A IP + ++ G E+I N GLL E E ++ KL + D + N +E Sbjct 284 AAFFEIPSVSRNIEGIPEVILDNETGLLYEGGEAELSEKLISLIHDKKKISWLGLNAKEY 343 Query 316 AIRASGK-FVIENYASAY 332 ++ K ++E Y+ Y Sbjct 344 VLKHFTKDIMVEKYSKLY 361 >ref|ZP_03052481.1| hypothetical protein EcE110019_3652 [Escherichia coli E110019] gb|EDV85605.1| hypothetical protein EcE110019_3652 [Escherichia coli E110019] Length=374 Score = 42.7 bits (99), Expect = 6e-04, Method: Compositional matrix adjust. Identities = 61/247 (24%), Positives = 101/247 (40%), Gaps = 32/247 (12%) Query 64 FLTLFKLRKILKENNISTLIASSANAGVYARLVRLLVD-FKCIYVSHGWSCLYNGGRLKS 122 F LF+++KI+ + + +A +++R +R+L+ I +H N G Sbjct 65 FRALFQVKKIIVALKPDIIHSHMFHANIFSRFIRMLIPAVPLICTAHN----KNEGGNAR 120 Query 123 IFCIVEKYLSLLTDVIWCVSKSDEKKAIENIGIKEPKIITVSN-----------SVPQMP 171 +FC + L + VSK ++ I + KI+ + N +V + Sbjct 121 MFCY--RLSDFLASITTNVSKEAVQEFIARKATPKNKIVEIPNFINTNKFNFDINVRKKT 178 Query 172 R--CNNKQLQYKVLFVGRLTHPKRPELLANVI--------SKKPQYSLHIVGGG---ERL 218 R N K +L VGRL K L N I S + L I G G +L Sbjct 179 RDAFNLKDSTAVLLAVGRLVEAKDYPNLLNAINHLILSKTSNCNDFILLIAGDGALRNKL 238 Query 219 ESLKKQFSECENIHFLGEVNNFYNYH-EYDLFSLISDSEGLPMSGLEAHTAAIPLLLSDV 277 L Q + + + FLG+ ++ DLF L S+ EG + EA P++ +D Sbjct 239 LDLVCQLNLVDKVFFLGQRSDIKELMCAADLFVLSSEWEGFGLVVAEAMACERPVVATDS 298 Query 278 GGCFELI 284 GG E++ Sbjct 299 GGVKEVV 305 >gb|ABB29914.1| WfaQ [Escherichia coli] Length=362 Score = 42.4 bits (98), Expect = 8e-04, Method: Compositional matrix adjust. Identities = 35/123 (28%), Positives = 61/123 (49%), Gaps = 10/123 (8%) Query 181 KVLFVGRLTHPKRPELLANVIS----KKPQYSLHIVGGGERLESLKKQFSE--CENIHFL 234 +++ VGRL K ++L S K P++ L I G G E+L+K S+ +N++ + Sbjct 186 RIISVGRLEKQKGFDMLLKAFSYISYKYPEWQLDIFGKGNEEENLRKLISKLNLKNVNLM 245 Query 235 GEVNNFYNYHEYDLFSLISDS-EGLPMSGLEAHTAAIPLLL--SDVGGCFELIEG-NGLL 290 N + + F ++S EG PM LEA + +P + + G +I+ NG L Sbjct 246 RTSKNIHQEYLSSAFYVMSSRYEGFPMVLLEAMASGLPCISFNCETGPADIIIDNENGFL 305 Query 291 VEN 293 +E+ Sbjct 306 IEH 308 >ref|YP_001726129.1| glycosyl transferase group 1 [Escherichia coli ATCC 8739] gb|ACA78802.1| glycosyl transferase group 1 [Escherichia coli ATCC 8739] Length=403 Score = 40.8 bits (94), Expect = 0.002, Method: Compositional matrix adjust. Identities = 59/225 (26%), Positives = 93/225 (41%), Gaps = 32/225 (14%) Query 134 LTDVIWCVSKSDEKKAIENIGIKEPKIITVSNSVPQMPRCNNKQLQYKVLFVGRLTHPKR 193 + DVI +S S+ K I + +I + N V P + + +L+VGRL+ K Sbjct 186 MLDVI--ISPSEFLKGILRRKLPHSRIDVIVNGVDDDPATDKTADKGYLLYVGRLSREKG 243 Query 194 PELLANVISK-KPQYSLHIVGGGERLESLKKQFSECENIHFLGEVNNFYNYHEYDLFSLI 252 L K + + L +VG G + L + + E FLG Y L +LI Sbjct 244 VATLPLAHQKMRNRAPLKVVGHGPLYDELVANYPDVE---FLG-----YVQQGEALNTLI 295 Query 253 SDSEG--LP--------MSGLEAHTAAIPLLLSDVGGCFELIEG--NGLLVE-NTEDDIG 299 ++ LP MS LEA + A P++ S +GG E I +G+L E D+ Sbjct 296 KEARAVILPSECYENCSMSVLEAMSFAKPVIGSRIGGIPEQIRDGIDGVLFEPGNVQDLA 355 Query 300 YKLDKIFDDYENYREQAIRASGKFV--------IENYASAYKSII 336 +D + D E R + A + +E + YK I+ Sbjct 356 NAMDYMIDSPEKARVMGLSARERLREKYTLQKHMETLTALYKEIL 400 >gb|ACD37088.1| WfeG [Escherichia coli] Length=393 Score = 39.3 bits (90), Expect = 0.007, Method: Compositional matrix adjust. Identities = 59/259 (22%), Positives = 105/259 (40%), Gaps = 30/259 (11%) Query 64 FLTLFKLRKILKENNISTLIASSANAGV-----YARLVRLLVDFKCIYVSHGWSCLYNGG 118 F L RKI++ENNI I+ + + +++ + + + LY G Sbjct 69 FYRLHLARKIIRENNIDICISFGERCNIINILSMGKTKKIITIHSQLSIENKTKGLY--G 126 Query 119 RLKSIFCIVEKYLSLLTDVIWCVSKSDEKKAIENIGIKEPKIITVSNSVPQMPRCNNKQL 178 ++ ++F K L D VS+ +K A + + + + N + +K Sbjct 127 KVTTLF---SKLLYKNADATVAVSEIVKKDACGLLNLDANNVEIIYNG-HDIGYIKDKST 182 Query 179 QYK--------VLFVGRLTHPKRPELL----ANVISKKPQYSLHIVGGGER------LES 220 YK + VGR+T+ K L A V P L+IVG E+ ++ Sbjct 183 DYKEFDTPVIDFVSVGRITYAKGHYHLLRSPAIVKETYPNVILYIVGTYEKDNLKSIIDH 242 Query 221 LKKQFSECENIHFLGEVNNFYNY-HEYDLFSLISDSEGLPMSGLEAHTAAIPLLLSDVGG 279 L +++ +N+ F G +N Y Y L S EG P +E+ P++ +D GG Sbjct 243 LIEKYDLYDNVIFTGFSDNPYPYIKSAKALILSSIFEGFPGVVIESIALGTPVIATDCGG 302 Query 280 CFELIEGNGLLVENTEDDI 298 E++ ++N D+ Sbjct 303 ASEVLRSPDAKIKNNTGDV 321 >gb|AAD21569.1| glycosyltransferase WcaO [Escherichia coli] Length=393 Score = 38.9 bits (89), Expect = 0.008, Method: Compositional matrix adjust. Identities = 54/194 (27%), Positives = 86/194 (44%), Gaps = 26/194 (13%) Query 147 KKAIENIGIKEP-----KIITVSNSVPQMPRCNNKQLQYKVLFVGRLTHPKRP--ELLAN 199 K + N GI +P ++++N+ Q+ N+ + +++VGRLT P++ E + Sbjct 176 KSTLSNYGIIKPISVIRNPVSIANTERQI-LLNDGSIH--IVYVGRLT-PEKGIVEFIKK 231 Query 200 VISKKPQ-YSLHIVGGGERLESLKK-QFSECENIHFLGEVNN---FYNYHEYDLFSLISD 254 V + Q LHI G GE E +K + E I F G ++ +Y +F L S Sbjct 232 VNHETTQAIHLHIYGAGESAEEIKSIKCREGFKILFHGFIDRDLLITEISKYHIFVLPSI 291 Query 255 -SEGLPMSGLEAHTAAIPLLLSDVGGCFELIEGNGLLVENTEDDIGYKLDKIFDDYENYR 313 E P+S +EA A +P+++ + GG E+ E E YK D D Sbjct 292 WLENAPVSIVEAAEAGLPVVVPNYGGLAEMAE---------ETLYNYKFDYEDSDLSEVI 342 Query 314 EQAIRASGKFVIEN 327 QA GK + N Sbjct 343 TQAADKKGKNKLNN 356 Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF excluding environmental samples from WGS projects Posted date: Jun 16, 2009 5:41 PM Number of letters in database: 26,573,871 Number of sequences in database: 84,272 Lambda K H 0.320 0.139 0.412 Gapped Lambda K H 0.267 0.0410 0.140 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Sequences: 84272 Number of Hits to DB: 4318030 Number of extensions: 179018 Number of successful extensions: 406 Number of sequences better than 0.1: 0 Number of HSP's better than 0.1 without gapping: 0 Number of HSP's gapped: 410 Number of HSP's successfully gapped: 0 Length of query: 338 Length of database: 26573871 Length adjustment: 104 Effective length of query: 234 Effective length of database: 17809583 Effective search space: 4167442422 Effective search space used: 4167442422 T: 11 A: 40 X1: 16 (7.4 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 41 (20.4 bits) S2: 80 (35.4 bits)