Reformat these Results Edit and Resubmit [Sign in above to save your search strategy]

Job Title: gb|AAV80749| (338 letters)

BLASTP 2.2.21+
Reference: Stephen F. Altschul, Thomas L. Madden, Alejandro
A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and
David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new
generation of protein database search programs", Nucleic
Acids Res. 25:3389-3402.


Reference for compositional score matrix adjustment: Stephen
F. Altschul, John C. Wootton, E. Michael Gertz, Richa
Agarwala, Aleksandr Morgulis, Alejandro A. Schaffer, and
Yi-Kuo Yu (2005) "Protein database searches using
compositionally adjusted substitution matrices", FEBS J.
272:5101-5109.


RID: 3H9FS32F013


Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF excluding environmental samples
from WGS projects
           9,079,606 sequences; 3,109,523,384 total letters
Query= gi|56159885|gb|AAV80749.1| predicted glycosyl transferase [Escherichia
coli O127:H6 str. E2348/69] >gi|37528724|gb|AAO37709.1| putative
glycosyltransferase [Escherichia coli] >gi|40794691|gb|AAR90884.1|
putative glycosyltransferase [Escherichia coli]
>gi|56159885|gb|AAV80749.1| putative glycosyltransferase [Escherichia
coli] >gi|56384973|gb|AAV85953.1| WcmA [Escherichia coli]
>gi|215265334|emb|CAS09729.1| predicted glycosyl transferase
[Escherichia coli O127:H6 str. E2348/69]
Length=338


                                                                   Score     E
Sequences producing significant alignments:                       (Bits)  Value

ref|YP_002329693.1|  predicted glycosyl transferase [Escherich...   689    0.0  
ref|YP_002387516.1|  putative glycosyltransferase [Escherichia...   296    2e-80
gb|ACA24904.1|  WfgO [Escherichia coli]                             111    1e-24
gb|ABI34559.1|  putative glycosyl transferase [Escherichia coli]    105    7e-23
gb|AAO37690.1|  putative galactosyltransferase [Escherichia coli]  88.2    1e-17
dbj|BAG11904.1|  putative galactosyltransferase WbgM [Escheric...  87.0    3e-17
gb|AAL67552.1|AF461121_3  putative galactosyltransferase WbgM ...  85.5    8e-17
ref|YP_541302.1|  putative galactosyltransferase WbgM [Escheri...  77.4    2e-14
gb|ACH97149.1|  WclR [Escherichia coli]                            75.1    1e-13
gb|AAD50490.1|AF172324_8  WbnE [Escherichia coli]                  68.2    1e-11
gb|ACD37115.1|  WffO [Escherichia coli]                            63.9    2e-10
ref|YP_002293579.1|  putative glycosyl transferase [Escherichi...  55.1    1e-07
gb|AAC45847.1|  putative GlcNAc transferase [Escherichia coli]     53.9    3e-07
gb|AAZ20762.1|  glycosyltransferase [Escherichia coli]             52.8    5e-07
gb|ACA24903.1|  WfgN [Escherichia coli]                            52.4    7e-07
gb|ACA24823.1|  WfgE [Escherichia coli]                            50.8    2e-06
ref|ZP_03033404.1|  Cps2D [Escherichia coli F11] >gb|EDV67486....  50.1    4e-06
gb|AAD50487.1|AF172324_5  WbnB [Escherichia coli]                  49.3    6e-06
ref|NP_755569.1|  hypothetical protein c3694 [Escherichia coli...  49.3    6e-06
gb|ABB29908.1|  WfaO [Escherichia coli]                            48.9    8e-06
gb|ACA24850.1|  WffZ [Escherichia coli]                            45.1    1e-04
gb|ACA24849.1|  WffY [Escherichia coli]                            44.3    2e-04
ref|YP_002403325.1|  WbwB [Escherichia coli 55989] >emb|CAU981...  43.5    3e-04
gb|AAK64373.1|AF361371_8  WbwB [Escherichia coli]                  43.5    4e-04
ref|ZP_03061297.1|  glycosyl transferase, group 1 family prote...  42.7    5e-04
ref|ZP_03029511.1|  WbbG [Escherichia coli B7A] >gb|ABA42235.1...  42.7    6e-04
ref|ZP_03052481.1|  hypothetical protein EcE110019_3652 [Esche...  42.7    6e-04
gb|ABB29914.1|  WfaQ [Escherichia coli]                            42.4    8e-04
ref|YP_001726129.1|  glycosyl transferase group 1 [Escherichia...  40.8    0.002
gb|ACD37088.1|  WfeG [Escherichia coli]                            39.3    0.007
gb|AAD21569.1|  glycosyltransferase WcaO [Escherichia coli]        38.9    0.008

ALIGNMENTS
>ref|YP_002329693.1| predicted glycosyl transferase [Escherichia coli O127:H6 str. 
E2348/69]
 gb|AAO37709.1| putative glycosyltransferase [Escherichia coli]
 gb|AAR90884.1| putative glycosyltransferase [Escherichia coli]
 gb|AAV80749.1| putative glycosyltransferase [Escherichia coli]
 gb|AAV85953.1| WcmA [Escherichia coli]
 emb|CAS09729.1| predicted glycosyl transferase [Escherichia coli O127:H6 str. 
E2348/69]
Length=338

 Score =  689 bits (1778),  Expect = 0.0, Method: Compositional matrix adjust.
 Identities = 338/338 (100%), Positives = 338/338 (100%), Gaps = 0/338 (0%)

Query  1    MKNVGFIVTKSEIGGAQTWVNEISNLIKEECNIFLITSEEGWLTHKDVFAGVFVIPGIKK  60
            MKNVGFIVTKSEIGGAQTWVNEISNLIKEECNIFLITSEEGWLTHKDVFAGVFVIPGIKK
Sbjct  1    MKNVGFIVTKSEIGGAQTWVNEISNLIKEECNIFLITSEEGWLTHKDVFAGVFVIPGIKK  60

Query  61   YFDFLTLFKLRKILKENNISTLIASSANAGVYARLVRLLVDFKCIYVSHGWSCLYNGGRL  120
            YFDFLTLFKLRKILKENNISTLIASSANAGVYARLVRLLVDFKCIYVSHGWSCLYNGGRL
Sbjct  61   YFDFLTLFKLRKILKENNISTLIASSANAGVYARLVRLLVDFKCIYVSHGWSCLYNGGRL  120

Query  121  KSIFCIVEKYLSLLTDVIWCVSKSDEKKAIENIGIKEPKIITVSNSVPQMPRCNNKQLQY  180
            KSIFCIVEKYLSLLTDVIWCVSKSDEKKAIENIGIKEPKIITVSNSVPQMPRCNNKQLQY
Sbjct  121  KSIFCIVEKYLSLLTDVIWCVSKSDEKKAIENIGIKEPKIITVSNSVPQMPRCNNKQLQY  180

Query  181  KVLFVGRLTHPKRPELLANVISKKPQYSLHIVGGGERLESLKKQFSECENIHFLGEVNNF  240
            KVLFVGRLTHPKRPELLANVISKKPQYSLHIVGGGERLESLKKQFSECENIHFLGEVNNF
Sbjct  181  KVLFVGRLTHPKRPELLANVISKKPQYSLHIVGGGERLESLKKQFSECENIHFLGEVNNF  240

Query  241  YNYHEYDLFSLISDSEGLPMSGLEAHTAAIPLLLSDVGGCFELIEGNGLLVENTEDDIGY  300
            YNYHEYDLFSLISDSEGLPMSGLEAHTAAIPLLLSDVGGCFELIEGNGLLVENTEDDIGY
Sbjct  241  YNYHEYDLFSLISDSEGLPMSGLEAHTAAIPLLLSDVGGCFELIEGNGLLVENTEDDIGY  300

Query  301  KLDKIFDDYENYREQAIRASGKFVIENYASAYKSIILG  338
            KLDKIFDDYENYREQAIRASGKFVIENYASAYKSIILG
Sbjct  301  KLDKIFDDYENYREQAIRASGKFVIENYASAYKSIILG  338


>ref|YP_002387516.1| putative glycosyltransferase [Escherichia coli IAI1]
 emb|CAQ98958.1| putative glycosyltransferase [Escherichia coli IAI1]
Length=352

 Score =  296 bits (759),  Expect = 2e-80, Method: Compositional matrix adjust.
 Identities = 154/342 (45%), Positives = 221/342 (64%), Gaps = 13/342 (3%)

Query  2    KNVGFIVTKSEIGGAQTWVNEISNLIKEECNIFLITSEEGWLTHKDVFAGVFVIPGIKKY  61
            K + FI+TKSE+GGAQ WV+E   L++++ + FLITS  GWLT       VF +P +   
Sbjct  7    KRLVFIITKSEVGGAQKWVSEQKLLLEDKYDTFLITSCTGWLTDNFSPDKVFFVPALTNI  66

Query  62   FDFLTLFKLRKILKENNISTLIASSANAGVYARLVRLLVDFKCIYVSHGWSCLYNGGRLK  121
                 LF + KIL+      ++++SANAG+YARL +++   + IYVSHGWSC+YNGGR K
Sbjct  67   KKISNLFSIAKILRMLKADIVVSNSANAGLYARLAKIIWKHRSIYVSHGWSCIYNGGRAK  126

Query  122  SIFCIVEKYLSLLTDVIWCVSKSDEKKAIENIGIKEPKIITVSNSVPQMPRCNNKQLQY-  180
             I C +E++LS  +D I CVS++D+  A+  IGIKE K+  + N+    P    K+  + 
Sbjct  127  KILCFIERFLSFFSDAILCVSENDKDNALNIIGIKESKLKLIKNAT--FPTNKEKKFWHI  184

Query  181  -----KVLFVGRLTHPKRPELLANVISKKPQYSLHIVGGGERLESLKKQFSECENIHFLG  235
                 +++FVGR+THPKRP+LLA  +S+K    L +VGGGE LE LK  +   +NIHF+G
Sbjct  185  MPKVLRLVFVGRMTHPKRPDLLAETLSRKKDVELFLVGGGEYLERLKNIYKNYDNIHFVG  244

Query  236  EVNNFYNYHEYDLFSLISDSEGLPMSGLEAHTAAIPLLLSDVGGCFELI---EG--NGLL  290
            E+ +F NY +YD F L+S+SEGLPMS +EA    +PLLLSDVGGC ELI   EG  NG+L
Sbjct  245  EIKDFNNYDDYDAFILVSESEGLPMSAIEAGVTGLPLLLSDVGGCHELIGEYEGKYNGVL  304

Query  291  VENTEDDIGYKLDKIFDDYENYREQAIRASGKFVIENYASAY  332
              N  +DI   +D++ ++YE Y + A + S +F + ++   Y
Sbjct  305  FNNNINDISRAIDEVRNNYEQYCKVANKISCQFNLNSFKEDY  346


>gb|ACA24904.1| WfgO [Escherichia coli]
Length=368

 Score =  111 bits (277),  Expect = 1e-24, Method: Compositional matrix adjust.
 Identities = 105/336 (31%), Positives = 172/336 (51%), Gaps = 35/336 (10%)

Query  4    VGFIVTKS-EIGGAQTWVNEISNLIKEEC-NIFLITSEEGWLTHKDVFAGV--FVIPG--  57
            V +I+TK+ EIGGAQ  + ++S+ +KE+  ++ +I  E G L  + +  GV   ++P   
Sbjct  3    VLYIITKADEIGGAQIHIRDLSSRLKEDGHDVVVIVGEHGALVDELIKRGVAYHIVPSLV  62

Query  58   -----IKKYFDFLTLFKLRKILKENNISTLIASSANAGVYARLVRLLVDFKCIYVSHGWS  112
                 IK     + + KL  IL  + IS     S+ AG+  RL  L      I+ +HGW+
Sbjct  63   REINPIKDLRAVIEISKLISILDPDIISL---HSSKAGIIGRLAALRKKKPVIFTAHGWA  119

Query  113  CLYNG--GRLKSIFCIVEKYLSLLTDVIWCVSKSDEKKAIE-NIGIKEPKII----TVSN  165
               NG     + ++CI+EK +  L   I  VS+ D++ A+E N+   E +++     + +
Sbjct  120  -FANGVSKNRQKLYCIIEKIIEPLASKIITVSEQDKQLALELNVSSHEKQVVIHNGMMQS  178

Query  166  SVPQ--MPRCNNKQLQYKVLFVGRLTHPKRPELLANVISK--KPQYSLHIVGGGERLE--  219
            S+P   + R +NK ++  ++ V R +  K    L   +S+     + L +VG G  LE  
Sbjct  179  SLPPRFVNRTSNKTVE--LISVARFSEQKDHRTLFVALSQINNLNWRLTLVGKGPLLEYY  236

Query  220  -SLKKQFSECENIHFLGEVNNFYNYH-EYDLFSLISDSEGLPMSGLEAHTAAIPLLLSDV  277
             +L ++ +  E I FLGE ++        D+F LIS  EG P S LEA  A +P++ S+V
Sbjct  237  KTLARKLNIHERIQFLGERHDVAELMVRSDVFLLISKWEGFPRSILEAMRAGLPVIASNV  296

Query  278  GGCFELIEG--NGLLVENTE-DDIGYKLDKIFDDYE  310
            GG  E I     G LVE  + D + +KL K+  + E
Sbjct  297  GGTSEAINDGITGFLVEREDVDGLKHKLCKLLSEPE  332


>gb|ABI34559.1| putative glycosyl transferase [Escherichia coli]
Length=363

 Score =  105 bits (262),  Expect = 7e-23, Method: Compositional matrix adjust.
 Identities = 94/314 (29%), Positives = 158/314 (50%), Gaps = 29/314 (9%)

Query  1    MKNVGFIVTKSEIGGAQTWVNEISNLIKEECNI--FLITSEEGWLTHKDVFA----GVFV  54
            MK +  I    E+GGAQT V ++   +  + NI  +L    +G  T  D+       V  
Sbjct  1    MKVLFLITRGDELGGAQTHVKDVILGLINKYNIECYLACGTKGIFT--DIMEENNINVIH  58

Query  55   IPGIKK---YFDFLTLFKLRKILKENNISTLIASSANAGVYARLVRLLVDFKCIYVSHGW  111
            I  +K+   + D + L KL  I+K+ N   +   S+ AGV  RL  L    K ++ +HGW
Sbjct  59   IDSMKREICFGDIIALKKLNDIIKDINPDIISCHSSKAGVLGRLASLGTRTKKVFTAHGW  118

Query  112  SCLYNGG---RLKSIFCIVEKYLSLLTDVIWCVSKSDEKKAIENIGIKEPKIITVSNSVP  168
            +  +  G   +  +I+  +E  LS +TD    VS  D+K A++  G+K    + + N +P
Sbjct  119  A--FTEGISPKKAAIYKKIELLLSYITDATINVSYYDKKLALK-AGLKSQHYV-IHNCIP  174

Query  169  QMPRCNNKQLQYKV----LFVGRLTHPKRPELLANVISK--KPQYSLHIVGGGER--LES  220
             +    N  +  K     + V R    K  E L    S   K ++ L ++GGG+   ++ 
Sbjct  175  DVHYEKNNGIANKTVLEFIMVARFCAQKDHETLLKAFSNIDKEKWRLTLIGGGDSKSIKE  234

Query  221  LKKQFSECENIHFLGEVNNFYNY-HEYDLFSLISDSEGLPMSGLEAHTAAIPLLLSDVGG  279
            L K+ +   NI+F+G+  N  ++ +  D+F LIS+ EG P+S LEA  +++P++ +DVGG
Sbjct  235  LAKKLNIDNNINFVGQTKNVVDFLNHSDVFLLISNWEGFPISILEAMRSSLPIIATDVGG  294

Query  280  CFELIEG--NGLLV  291
              E ++   NG L+
Sbjct  295  VSEAVKHGYNGFLI  308


>gb|AAO37690.1| putative galactosyltransferase [Escherichia coli]
Length=373

 Score = 88.2 bits (217),  Expect = 1e-17, Method: Compositional matrix adjust.
 Identities = 86/333 (25%), Positives = 155/333 (46%), Gaps = 27/333 (8%)

Query  11   SEIGGAQTWVNEISNLIKEECNIFLITSEEGWLTHKDVFAGV--FVIPGIKKYF----DF  64
            S+I GAQ    +    +    + +++ S+EG  T +    GV  +VI  + +      DF
Sbjct  11   SKISGAQRVSLDEMKTLSNHYSQYMVCSKEGDFTQEADRIGVKTYVIETLVREISPLKDF  70

Query  65   LTLFKLRKILKENNISTLIASSANAGVYARLVRLLVDFK-CIYVSHGWSCLYNGGRL-KS  122
             +L KL K +K+     +   S+ +G   RL   L   K  I+  HG++      R+ K 
Sbjct  71   YSLIKLYKFIKQEKFDIIHTHSSKSGFLGRLAAKLAGTKQIIHTVHGFAFPSTSNRVVKL  130

Query  123  IFCIVEKYLSLLTDVIWCVSKSDEKKAIENIGIKEPKIITVSNSVPQMPRCN--------  174
            I+ ++E + SL + VI  ++++DEK A +         +T+ N+   + +          
Sbjct  131  IYFLMEYFASLCSSVIIVMNENDEKIARKYFSSAPWTKVTLLNNAVDIKKFQKRYIGIES  190

Query  175  ----NKQLQYKVLFVGRLTHPKRPELLANVIS-KKPQYSLHIVGGGERLESLKKQFSEC-  228
                N+Q ++K++ +GRL   K P L+ N +      Y +  VG G     L+ Q ++  
Sbjct  191  KSEINEQKKFKMVMIGRLCEQKNPLLIINALKILGDHYYVDFVGDGPLRSDLESQIAKRG  250

Query  229  --ENIHFLGEVNNFYNY-HEYDLFSLISDSEGLPMSGLEAHTAAIPLLLSDVGGCFELIE  285
              + +  LG  ++      +YDLF L S  EG+P++ LEA  + +P+L S++     LI 
Sbjct  251  LEKRVRLLGWCSSVEEIIFKYDLFLLPSKWEGMPLAILEAMASKVPVLCSNIDANAYLIN  310

Query  286  GNGLLVENTED--DIGYKLDKIFDDYENYREQA  316
                 + N +D  D+   +  IFD+ +  R+ A
Sbjct  311  KTSGFLFNNDDAKDLAKNIKYIFDNVDVRRKVA  343


>dbj|BAG11904.1| putative galactosyltransferase WbgM [Escherichia coli O55:H7]
Length=366

 Score = 87.0 bits (214),  Expect = 3e-17, Method: Compositional matrix adjust.
 Identities = 93/350 (26%), Positives = 176/350 (50%), Gaps = 26/350 (7%)

Query  12   EIGGAQTWVNEISNLIKEECNIFLITSEEGWLTHKDVFAGV--FVIPGIKK----YFDFL  65
            +I GAQ    +  + + +E    ++ S+EG L  +    GV   +IP + +    + D  
Sbjct  12   DISGAQRVSLDEMHTLSQEFQQSMVCSKEGRLAEQARCFGVCTHIIPTLTREISLFKDCA  71

Query  66   TLFKLRKILKENNISTLIASSANAGVYARLV-RLLVDFKCIYVSHGWSCLYNGGRL-KSI  123
            +LF+L KI+K+     +   S+  G   R+  +L    K ++  HG++      +L K I
Sbjct  72   SLFQLYKIIKKEKFDIVHTHSSKTGFLGRVAAKLAGTKKIVHTVHGFAFPSTENKLIKFI  131

Query  124  FCIVEKYLSLLTDVIWCVSKSDEKKAIEN-IGIKEPKIITVSNSVPQMPRCNNKQLQYK-  181
            + ++E   S  +++I  +++SDE+ A +  +  K+ K++ ++N++       +K      
Sbjct  132  YFLMELIASYCSNIIIVMNESDERIARKYFVKNKKSKLLLINNAIDVDKYNKDKDKDKDK  191

Query  182  ------VLFVGRLTHPKRPELLANVISKKPQYSLH--IVGGGE-RLESLKK--QFSECEN  230
                  ++ VGRL   K P LL   I K  + ++H  I+G G  +++ L+K  Q++  + 
Sbjct  192  DKDIFKIVMVGRLCDQKNPLLLIEAI-KDLESNIHVDIIGDGPLKVKLLEKINQYNIADK  250

Query  231  IHFLGEVNNFYNY-HEYDLFSLISDSEGLPMSGLEAHTAAIPLLLSDVGGCFELIEGNGL  289
            + FLG ++    + ++YDLF L S  EG+P++ LEA  A +P+L SD+     LIE    
Sbjct  251  VSFLGWIDAVEEHLYKYDLFVLPSRWEGMPLAMLEAMAAKVPVLSSDIEANKYLIEKTAG  310

Query  290  LVENTED--DIGYKLDKIFDDYENYREQAIRASGKFVIENYASAYKSIIL  337
            +V   ED  D+  K+D +  + E  R      + + +IE++    ++ IL
Sbjct  311  VVFKDEDSKDLKRKIDVLHANPE-LRNNLAHKAYQALIEDFDLTKRTKIL  359


>gb|AAL67552.1|AF461121_3 putative galactosyltransferase WbgM [Escherichia coli]
 dbj|BAG11846.1| putative galactosyltransferase [Escherichia coli O55:H7]
 dbj|BAG11960.1| putative galactosyltransferase WbgM [Escherichia coli O55:H6]
Length=364

 Score = 85.5 bits (210),  Expect = 8e-17, Method: Compositional matrix adjust.
 Identities = 92/348 (26%), Positives = 176/348 (50%), Gaps = 24/348 (6%)

Query  12   EIGGAQTWVNEISNLIKEECNIFLITSEEGWLTHKDVFAGV--FVIPGIKK----YFDFL  65
            +I GAQ    +  + + +E    ++ S+EG L  +    GV   +IP + +    + D  
Sbjct  12   DISGAQRVSLDEMHTLSQEFQQSMVCSKEGRLAEQARCFGVCTHIIPTLTREISLFKDCA  71

Query  66   TLFKLRKILKENNISTLIASSANAGVYARLV-RLLVDFKCIYVSHGWSCLYNGGRL-KSI  123
            +LF+L KI+K+     +   S+  G   R+  +L    K ++  HG++      +L K I
Sbjct  72   SLFQLYKIIKKEKFDIVHTHSSKTGFLGRVAAKLAGTKKIVHTVHGFAFPSTENKLIKFI  131

Query  124  FCIVEKYLSLLTDVIWCVSKSDEKKAIEN-IGIKEPKIITVSNSVPQMPRCNNKQLQYK-  181
            + ++E   S  +++I  +++SDE+ A +  +  K+ K++ ++N++       +K      
Sbjct  132  YFLMELIASYCSNIIIVMNESDERIARKYFVKNKKSKLLLINNAIDVDKYNKDKDKDKDK  191

Query  182  ----VLFVGRLTHPKRPELLANVISKKPQYSLH--IVGGGE-RLESLKK--QFSECENIH  232
                ++ VGRL   K P LL   I K  + ++H  I+G G  +++ L+K  Q++  + + 
Sbjct  192  DIFKIVMVGRLCDQKNPLLLIEAI-KDLESNIHVDIIGDGPLKVKLLEKINQYNIADKVS  250

Query  233  FLGEVNNFYNY-HEYDLFSLISDSEGLPMSGLEAHTAAIPLLLSDVGGCFELIEGNGLLV  291
            FLG ++    + ++YDLF L S  EG+P++ LEA  A +P+L SD+     LIE    +V
Sbjct  251  FLGWIDAVEEHLYKYDLFVLPSRWEGMPLAMLEAMAAKVPVLSSDIEANKYLIEKTAGVV  310

Query  292  ENTED--DIGYKLDKIFDDYENYREQAIRASGKFVIENYASAYKSIIL  337
               ED  D+  K++ +  + E  R      + + +IE++    ++ IL
Sbjct  311  FKDEDSKDLKRKINVLHANPE-LRNNLAHKAYQALIEDFDLTKRTKIL  357


>ref|YP_541302.1| putative galactosyltransferase WbgM [Escherichia coli UTI89]
 gb|ABE07771.1| putative galactosyltransferase WbgM [Escherichia coli UTI89]
Length=367

 Score = 77.4 bits (189),  Expect = 2e-14, Method: Compositional matrix adjust.
 Identities = 86/300 (28%), Positives = 138/300 (46%), Gaps = 23/300 (7%)

Query  14   GGAQTWVNEISNLIKEECNIFLITSEEGWLTHK--DVFAGVFVIPGIKKYF----DFLTL  67
            G  +  +NEIS L   + +  L+ S++G LT    +       IP + +      DF  L
Sbjct  19   GVQRVTLNEISALY-TDYDYTLVCSKKGPLTKALLEYDVDCHCIPELTREITVKNDFKAL  77

Query  68   FKLRKILKENNISTLIASSANAGVYARLVRLLVDF-KCIYVSHGWSCLYNGGRLKS--IF  124
            FKL K +K+     +   S+  G+  R+   L    K I+  HG+S      + KS  ++
Sbjct  78   FKLYKFIKKEKFDIVHTHSSKTGILGRVAAKLARVGKVIHTVHGFSFPAASSK-KSYYLY  136

Query  125  CIVEKYLSLLTDVIWCVSKSDEKKAIENIGIKEPKIITVSNSVP--QMPRCNNK--QLQY  180
              +E      TD +  ++  DE  AI  +  K  K+  + N V   +     NK      
Sbjct  137  FFMEWIAKFFTDKLIVLNVDDEYIAINKLKFKRDKVFLIPNGVDTDKFSPLENKIYSSTL  196

Query  181  KVLFVGRLTHPKRPELL----ANVISKKPQYSLHIVGGGERLESLKKQFSECE-NIHFLG  235
             ++ VGRL+  K PE L      ++++     L +VG GE  E L+ +F   +  I F G
Sbjct  197  NLVMVGRLSKQKDPETLLLAVEKLLNENVNVKLTLVGDGELKEQLESRFKRQDGRIIFHG  256

Query  236  EVNNFYNYHEY-DLFSLISDSEGLPMSGLEAHTAAIPLLLSDVGGCFELIEG--NGLLVE  292
              +N  N  +  DLF L S  EG+P++ LEA +  +P +++++ G   LIE   NG L E
Sbjct  257  WSDNIVNILKVNDLFILPSLWEGMPLAILEALSCGLPCIVTNIPGNNSLIEDGYNGCLFE  316


>gb|ACH97149.1| WclR [Escherichia coli]
Length=387

 Score = 75.1 bits (183),  Expect = 1e-13, Method: Compositional matrix adjust.
 Identities = 88/334 (26%), Positives = 143/334 (42%), Gaps = 44/334 (13%)

Query  14   GGAQTWVNEISNLIKEECNIFLITSEEGWLT-HKDVFAGVFVIPGIKKYF----DFLTLF  68
            G  +  + E   L  E+ +I LI  E G LT + D     F +P + +      D  +L 
Sbjct  21   GVQRVSLQEFELLPNEQFDINLICKESGPLTDYLDDSVRAFFVPTLCRNISLIKDMKSLI  80

Query  69   KLRKILKENNISTLIASSANAGVYARLVRLLVDFKCI-YVSHGWSCLYNGGRLKSIFCI-  126
             L K+LK+     +   S+  G+  R+   L    C+ +  HG++  +   + KS+  + 
Sbjct  81   SLYKLLKKEKYDIVHTHSSKTGILGRIAARLAGVPCVVHTVHGFA--FESTKRKSVKLVY  138

Query  127  --VEKYLSLLTDVIWCVSKSDEKKAIENIGIKEPKIITVSNSVPQ---MPRCNNKQLQYK  181
              +E + +  T  + C+   D++  I+ + +   KI  + N V      P  N   L+ K
Sbjct  139  KWLEIFAAKCTTRLICLHNEDKEICIKELYVDPMKISVIPNGVDLEKFAPAINKGDLKEK  198

Query  182  VL----------FVGRLTHPKRPELLA---------NVISKKPQYSLHIVGGGERLESLK  222
            +L           VGRL   K P   A         N+I   P     IVG GE +  LK
Sbjct  199  ILGLKRNSFVFTMVGRLWPQKNPLYFAEAAKYIIENNLI---PDSVFVIVGDGELMNDLK  255

Query  223  KQFSECENIH----FLGEVNNFYN-YHEYDLFSLISDSEGLPMSGLEAHTAAIPLLLSDV  277
              +    N+      LG  N+  N     D+F L S  EG+P++ LEA +  +P ++S++
Sbjct  256  YNYQTDMNLKKRLLLLGWRNDIPNILKASDVFVLPSLWEGMPLAILEAQSTGLPCIVSNI  315

Query  278  GG--CFELIEGNGLLVE-NTEDDIGYKLDKIFDD  308
             G  C    E +G L+E N  D     L ++ DD
Sbjct  316  NGNNCLVKNEFDGFLIELNDIDSFINALVRVTDD  349


>gb|AAD50490.1|AF172324_8 WbnE [Escherichia coli]
Length=392

 Score = 68.2 bits (165),  Expect = 1e-11, Method: Compositional matrix adjust.
 Identities = 81/328 (24%), Positives = 144/328 (43%), Gaps = 35/328 (10%)

Query  1    MKNVGFIVTKSEIGGAQ-TWVNEISNLIKEECNIFLITSEEGWLTHKDVFAGV--FVIPG  57
            MK +  I     + G Q   + E+  L   E   +LI  EEG LT +    G+   V+  
Sbjct  1    MKKIAHIQLLPMLSGVQKVTLQELMILNDNEYTKYLICKEEGELTEECKRLGIKTHVVKD  60

Query  58   IKKYF----DFLTLFKLRKILKENNISTLIASSANAGVYARLVRLLVDFKCI-YVSHGWS  112
            + +      D + L+K+ K LK N+I  +    A  G   R+   L     I +  HG+ 
Sbjct  61   LTREINAVKDIIALYKIYKFLKANDIDIVHTHFAKTGFLGRVAAKLAGIPLIVHTVHGFP  120

Query  113  CLYNGGRLKSIFCIVEKYLSL-LTDVIWCVSKSDEKKAIENIGIKEPKIITVSNSVP---  168
                  +  + F  V +++S    D+I C+   D++   + + + E K++ + N +    
Sbjct  121  FDSAKNKYIAFFYKVLEFISARFADIIICLHDGDKETCKKLLHVPESKVLVLPNGIAFTE  180

Query  169  --QMPRCNNKQLQYKV---------LFVGRLTHPKRPELLAN----VISKKPQYSL--HI  211
              ++  C+ K+ +  +           VGRL   K P LL N    VI++ P   +   +
Sbjct  181  FFRLSECDKKKARTILGIPESSLVFTMVGRLWEQKNPLLLINAAKEVINEYPSDDIIFLL  240

Query  212  VGGG---ERLESLKKQFSECENIHFLGEVNNFYNYHE-YDLFSLISDSEGLPMSGLEAHT  267
            +G G   + +E + ++      I  LG   +  +     D+F L S  EG+P++ LEA  
Sbjct  241  IGDGFLRKEIERIAEREIYHNKIVLLGWRKDIPDILSCSDVFVLPSRWEGMPLAILEAQA  300

Query  268  AAIPLLLSDVGGCFELIEG--NGLLVEN  293
              +P ++SD+ G   L++   NG L E+
Sbjct  301  TGLPCIVSDIPGNNNLVKDGVNGYLFES  328


>gb|ACD37115.1| WffO [Escherichia coli]
Length=389

 Score = 63.9 bits (154),  Expect = 2e-10, Method: Compositional matrix adjust.
 Identities = 80/314 (25%), Positives = 137/314 (43%), Gaps = 37/314 (11%)

Query  14   GGAQTWVNEISNLIKEECNIFLITSEEGWLT---HKDVFAGVFVIP----GIKKYFDFLT  66
            G  +  + EI NL  E   I LI  E G L    +K V    F IP     I    D  +
Sbjct  21   GVQRVSLQEIENLPPEYFEIDLICKEGGPLVDALNKKVRK--FFIPTLCRNISPVEDLKS  78

Query  67   LFKLRKILKENNISTLIASSANAGVYARLVRLLVDFKCI-YVSHGWSCLYNGGR-LKSIF  124
            L  L KI K      +   S+  G+  R+   +    C+ +  HG++      + +K+++
Sbjct  79   LISLYKIFKRERYDIVHTHSSKTGILGRIAARMARVPCVVHTVHGFAFESTKKQAIKNLY  138

Query  125  CIVEKYLSLLTDVIWCVSKSDEKKAIENIGIKEPKIITVSNSVPQ---MPRCNNKQLQYK  181
              +E   +  +  I C+ + D+   +  + IK  KI+ + N V      P  N  +++ +
Sbjct  139  KWLEMIGAKCSTKIICLHEEDKNICLNILKIKADKIVVIPNGVDINKFTPATNKGKIKEE  198

Query  182  VL----------FVGRLTHPKRP----ELLANVISKK--PQYSLHIVGGGERLESLKKQF  225
            +L           VGRL   K P    E+   +I  +  P     +VG GE +  +K+ +
Sbjct  199  ILSLRESNFVFTMVGRLWPQKNPLYFVEVAKQIIKNELIPGSIFVLVGDGELMSVIKEHY  258

Query  226  SECENIHFLGEVNNFYN-----YHEYDLFSLISDSEGLPMSGLEAHTAAIPLLLSDVGGC  280
             E E +H +  +  + N         D+F L S  EG+P++ LEA +  +P ++S++ G 
Sbjct  259  LEDELLHNILLLLGWRNDISDSLKARDVFVLPSLWEGMPLAILEAQSTGLPWVVSNINGN  318

Query  281  FELIEG--NGLLVE  292
              L+    +G LVE
Sbjct  319  KSLVTNKFDGYLVE  332


>ref|YP_002293579.1| putative glycosyl transferase [Escherichia coli SE11]
 dbj|BAG77828.1| putative glycosyl transferase [Escherichia coli SE11]
Length=357

 Score = 55.1 bits (131),  Expect = 1e-07, Method: Compositional matrix adjust.
 Identities = 52/158 (32%), Positives = 77/158 (48%), Gaps = 10/158 (6%)

Query  173  CNNKQLQ-YKVLFVGRLTHPKRPELLANVI---SKKPQYSLHIVGGGERLESLKK--QFS  226
            C    L+ +K++ VGRL + K  +LL       SK   +SL I G G   + L++  QF+
Sbjct  177  CGESTLRNHKIIAVGRLEYQKGFDLLIQAFARASKDTDWSLDIYGDGTLRKELEEIIQFN  236

Query  227  ECENIHFLGEVNNFYN-YHEYDLFSLISDSEGLPMSGLEAHTAAIPLL-LSDVGGCFELI  284
            E  NI+ LG V+N    Y +Y LF   S  EG  M  LEA  A +P +  +   G  E+ 
Sbjct  237  EISNINLLGNVSNIDEIYKDYSLFVFSSRFEGFGMVLLEAMRAGLPCISFNCPTGPAEIF  296

Query  285  EGN--GLLVENTEDDIGYKLDKIFDDYENYREQAIRAS  320
            +    G+LV+N   D    + K+F D    R +  + S
Sbjct  297  DNGEYGILVDNGNIDELSNVMKMFMDSFELRSKFSKLS  334


>gb|AAC45847.1| putative GlcNAc transferase [Escherichia coli]
Length=349

 Score = 53.9 bits (128),  Expect = 3e-07, Method: Compositional matrix adjust.
 Identities = 65/237 (27%), Positives = 104/237 (43%), Gaps = 30/237 (12%)

Query  109  HGWSCLYNGGR------LKSIFCIVEKYLSLLTDVIWCVSKSDEKKAIENIGIKEPKIIT  162
            HG    Y  GR      +K++F +     SLL   I C S+   K   E+  I   K+I 
Sbjct  103  HGGDVKYLKGRSFIFHKIKNVFTVTLFKHSLL---ILCPSQQYAKYLCEHYNINISKVIV  159

Query  163  VSNSVPQMPRCNNKQLQY-------KVLFVGRLTHPKRPELLANVISKKPQYSLHIVGGG  215
              +    + +C   +  +       K+ F GRL   K  +L+ N I +     L IVG G
Sbjct  160  YPSG--GVKKCFKYETFFPVHDESVKIGFAGRLVKSKNVDLIINAIKQLKNVQLSIVGDG  217

Query  216  ERLESLKKQFSECENIHFLGEVNNFYN-----YHEYDLFSLISDSEGLPMSGLEAHTAAI  270
            E+ + L K   +C+ I FLG +N  +N     Y   ++    S+SE L +  LEA ++ +
Sbjct  218  EQKDYLYKLAKDCD-IEFLGPMN--HNELARWYKTINVLVYPSESESLGLVPLEAMSSGV  274

Query  271  PLLLSDVGGCFELIEGNGLLVENTEDDIGYKLDKIFDDYENYREQAIRASGKFVIEN  327
              +LS +   +E I  +GL     E+   Y    I  + EN+++  I    K +  N
Sbjct  275  YCILSKIPAFYE-IRQHGLTFSFIEN---YDSQSIAREIENFQKINITHLNKILKRN  327


>gb|AAZ20762.1| glycosyltransferase [Escherichia coli]
Length=358

 Score = 52.8 bits (125),  Expect = 5e-07, Method: Compositional matrix adjust.
 Identities = 45/136 (33%), Positives = 71/136 (52%), Gaps = 11/136 (8%)

Query  168  PQMPRCNNKQLQYKVLFVGRLTHPKRPELL----ANVISKKP-QYSLHIVGGGERLESLK  222
            P++P   +KQ    +L VGRLT  K+  LL     N+I++ P  ++LHIVG GE    LK
Sbjct  174  PELPYEMHKQNSKTILCVGRLTADKQHLLLLKMWKNIINEIPFGWTLHIVGDGELKPILK  233

Query  223  KQFSE---CENIHFLGEVNNFYNYH-EYDLFSLISDSEGLPMSGLEAHTAAIPLLLSDV-  277
               +E    +++       N   Y+ +   F+L S SEG  M  LEA +  +P++  D  
Sbjct  234  DFINENGLSQSVRLSDSTKNISKYYIDSSFFALTSKSEGFGMVILEALSFGLPIISFDCP  293

Query  278  GGCFELI-EGNGLLVE  292
             G  ++I + NG L++
Sbjct  294  SGPRDMINDNNGFLIQ  309


>gb|ACA24903.1| WfgN [Escherichia coli]
Length=408

 Score = 52.4 bits (124),  Expect = 7e-07, Method: Compositional matrix adjust.
 Identities = 35/132 (26%), Positives = 72/132 (54%), Gaps = 9/132 (6%)

Query  192  KRPELLANVISKKPQYSLHIVGGGER--LESLKKQFSECENIHFLGEVNNFYNYHEY---  246
            K  ++L+N  ++K   +LHI+G G++   ++L  + +  ENI+F+G +++     +Y   
Sbjct  239  KAVKILSNSTNEK--ITLHIIGPGDKKKYQNLASRLNLLENINFVGSLSDSKAVQKYLNE  296

Query  247  --DLFSLISDSEGLPMSGLEAHTAAIPLLLSDVGGCFELIEGNGLLVENTEDDIGYKLDK  304
              D++   S  EG+P + LEA +  IP ++S  GG  E+I  + +  +     + + + K
Sbjct  297  YIDIYIQPSYQEGMPRAVLEAMSCGIPCIVSCAGGMPEIISSDYVHAKGDYKQLAHLIKK  356

Query  305  IFDDYENYREQA  316
            I    + YR+++
Sbjct  357  ISSSEKIYRQES  368


>gb|ACA24823.1| WfgE [Escherichia coli]
Length=356

 Score = 50.8 bits (120),  Expect = 2e-06, Method: Compositional matrix adjust.
 Identities = 40/123 (32%), Positives = 61/123 (49%), Gaps = 10/123 (8%)

Query  181  KVLFVGRLTHPKRPELLANVISK----KPQYSLHIVGGG--ERLESLKKQFSECENIHFL  234
            K++ VGRL H K  +LL ++ +K     P + LHI G G  E+  + K       N+  +
Sbjct  186  KIMAVGRLEHQKGFDLLIDIFAKVNKSNPGWELHIYGVGTCEKFLTDKINKHGLNNVKLM  245

Query  235  GEVNNFYNYH-EYDLFSLISDSEGLPMSGLEAHTAAIPLLLSDVGGCFELIEGN---GLL  290
            G V++   Y+ +Y +F+  S  EG  M  LEA    +P +  D       I G+   G+L
Sbjct  246  GSVDHIQQYYPKYSIFAFSSRFEGFGMVLLEAMECGLPCISFDCPTGPSEILGDGEYGIL  305

Query  291  VEN  293
            VEN
Sbjct  306  VEN  308


>ref|ZP_03033404.1| Cps2D [Escherichia coli F11]
 gb|EDV67486.1| Cps2D [Escherichia coli F11]
Length=1266

 Score = 50.1 bits (118),  Expect = 4e-06, Method: Compositional matrix adjust.
 Identities = 41/127 (32%), Positives = 63/127 (49%), Gaps = 11/127 (8%)

Query  181   KVLFV--GRLTHPKRPELLANVISK----KPQYSLHIVGGGERLESLKKQFSEC---ENI  231
             K+ F+  GRL+  K  + L N   +     P   L I+G G     L++Q       +++
Sbjct  1102  KIYFITLGRLSVEKDQQKLINAFCRLQKLYPNIELLILGDGPLKIDLQRQIKTLGLEKSV  1161

Query  232   HFLGEVNN-FYNYHEYDLFSLISDSEGLPMSGLEAHTAAIPLLLSDVGGCFELIEG-NGL  289
             H LG ++N F      D F L S+ EG PM   EA     P++ +D+ G    +EG +G+
Sbjct  1162  HLLGRISNPFPLLKRADCFVLSSNHEGQPMVLFEAMILDKPIISTDITGSRSALEGRSGV  1221

Query  290   LVENTED  296
             LVEN+ D
Sbjct  1222  LVENSVD  1228


>gb|AAD50487.1|AF172324_5 WbnB [Escherichia coli]
Length=350

 Score = 49.3 bits (116),  Expect = 6e-06, Method: Compositional matrix adjust.
 Identities = 55/204 (26%), Positives = 99/204 (48%), Gaps = 21/204 (10%)

Query  88   NAGVYARLVRLLVDFKCIYVSHGWSCLYNGGRLKSIF-----CIVEKYLSLLTDVIWCVS  142
            ++G+Y  + ++   FK I + H  S   +  R  SIF      I++  ++ L+D    VS
Sbjct  94   SSGIYLFISKIF--FKKINIVHSHSDRRSIDRRSSIFKKIYIFIMKFLINRLSDYKIAVS  151

Query  143  KSDEKKAIENIGIKE----PKIITVSNSVPQMPRCNNKQLQYKVLFVGRLTHPKRPELLA  198
            +   K       I      P I+    S+P + + ++ +  +K+  +GR +  K    + 
Sbjct  152  ERAGKSLFYGSFITHYCGVPDIML---SLPDIKKVSSSE--FKIYHIGRNSDAKNYPFIF  206

Query  199  NVISKKPQY-SLHIVGGGERLESLKKQFSE--CENIHFLGEVNN--FYNYHEYDLFSLIS  253
            ++     +Y ++HI   G  LE L+K+  E   +N+HFLG + N   + Y   ++F + S
Sbjct  207  SIAHSLREYENVHIYCMGAGLELLQKKSQEENLKNMHFLGFIENPLSHIYIHANVFIMPS  266

Query  254  DSEGLPMSGLEAHTAAIPLLLSDV  277
              EGLP+S +EA    +P L+SD 
Sbjct  267  LWEGLPLSVVEAQKCNVPCLVSDT  290


>ref|NP_755569.1| hypothetical protein c3694 [Escherichia coli CFT073]
 gb|AAN82142.1|AE016766_230 Hypothetical protein c3694 [Escherichia coli CFT073]
Length=1266

 Score = 49.3 bits (116),  Expect = 6e-06, Method: Compositional matrix adjust.
 Identities = 41/127 (32%), Positives = 63/127 (49%), Gaps = 11/127 (8%)

Query  181   KVLFV--GRLTHPKRPELLANVISK----KPQYSLHIVGGGERLESLKKQFSEC---ENI  231
             K+ F+  GRL+  K  + L N   +     P   L I+G G     L++Q       +++
Sbjct  1102  KIYFITLGRLSVEKDQQKLINAFCRLQKLYPNIELLILGDGPLKIDLQRQIITLGLEKSV  1161

Query  232   HFLGEVNN-FYNYHEYDLFSLISDSEGLPMSGLEAHTAAIPLLLSDVGGCFELIEG-NGL  289
             H LG ++N F      D F L S+ EG PM   EA     P++ +D+ G    +EG +G+
Sbjct  1162  HLLGRISNPFPLLKRADCFVLSSNHEGQPMVLFEAMILDKPIISTDITGSRSALEGRSGV  1221

Query  290   LVENTED  296
             LVEN+ D
Sbjct  1222  LVENSVD  1228


>gb|ABB29908.1| WfaO [Escherichia coli]
Length=356

 Score = 48.9 bits (115),  Expect = 8e-06, Method: Compositional matrix adjust.
 Identities = 40/123 (32%), Positives = 62/123 (50%), Gaps = 10/123 (8%)

Query  181  KVLFVGRLTHPKRPELLANVISK----KPQYSLHIVGGGERLESLKKQFSE--CENIHFL  234
            K++ VGRL + K  ++L ++ ++     P + LHI G G   E L+ + ++    NI  +
Sbjct  186  KIIAVGRLEYQKGFDILIDIFARVNKEHPGWELHIYGVGTCEEFLRDKINQYKLNNIKLM  245

Query  235  GEVNNFYNYH-EYDLFSLISDSEGLPMSGLEAHTAAIPLLLSDVGGCFELIEGN---GLL  290
            G V+N   Y+ +Y +F   S  EG  M  LEA    +P +  D       I GN   G+L
Sbjct  246  GCVDNIQLYYPKYSVFVFSSRFEGFGMVLLEAMECGLPCISFDCPTGPSEILGNGQYGIL  305

Query  291  VEN  293
            VEN
Sbjct  306  VEN  308


>gb|ACA24850.1| WffZ [Escherichia coli]
Length=351

 Score = 45.1 bits (105),  Expect = 1e-04, Method: Compositional matrix adjust.
 Identities = 45/145 (31%), Positives = 72/145 (49%), Gaps = 15/145 (10%)

Query  181  KVLFVGRLTHPKRPELLANVISKKPQYSLHIVGGGERLESLKKQFSE--CENIHFLGEVN  238
            + ++VGR++  K  + +  V    P Y L ++G G     LKKQF +    NI FLG ++
Sbjct  193  RFIYVGRISSEKNIDFMVKVFKTLP-YELILIGDG----PLKKQFDDKTYSNIRFLGYID  247

Query  239  NFYNYHEY---DLFSLISDSEGLPMSGLEAHTAAIPLLLSDVGGCFELI--EGNGLL--V  291
            N     E    D F L S SE   +   EA T  +P+++S+  GC   +  + NG++  V
Sbjct  248  NKKLSKELLKSDCFILPSLSEPWGLVVEEALTLGLPVIVSNHVGCHSDLVNDRNGIIFDV  307

Query  292  ENTEDDIGYKLDKIFDDYENYREQA  316
             +T+  I   L K+  +YE +   A
Sbjct  308  NDTQSFID-ALSKMEKNYERFARGA  331


>gb|ACA24849.1| WffY [Escherichia coli]
Length=366

 Score = 44.3 bits (103),  Expect = 2e-04, Method: Compositional matrix adjust.
 Identities = 49/144 (34%), Positives = 67/144 (46%), Gaps = 17/144 (11%)

Query  158  PKIITV-SNSVPQMPRCNNKQLQYKVLFVGRLTHPKRPELL--ANVISKKPQYSLHIVGG  214
            P II+  +  +PQ     NK  Q  VL VGRLTH K  +LL  A   +    + L I+G 
Sbjct  177  PNIISFEATDIPQ-----NKIEQKNVLAVGRLTHQKGFDLLLQAWADANTHDWRLKIIGD  231

Query  215  GERLESLKKQFSE-----CENIHFLGEVNNFYNYHEYDLFSLISDSEGLPMSGLEAHTAA  269
            GE L  L    +E      E I F  ++    +Y    +F L S  EGL M  LEA ++ 
Sbjct  232  GEELNHLNSLITELNISNAEIIPFQKDIQR--HYSSAGIFVLSSRFEGLGMVLLEALSSG  289

Query  270  IPLLLSDV-GGCFELIEG-NGLLV  291
            +  +  D   G   +I   NG+LV
Sbjct  290  LACISFDCPAGPKSIISSDNGVLV  313


>ref|YP_002403325.1| WbwB [Escherichia coli 55989]
 emb|CAU98164.1| WbwB [Escherichia coli 55989]
Length=407

 Score = 43.5 bits (101),  Expect = 3e-04, Method: Compositional matrix adjust.
 Identities = 39/140 (27%), Positives = 68/140 (48%), Gaps = 17/140 (12%)

Query  154  GIKEPKIITVSNSVPQMPRCNNKQLQYK---------VLFVGRLTHPKRPE----LLANV  200
            G  + +IIT+ N         N ++Q +         ++ V  LT  KR +     +  +
Sbjct  196  GFNDKEIITIYNPFNFTELEGNSRIQCEGNIPLPKEFIVTVSTLTDRKRVDRTIKAMPKI  255

Query  201  ISKKPQYSLHIVGGGE---RLESLKKQFSECENIHFLG-EVNNFYNYHEYDLFSLISDSE  256
            I +  +  L I+G G+    L++L K+ +  + +HFLG + N +Y  ++  L  L SDSE
Sbjct  256  IREYGEIDLLIIGEGQLRNDLQNLVKELNIEKYVHFLGFQTNPYYFINKAQLLILSSDSE  315

Query  257  GLPMSGLEAHTAAIPLLLSD  276
            GLP   +E+     P+L +D
Sbjct  316  GLPTVIIESLILGTPVLSTD  335


>gb|AAK64373.1|AF361371_8 WbwB [Escherichia coli]
Length=407

 Score = 43.5 bits (101),  Expect = 4e-04, Method: Compositional matrix adjust.
 Identities = 39/140 (27%), Positives = 68/140 (48%), Gaps = 17/140 (12%)

Query  154  GIKEPKIITVSNSVPQMPRCNNKQLQYK---------VLFVGRLTHPKRPE----LLANV  200
            G  + +IIT+ N         N ++Q +         ++ V  LT  KR +     +  +
Sbjct  196  GFNDKEIITIYNPFNFTELEGNSRIQCEGNIPLPKEFIVTVSTLTDRKRVDRTIKAMPKI  255

Query  201  ISKKPQYSLHIVGGGE---RLESLKKQFSECENIHFLG-EVNNFYNYHEYDLFSLISDSE  256
            I +  +  L I+G G+    L++L K+ +  + +HFLG + N +Y  ++  L  L SDSE
Sbjct  256  IREYGEIDLLIIGEGQLRNDLQNLVKELNIEKYVHFLGFQTNPYYFINKAQLLILSSDSE  315

Query  257  GLPMSGLEAHTAAIPLLLSD  276
            GLP   +E+     P+L +D
Sbjct  316  GLPTVIIESLILGTPVLSTD  335


>ref|ZP_03061297.1| glycosyl transferase, group 1 family protein [Escherichia coli 
B171]
 gb|AAD46732.1|AF078736_12 putative glycosyl transferase [Escherichia coli]
 gb|EDX29494.1| glycosyl transferase, group 1 family protein [Escherichia coli 
B171]
Length=374

 Score = 42.7 bits (99),  Expect = 5e-04, Method: Compositional matrix adjust.
 Identities = 61/247 (24%), Positives = 101/247 (40%), Gaps = 32/247 (12%)

Query  64   FLTLFKLRKILKENNISTLIASSANAGVYARLVRLLVD-FKCIYVSHGWSCLYNGGRLKS  122
            F  LF+++KI+       + +   +A +++R +R+L+     I  +H      N G    
Sbjct  65   FRALFQVKKIIVALKPDIIHSHMFHANIFSRFIRMLIPAVPLICTAHN----KNEGGNAR  120

Query  123  IFCIVEKYLSLLTDVIWCVSKSDEKKAIENIGIKEPKIITVSN-----------SVPQMP  171
            +FC   +    L  +   VSK   ++ I      + KI+ + N           +V +  
Sbjct  121  MFCY--RLSDFLASITTNVSKEAVQEFIARKATPKNKIVEIPNFINTNKFDFDINVRKKT  178

Query  172  R--CNNKQLQYKVLFVGRLTHPKRPELLANVI--------SKKPQYSLHIVGGG---ERL  218
            R   N K     +L VGRL   K    L N I        S    + L I G G    +L
Sbjct  179  RDAFNLKDSTAVLLAVGRLVEAKDYPNLLNAINHLILSKTSNCNDFILLIAGDGALRNKL  238

Query  219  ESLKKQFSECENIHFLGEVNNFYNYH-EYDLFSLISDSEGLPMSGLEAHTAAIPLLLSDV  277
              L  Q +  + + FLG+ ++        DLF L S+ EG  +   EA     P++ +D 
Sbjct  239  LDLVCQLNLVDKVFFLGQRSDIKELMCAADLFVLSSEWEGFGLVVAEAMACERPVVATDS  298

Query  278  GGCFELI  284
            GG  E++
Sbjct  299  GGVKEVV  305


>ref|ZP_03029511.1| WbbG [Escherichia coli B7A]
 gb|ABA42235.1| WbbG [Escherichia coli]
 gb|EDV61957.1| WbbG [Escherichia coli B7A]
Length=363

 Score = 42.7 bits (99),  Expect = 6e-04, Method: Compositional matrix adjust.
 Identities = 80/318 (25%), Positives = 126/318 (39%), Gaps = 43/318 (13%)

Query  47   DVFAGVFVIPGIKK---YFDFLTLFKLRKILKENNISTLIASSANAGVYARLVRLLVDFK  103
            D+   V  IP +K+   + DF     L    K+     +  +S   G+ AR+   L   K
Sbjct  55   DIGVRVITIPTLKRNIGWHDFRCFIDLYNFFKKEKFDIVHTNSTKPGIIARIAARLAGTK  114

Query  104  C-IYVSHGWSCLYNGGRLKSIF--CIVEKYLSLLTDVIWCVSKSDEKKAIENIGIKEP--  158
              I+  HG +       ++ IF  C+ E + +L   +   V+        EN     P  
Sbjct  115  LIIHTVHGIAFHRKENTVRKIFYYCL-ENFATLFGSINVTVN--------ENYLKYYPFV  165

Query  159  KIITVSNSVPQMPRCNNKQLQ--YKVLFVGRLTHPKRP-ELL--ANVISKK---PQYSLH  210
            K   + N V     C NK+      + F+ RL   K P E +   N+I KK    +    
Sbjct  166  KSHIIYNGVDFNVLCCNKKDHDFLHIAFMARLDKQKNPLEFIRAVNIIKKKLPNERLKFT  225

Query  211  IVGGGERLESLKK---QFSECENIHFLGEV---NNFYNYHEYDLFSLISDSEGLPMSGLE  264
            + G GE     KK    F   + I   G +   N FYN    D+    S+ E   +  +E
Sbjct  226  LAGCGELENECKKLIEHFHLTDVIDMPGWIVDKNTFYN--SVDIICQPSNWEAFGLVFVE  283

Query  265  AHTAAIPLLLSDVGGCFELIEGN--GLLVENTEDDIGYKLDKIFDDYE-------NYREQ  315
            A    IP +  ++ G  E+I  N  GLL E  E ++  KL  +  D +       N +E 
Sbjct  284  AAFFEIPSVSRNIEGIPEVILDNETGLLYEGGEAELSEKLISLIHDKKKISWLGLNAKEY  343

Query  316  AIRASGK-FVIENYASAY  332
             ++   K  ++E Y+  Y
Sbjct  344  VLKHFTKDIMVEKYSKLY  361


>ref|ZP_03052481.1| hypothetical protein EcE110019_3652 [Escherichia coli E110019]
 gb|EDV85605.1| hypothetical protein EcE110019_3652 [Escherichia coli E110019]
Length=374

 Score = 42.7 bits (99),  Expect = 6e-04, Method: Compositional matrix adjust.
 Identities = 61/247 (24%), Positives = 101/247 (40%), Gaps = 32/247 (12%)

Query  64   FLTLFKLRKILKENNISTLIASSANAGVYARLVRLLVD-FKCIYVSHGWSCLYNGGRLKS  122
            F  LF+++KI+       + +   +A +++R +R+L+     I  +H      N G    
Sbjct  65   FRALFQVKKIIVALKPDIIHSHMFHANIFSRFIRMLIPAVPLICTAHN----KNEGGNAR  120

Query  123  IFCIVEKYLSLLTDVIWCVSKSDEKKAIENIGIKEPKIITVSN-----------SVPQMP  171
            +FC   +    L  +   VSK   ++ I      + KI+ + N           +V +  
Sbjct  121  MFCY--RLSDFLASITTNVSKEAVQEFIARKATPKNKIVEIPNFINTNKFNFDINVRKKT  178

Query  172  R--CNNKQLQYKVLFVGRLTHPKRPELLANVI--------SKKPQYSLHIVGGG---ERL  218
            R   N K     +L VGRL   K    L N I        S    + L I G G    +L
Sbjct  179  RDAFNLKDSTAVLLAVGRLVEAKDYPNLLNAINHLILSKTSNCNDFILLIAGDGALRNKL  238

Query  219  ESLKKQFSECENIHFLGEVNNFYNYH-EYDLFSLISDSEGLPMSGLEAHTAAIPLLLSDV  277
              L  Q +  + + FLG+ ++        DLF L S+ EG  +   EA     P++ +D 
Sbjct  239  LDLVCQLNLVDKVFFLGQRSDIKELMCAADLFVLSSEWEGFGLVVAEAMACERPVVATDS  298

Query  278  GGCFELI  284
            GG  E++
Sbjct  299  GGVKEVV  305


>gb|ABB29914.1| WfaQ [Escherichia coli]
Length=362

 Score = 42.4 bits (98),  Expect = 8e-04, Method: Compositional matrix adjust.
 Identities = 35/123 (28%), Positives = 61/123 (49%), Gaps = 10/123 (8%)

Query  181  KVLFVGRLTHPKRPELLANVIS----KKPQYSLHIVGGGERLESLKKQFSE--CENIHFL  234
            +++ VGRL   K  ++L    S    K P++ L I G G   E+L+K  S+   +N++ +
Sbjct  186  RIISVGRLEKQKGFDMLLKAFSYISYKYPEWQLDIFGKGNEEENLRKLISKLNLKNVNLM  245

Query  235  GEVNNFYNYHEYDLFSLISDS-EGLPMSGLEAHTAAIPLLL--SDVGGCFELIEG-NGLL  290
                N +  +    F ++S   EG PM  LEA  + +P +    + G    +I+  NG L
Sbjct  246  RTSKNIHQEYLSSAFYVMSSRYEGFPMVLLEAMASGLPCISFNCETGPADIIIDNENGFL  305

Query  291  VEN  293
            +E+
Sbjct  306  IEH  308


>ref|YP_001726129.1| glycosyl transferase group 1 [Escherichia coli ATCC 8739]
 gb|ACA78802.1| glycosyl transferase group 1 [Escherichia coli ATCC 8739]
Length=403

 Score = 40.8 bits (94),  Expect = 0.002, Method: Compositional matrix adjust.
 Identities = 59/225 (26%), Positives = 93/225 (41%), Gaps = 32/225 (14%)

Query  134  LTDVIWCVSKSDEKKAIENIGIKEPKIITVSNSVPQMPRCNNKQLQYKVLFVGRLTHPKR  193
            + DVI  +S S+  K I    +   +I  + N V   P  +    +  +L+VGRL+  K 
Sbjct  186  MLDVI--ISPSEFLKGILRRKLPHSRIDVIVNGVDDDPATDKTADKGYLLYVGRLSREKG  243

Query  194  PELLANVISK-KPQYSLHIVGGGERLESLKKQFSECENIHFLGEVNNFYNYHEYDLFSLI  252
               L     K + +  L +VG G   + L   + + E   FLG     Y      L +LI
Sbjct  244  VATLPLAHQKMRNRAPLKVVGHGPLYDELVANYPDVE---FLG-----YVQQGEALNTLI  295

Query  253  SDSEG--LP--------MSGLEAHTAAIPLLLSDVGGCFELIEG--NGLLVE-NTEDDIG  299
             ++    LP        MS LEA + A P++ S +GG  E I    +G+L E     D+ 
Sbjct  296  KEARAVILPSECYENCSMSVLEAMSFAKPVIGSRIGGIPEQIRDGIDGVLFEPGNVQDLA  355

Query  300  YKLDKIFDDYENYREQAIRASGKFV--------IENYASAYKSII  336
              +D + D  E  R   + A  +          +E   + YK I+
Sbjct  356  NAMDYMIDSPEKARVMGLSARERLREKYTLQKHMETLTALYKEIL  400


>gb|ACD37088.1| WfeG [Escherichia coli]
Length=393

 Score = 39.3 bits (90),  Expect = 0.007, Method: Compositional matrix adjust.
 Identities = 59/259 (22%), Positives = 105/259 (40%), Gaps = 30/259 (11%)

Query  64   FLTLFKLRKILKENNISTLIASSANAGV-----YARLVRLLVDFKCIYVSHGWSCLYNGG  118
            F  L   RKI++ENNI   I+      +       +  +++     + + +    LY  G
Sbjct  69   FYRLHLARKIIRENNIDICISFGERCNIINILSMGKTKKIITIHSQLSIENKTKGLY--G  126

Query  119  RLKSIFCIVEKYLSLLTDVIWCVSKSDEKKAIENIGIKEPKIITVSNSVPQMPRCNNKQL  178
            ++ ++F    K L    D    VS+  +K A   + +    +  + N    +    +K  
Sbjct  127  KVTTLF---SKLLYKNADATVAVSEIVKKDACGLLNLDANNVEIIYNG-HDIGYIKDKST  182

Query  179  QYK--------VLFVGRLTHPKRPELL----ANVISKKPQYSLHIVGGGER------LES  220
             YK         + VGR+T+ K    L    A V    P   L+IVG  E+      ++ 
Sbjct  183  DYKEFDTPVIDFVSVGRITYAKGHYHLLRSPAIVKETYPNVILYIVGTYEKDNLKSIIDH  242

Query  221  LKKQFSECENIHFLGEVNNFYNY-HEYDLFSLISDSEGLPMSGLEAHTAAIPLLLSDVGG  279
            L +++   +N+ F G  +N Y Y        L S  EG P   +E+     P++ +D GG
Sbjct  243  LIEKYDLYDNVIFTGFSDNPYPYIKSAKALILSSIFEGFPGVVIESIALGTPVIATDCGG  302

Query  280  CFELIEGNGLLVENTEDDI  298
              E++      ++N   D+
Sbjct  303  ASEVLRSPDAKIKNNTGDV  321


>gb|AAD21569.1| glycosyltransferase WcaO [Escherichia coli]
Length=393

 Score = 38.9 bits (89),  Expect = 0.008, Method: Compositional matrix adjust.
 Identities = 54/194 (27%), Positives = 86/194 (44%), Gaps = 26/194 (13%)

Query  147  KKAIENIGIKEP-----KIITVSNSVPQMPRCNNKQLQYKVLFVGRLTHPKRP--ELLAN  199
            K  + N GI +P       ++++N+  Q+   N+  +   +++VGRLT P++   E +  
Sbjct  176  KSTLSNYGIIKPISVIRNPVSIANTERQI-LLNDGSIH--IVYVGRLT-PEKGIVEFIKK  231

Query  200  VISKKPQ-YSLHIVGGGERLESLKK-QFSECENIHFLGEVNN---FYNYHEYDLFSLISD  254
            V  +  Q   LHI G GE  E +K  +  E   I F G ++         +Y +F L S 
Sbjct  232  VNHETTQAIHLHIYGAGESAEEIKSIKCREGFKILFHGFIDRDLLITEISKYHIFVLPSI  291

Query  255  -SEGLPMSGLEAHTAAIPLLLSDVGGCFELIEGNGLLVENTEDDIGYKLDKIFDDYENYR  313
              E  P+S +EA  A +P+++ + GG  E+ E         E    YK D    D     
Sbjct  292  WLENAPVSIVEAAEAGLPVVVPNYGGLAEMAE---------ETLYNYKFDYEDSDLSEVI  342

Query  314  EQAIRASGKFVIEN  327
             QA    GK  + N
Sbjct  343  TQAADKKGKNKLNN  356



  Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF
excluding environmental samples from WGS projects
    Posted date:  Jun 16, 2009  5:41 PM
  Number of letters in database: 26,573,871
  Number of sequences in database:  84,272

Lambda     K      H
   0.320    0.139    0.412 
Gapped
Lambda     K      H
   0.267   0.0410    0.140 
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 84272
Number of Hits to DB: 4318030
Number of extensions: 179018
Number of successful extensions: 406
Number of sequences better than 0.1: 0
Number of HSP's better than 0.1 without gapping: 0
Number of HSP's gapped: 410
Number of HSP's successfully gapped: 0
Length of query: 338
Length of database: 26573871
Length adjustment: 104
Effective length of query: 234
Effective length of database: 17809583
Effective search space: 4167442422
Effective search space used: 4167442422
T: 11
A: 40
X1: 16 (7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (20.4 bits)
S2: 80 (35.4 bits)