Reformat these Results Edit and Resubmit [Sign in above to save your search strategy]

Job Title: gb|AAV74378| (347 letters)

BLASTP 2.2.21+
Reference: Stephen F. Altschul, Thomas L. Madden, Alejandro
A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and
David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new
generation of protein database search programs", Nucleic
Acids Res. 25:3389-3402.


Reference for compositional score matrix adjustment: Stephen
F. Altschul, John C. Wootton, E. Michael Gertz, Richa
Agarwala, Aleksandr Morgulis, Alejandro A. Schaffer, and
Yi-Kuo Yu (2005) "Protein database searches using
compositionally adjusted substitution matrices", FEBS J.
272:5101-5109.


RID: 3H9DWTF901S


Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF excluding environmental samples
from WGS projects
           9,079,606 sequences; 3,109,523,384 total letters
Query= gi|56122506|gb|AAV74378.1| glycosyltransferase [Escherichia coli]
Length=347


                                                                   Score     E
Sequences producing significant alignments:                       (Bits)  Value

gb|AAV74378.1|  glycosyltransferase [Escherichia coli]              718    0.0  
ref|NP_754447.1|  hypothetical protein c2559 [Escherichia coli...   266    3e-71
ref|YP_669975.1|  putative glycosyltransferase [Escherichia co...   266    4e-71
emb|CAD19797.1|  putative glycosyltransferase [Escherichia coli]    265    5e-71
ref|YP_002413092.1|  Mannosyl transferase wbaD [Escherichia co...   261    7e-70

ALIGNMENTS
>gb|AAV74378.1| glycosyltransferase [Escherichia coli]
Length=347

 Score =  718 bits (1853),  Expect = 0.0, Method: Compositional matrix adjust.
 Identities = 347/347 (100%), Positives = 347/347 (100%), Gaps = 0/347 (0%)

Query  1    MTYPIPDDGVYTSVYRCHHPRIISYFSCLLALPTNIPLQVAYYYSPKFKKIINKLVPEHD  60
            MTYPIPDDGVYTSVYRCHHPRIISYFSCLLALPTNIPLQVAYYYSPKFKKIINKLVPEHD
Sbjct  1    MTYPIPDDGVYTSVYRCHHPRIISYFSCLLALPTNIPLQVAYYYSPKFKKIINKLVPEHD  60

Query  61   LLLPHLIRVAGYVKKNSTPKILEMTDAISMNYERVCKLKNSTGIKGLIYKIERNRLNQYE  120
            LLLPHLIRVAGYVKKNSTPKILEMTDAISMNYERVCKLKNSTGIKGLIYKIERNRLNQYE
Sbjct  61   LLLPHLIRVAGYVKKNSTPKILEMTDAISMNYERVCKLKNSTGIKGLIYKIERNRLNQYE  120

Query  121  KSIAKYFDQTIFVSQHDKNYLFRNLPDLYHKSLVCTNGVDVANFKNTLFKRSYKLIFIGN  180
            KSIAKYFDQTIFVSQHDKNYLFRNLPDLYHKSLVCTNGVDVANFKNTLFKRSYKLIFIGN
Sbjct  121  KSIAKYFDQTIFVSQHDKNYLFRNLPDLYHKSLVCTNGVDVANFKNTLFKRSYKLIFIGN  180

Query  181  MFSVQNFDAAFWFCESVLPILRQYGPFTFHVIGKISLENSKKLSAYEGVFVTGAVDNVMD  240
            MFSVQNFDAAFWFCESVLPILRQYGPFTFHVIGKISLENSKKLSAYEGVFVTGAVDNVMD
Sbjct  181  MFSVQNFDAAFWFCESVLPILRQYGPFTFHVIGKISLENSKKLSAYEGVFVTGAVDNVMD  240

Query  241  YANNSLAGICSVRLAAGVQNKILEYMAMGIPAITTSIGLEGLFAVDGESIVVANTPHEFV  300
            YANNSLAGICSVRLAAGVQNKILEYMAMGIPAITTSIGLEGLFAVDGESIVVANTPHEFV
Sbjct  241  YANNSLAGICSVRLAAGVQNKILEYMAMGIPAITTSIGLEGLFAVDGESIVVANTPHEFV  300

Query  301  STILKLFNDPSFGKTISKNGLGYVQQNHSWSEKLQPLIQVINNLIEE  347
            STILKLFNDPSFGKTISKNGLGYVQQNHSWSEKLQPLIQVINNLIEE
Sbjct  301  STILKLFNDPSFGKTISKNGLGYVQQNHSWSEKLQPLIQVINNLIEE  347


>ref|NP_754447.1| hypothetical protein c2559 [Escherichia coli CFT073]
 gb|AAN81014.1|AE016762_267 Hypothetical protein c2559 [Escherichia coli CFT073]
Length=371

 Score =  266 bits (679),  Expect = 3e-71, Method: Compositional matrix adjust.
 Identities = 145/347 (41%), Positives = 208/347 (59%), Gaps = 11/347 (3%)

Query  7    DDGVYTSVYRCHHPRIISYFSCLLALPTNIPLQVAYYYSPKFKKIINKLVPEHDLLLPHL  66
            +D V+  ++R + P+  SY++ L AL T  PLQ+AYY S  FK   NKL+ + D +  HL
Sbjct  29   NDSVFKEIHRVYLPKYKSYYNVLKALVTQKPLQIAYYQSDTFKNKYNKLIKQCDAVFCHL  88

Query  67   IRVAGYVKKNSTPKILEMTDAISMNYERVCKLKNSTGIKGLIYKIERNRLNQYEKSIAKY  126
            IRVA YVK     KIL+MTDAIS+NY RV KL +   ++ +IY +E+ RL  YE+S+A  
Sbjct  89   IRVADYVKDTDKFKILDMTDAISLNYSRVKKLASKKSLRAIIYSLEQKRLESYERSVANL  148

Query  127  FDQTIFVSQHDKNYLFRNLPDLYHKSLVCTNGVDVANFKNTLFKRSYK------LIFIGN  180
            FD T F+S  D++YL+ N     H   +  NGVD +  +    KR  K      LIFIGN
Sbjct  149  FDLTTFISSVDRDYLYPNPGSNIH---IVNNGVDTSALR--YIKREIKIDKPVELIFIGN  203

Query  181  MFSVQNFDAAFWFCESVLPILRQYGPFTFHVIGKISLENSKKLSAYEGVFVTGAVDNVMD  240
            M+S+QN DAA  F +++LP L       F VIGKIS  N   L++++     G VD++  
Sbjct  204  MYSLQNMDAAKHFAKNILPCLYDEFNIIFKVIGKISETNKNILNSFKNTIALGTVDDINS  263

Query  241  YANNSLAGICSVRLAAGVQNKILEYMAMGIPAITTSIGLEGLFAVDGESIVVANTPHEFV  300
             A+    GIC VRL AGVQNKILEYMA+G+P IT+SIG EG+ A  G  I VA+T  ++ 
Sbjct  264  SASTGHIGICPVRLGAGVQNKILEYMALGLPCITSSIGYEGINAKSGSEIFVADTVEQYK  323

Query  301  STILKLFNDPSFGKTISKNGLGYVQQNHSWSEKLQPLIQVINNLIEE  347
            + + ++  D +    +++N   +V+ N SW  K+  L+  ++  + E
Sbjct  324  NVLREIIYDYNRYTEVAENARSFVENNFSWESKVANLMNTLDEKLYE  370


>ref|YP_669975.1| putative glycosyltransferase [Escherichia coli 536]
 ref|ZP_03031752.1| mannosyl transferase [Escherichia coli F11]
 gb|ABG70074.1| putative glycosyltransferase [Escherichia coli 536]
 gb|EDV69299.1| mannosyl transferase [Escherichia coli F11]
Length=401

 Score =  266 bits (679),  Expect = 4e-71, Method: Compositional matrix adjust.
 Identities = 145/347 (41%), Positives = 208/347 (59%), Gaps = 11/347 (3%)

Query  7    DDGVYTSVYRCHHPRIISYFSCLLALPTNIPLQVAYYYSPKFKKIINKLVPEHDLLLPHL  66
            +D V+  ++R + P+  SY++ L AL T  PLQ+AYY S  FK   NKL+ + D +  HL
Sbjct  59   NDSVFKEIHRVYLPKYKSYYNVLKALVTQKPLQIAYYQSDTFKNKYNKLIKQCDAVFCHL  118

Query  67   IRVAGYVKKNSTPKILEMTDAISMNYERVCKLKNSTGIKGLIYKIERNRLNQYEKSIAKY  126
            IRVA YVK     KIL+MTDAIS+NY RV KL +   ++ +IY +E+ RL  YE+S+A  
Sbjct  119  IRVADYVKDTDKFKILDMTDAISLNYSRVKKLASKKSLRAIIYSLEQKRLESYERSVANL  178

Query  127  FDQTIFVSQHDKNYLFRNLPDLYHKSLVCTNGVDVANFKNTLFKRSYK------LIFIGN  180
            FD T F+S  D++YL+ N     H   +  NGVD +  +    KR  K      LIFIGN
Sbjct  179  FDLTTFISSVDRDYLYPNPGSNIH---IVNNGVDTSALR--YIKREIKIDKPVELIFIGN  233

Query  181  MFSVQNFDAAFWFCESVLPILRQYGPFTFHVIGKISLENSKKLSAYEGVFVTGAVDNVMD  240
            M+S+QN DAA  F +++LP L       F VIGKIS  N   L++++     G VD++  
Sbjct  234  MYSLQNMDAAKHFAKNILPCLYDEFNIIFKVIGKISETNKNILNSFKNTIALGTVDDINS  293

Query  241  YANNSLAGICSVRLAAGVQNKILEYMAMGIPAITTSIGLEGLFAVDGESIVVANTPHEFV  300
             A+    GIC VRL AGVQNKILEYMA+G+P IT+SIG EG+ A  G  I VA+T  ++ 
Sbjct  294  SASTGHIGICPVRLGAGVQNKILEYMALGLPCITSSIGYEGINAKSGSEIFVADTVEQYK  353

Query  301  STILKLFNDPSFGKTISKNGLGYVQQNHSWSEKLQPLIQVINNLIEE  347
            + + ++  D +    +++N   +V+ N SW  K+  L+  ++  + E
Sbjct  354  NVLREIIYDYNRYTEVAENARSFVENNFSWESKVANLMNTLDEKLYE  400


>emb|CAD19797.1| putative glycosyltransferase [Escherichia coli]
Length=401

 Score =  265 bits (678),  Expect = 5e-71, Method: Compositional matrix adjust.
 Identities = 145/347 (41%), Positives = 208/347 (59%), Gaps = 11/347 (3%)

Query  7    DDGVYTSVYRCHHPRIISYFSCLLALPTNIPLQVAYYYSPKFKKIINKLVPEHDLLLPHL  66
            +D V+  ++R + P+  SY++ L AL T  PLQ+AYY S  FK   NKL+ + D +  HL
Sbjct  59   NDSVFKEIHRVYLPKYKSYYNVLKALVTQKPLQIAYYQSDTFKNKYNKLIKQCDAVFCHL  118

Query  67   IRVAGYVKKNSTPKILEMTDAISMNYERVCKLKNSTGIKGLIYKIERNRLNQYEKSIAKY  126
            IRVA YVK     KIL+MTDAIS+NY RV KL +   ++ +IY +E+ RL  YE+S+A  
Sbjct  119  IRVADYVKDTDKFKILDMTDAISLNYSRVKKLASKKSLRAIIYSLEQKRLESYERSVANL  178

Query  127  FDQTIFVSQHDKNYLFRNLPDLYHKSLVCTNGVDVANFKNTLFKRSYK------LIFIGN  180
            FD T F+S  D++YL+ N     H   +  NGVD +  +    KR  K      LIFIGN
Sbjct  179  FDLTTFISSVDRDYLYPNPGSNIH---IVNNGVDTSALR--YIKREIKIDKPVELIFIGN  233

Query  181  MFSVQNFDAAFWFCESVLPILRQYGPFTFHVIGKISLENSKKLSAYEGVFVTGAVDNVMD  240
            M+S+QN DAA  F +++LP L       F VIGKIS  N   L++++     G VD++  
Sbjct  234  MYSLQNMDAAKHFAKNILPCLYDEFNIIFKVIGKISETNKNILNSFKNTIALGTVDDINS  293

Query  241  YANNSLAGICSVRLAAGVQNKILEYMAMGIPAITTSIGLEGLFAVDGESIVVANTPHEFV  300
             A+    GIC VRL AGVQNKILEYMA+G+P IT+SIG EG+ A  G  I VA+T  ++ 
Sbjct  294  SASTGHIGICLVRLGAGVQNKILEYMALGLPCITSSIGYEGINAKSGSEIFVADTVEQYK  353

Query  301  STILKLFNDPSFGKTISKNGLGYVQQNHSWSEKLQPLIQVINNLIEE  347
            + + ++  D +    +++N   +V+ N SW  K+  L+  ++  + E
Sbjct  354  NVLREIIYDYNRYTEVAENARSFVENNFSWESKVANLMNTLDEKLYE  400


>ref|YP_002413092.1| Mannosyl transferase wbaD [Escherichia coli UMN026]
 gb|AAY23730.1| WbaD [Escherichia coli]
 gb|AAY23736.1| WbaD [Escherichia coli]
 gb|AAY23742.1| WbaD [Escherichia coli]
 emb|CAR13564.1| Mannosyl transferase wbaD [Escherichia coli UMN026]
Length=400

 Score =  261 bits (668),  Expect = 7e-70, Method: Compositional matrix adjust.
 Identities = 141/348 (40%), Positives = 211/348 (60%), Gaps = 15/348 (4%)

Query  7    DDGVYTSVYRCHHPRIISYFSCLLALPTNIPLQVAYYYSPKFKKIINKLVPEHDLLLPHL  66
            D  V++SV+R +  +  S  + + +L +N PLQ+ YY S +F+  + +L+PEH   L HL
Sbjct  60   DREVFSSVHRVYLSKKKSILNVIFSLFSNTPLQIGYYKSKEFEDKLKQLLPEHSATLSHL  119

Query  67   IRVAGYVKKNSTPKILEMTDAISMNYERVCKLKNSTGIKGLIYKIERNRLNQYEKSIAKY  126
            IRV  YVK+N     LEMTDAIS+NY+RV +  +   +K  +Y  E+ RL +YE++I   
Sbjct  120  IRVGDYVKENKDINFLEMTDAISLNYKRVKEKASLLSLKTFVYSFEQKRLERYERTINNK  179

Query  127  FDQTIFVSQHDKNYLFRNLPDLYHKSLVCTNGVDVANF----KNTLFKRSYKLIFIGNMF  182
            F  T  VSQ D +YL+   PD  +  LVC NGVD  +     +     +   L+FIGN++
Sbjct  180  FSLTTLVSQVDSDYLY---PDRPNNVLVCGNGVDAVSLPFSERKIAKDKKITLVFIGNLY  236

Query  183  SVQNFDAAFWFCESVLPILRQYGPFTFHVIGKISLENSKKLSAYEGVFVTGAVDNVMDYA  242
            S+QN D   WF + VLP L ++G F F VIG+I+ ++   L +  GV VTG VD++   A
Sbjct  237  SLQNMDGVRWFTKEVLPFLNKHGNFEFKVIGRITDKDKSWLESQPGVVVTGEVDSITYAA  296

Query  243  NNSLAGICSVRLAAGVQNKILEYMAMGIPAITTSIGLEGLFAVDGESIVVANTPHEFVST  302
             +   G+C +RL AG+QNK+LEYMA+G+P I++++G EGL A +G+ I VANT  E++  
Sbjct  297  ADGHIGVCPIRLGAGIQNKVLEYMALGLPCISSTVGFEGLGAEEGKEIYVANTKEEYLR-  355

Query  303  ILKLF--NDPSFGKT--ISKNGLGYVQQNHSWSEKLQPLIQVINNLIE  346
            +L  F  N   + +T  ++K  +G   +N SW  KL P IQ I   ++
Sbjct  356  VLNYFITNLDKYTETALVAKKFIG---ENFSWEAKLSPYIQKIKESVK  400



  Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF
excluding environmental samples from WGS projects
    Posted date:  Jun 15, 2009  5:41 PM
  Number of letters in database: 26,570,333
  Number of sequences in database:  84,251

Lambda     K      H
   0.322    0.138    0.412 
Gapped
Lambda     K      H
   0.267   0.0410    0.140 
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 84251
Number of Hits to DB: 1683107
Number of extensions: 66219
Number of successful extensions: 168
Number of sequences better than 0.1: 0
Number of HSP's better than 0.1 without gapping: 0
Number of HSP's gapped: 168
Number of HSP's successfully gapped: 0
Length of query: 347
Length of database: 26570333
Length adjustment: 105
Effective length of query: 242
Effective length of database: 17723978
Effective search space: 4289202676
Effective search space used: 4289202676
T: 11
A: 40
X1: 16 (7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (20.4 bits)
S2: 80 (35.4 bits)