BLASTP 2.2.21+ Reference: Stephen F. Altschul, Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Reference for compositional score matrix adjustment: Stephen F. Altschul, John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis, Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109. RID: 3H9DWTF901S Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF excluding environmental samples from WGS projects 9,079,606 sequences; 3,109,523,384 total letters Query= gi|56122506|gb|AAV74378.1| glycosyltransferase [Escherichia coli] Length=347 Score E Sequences producing significant alignments: (Bits) Value gb|AAV74378.1| glycosyltransferase [Escherichia coli] 718 0.0 ref|NP_754447.1| hypothetical protein c2559 [Escherichia coli... 266 3e-71 ref|YP_669975.1| putative glycosyltransferase [Escherichia co... 266 4e-71 emb|CAD19797.1| putative glycosyltransferase [Escherichia coli] 265 5e-71 ref|YP_002413092.1| Mannosyl transferase wbaD [Escherichia co... 261 7e-70 ALIGNMENTS >gb|AAV74378.1| glycosyltransferase [Escherichia coli] Length=347 Score = 718 bits (1853), Expect = 0.0, Method: Compositional matrix adjust. Identities = 347/347 (100%), Positives = 347/347 (100%), Gaps = 0/347 (0%) Query 1 MTYPIPDDGVYTSVYRCHHPRIISYFSCLLALPTNIPLQVAYYYSPKFKKIINKLVPEHD 60 MTYPIPDDGVYTSVYRCHHPRIISYFSCLLALPTNIPLQVAYYYSPKFKKIINKLVPEHD Sbjct 1 MTYPIPDDGVYTSVYRCHHPRIISYFSCLLALPTNIPLQVAYYYSPKFKKIINKLVPEHD 60 Query 61 LLLPHLIRVAGYVKKNSTPKILEMTDAISMNYERVCKLKNSTGIKGLIYKIERNRLNQYE 120 LLLPHLIRVAGYVKKNSTPKILEMTDAISMNYERVCKLKNSTGIKGLIYKIERNRLNQYE Sbjct 61 LLLPHLIRVAGYVKKNSTPKILEMTDAISMNYERVCKLKNSTGIKGLIYKIERNRLNQYE 120 Query 121 KSIAKYFDQTIFVSQHDKNYLFRNLPDLYHKSLVCTNGVDVANFKNTLFKRSYKLIFIGN 180 KSIAKYFDQTIFVSQHDKNYLFRNLPDLYHKSLVCTNGVDVANFKNTLFKRSYKLIFIGN Sbjct 121 KSIAKYFDQTIFVSQHDKNYLFRNLPDLYHKSLVCTNGVDVANFKNTLFKRSYKLIFIGN 180 Query 181 MFSVQNFDAAFWFCESVLPILRQYGPFTFHVIGKISLENSKKLSAYEGVFVTGAVDNVMD 240 MFSVQNFDAAFWFCESVLPILRQYGPFTFHVIGKISLENSKKLSAYEGVFVTGAVDNVMD Sbjct 181 MFSVQNFDAAFWFCESVLPILRQYGPFTFHVIGKISLENSKKLSAYEGVFVTGAVDNVMD 240 Query 241 YANNSLAGICSVRLAAGVQNKILEYMAMGIPAITTSIGLEGLFAVDGESIVVANTPHEFV 300 YANNSLAGICSVRLAAGVQNKILEYMAMGIPAITTSIGLEGLFAVDGESIVVANTPHEFV Sbjct 241 YANNSLAGICSVRLAAGVQNKILEYMAMGIPAITTSIGLEGLFAVDGESIVVANTPHEFV 300 Query 301 STILKLFNDPSFGKTISKNGLGYVQQNHSWSEKLQPLIQVINNLIEE 347 STILKLFNDPSFGKTISKNGLGYVQQNHSWSEKLQPLIQVINNLIEE Sbjct 301 STILKLFNDPSFGKTISKNGLGYVQQNHSWSEKLQPLIQVINNLIEE 347 >ref|NP_754447.1| hypothetical protein c2559 [Escherichia coli CFT073] gb|AAN81014.1|AE016762_267 Hypothetical protein c2559 [Escherichia coli CFT073] Length=371 Score = 266 bits (679), Expect = 3e-71, Method: Compositional matrix adjust. Identities = 145/347 (41%), Positives = 208/347 (59%), Gaps = 11/347 (3%) Query 7 DDGVYTSVYRCHHPRIISYFSCLLALPTNIPLQVAYYYSPKFKKIINKLVPEHDLLLPHL 66 +D V+ ++R + P+ SY++ L AL T PLQ+AYY S FK NKL+ + D + HL Sbjct 29 NDSVFKEIHRVYLPKYKSYYNVLKALVTQKPLQIAYYQSDTFKNKYNKLIKQCDAVFCHL 88 Query 67 IRVAGYVKKNSTPKILEMTDAISMNYERVCKLKNSTGIKGLIYKIERNRLNQYEKSIAKY 126 IRVA YVK KIL+MTDAIS+NY RV KL + ++ +IY +E+ RL YE+S+A Sbjct 89 IRVADYVKDTDKFKILDMTDAISLNYSRVKKLASKKSLRAIIYSLEQKRLESYERSVANL 148 Query 127 FDQTIFVSQHDKNYLFRNLPDLYHKSLVCTNGVDVANFKNTLFKRSYK------LIFIGN 180 FD T F+S D++YL+ N H + NGVD + + KR K LIFIGN Sbjct 149 FDLTTFISSVDRDYLYPNPGSNIH---IVNNGVDTSALR--YIKREIKIDKPVELIFIGN 203 Query 181 MFSVQNFDAAFWFCESVLPILRQYGPFTFHVIGKISLENSKKLSAYEGVFVTGAVDNVMD 240 M+S+QN DAA F +++LP L F VIGKIS N L++++ G VD++ Sbjct 204 MYSLQNMDAAKHFAKNILPCLYDEFNIIFKVIGKISETNKNILNSFKNTIALGTVDDINS 263 Query 241 YANNSLAGICSVRLAAGVQNKILEYMAMGIPAITTSIGLEGLFAVDGESIVVANTPHEFV 300 A+ GIC VRL AGVQNKILEYMA+G+P IT+SIG EG+ A G I VA+T ++ Sbjct 264 SASTGHIGICPVRLGAGVQNKILEYMALGLPCITSSIGYEGINAKSGSEIFVADTVEQYK 323 Query 301 STILKLFNDPSFGKTISKNGLGYVQQNHSWSEKLQPLIQVINNLIEE 347 + + ++ D + +++N +V+ N SW K+ L+ ++ + E Sbjct 324 NVLREIIYDYNRYTEVAENARSFVENNFSWESKVANLMNTLDEKLYE 370 >ref|YP_669975.1| putative glycosyltransferase [Escherichia coli 536] ref|ZP_03031752.1| mannosyl transferase [Escherichia coli F11] gb|ABG70074.1| putative glycosyltransferase [Escherichia coli 536] gb|EDV69299.1| mannosyl transferase [Escherichia coli F11] Length=401 Score = 266 bits (679), Expect = 4e-71, Method: Compositional matrix adjust. Identities = 145/347 (41%), Positives = 208/347 (59%), Gaps = 11/347 (3%) Query 7 DDGVYTSVYRCHHPRIISYFSCLLALPTNIPLQVAYYYSPKFKKIINKLVPEHDLLLPHL 66 +D V+ ++R + P+ SY++ L AL T PLQ+AYY S FK NKL+ + D + HL Sbjct 59 NDSVFKEIHRVYLPKYKSYYNVLKALVTQKPLQIAYYQSDTFKNKYNKLIKQCDAVFCHL 118 Query 67 IRVAGYVKKNSTPKILEMTDAISMNYERVCKLKNSTGIKGLIYKIERNRLNQYEKSIAKY 126 IRVA YVK KIL+MTDAIS+NY RV KL + ++ +IY +E+ RL YE+S+A Sbjct 119 IRVADYVKDTDKFKILDMTDAISLNYSRVKKLASKKSLRAIIYSLEQKRLESYERSVANL 178 Query 127 FDQTIFVSQHDKNYLFRNLPDLYHKSLVCTNGVDVANFKNTLFKRSYK------LIFIGN 180 FD T F+S D++YL+ N H + NGVD + + KR K LIFIGN Sbjct 179 FDLTTFISSVDRDYLYPNPGSNIH---IVNNGVDTSALR--YIKREIKIDKPVELIFIGN 233 Query 181 MFSVQNFDAAFWFCESVLPILRQYGPFTFHVIGKISLENSKKLSAYEGVFVTGAVDNVMD 240 M+S+QN DAA F +++LP L F VIGKIS N L++++ G VD++ Sbjct 234 MYSLQNMDAAKHFAKNILPCLYDEFNIIFKVIGKISETNKNILNSFKNTIALGTVDDINS 293 Query 241 YANNSLAGICSVRLAAGVQNKILEYMAMGIPAITTSIGLEGLFAVDGESIVVANTPHEFV 300 A+ GIC VRL AGVQNKILEYMA+G+P IT+SIG EG+ A G I VA+T ++ Sbjct 294 SASTGHIGICPVRLGAGVQNKILEYMALGLPCITSSIGYEGINAKSGSEIFVADTVEQYK 353 Query 301 STILKLFNDPSFGKTISKNGLGYVQQNHSWSEKLQPLIQVINNLIEE 347 + + ++ D + +++N +V+ N SW K+ L+ ++ + E Sbjct 354 NVLREIIYDYNRYTEVAENARSFVENNFSWESKVANLMNTLDEKLYE 400 >emb|CAD19797.1| putative glycosyltransferase [Escherichia coli] Length=401 Score = 265 bits (678), Expect = 5e-71, Method: Compositional matrix adjust. Identities = 145/347 (41%), Positives = 208/347 (59%), Gaps = 11/347 (3%) Query 7 DDGVYTSVYRCHHPRIISYFSCLLALPTNIPLQVAYYYSPKFKKIINKLVPEHDLLLPHL 66 +D V+ ++R + P+ SY++ L AL T PLQ+AYY S FK NKL+ + D + HL Sbjct 59 NDSVFKEIHRVYLPKYKSYYNVLKALVTQKPLQIAYYQSDTFKNKYNKLIKQCDAVFCHL 118 Query 67 IRVAGYVKKNSTPKILEMTDAISMNYERVCKLKNSTGIKGLIYKIERNRLNQYEKSIAKY 126 IRVA YVK KIL+MTDAIS+NY RV KL + ++ +IY +E+ RL YE+S+A Sbjct 119 IRVADYVKDTDKFKILDMTDAISLNYSRVKKLASKKSLRAIIYSLEQKRLESYERSVANL 178 Query 127 FDQTIFVSQHDKNYLFRNLPDLYHKSLVCTNGVDVANFKNTLFKRSYK------LIFIGN 180 FD T F+S D++YL+ N H + NGVD + + KR K LIFIGN Sbjct 179 FDLTTFISSVDRDYLYPNPGSNIH---IVNNGVDTSALR--YIKREIKIDKPVELIFIGN 233 Query 181 MFSVQNFDAAFWFCESVLPILRQYGPFTFHVIGKISLENSKKLSAYEGVFVTGAVDNVMD 240 M+S+QN DAA F +++LP L F VIGKIS N L++++ G VD++ Sbjct 234 MYSLQNMDAAKHFAKNILPCLYDEFNIIFKVIGKISETNKNILNSFKNTIALGTVDDINS 293 Query 241 YANNSLAGICSVRLAAGVQNKILEYMAMGIPAITTSIGLEGLFAVDGESIVVANTPHEFV 300 A+ GIC VRL AGVQNKILEYMA+G+P IT+SIG EG+ A G I VA+T ++ Sbjct 294 SASTGHIGICLVRLGAGVQNKILEYMALGLPCITSSIGYEGINAKSGSEIFVADTVEQYK 353 Query 301 STILKLFNDPSFGKTISKNGLGYVQQNHSWSEKLQPLIQVINNLIEE 347 + + ++ D + +++N +V+ N SW K+ L+ ++ + E Sbjct 354 NVLREIIYDYNRYTEVAENARSFVENNFSWESKVANLMNTLDEKLYE 400 >ref|YP_002413092.1| Mannosyl transferase wbaD [Escherichia coli UMN026] gb|AAY23730.1| WbaD [Escherichia coli] gb|AAY23736.1| WbaD [Escherichia coli] gb|AAY23742.1| WbaD [Escherichia coli] emb|CAR13564.1| Mannosyl transferase wbaD [Escherichia coli UMN026] Length=400 Score = 261 bits (668), Expect = 7e-70, Method: Compositional matrix adjust. Identities = 141/348 (40%), Positives = 211/348 (60%), Gaps = 15/348 (4%) Query 7 DDGVYTSVYRCHHPRIISYFSCLLALPTNIPLQVAYYYSPKFKKIINKLVPEHDLLLPHL 66 D V++SV+R + + S + + +L +N PLQ+ YY S +F+ + +L+PEH L HL Sbjct 60 DREVFSSVHRVYLSKKKSILNVIFSLFSNTPLQIGYYKSKEFEDKLKQLLPEHSATLSHL 119 Query 67 IRVAGYVKKNSTPKILEMTDAISMNYERVCKLKNSTGIKGLIYKIERNRLNQYEKSIAKY 126 IRV YVK+N LEMTDAIS+NY+RV + + +K +Y E+ RL +YE++I Sbjct 120 IRVGDYVKENKDINFLEMTDAISLNYKRVKEKASLLSLKTFVYSFEQKRLERYERTINNK 179 Query 127 FDQTIFVSQHDKNYLFRNLPDLYHKSLVCTNGVDVANF----KNTLFKRSYKLIFIGNMF 182 F T VSQ D +YL+ PD + LVC NGVD + + + L+FIGN++ Sbjct 180 FSLTTLVSQVDSDYLY---PDRPNNVLVCGNGVDAVSLPFSERKIAKDKKITLVFIGNLY 236 Query 183 SVQNFDAAFWFCESVLPILRQYGPFTFHVIGKISLENSKKLSAYEGVFVTGAVDNVMDYA 242 S+QN D WF + VLP L ++G F F VIG+I+ ++ L + GV VTG VD++ A Sbjct 237 SLQNMDGVRWFTKEVLPFLNKHGNFEFKVIGRITDKDKSWLESQPGVVVTGEVDSITYAA 296 Query 243 NNSLAGICSVRLAAGVQNKILEYMAMGIPAITTSIGLEGLFAVDGESIVVANTPHEFVST 302 + G+C +RL AG+QNK+LEYMA+G+P I++++G EGL A +G+ I VANT E++ Sbjct 297 ADGHIGVCPIRLGAGIQNKVLEYMALGLPCISSTVGFEGLGAEEGKEIYVANTKEEYLR- 355 Query 303 ILKLF--NDPSFGKT--ISKNGLGYVQQNHSWSEKLQPLIQVINNLIE 346 +L F N + +T ++K +G +N SW KL P IQ I ++ Sbjct 356 VLNYFITNLDKYTETALVAKKFIG---ENFSWEAKLSPYIQKIKESVK 400 Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF excluding environmental samples from WGS projects Posted date: Jun 15, 2009 5:41 PM Number of letters in database: 26,570,333 Number of sequences in database: 84,251 Lambda K H 0.322 0.138 0.412 Gapped Lambda K H 0.267 0.0410 0.140 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Sequences: 84251 Number of Hits to DB: 1683107 Number of extensions: 66219 Number of successful extensions: 168 Number of sequences better than 0.1: 0 Number of HSP's better than 0.1 without gapping: 0 Number of HSP's gapped: 168 Number of HSP's successfully gapped: 0 Length of query: 347 Length of database: 26570333 Length adjustment: 105 Effective length of query: 242 Effective length of database: 17723978 Effective search space: 4289202676 Effective search space used: 4289202676 T: 11 A: 40 X1: 16 (7.4 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 41 (20.4 bits) S2: 80 (35.4 bits)