BLASTP 2.2.21+ Reference: Stephen F. Altschul, Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Reference for compositional score matrix adjustment: Stephen F. Altschul, John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis, Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109. RID: 3H9HKFT0013 Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF excluding environmental samples from WGS projects 9,090,450 sequences; 3,112,979,771 total letters Query= gi|56159895|gb|AAV80759.1| putative fucosyltransferase [Escherichia coli] >gi|56384983|gb|AAV85963.1| WcmD [Escherichia coli] Length=302 Score E Sequences producing significant alignments: (Bits) Value gb|AAV80759.1| putative fucosyltransferase [Escherichia coli]... 629 1e-180 gb|AAO37719.1| putative fucosyltransferase [Escherichia coli] 628 3e-180 ref|YP_002329683.1| predicted fucosyltransferase [Escherichia... 293 2e-79 gb|AAO37698.1| putative fucosyltransferase [Escherichia coli] 145 4e-35 gb|AAL67556.1|AF461121_7 putative fucosyltransferase WbgN [Es... 119 3e-27 gb|ABE98421.1| glycosyltransferase [Escherichia coli] 112 6e-25 ALIGNMENTS >gb|AAV80759.1| putative fucosyltransferase [Escherichia coli] gb|AAV85963.1| WcmD [Escherichia coli] Length=302 Score = 629 bits (1622), Expect = 1e-180, Method: Compositional matrix adjust. Identities = 302/302 (100%), Positives = 302/302 (100%), Gaps = 0/302 (0%) Query 1 MYSCLSGGLGNQMFQYAAAYILQRKLKQRSLVLDDSYFLDCSNRDTRRRFELNQFNICYD 60 MYSCLSGGLGNQMFQYAAAYILQRKLKQRSLVLDDSYFLDCSNRDTRRRFELNQFNICYD Sbjct 1 MYSCLSGGLGNQMFQYAAAYILQRKLKQRSLVLDDSYFLDCSNRDTRRRFELNQFNICYD 60 Query 61 RLTTSKEKKEISIIRHVNRYRLPLFVTNSIFGVLLKKNYLPEAKFYEFLNNCKLQVKNGY 120 RLTTSKEKKEISIIRHVNRYRLPLFVTNSIFGVLLKKNYLPEAKFYEFLNNCKLQVKNGY Sbjct 61 RLTTSKEKKEISIIRHVNRYRLPLFVTNSIFGVLLKKNYLPEAKFYEFLNNCKLQVKNGY 120 Query 121 CLFSYFQDATLIDSHRDMILPLFQINEDLLNLCNDLHIYKKVICENANTTSLHIRRGDYI 180 CLFSYFQDATLIDSHRDMILPLFQINEDLLNLCNDLHIYKKVICENANTTSLHIRRGDYI Sbjct 121 CLFSYFQDATLIDSHRDMILPLFQINEDLLNLCNDLHIYKKVICENANTTSLHIRRGDYI 180 Query 181 TNPHASKFHGVLPMDYYEKAIRYIEDVQGEQVIIVFSDDVKWAENTFANQPNYYVVNNSE 240 TNPHASKFHGVLPMDYYEKAIRYIEDVQGEQVIIVFSDDVKWAENTFANQPNYYVVNNSE Sbjct 181 TNPHASKFHGVLPMDYYEKAIRYIEDVQGEQVIIVFSDDVKWAENTFANQPNYYVVNNSE 240 Query 241 CEYSAIDMFLMSKCKNNIIANSTYSWWGAWLNTFEDKIVVSPRKWFAGNNKSKLTMDSWI 300 CEYSAIDMFLMSKCKNNIIANSTYSWWGAWLNTFEDKIVVSPRKWFAGNNKSKLTMDSWI Sbjct 241 CEYSAIDMFLMSKCKNNIIANSTYSWWGAWLNTFEDKIVVSPRKWFAGNNKSKLTMDSWI 300 Query 301 NL 302 NL Sbjct 301 NL 302 >gb|AAO37719.1| putative fucosyltransferase [Escherichia coli] Length=302 Score = 628 bits (1619), Expect = 3e-180, Method: Compositional matrix adjust. Identities = 301/302 (99%), Positives = 302/302 (100%), Gaps = 0/302 (0%) Query 1 MYSCLSGGLGNQMFQYAAAYILQRKLKQRSLVLDDSYFLDCSNRDTRRRFELNQFNICYD 60 MYSCLSGGLGNQMFQYAAAYILQRKLKQRSLVLDDSYFLDCSNRDTRRRFELNQFNICYD Sbjct 1 MYSCLSGGLGNQMFQYAAAYILQRKLKQRSLVLDDSYFLDCSNRDTRRRFELNQFNICYD 60 Query 61 RLTTSKEKKEISIIRHVNRYRLPLFVTNSIFGVLLKKNYLPEAKFYEFLNNCKLQVKNGY 120 RLTTSKEKKEISIIRHVNRYRLPLFVTNSIFGVLLKKNYLPEAKFYEFLNNCKLQVKNGY Sbjct 61 RLTTSKEKKEISIIRHVNRYRLPLFVTNSIFGVLLKKNYLPEAKFYEFLNNCKLQVKNGY 120 Query 121 CLFSYFQDATLIDSHRDMILPLFQINEDLLNLCNDLHIYKKVICENANTTSLHIRRGDYI 180 CLFSYFQDATLIDSHRDMILPLFQINEDLL+LCNDLHIYKKVICENANTTSLHIRRGDYI Sbjct 121 CLFSYFQDATLIDSHRDMILPLFQINEDLLHLCNDLHIYKKVICENANTTSLHIRRGDYI 180 Query 181 TNPHASKFHGVLPMDYYEKAIRYIEDVQGEQVIIVFSDDVKWAENTFANQPNYYVVNNSE 240 TNPHASKFHGVLPMDYYEKAIRYIEDVQGEQVIIVFSDDVKWAENTFANQPNYYVVNNSE Sbjct 181 TNPHASKFHGVLPMDYYEKAIRYIEDVQGEQVIIVFSDDVKWAENTFANQPNYYVVNNSE 240 Query 241 CEYSAIDMFLMSKCKNNIIANSTYSWWGAWLNTFEDKIVVSPRKWFAGNNKSKLTMDSWI 300 CEYSAIDMFLMSKCKNNIIANSTYSWWGAWLNTFEDKIVVSPRKWFAGNNKSKLTMDSWI Sbjct 241 CEYSAIDMFLMSKCKNNIIANSTYSWWGAWLNTFEDKIVVSPRKWFAGNNKSKLTMDSWI 300 Query 301 NL 302 NL Sbjct 301 NL 302 >ref|YP_002329683.1| predicted fucosyltransferase [Escherichia coli O127:H6 str. E2348/69] gb|AAR90894.1| putative fucosyltransferase [Escherichia coli] emb|CAS09719.1| predicted fucosyltransferase [Escherichia coli O127:H6 str. E2348/69] Length=299 Score = 293 bits (750), Expect = 2e-79, Method: Compositional matrix adjust. Identities = 148/304 (48%), Positives = 204/304 (67%), Gaps = 8/304 (2%) Query 1 MYSCLSGGLGNQMFQYAAAYILQRKLKQRSLVLDDSYFLDCSNRDTRRRFELNQFNICYD 60 MY CLSGGLGNQMFQYAAAYIL++ LVLDDSY+ + +DT R EL+QF I +D Sbjct 2 MYCCLSGGLGNQMFQYAAAYILKQHFPDTILVLDDSYYFNQPQKDTIRHLELDQFKIIFD 61 Query 61 RLTTSKEKKEISIIRHVNRYRLPLFVTNSIFGV--LLKKNYLPEAKFYEFLNNCKLQVKN 118 R ++ EK +I+ +R ++PL + F L K L +A +Y N ++ + Sbjct 62 RFSSKDEKVKINRLR--KHKKIPLLNSFLQFTAIKLCNKYSLNDASYY---NPESIKNID 116 Query 119 GYCLFSYFQDATLIDSHRDMILPLFQINEDLLNLCNDLHIYKKVICENANTTSLHIRRGD 178 CLFS++QD+ L++ HRD+ILPLF+I +DL LC++L IY +I ++ N TS+H+RRGD Sbjct 117 VACLFSFYQDSKLLNEHRDLILPLFEIRDDLRVLCHNLQIYS-LITDSKNITSIHVRRGD 175 Query 179 YITNPHASKFHGVLPMDYYEKAIRYIEDVQGEQVIIVFSDDVKWAENTFANQPNYYVVNN 238 Y+ N HA+KFHG L MDYY A+ YIE G Q I+F+DDV WA+ F+ N V + Sbjct 176 YVNNKHAAKFHGTLSMDYYISAMEYIESECGSQTFIIFTDDVIWAKEKFSKYSNCLVADA 235 Query 239 SECEYSAIDMFLMSKCKNNIIANSTYSWWGAWLNTFEDKIVVSPRKWFAGNNKSKLTMDS 298 E ++S IDM+LMS C NNIIANSTYSWWGAWLN EDK+V++P++W+ N+ L ++ Sbjct 236 DENKFSVIDMYLMSLCNNNIIANSTYSWWGAWLNRSEDKLVIAPKQWYISGNECSLKNEN 295 Query 299 WINL 302 WI + Sbjct 296 WIAM 299 >gb|AAO37698.1| putative fucosyltransferase [Escherichia coli] Length=283 Score = 145 bits (367), Expect = 4e-35, Method: Compositional matrix adjust. Identities = 103/302 (34%), Positives = 161/302 (53%), Gaps = 31/302 (10%) Query 7 GGLGNQMFQYAAAYILQRKLKQRSLVLDDSYFLDCSNRDTR--RRFELNQFNICYDRLTT 64 GGLGNQMFQYA A+ + ++ Q +L +D S D T R EL+ C Sbjct 7 GGLGNQMFQYATAFAIAKRTHQ-NLTVDIS---DAVKYKTHPLRLVELS----CSSEFV- 57 Query 65 SKEKKEISIIRHVNRYRLPLFVTNSIFGVLLKKNYLPEAKFYEFLNNCKLQVKNGYCLFS 124 KK +++ ++P F+ +F +K+Y+ ++ Y+ + K K + Sbjct 58 ---KKAWPFEKYLFSEKIPHFMKKGMF----RKHYVEKSLEYDPDIDTKSINKK---IVG 107 Query 125 YFQDATLIDSHRDMILPLFQINEDLLNLCNDLHIYKKVICENANTTSLHIRRGDYITNPH 184 YFQ R ++ FQ + N+L +I EN +T SLHIRRGDY+++ Sbjct 108 YFQTEKYFKEFRHELIKEFQPKTKFNSYQNELL---NLIKEN-DTCSLHIRRGDYVSSKI 163 Query 185 ASKFHGVLPMDYYEKAIRYIED---VQGEQVIIVFSDDVKWA-ENTFANQPNYYVVNNSE 240 A++ HG Y+E+AI Y+ + + + ++ +FSDD+KW EN F N +V + Sbjct 164 ANETHGTCSEKYFERAIDYLMNKGVINKKTLLFIFSDDIKWCRENIFFNNQICFV--QGD 221 Query 241 CEYSAIDMFLMSKCKNNIIANSTYSWWGAWLNTFEDKIVVSPRKWFAGNNKSKLTMDSWI 300 + +DM LMSKCKNNII+NS++SWW AWLN ++K V++P KWF + K + +SW+ Sbjct 222 AYHVELDMLLMSKCKNNIISNSSFSWWAAWLNENKNKTVIAPSKWFKKDIKHDIIPESWV 281 Query 301 NL 302 L Sbjct 282 KL 283 >gb|AAL67556.1|AF461121_7 putative fucosyltransferase WbgN [Escherichia coli] dbj|BAG11841.1| putative fucosyltransferase [Escherichia coli O55:H7] dbj|BAG11899.1| putative fucosyltransferase WbgN [Escherichia coli O55:H7] dbj|BAG11955.1| putative fucosyltransferase WbgN [Escherichia coli O55:H6] Length=290 Score = 119 bits (299), Expect = 3e-27, Method: Compositional matrix adjust. Identities = 89/291 (30%), Positives = 138/291 (47%), Gaps = 37/291 (12%) Query 5 LSGGLGNQMFQYAAAYILQRKLKQRSLVLDDSYFLDCSNRDTRRRFELNQFNICYDRLTT 64 L+GGLGNQMFQYA Y + + SL LD N F L++ NI + + Sbjct 8 LAGGLGNQMFQYAKGYA-ESVERNSSLKLD---LRGYKNYTLHGGFRLDKLNIDNTFVMS 63 Query 65 SKEKKEIS--IIRHVNRYRLPLFVTNSIFGVLLKKNYLPEAKFYEFLNNCKLQVKNGYCL 122 KE I+R +N++ P L K + E ++ +K Sbjct 64 KKEMCIFPNFIVRAINKF--PKLS-------LCSKRFESE----QYSKKINGSMKGSVEF 110 Query 123 FSYFQDATLIDSHRDMILPLF---QINEDLLNLCNDLHIYKKVICENANTTSLHIRRGDY 179 ++Q+ H++ + +F IN D L + + N+ S+HIRRGDY Sbjct 111 IGFWQNERYFLEHKEKLREIFTPININLDAKELSD--------VIRCTNSVSVHIRRGDY 162 Query 180 ITNPHASKFHGVLPMDYYEKAIRYIEDVQGEQVIIVFSDDVKWAE----NTFANQPNYYV 235 ++N A K HG+ YY +IRY+++ V VFSDD++W + F+ + Sbjct 163 VSNVEALKIHGLCTERYYIDSIRYLKERFNNLVFFVFSDDIEWCKKYKNEIFSRSDDVKF 222 Query 236 VNNSECEYSAIDMFLMSKCKNNIIANSTYSWWGAWLNTFEDKIVVSPRKWF 286 + + E +DM+LMS K +IIANS++SWWGAWL ++ I ++P WF Sbjct 223 IEGNTQE---VDMWLMSNAKYHIIANSSFSWWGAWLKNYDLGITIAPTPWF 270 >gb|ABE98421.1| glycosyltransferase [Escherichia coli] Length=297 Score = 112 bits (279), Expect = 6e-25, Method: Compositional matrix adjust. Identities = 92/317 (29%), Positives = 144/317 (45%), Gaps = 50/317 (15%) Query 5 LSGGLGNQMFQYAAAYILQRKLKQRSLVLDDSYFLDCSNRDTRRRFELNQFNICYDRLTT 64 L GGLGNQ+FQ++ Y L K+ L D S++ + D + LN I + L Sbjct 6 LQGGLGNQLFQFSFGYALS-KINGTPLYFDISHY---AENDDHGGYRLNNLQIPEEYL-- 59 Query 65 SKEKKEISIIRHVNRYRLPLFVTNSIFGVLLKKNYLPEAKFYEFLNNCKLQVKNGYCL-- 122 +Y P N+I+ +L++ + L F FL C GY Sbjct 60 --------------QYYTP--KINNIYKLLVRGSRLYPDIFL-FLGFCNEFHAYGYDFEY 102 Query 123 ----------FSYFQDATLIDSH----RDMILPLFQINEDLLNLCNDLHIYKKVICENAN 168 Y+Q H ++ +P N+ ++ I E+ + Sbjct 103 IAQKWKSKKYIGYWQSEHFFHKHILDLKEFFIPK--------NVSEQANLLAAKILESQS 154 Query 169 TTSLHIRRGDYITNPHASKFHGVLPMDYYEKAIRYIEDVQGEQVIIVFSDDVKW-AENTF 227 + S+HIRRGDYI N A+ HGV ++YY+KA+ I D+ + + +FSDD+ W EN Sbjct 155 SLSIHIRRGDYIKNKTATLTHGVCSLEYYKKALNKIRDLAMIRDVFIFSDDIFWCKENIE 214 Query 228 ANQPNYYVVNNSECEYSAIDMFLMSKCKNNIIANSTYSWWGAWLNTFEDKIVVSPRKWF- 286 Y + SE D++LMS ++IIANS++SWWGA+L + +IV+ P W+ Sbjct 215 TLLSKKYNIYYSEDLSQEEDLWLMSLANHHIIANSSFSWWGAYLGSSASQIVIYPTPWYD 274 Query 287 -AGNNKSKLTMDSWINL 302 N ++ WIN+ Sbjct 275 ITPKNTYIPIVNHWINV 291 Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF excluding environmental samples from WGS projects Posted date: Jun 16, 2009 5:41 PM Number of letters in database: 26,573,871 Number of sequences in database: 84,272 Lambda K H 0.323 0.137 0.428 Gapped Lambda K H 0.267 0.0410 0.140 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Sequences: 84272 Number of Hits to DB: 418252 Number of extensions: 17014 Number of successful extensions: 46 Number of sequences better than 0.1: 1 Number of HSP's better than 0.1 without gapping: 0 Number of HSP's gapped: 45 Number of HSP's successfully gapped: 1 Length of query: 302 Length of database: 26573871 Length adjustment: 103 Effective length of query: 199 Effective length of database: 17893855 Effective search space: 3560877145 Effective search space used: 3560877145 T: 11 A: 40 X1: 16 (7.5 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 41 (20.4 bits) S2: 80 (35.4 bits)