BLASTP 2.2.21+ Reference: Stephen F. Altschul, Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Reference for compositional score matrix adjustment: Stephen F. Altschul, John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis, Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109. RID: 3H9TDMS201R Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF excluding environmental samples from WGS projects 9,079,606 sequences; 3,109,523,384 total letters Query= gi|18266402|gb|AAL67556.1|AF461121_7 putative fucosyltransferase WbgN [Escherichia coli] >gi|168986301|dbj|BAG11841.1| putative fucosyltransferase [Escherichia coli O55:H7] >gi|168986360|dbj|BAG11899.1| putative fucosyltransferase WbgN [Escherichia coli O55:H7] >gi|168986417|dbj|BAG11955.1| putative fucosyltransferase WbgN [Escherichia coli O55:H6] Length=290 Score E Sequences producing significant alignments: (Bits) Value gb|AAL67556.1|AF461121_7 putative fucosyltransferase WbgN [Es... 600 8e-172 gb|ABE98421.1| glycosyltransferase [Escherichia coli] 196 3e-50 gb|AAO37698.1| putative fucosyltransferase [Escherichia coli] 157 2e-38 ref|YP_002329683.1| predicted fucosyltransferase [Escherichia... 123 2e-28 gb|AAO37719.1| putative fucosyltransferase [Escherichia coli] 120 2e-27 gb|AAV80759.1| putative fucosyltransferase [Escherichia coli]... 119 3e-27 ALIGNMENTS >gb|AAL67556.1|AF461121_7 putative fucosyltransferase WbgN [Escherichia coli] dbj|BAG11841.1| putative fucosyltransferase [Escherichia coli O55:H7] dbj|BAG11899.1| putative fucosyltransferase WbgN [Escherichia coli O55:H7] dbj|BAG11955.1| putative fucosyltransferase WbgN [Escherichia coli O55:H6] Length=290 Score = 600 bits (1546), Expect = 8e-172, Method: Compositional matrix adjust. Identities = 290/290 (100%), Positives = 290/290 (100%), Gaps = 0/290 (0%) Query 1 MSIVVARLAGGLGNQMFQYAKGYAESVERNSSLKLDLRGYKNYTLHGGFRLDKLNIDNTF 60 MSIVVARLAGGLGNQMFQYAKGYAESVERNSSLKLDLRGYKNYTLHGGFRLDKLNIDNTF Sbjct 1 MSIVVARLAGGLGNQMFQYAKGYAESVERNSSLKLDLRGYKNYTLHGGFRLDKLNIDNTF 60 Query 61 VMSKKEMCIFPNFIVRAINKFPKLSLCSKRFESEQYSKKINGSMKGSVEFIGFWQNERYF 120 VMSKKEMCIFPNFIVRAINKFPKLSLCSKRFESEQYSKKINGSMKGSVEFIGFWQNERYF Sbjct 61 VMSKKEMCIFPNFIVRAINKFPKLSLCSKRFESEQYSKKINGSMKGSVEFIGFWQNERYF 120 Query 121 LEHKEKLREIFTPININLDAKELSDVIRCTNSVSVHIRRGDYVSNVEALKIHGLCTERYY 180 LEHKEKLREIFTPININLDAKELSDVIRCTNSVSVHIRRGDYVSNVEALKIHGLCTERYY Sbjct 121 LEHKEKLREIFTPININLDAKELSDVIRCTNSVSVHIRRGDYVSNVEALKIHGLCTERYY 180 Query 181 IDSIRYLKERFNNLVFFVFSDDIEWCKKYKNEIFSRSDDVKFIEGNTQEVDMWLMSNAKY 240 IDSIRYLKERFNNLVFFVFSDDIEWCKKYKNEIFSRSDDVKFIEGNTQEVDMWLMSNAKY Sbjct 181 IDSIRYLKERFNNLVFFVFSDDIEWCKKYKNEIFSRSDDVKFIEGNTQEVDMWLMSNAKY 240 Query 241 HIIANSSFSWWGAWLKNYDLGITIAPTPWFEREELNSFDPCPEKWVRIEK 290 HIIANSSFSWWGAWLKNYDLGITIAPTPWFEREELNSFDPCPEKWVRIEK Sbjct 241 HIIANSSFSWWGAWLKNYDLGITIAPTPWFEREELNSFDPCPEKWVRIEK 290 >gb|ABE98421.1| glycosyltransferase [Escherichia coli] Length=297 Score = 196 bits (497), Expect = 3e-50, Method: Compositional matrix adjust. Identities = 110/297 (37%), Positives = 170/297 (57%), Gaps = 11/297 (3%) Query 1 MSIVVARLAGGLGNQMFQYAKGYAESVERNSSLKLDLRGYKNYTLHGGFRLDKLNIDNTF 60 MSI+ RL GGLGNQ+FQ++ GYA S + L D+ Y HGG+RL+ L I + Sbjct 1 MSII--RLQGGLGNQLFQFSFGYALSKINGTPLYFDISHYAENDDHGGYRLNNLQIPEEY 58 Query 61 VM--SKKEMCIFPNFIVRAINKFPK----LSLCSKRFESEQYSKKINGSMKGSVEFIGFW 114 + + K I+ +VR +P L C++ F + Y + S ++IG+W Sbjct 59 LQYYTPKINNIY-KLLVRGSRLYPDIFLFLGFCNE-FHAYGYDFEYIAQKWKSKKYIGYW 116 Query 115 QNERYFLEHKEKLREIFTPININLDAKELS-DVIRCTNSVSVHIRRGDYVSNVEALKIHG 173 Q+E +F +H L+E F P N++ A L+ ++ +S+S+HIRRGDY+ N A HG Sbjct 117 QSEHFFHKHILDLKEFFIPKNVSEQANLLAAKILESQSSLSIHIRRGDYIKNKTATLTHG 176 Query 174 LCTERYYIDSIRYLKERFNNLVFFVFSDDIEWCKKYKNEIFSRSDDVKFIEGNTQEVDMW 233 +C+ YY ++ +++ F+FSDDI WCK+ + S+ ++ + E +QE D+W Sbjct 177 VCSLEYYKKALNKIRDLAMIRDVFIFSDDIFWCKENIETLLSKKYNIYYSEDLSQEEDLW 236 Query 234 LMSNAKYHIIANSSFSWWGAWLKNYDLGITIAPTPWFEREELNSFDPCPEKWVRIEK 290 LMS A +HIIANSSFSWWGA+L + I I PTPW++ N++ P W+ ++K Sbjct 237 LMSLANHHIIANSSFSWWGAYLGSSASQIVIYPTPWYDITPKNTYIPIVNHWINVDK 293 >gb|AAO37698.1| putative fucosyltransferase [Escherichia coli] Length=283 Score = 157 bits (396), Expect = 2e-38, Method: Compositional matrix adjust. Identities = 103/291 (35%), Positives = 162/291 (55%), Gaps = 18/291 (6%) Query 6 ARLAGGLGNQMFQYAKGYAESVERNSSLKLDLRGYKNYTLHGGFRLDKLNIDNTFVMSKK 65 ++ GGLGNQMFQYA +A + + +L +D+ Y H RL +L+ + FV KK Sbjct 3 VKIIGGLGNQMFQYATAFAIAKRTHQNLTVDISDAVKYKTHP-LRLVELSCSSEFV--KK 59 Query 66 EMCIFPNFIVRAINKFPKLSLCSKRF--ESEQYSKKINGSMKGSVEFIGFWQNERYFLEH 123 I F K + K + +S +Y I+ + + + +G++Q E+YF E Sbjct 60 AWPFEKYLFSEKIPHFMKKGMFRKHYVEKSLEYDPDID-TKSINKKIVGYFQTEKYFKEF 118 Query 124 KEKLREIFTP-ININLDAKELSDVIRCTNSVSVHIRRGDYVSNVEALKIHGLCTERYYID 182 + +L + F P N EL ++I+ ++ S+HIRRGDYVS+ A + HG C+E+Y+ Sbjct 119 RHELIKEFQPKTKFNSYQNELLNLIKENDTCSLHIRRGDYVSSKIANETHGTCSEKYFER 178 Query 183 SIRYLKERF---NNLVFFVFSDDIEWCKKYKNEIFSRSDDVKFIEGNTQ--EVDMWLMSN 237 +I YL + + F+FSDDI+WC++ N F+ + + F++G+ E+DM LMS Sbjct 179 AIDYLMNKGVINKKTLLFIFSDDIKWCRE--NIFFN--NQICFVQGDAYHVELDMLLMSK 234 Query 238 AKYHIIANSSFSWWGAWLKNYDLGITIAPTPWFEREELNSFDPCPEKWVRI 288 K +II+NSSFSWW AWL IAP+ WF+++ D PE WV++ Sbjct 235 CKNNIISNSSFSWWAAWLNENKNKTVIAPSKWFKKDI--KHDIIPESWVKL 283 >ref|YP_002329683.1| predicted fucosyltransferase [Escherichia coli O127:H6 str. E2348/69] gb|AAR90894.1| putative fucosyltransferase [Escherichia coli] emb|CAS09719.1| predicted fucosyltransferase [Escherichia coli O127:H6 str. E2348/69] Length=299 Score = 123 bits (309), Expect = 2e-28, Method: Compositional matrix adjust. Identities = 90/286 (31%), Positives = 141/286 (49%), Gaps = 31/286 (10%) Query 8 LAGGLGNQMFQYAKGYAESVE-RNSSLKLDLRGYKNYTLHGGFR---LDKLNIDNTFVMS 63 L+GGLGNQMFQYA Y ++ L LD Y N R LD+ I S Sbjct 6 LSGGLGNQMFQYAAAYILKQHFPDTILVLDDSYYFNQPQKDTIRHLELDQFKIIFDRFSS 65 Query 64 KKEMCIFPNF-------IVRAINKFPKLSLCSKRF--ESEQYSKKINGSMKGSVEFIGFW 114 K E ++ + +F + LC+K ++ Y+ + ++ + F F+ Sbjct 66 KDEKVKINRLRKHKKIPLLNSFLQFTAIKLCNKYSLNDASYYNPESIKNIDVACLF-SFY 124 Query 115 QNERYFLEHKE------KLREIFTPININLDAKELSDVIRCTNSVSVHIRRGDYVSNVEA 168 Q+ + EH++ ++R+ + NL L + N S+H+RRGDYV+N A Sbjct 125 QDSKLLNEHRDLILPLFEIRDDLRVLCHNLQIYSL--ITDSKNITSIHVRRGDYVNNKHA 182 Query 169 LKIHGLCTERYYIDSIRYLKERFNNLVFFVFSDDIEWCK----KYKNEIFSRSDDVKFIE 224 K HG + YYI ++ Y++ + F +F+DD+ W K KY N + + +D+ KF Sbjct 183 AKFHGTLSMDYYISAMEYIESECGSQTFIIFTDDVIWAKEKFSKYSNCLVADADENKF-- 240 Query 225 GNTQEVDMWLMSNAKYHIIANSSFSWWGAWLKNYDLGITIAPTPWF 270 +DM+LMS +IIANS++SWWGAWL + + IAP W+ Sbjct 241 ---SVIDMYLMSLCNNNIIANSTYSWWGAWLNRSEDKLVIAPKQWY 283 >gb|AAO37719.1| putative fucosyltransferase [Escherichia coli] Length=302 Score = 120 bits (300), Expect = 2e-27, Method: Compositional matrix adjust. Identities = 89/291 (30%), Positives = 138/291 (47%), Gaps = 37/291 (12%) Query 8 LAGGLGNQMFQYAKGYA-ESVERNSSLKLD---LRGYKNYTLHGGFRLDKLNIDNTFVMS 63 L+GGLGNQMFQYA Y + + SL LD N F L++ NI + + Sbjct 5 LSGGLGNQMFQYAAAYILQRKLKQRSLVLDDSYFLDCSNRDTRRRFELNQFNICYDRLTT 64 Query 64 KKEMCIFPNFIVRAINKF--PKLS-------LCSKRFESE----QYSKKINGSMKGSVEF 110 KE I+R +N++ P L K + E ++ +K Sbjct 65 SKEKKEIS--IIRHVNRYRLPLFVTNSIFGVLLKKNYLPEAKFYEFLNNCKLQVKNGYCL 122 Query 111 IGFWQNERYFLEHKEKLREIFTPININLDAKELSD--------VIRCTNSVSVHIRRGDY 162 ++Q+ H++ + +F IN D L + + N+ S+HIRRGDY Sbjct 123 FSYFQDATLIDSHRDMILPLF---QINEDLLHLCNDLHIYKKVICENANTTSLHIRRGDY 179 Query 163 VSNVEALKIHGLCTERYYIDSIRYLKERFNNLVFFVFSDDIEWCKKYKNEIFSRSDDVKF 222 ++N A K HG+ YY +IRY+++ V VFSDD++W + F+ + Sbjct 180 ITNPHASKFHGVLPMDYYEKAIRYIEDVQGEQVIIVFSDDVKWAE----NTFANQPNYYV 235 Query 223 IEGNTQE---VDMWLMSNAKYHIIANSSFSWWGAWLKNYDLGITIAPTPWF 270 + + E +DM+LMS K +IIANS++SWWGAWL ++ I ++P WF Sbjct 236 VNNSECEYSAIDMFLMSKCKNNIIANSTYSWWGAWLNTFEDKIVVSPRKWF 286 >gb|AAV80759.1| putative fucosyltransferase [Escherichia coli] gb|AAV85963.1| WcmD [Escherichia coli] Length=302 Score = 119 bits (299), Expect = 3e-27, Method: Compositional matrix adjust. Identities = 89/291 (30%), Positives = 138/291 (47%), Gaps = 37/291 (12%) Query 8 LAGGLGNQMFQYAKGYA-ESVERNSSLKLD---LRGYKNYTLHGGFRLDKLNIDNTFVMS 63 L+GGLGNQMFQYA Y + + SL LD N F L++ NI + + Sbjct 5 LSGGLGNQMFQYAAAYILQRKLKQRSLVLDDSYFLDCSNRDTRRRFELNQFNICYDRLTT 64 Query 64 KKEMCIFPNFIVRAINKF--PKLS-------LCSKRFESE----QYSKKINGSMKGSVEF 110 KE I+R +N++ P L K + E ++ +K Sbjct 65 SKEKKEIS--IIRHVNRYRLPLFVTNSIFGVLLKKNYLPEAKFYEFLNNCKLQVKNGYCL 122 Query 111 IGFWQNERYFLEHKEKLREIFTPININLDAKELSD--------VIRCTNSVSVHIRRGDY 162 ++Q+ H++ + +F IN D L + + N+ S+HIRRGDY Sbjct 123 FSYFQDATLIDSHRDMILPLF---QINEDLLNLCNDLHIYKKVICENANTTSLHIRRGDY 179 Query 163 VSNVEALKIHGLCTERYYIDSIRYLKERFNNLVFFVFSDDIEWCKKYKNEIFSRSDDVKF 222 ++N A K HG+ YY +IRY+++ V VFSDD++W + F+ + Sbjct 180 ITNPHASKFHGVLPMDYYEKAIRYIEDVQGEQVIIVFSDDVKWAE----NTFANQPNYYV 235 Query 223 IEGNTQE---VDMWLMSNAKYHIIANSSFSWWGAWLKNYDLGITIAPTPWF 270 + + E +DM+LMS K +IIANS++SWWGAWL ++ I ++P WF Sbjct 236 VNNSECEYSAIDMFLMSKCKNNIIANSTYSWWGAWLNTFEDKIVVSPRKWF 286 Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF excluding environmental samples from WGS projects Posted date: Jun 15, 2009 5:41 PM Number of letters in database: 26,570,333 Number of sequences in database: 84,251 Lambda K H 0.322 0.138 0.427 Gapped Lambda K H 0.267 0.0410 0.140 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Sequences: 84251 Number of Hits to DB: 397777 Number of extensions: 16519 Number of successful extensions: 63 Number of sequences better than 0.1: 1 Number of HSP's better than 0.1 without gapping: 0 Number of HSP's gapped: 61 Number of HSP's successfully gapped: 1 Length of query: 290 Length of database: 26570333 Length adjustment: 103 Effective length of query: 187 Effective length of database: 17892480 Effective search space: 3345893760 Effective search space used: 3345893760 T: 11 A: 40 X1: 16 (7.4 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 41 (20.4 bits) S2: 79 (35.0 bits)