Amino acid dipepetide frequency for Enterobacteria phage T6 (Bacteriophage T6)

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
4.7AlaAla: 4.7 ± 0.377
0.495AlaCys: 0.495 ± 0.102
3.273AlaAsp: 3.273 ± 0.25
5.062AlaGlu: 5.062 ± 0.353
2.207AlaPhe: 2.207 ± 0.181
3.768AlaGly: 3.768 ± 0.317
1.332AlaHis: 1.332 ± 0.177
4.833AlaIle: 4.833 ± 0.284
5.062AlaLys: 5.062 ± 0.308
5.537AlaLeu: 5.537 ± 0.365
1.218AlaMet: 1.218 ± 0.152
3.273AlaAsn: 3.273 ± 0.259
2.055AlaPro: 2.055 ± 0.218
2.569AlaGln: 2.569 ± 0.254
2.797AlaArg: 2.797 ± 0.197
4.167AlaSer: 4.167 ± 0.322
2.835AlaThr: 2.835 ± 0.39
4.148AlaVal: 4.148 ± 0.27
0.97AlaTrp: 0.97 ± 0.129
2.264AlaTyr: 2.264 ± 0.204
0.0AlaXaa: 0.0 ± 0.0
Cys
0.704CysAla: 0.704 ± 0.121
0.152CysCys: 0.152 ± 0.059
0.799CysAsp: 0.799 ± 0.111
0.799CysGlu: 0.799 ± 0.14
0.381CysPhe: 0.381 ± 0.099
0.989CysGly: 0.989 ± 0.135
0.343CysHis: 0.343 ± 0.081
0.609CysIle: 0.609 ± 0.114
0.818CysLys: 0.818 ± 0.144
0.723CysLeu: 0.723 ± 0.114
0.343CysMet: 0.343 ± 0.085
0.609CysAsn: 0.609 ± 0.117
0.609CysPro: 0.609 ± 0.114
0.343CysGln: 0.343 ± 0.083
0.514CysArg: 0.514 ± 0.088
0.875CysSer: 0.875 ± 0.15
0.495CysThr: 0.495 ± 0.091
0.666CysVal: 0.666 ± 0.127
0.152CysTrp: 0.152 ± 0.051
0.495CysTyr: 0.495 ± 0.104
0.0CysXaa: 0.0 ± 0.0
Asp
3.692AspAla: 3.692 ± 0.231
0.666AspCys: 0.666 ± 0.122
4.129AspAsp: 4.129 ± 0.286
4.586AspGlu: 4.586 ± 0.413
3.33AspPhe: 3.33 ± 0.265
4.148AspGly: 4.148 ± 0.349
0.837AspHis: 0.837 ± 0.114
4.928AspIle: 4.928 ± 0.294
4.852AspLys: 4.852 ± 0.31
4.662AspLeu: 4.662 ± 0.314
1.865AspMet: 1.865 ± 0.213
2.664AspAsn: 2.664 ± 0.198
2.055AspPro: 2.055 ± 0.233
1.294AspGln: 1.294 ± 0.17
2.112AspArg: 2.112 ± 0.217
3.806AspSer: 3.806 ± 0.256
2.93AspThr: 2.93 ± 0.229
4.32AspVal: 4.32 ± 0.293
1.199AspTrp: 1.199 ± 0.157
3.33AspTyr: 3.33 ± 0.265
0.0AspXaa: 0.0 ± 0.0
Glu
4.852GluAla: 4.852 ± 0.339
0.894GluCys: 0.894 ± 0.13
3.711GluAsp: 3.711 ± 0.288
5.157GluGlu: 5.157 ± 0.351
3.14GluPhe: 3.14 ± 0.237
3.692GluGly: 3.692 ± 0.256
1.332GluHis: 1.332 ± 0.167
6.279GluIle: 6.279 ± 0.357
5.29GluLys: 5.29 ± 0.354
6.984GluLeu: 6.984 ± 0.381
2.226GluMet: 2.226 ± 0.232
4.072GluAsn: 4.072 ± 0.313
1.541GluPro: 1.541 ± 0.164
2.36GluGln: 2.36 ± 0.221
2.759GluArg: 2.759 ± 0.228
4.586GluSer: 4.586 ± 0.323
4.377GluThr: 4.377 ± 0.357
4.795GluVal: 4.795 ± 0.323
1.123GluTrp: 1.123 ± 0.164
3.673GluTyr: 3.673 ± 0.289
0.0GluXaa: 0.0 ± 0.0
Phe
2.093PheAla: 2.093 ± 0.18
0.495PheCys: 0.495 ± 0.107
3.292PheAsp: 3.292 ± 0.216
3.634PheGlu: 3.634 ± 0.244
1.503PhePhe: 1.503 ± 0.19
2.854PheGly: 2.854 ± 0.22
0.647PheHis: 0.647 ± 0.119
3.444PheIle: 3.444 ± 0.26
4.034PheLys: 4.034 ± 0.281
2.264PheLeu: 2.264 ± 0.195
1.427PheMet: 1.427 ± 0.158
3.197PheAsn: 3.197 ± 0.229
1.104PhePro: 1.104 ± 0.138
1.313PheGln: 1.313 ± 0.179
1.732PheArg: 1.732 ± 0.169
3.463PheSer: 3.463 ± 0.23
2.607PheThr: 2.607 ± 0.207
2.645PheVal: 2.645 ± 0.214
0.571PheTrp: 0.571 ± 0.101
1.865PheTyr: 1.865 ± 0.195
0.0PheXaa: 0.0 ± 0.0
Gly
2.854GlyAla: 2.854 ± 0.302
0.609GlyCys: 0.609 ± 0.118
3.996GlyAsp: 3.996 ± 0.333
3.73GlyGlu: 3.73 ± 0.24
2.607GlyPhe: 2.607 ± 0.193
3.596GlyGly: 3.596 ± 0.554
0.799GlyHis: 0.799 ± 0.126
4.453GlyIle: 4.453 ± 0.315
4.548GlyLys: 4.548 ± 0.328
4.529GlyLeu: 4.529 ± 0.32
2.055GlyMet: 2.055 ± 0.221
3.102GlyAsn: 3.102 ± 0.331
1.77GlyPro: 1.77 ± 0.208
2.112GlyGln: 2.112 ± 0.247
2.455GlyArg: 2.455 ± 0.241
4.243GlySer: 4.243 ± 0.369
3.92GlyThr: 3.92 ± 0.386
4.11GlyVal: 4.11 ± 0.307
0.875GlyTrp: 0.875 ± 0.122
2.873GlyTyr: 2.873 ± 0.213
0.0GlyXaa: 0.0 ± 0.0
His
0.913HisAla: 0.913 ± 0.127
0.343HisCys: 0.343 ± 0.085
0.894HisAsp: 0.894 ± 0.133
1.066HisGlu: 1.066 ± 0.158
1.085HisPhe: 1.085 ± 0.143
0.875HisGly: 0.875 ± 0.14
0.419HisHis: 0.419 ± 0.094
1.37HisIle: 1.37 ± 0.197
1.446HisLys: 1.446 ± 0.182
1.37HisLeu: 1.37 ± 0.14
0.381HisMet: 0.381 ± 0.083
0.704HisAsn: 0.704 ± 0.113
0.875HisPro: 0.875 ± 0.143
0.571HisGln: 0.571 ± 0.09
0.837HisArg: 0.837 ± 0.155
1.256HisSer: 1.256 ± 0.164
0.932HisThr: 0.932 ± 0.139
0.913HisVal: 0.913 ± 0.129
0.323HisTrp: 0.323 ± 0.085
0.647HisTyr: 0.647 ± 0.087
0.0HisXaa: 0.0 ± 0.0
Ile
5.271IleAla: 5.271 ± 0.374
0.761IleCys: 0.761 ± 0.133
5.214IleAsp: 5.214 ± 0.338
5.499IleGlu: 5.499 ± 0.387
2.683IlePhe: 2.683 ± 0.217
3.634IleGly: 3.634 ± 0.237
1.351IleHis: 1.351 ± 0.134
5.823IleIle: 5.823 ± 0.333
7.478IleLys: 7.478 ± 0.395
4.776IleLeu: 4.776 ± 0.313
1.789IleMet: 1.789 ± 0.168
5.442IleAsn: 5.442 ± 0.262
2.873IlePro: 2.873 ± 0.264
2.626IleGln: 2.626 ± 0.208
3.615IleArg: 3.615 ± 0.259
5.005IleSer: 5.005 ± 0.345
4.453IleThr: 4.453 ± 0.296
4.243IleVal: 4.243 ± 0.29
0.59IleTrp: 0.59 ± 0.108
2.949IleTyr: 2.949 ± 0.261
0.0IleXaa: 0.0 ± 0.0
Lys
5.842LysAla: 5.842 ± 0.338
1.009LysCys: 1.009 ± 0.159
5.29LysAsp: 5.29 ± 0.366
6.051LysGlu: 6.051 ± 0.421
4.301LysPhe: 4.301 ± 0.289
3.939LysGly: 3.939 ± 0.24
1.846LysHis: 1.846 ± 0.211
6.318LysIle: 6.318 ± 0.391
5.652LysLys: 5.652 ± 0.469
6.413LysLeu: 6.413 ± 0.354
2.664LysMet: 2.664 ± 0.223
4.719LysAsn: 4.719 ± 0.329
2.436LysPro: 2.436 ± 0.213
2.55LysGln: 2.55 ± 0.237
3.711LysArg: 3.711 ± 0.26
5.328LysSer: 5.328 ± 0.371
4.358LysThr: 4.358 ± 0.255
5.119LysVal: 5.119 ± 0.375
1.18LysTrp: 1.18 ± 0.129
3.577LysTyr: 3.577 ± 0.286
0.0LysXaa: 0.0 ± 0.0
Leu
4.795LeuAla: 4.795 ± 0.288
0.875LeuCys: 0.875 ± 0.131
4.776LeuAsp: 4.776 ± 0.319
5.575LeuGlu: 5.575 ± 0.364
3.349LeuPhe: 3.349 ± 0.245
4.034LeuGly: 4.034 ± 0.248
1.085LeuHis: 1.085 ± 0.166
5.423LeuIle: 5.423 ± 0.396
6.413LeuLys: 6.413 ± 0.412
4.795LeuLeu: 4.795 ± 0.296
2.169LeuMet: 2.169 ± 0.222
4.738LeuAsn: 4.738 ± 0.257
2.816LeuPro: 2.816 ± 0.225
2.379LeuGln: 2.379 ± 0.228
3.159LeuArg: 3.159 ± 0.22
5.29LeuSer: 5.29 ± 0.316
4.129LeuThr: 4.129 ± 0.305
4.548LeuVal: 4.548 ± 0.311
0.894LeuTrp: 0.894 ± 0.136
2.778LeuTyr: 2.778 ± 0.233
0.0LeuXaa: 0.0 ± 0.0
Met
2.074MetAla: 2.074 ± 0.194
0.304MetCys: 0.304 ± 0.064
1.332MetAsp: 1.332 ± 0.16
1.656MetGlu: 1.656 ± 0.181
1.332MetPhe: 1.332 ± 0.13
1.484MetGly: 1.484 ± 0.176
0.323MetHis: 0.323 ± 0.086
1.96MetIle: 1.96 ± 0.208
3.33MetLys: 3.33 ± 0.269
1.979MetLeu: 1.979 ± 0.19
0.932MetMet: 0.932 ± 0.135
1.732MetAsn: 1.732 ± 0.203
0.78MetPro: 0.78 ± 0.115
0.856MetGln: 0.856 ± 0.114
1.085MetArg: 1.085 ± 0.138
1.865MetSer: 1.865 ± 0.183
1.656MetThr: 1.656 ± 0.158
1.332MetVal: 1.332 ± 0.151
0.209MetTrp: 0.209 ± 0.065
1.161MetTyr: 1.161 ± 0.159
0.0MetXaa: 0.0 ± 0.0
Asn
3.673AsnAla: 3.673 ± 0.285
0.571AsnCys: 0.571 ± 0.114
3.482AsnAsp: 3.482 ± 0.247
3.996AsnGlu: 3.996 ± 0.286
2.55AsnPhe: 2.55 ± 0.169
3.977AsnGly: 3.977 ± 0.334
0.818AsnHis: 0.818 ± 0.131
4.605AsnIle: 4.605 ± 0.316
4.814AsnLys: 4.814 ± 0.334
4.301AsnLeu: 4.301 ± 0.266
1.503AsnMet: 1.503 ± 0.188
3.463AsnAsn: 3.463 ± 0.276
2.531AsnPro: 2.531 ± 0.219
1.732AsnGln: 1.732 ± 0.176
2.645AsnArg: 2.645 ± 0.23
4.11AsnSer: 4.11 ± 0.29
2.797AsnThr: 2.797 ± 0.248
3.064AsnVal: 3.064 ± 0.27
0.818AsnTrp: 0.818 ± 0.113
2.417AsnTyr: 2.417 ± 0.181
0.0AsnXaa: 0.0 ± 0.0
Pro
1.96ProAla: 1.96 ± 0.208
0.495ProCys: 0.495 ± 0.087
2.436ProAsp: 2.436 ± 0.231
3.197ProGlu: 3.197 ± 0.277
1.579ProPhe: 1.579 ± 0.159
2.493ProGly: 2.493 ± 0.244
0.628ProHis: 0.628 ± 0.109
2.131ProIle: 2.131 ± 0.229
2.302ProLys: 2.302 ± 0.232
2.264ProLeu: 2.264 ± 0.193
0.799ProMet: 0.799 ± 0.133
1.846ProAsn: 1.846 ± 0.195
0.989ProPro: 0.989 ± 0.158
1.028ProGln: 1.028 ± 0.136
1.161ProArg: 1.161 ± 0.163
2.188ProSer: 2.188 ± 0.201
2.207ProThr: 2.207 ± 0.213
2.283ProVal: 2.283 ± 0.214
0.685ProTrp: 0.685 ± 0.109
1.484ProTyr: 1.484 ± 0.17
0.0ProXaa: 0.0 ± 0.0
Gln
2.283GlnAla: 2.283 ± 0.251
0.285GlnCys: 0.285 ± 0.066
1.408GlnAsp: 1.408 ± 0.177
2.36GlnGlu: 2.36 ± 0.255
1.636GlnPhe: 1.636 ± 0.2
1.694GlnGly: 1.694 ± 0.163
0.419GlnHis: 0.419 ± 0.1
2.512GlnIle: 2.512 ± 0.228
2.36GlnLys: 2.36 ± 0.243
2.911GlnLeu: 2.911 ± 0.259
0.913GlnMet: 0.913 ± 0.146
1.56GlnAsn: 1.56 ± 0.192
1.199GlnPro: 1.199 ± 0.135
0.951GlnGln: 0.951 ± 0.188
1.751GlnArg: 1.751 ± 0.166
2.055GlnSer: 2.055 ± 0.199
1.884GlnThr: 1.884 ± 0.184
1.998GlnVal: 1.998 ± 0.196
0.571GlnTrp: 0.571 ± 0.104
1.656GlnTyr: 1.656 ± 0.162
0.0GlnXaa: 0.0 ± 0.0
Arg
2.645ArgAla: 2.645 ± 0.238
0.666ArgCys: 0.666 ± 0.107
2.55ArgAsp: 2.55 ± 0.262
3.539ArgGlu: 3.539 ± 0.277
1.751ArgPhe: 1.751 ± 0.15
2.74ArgGly: 2.74 ± 0.201
0.837ArgHis: 0.837 ± 0.123
3.235ArgIle: 3.235 ± 0.238
3.482ArgLys: 3.482 ± 0.283
3.482ArgLeu: 3.482 ± 0.287
0.97ArgMet: 0.97 ± 0.139
2.226ArgAsn: 2.226 ± 0.202
1.275ArgPro: 1.275 ± 0.142
1.541ArgGln: 1.541 ± 0.156
1.941ArgArg: 1.941 ± 0.233
2.759ArgSer: 2.759 ± 0.252
2.188ArgThr: 2.188 ± 0.187
2.474ArgVal: 2.474 ± 0.241
0.799ArgTrp: 0.799 ± 0.131
1.675ArgTyr: 1.675 ± 0.199
0.0ArgXaa: 0.0 ± 0.0
Ser
3.482SerAla: 3.482 ± 0.265
0.647SerCys: 0.647 ± 0.116
3.958SerAsp: 3.958 ± 0.283
4.243SerGlu: 4.243 ± 0.274
2.816SerPhe: 2.816 ± 0.229
4.377SerGly: 4.377 ± 0.364
1.161SerHis: 1.161 ± 0.15
5.176SerIle: 5.176 ± 0.288
6.108SerLys: 6.108 ± 0.331
5.081SerLeu: 5.081 ± 0.328
1.636SerMet: 1.636 ± 0.177
3.711SerAsn: 3.711 ± 0.296
2.531SerPro: 2.531 ± 0.197
2.093SerGln: 2.093 ± 0.208
2.968SerArg: 2.968 ± 0.233
5.157SerSer: 5.157 ± 0.349
4.415SerThr: 4.415 ± 0.43
4.301SerVal: 4.301 ± 0.294
0.742SerTrp: 0.742 ± 0.124
2.873SerTyr: 2.873 ± 0.252
0.0SerXaa: 0.0 ± 0.0
Thr
3.863ThrAla: 3.863 ± 0.392
0.495ThrCys: 0.495 ± 0.11
3.33ThrAsp: 3.33 ± 0.263
3.806ThrGlu: 3.806 ± 0.329
2.379ThrPhe: 2.379 ± 0.206
4.129ThrGly: 4.129 ± 0.285
1.028ThrHis: 1.028 ± 0.138
4.148ThrIle: 4.148 ± 0.292
3.863ThrLys: 3.863 ± 0.225
3.825ThrLeu: 3.825 ± 0.297
1.028ThrMet: 1.028 ± 0.146
2.988ThrAsn: 2.988 ± 0.23
2.721ThrPro: 2.721 ± 0.27
1.713ThrGln: 1.713 ± 0.263
2.531ThrArg: 2.531 ± 0.285
3.216ThrSer: 3.216 ± 0.308
2.911ThrThr: 2.911 ± 0.273
4.567ThrVal: 4.567 ± 0.344
0.837ThrTrp: 0.837 ± 0.114
2.474ThrTyr: 2.474 ± 0.202
0.0ThrXaa: 0.0 ± 0.0
Val
3.216ValAla: 3.216 ± 0.272
1.028ValCys: 1.028 ± 0.133
3.692ValAsp: 3.692 ± 0.28
5.328ValGlu: 5.328 ± 0.338
2.683ValPhe: 2.683 ± 0.219
3.711ValGly: 3.711 ± 0.361
1.047ValHis: 1.047 ± 0.121
4.415ValIle: 4.415 ± 0.345
5.633ValLys: 5.633 ± 0.316
4.453ValLeu: 4.453 ± 0.224
1.865ValMet: 1.865 ± 0.186
4.205ValAsn: 4.205 ± 0.28
2.055ValPro: 2.055 ± 0.244
1.979ValGln: 1.979 ± 0.231
2.493ValArg: 2.493 ± 0.231
4.32ValSer: 4.32 ± 0.276
3.73ValThr: 3.73 ± 0.308
4.32ValVal: 4.32 ± 0.307
0.761ValTrp: 0.761 ± 0.128
2.55ValTyr: 2.55 ± 0.228
0.0ValXaa: 0.0 ± 0.0
Trp
0.704TrpAla: 0.704 ± 0.113
0.133TrpCys: 0.133 ± 0.048
0.856TrpAsp: 0.856 ± 0.119
0.875TrpGlu: 0.875 ± 0.146
0.951TrpPhe: 0.951 ± 0.138
0.59TrpGly: 0.59 ± 0.127
0.152TrpHis: 0.152 ± 0.058
0.989TrpIle: 0.989 ± 0.146
1.446TrpLys: 1.446 ± 0.168
0.97TrpLeu: 0.97 ± 0.121
0.495TrpMet: 0.495 ± 0.1
1.085TrpAsn: 1.085 ± 0.145
0.381TrpPro: 0.381 ± 0.093
0.647TrpGln: 0.647 ± 0.095
0.533TrpArg: 0.533 ± 0.106
0.894TrpSer: 0.894 ± 0.141
0.685TrpThr: 0.685 ± 0.105
0.818TrpVal: 0.818 ± 0.129
0.171TrpTrp: 0.171 ± 0.058
0.704TrpTyr: 0.704 ± 0.1
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.949TyrAla: 2.949 ± 0.239
0.495TyrCys: 0.495 ± 0.111
2.988TyrAsp: 2.988 ± 0.234
2.683TyrGlu: 2.683 ± 0.245
1.922TyrPhe: 1.922 ± 0.19
2.512TyrGly: 2.512 ± 0.235
0.818TyrHis: 0.818 ± 0.113
3.425TyrIle: 3.425 ± 0.272
3.368TyrLys: 3.368 ± 0.264
2.721TyrLeu: 2.721 ± 0.23
1.085TyrMet: 1.085 ± 0.162
2.531TyrAsn: 2.531 ± 0.25
1.56TyrPro: 1.56 ± 0.169
1.751TyrGln: 1.751 ± 0.197
1.979TyrArg: 1.979 ± 0.209
2.892TyrSer: 2.892 ± 0.217
2.341TyrThr: 2.341 ± 0.205
2.816TyrVal: 2.816 ± 0.233
0.609TyrTrp: 0.609 ± 0.093
1.808TyrTyr: 1.808 ± 0.193
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 268 proteins (52553 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski