Amino acid dipepetide frequency for Staphylococcus phage vB_SauH_DELF3

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
0.796AlaAla: 0.796 ± 0.16
0.521AlaCys: 0.521 ± 0.11
2.744AlaAsp: 2.744 ± 0.303
3.32AlaGlu: 3.32 ± 0.308
1.619AlaPhe: 1.619 ± 0.211
2.113AlaGly: 2.113 ± 0.293
0.988AlaHis: 0.988 ± 0.155
3.183AlaIle: 3.183 ± 0.272
3.896AlaLys: 3.896 ± 0.4
3.759AlaLeu: 3.759 ± 0.356
0.933AlaMet: 0.933 ± 0.169
2.36AlaAsn: 2.36 ± 0.299
1.674AlaPro: 1.674 ± 0.255
1.838AlaGln: 1.838 ± 0.201
2.085AlaArg: 2.085 ± 0.232
3.348AlaSer: 3.348 ± 0.336
3.32AlaThr: 3.32 ± 0.323
3.21AlaVal: 3.21 ± 0.317
0.302AlaTrp: 0.302 ± 0.11
2.003AlaTyr: 2.003 ± 0.231
0.0AlaXaa: 0.0 ± 0.0
Cys
0.384CysAla: 0.384 ± 0.109
0.247CysCys: 0.247 ± 0.08
0.686CysAsp: 0.686 ± 0.169
0.741CysGlu: 0.741 ± 0.157
0.439CysPhe: 0.439 ± 0.114
0.631CysGly: 0.631 ± 0.121
0.357CysHis: 0.357 ± 0.088
0.741CysIle: 0.741 ± 0.141
1.125CysLys: 1.125 ± 0.239
0.933CysLeu: 0.933 ± 0.189
0.302CysMet: 0.302 ± 0.09
0.933CysAsn: 0.933 ± 0.155
0.412CysPro: 0.412 ± 0.114
0.274CysGln: 0.274 ± 0.078
0.768CysArg: 0.768 ± 0.15
0.933CysSer: 0.933 ± 0.173
0.823CysThr: 0.823 ± 0.164
0.741CysVal: 0.741 ± 0.144
0.137CysTrp: 0.137 ± 0.06
0.851CysTyr: 0.851 ± 0.145
0.0CysXaa: 0.0 ± 0.0
Asp
2.662AspAla: 2.662 ± 0.279
0.686AspCys: 0.686 ± 0.14
4.116AspAsp: 4.116 ± 0.331
4.061AspGlu: 4.061 ± 0.329
2.799AspPhe: 2.799 ± 0.256
3.759AspGly: 3.759 ± 0.338
1.015AspHis: 1.015 ± 0.149
5.351AspIle: 5.351 ± 0.391
6.009AspLys: 6.009 ± 0.446
5.625AspLeu: 5.625 ± 0.43
1.729AspMet: 1.729 ± 0.191
4.829AspAsn: 4.829 ± 0.308
2.195AspPro: 2.195 ± 0.244
1.838AspGln: 1.838 ± 0.288
3.43AspArg: 3.43 ± 0.308
4.857AspSer: 4.857 ± 0.411
4.253AspThr: 4.253 ± 0.462
4.582AspVal: 4.582 ± 0.365
0.659AspTrp: 0.659 ± 0.133
3.21AspTyr: 3.21 ± 0.292
0.0AspXaa: 0.0 ± 0.0
Glu
2.991GluAla: 2.991 ± 0.367
0.823GluCys: 0.823 ± 0.142
5.46GluAsp: 5.46 ± 0.396
4.116GluGlu: 4.116 ± 0.444
2.223GluPhe: 2.223 ± 0.244
3.183GluGly: 3.183 ± 0.274
1.564GluHis: 1.564 ± 0.231
4.006GluIle: 4.006 ± 0.393
5.323GluLys: 5.323 ± 0.523
6.174GluLeu: 6.174 ± 0.428
1.564GluMet: 1.564 ± 0.193
2.936GluAsn: 2.936 ± 0.276
2.36GluPro: 2.36 ± 0.429
2.634GluGln: 2.634 ± 0.262
3.046GluArg: 3.046 ± 0.26
4.527GluSer: 4.527 ± 0.34
3.348GluThr: 3.348 ± 0.318
5.186GluVal: 5.186 ± 0.38
0.549GluTrp: 0.549 ± 0.104
3.046GluTyr: 3.046 ± 0.29
0.0GluXaa: 0.0 ± 0.0
Phe
1.372PheAla: 1.372 ± 0.159
0.851PheCys: 0.851 ± 0.157
2.497PheAsp: 2.497 ± 0.212
2.277PheGlu: 2.277 ± 0.249
2.552PhePhe: 2.552 ± 0.898
1.921PheGly: 1.921 ± 0.189
0.549PheHis: 0.549 ± 0.117
3.238PheIle: 3.238 ± 0.353
3.018PheLys: 3.018 ± 0.308
2.497PheLeu: 2.497 ± 0.288
0.686PheMet: 0.686 ± 0.141
2.689PheAsn: 2.689 ± 0.318
1.152PhePro: 1.152 ± 0.175
0.768PheGln: 0.768 ± 0.154
1.345PheArg: 1.345 ± 0.187
2.607PheSer: 2.607 ± 0.268
2.195PheThr: 2.195 ± 0.223
2.25PheVal: 2.25 ± 0.27
0.274PheTrp: 0.274 ± 0.082
2.168PheTyr: 2.168 ± 0.224
0.0PheXaa: 0.0 ± 0.0
Gly
2.223GlyAla: 2.223 ± 0.29
0.713GlyCys: 0.713 ± 0.137
3.567GlyAsp: 3.567 ± 0.373
3.101GlyGlu: 3.101 ± 0.294
1.893GlyPhe: 1.893 ± 0.189
3.128GlyGly: 3.128 ± 0.502
1.427GlyHis: 1.427 ± 0.206
4.281GlyIle: 4.281 ± 0.285
5.213GlyLys: 5.213 ± 0.477
4.143GlyLeu: 4.143 ± 0.338
1.235GlyMet: 1.235 ± 0.203
3.265GlyAsn: 3.265 ± 0.272
1.043GlyPro: 1.043 ± 0.211
1.838GlyGln: 1.838 ± 0.292
2.442GlyArg: 2.442 ± 0.23
4.116GlySer: 4.116 ± 0.406
3.595GlyThr: 3.595 ± 0.361
3.842GlyVal: 3.842 ± 0.348
0.686GlyTrp: 0.686 ± 0.144
3.156GlyTyr: 3.156 ± 0.329
0.0GlyXaa: 0.0 ± 0.0
His
1.015HisAla: 1.015 ± 0.133
0.329HisCys: 0.329 ± 0.084
1.509HisAsp: 1.509 ± 0.216
1.372HisGlu: 1.372 ± 0.17
0.768HisPhe: 0.768 ± 0.141
1.18HisGly: 1.18 ± 0.231
0.302HisHis: 0.302 ± 0.102
1.646HisIle: 1.646 ± 0.218
1.893HisLys: 1.893 ± 0.222
1.701HisLeu: 1.701 ± 0.25
0.384HisMet: 0.384 ± 0.117
1.427HisAsn: 1.427 ± 0.203
0.576HisPro: 0.576 ± 0.132
0.576HisGln: 0.576 ± 0.115
1.07HisArg: 1.07 ± 0.149
1.235HisSer: 1.235 ± 0.198
1.098HisThr: 1.098 ± 0.169
1.591HisVal: 1.591 ± 0.247
0.082HisTrp: 0.082 ± 0.051
1.098HisTyr: 1.098 ± 0.185
0.0HisXaa: 0.0 ± 0.0
Ile
3.018IleAla: 3.018 ± 0.311
0.631IleCys: 0.631 ± 0.121
5.131IleAsp: 5.131 ± 0.411
4.967IleGlu: 4.967 ± 0.41
2.14IlePhe: 2.14 ± 0.273
3.622IleGly: 3.622 ± 0.297
1.125IleHis: 1.125 ± 0.2
4.226IleIle: 4.226 ± 0.378
6.311IleLys: 6.311 ± 0.41
5.186IleLeu: 5.186 ± 0.4
1.317IleMet: 1.317 ± 0.182
4.802IleAsn: 4.802 ± 0.377
2.826IlePro: 2.826 ± 0.279
2.085IleGln: 2.085 ± 0.233
3.348IleArg: 3.348 ± 0.284
4.802IleSer: 4.802 ± 0.366
5.406IleThr: 5.406 ± 0.408
3.951IleVal: 3.951 ± 0.332
0.384IleTrp: 0.384 ± 0.105
2.771IleTyr: 2.771 ± 0.288
0.0IleXaa: 0.0 ± 0.0
Lys
3.979LysAla: 3.979 ± 0.312
1.07LysCys: 1.07 ± 0.223
6.421LysAsp: 6.421 ± 0.461
7.738LysGlu: 7.738 ± 0.614
2.744LysPhe: 2.744 ± 0.269
5.735LysGly: 5.735 ± 0.502
1.948LysHis: 1.948 ± 0.267
4.473LysIle: 4.473 ± 0.336
7.052LysLys: 7.052 ± 0.458
5.899LysLeu: 5.899 ± 0.537
1.838LysMet: 1.838 ± 0.258
5.817LysAsn: 5.817 ± 0.479
3.21LysPro: 3.21 ± 0.467
3.183LysGln: 3.183 ± 0.326
4.281LysArg: 4.281 ± 0.363
5.213LysSer: 5.213 ± 0.363
4.665LysThr: 4.665 ± 0.411
5.543LysVal: 5.543 ± 0.397
0.713LysTrp: 0.713 ± 0.15
4.418LysTyr: 4.418 ± 0.337
0.0LysXaa: 0.0 ± 0.0
Leu
4.39LeuAla: 4.39 ± 0.361
0.823LeuCys: 0.823 ± 0.141
5.845LeuAsp: 5.845 ± 0.387
5.488LeuGlu: 5.488 ± 0.379
2.963LeuPhe: 2.963 ± 0.348
4.308LeuGly: 4.308 ± 0.439
1.537LeuHis: 1.537 ± 0.186
5.186LeuIle: 5.186 ± 0.39
6.997LeuLys: 6.997 ± 0.527
6.997LeuLeu: 6.997 ± 0.478
1.976LeuMet: 1.976 ± 0.274
5.515LeuAsn: 5.515 ± 0.407
3.485LeuPro: 3.485 ± 0.264
3.21LeuGln: 3.21 ± 0.287
3.814LeuArg: 3.814 ± 0.385
6.695LeuSer: 6.695 ± 0.437
4.939LeuThr: 4.939 ± 0.349
4.912LeuVal: 4.912 ± 0.319
0.823LeuTrp: 0.823 ± 0.122
3.787LeuTyr: 3.787 ± 0.345
0.0LeuXaa: 0.0 ± 0.0
Met
1.015MetAla: 1.015 ± 0.163
0.192MetCys: 0.192 ± 0.091
1.29MetAsp: 1.29 ± 0.191
1.207MetGlu: 1.207 ± 0.186
0.631MetPhe: 0.631 ± 0.135
1.07MetGly: 1.07 ± 0.2
0.466MetHis: 0.466 ± 0.12
1.509MetIle: 1.509 ± 0.209
1.921MetLys: 1.921 ± 0.216
1.674MetLeu: 1.674 ± 0.206
0.576MetMet: 0.576 ± 0.14
0.823MetAsn: 0.823 ± 0.148
1.015MetPro: 1.015 ± 0.182
0.851MetGln: 0.851 ± 0.153
1.152MetArg: 1.152 ± 0.173
2.332MetSer: 2.332 ± 0.227
1.18MetThr: 1.18 ± 0.179
1.152MetVal: 1.152 ± 0.189
0.247MetTrp: 0.247 ± 0.074
1.207MetTyr: 1.207 ± 0.18
0.0MetXaa: 0.0 ± 0.0
Asn
2.387AsnAla: 2.387 ± 0.237
0.631AsnCys: 0.631 ± 0.149
3.732AsnAsp: 3.732 ± 0.373
3.842AsnGlu: 3.842 ± 0.332
1.948AsnPhe: 1.948 ± 0.289
3.595AsnGly: 3.595 ± 0.275
1.235AsnHis: 1.235 ± 0.183
4.582AsnIle: 4.582 ± 0.37
6.585AsnLys: 6.585 ± 0.514
5.872AsnLeu: 5.872 ± 0.467
1.482AsnMet: 1.482 ± 0.175
5.323AsnAsn: 5.323 ± 0.429
2.579AsnPro: 2.579 ± 0.308
1.893AsnGln: 1.893 ± 0.269
3.32AsnArg: 3.32 ± 0.245
3.951AsnSer: 3.951 ± 0.321
5.406AsnThr: 5.406 ± 0.465
3.677AsnVal: 3.677 ± 0.302
0.604AsnTrp: 0.604 ± 0.113
2.771AsnTyr: 2.771 ± 0.305
0.0AsnXaa: 0.0 ± 0.0
Pro
1.152ProAla: 1.152 ± 0.176
0.357ProCys: 0.357 ± 0.127
1.893ProAsp: 1.893 ± 0.264
2.305ProGlu: 2.305 ± 0.336
1.345ProPhe: 1.345 ± 0.178
1.646ProGly: 1.646 ± 0.242
0.796ProHis: 0.796 ± 0.172
3.156ProIle: 3.156 ± 0.574
3.156ProLys: 3.156 ± 0.301
3.512ProLeu: 3.512 ± 0.348
0.741ProMet: 0.741 ± 0.13
2.47ProAsn: 2.47 ± 0.3
1.125ProPro: 1.125 ± 0.23
1.372ProGln: 1.372 ± 0.224
1.564ProArg: 1.564 ± 0.209
2.716ProSer: 2.716 ± 0.311
2.36ProThr: 2.36 ± 0.278
2.085ProVal: 2.085 ± 0.264
0.082ProTrp: 0.082 ± 0.051
1.811ProTyr: 1.811 ± 0.221
0.0ProXaa: 0.0 ± 0.0
Gln
1.756GlnAla: 1.756 ± 0.271
0.384GlnCys: 0.384 ± 0.105
2.854GlnAsp: 2.854 ± 0.299
2.607GlnGlu: 2.607 ± 0.292
0.878GlnPhe: 0.878 ± 0.142
1.976GlnGly: 1.976 ± 0.252
0.659GlnHis: 0.659 ± 0.134
1.838GlnIle: 1.838 ± 0.215
2.415GlnLys: 2.415 ± 0.242
3.457GlnLeu: 3.457 ± 0.306
0.659GlnMet: 0.659 ± 0.129
1.866GlnAsn: 1.866 ± 0.22
1.098GlnPro: 1.098 ± 0.178
1.619GlnGln: 1.619 ± 0.245
1.454GlnArg: 1.454 ± 0.23
2.47GlnSer: 2.47 ± 0.298
1.619GlnThr: 1.619 ± 0.205
2.113GlnVal: 2.113 ± 0.275
0.247GlnTrp: 0.247 ± 0.078
0.96GlnTyr: 0.96 ± 0.166
0.0GlnXaa: 0.0 ± 0.0
Arg
2.223ArgAla: 2.223 ± 0.252
0.549ArgCys: 0.549 ± 0.129
3.485ArgAsp: 3.485 ± 0.281
2.662ArgGlu: 2.662 ± 0.23
2.744ArgPhe: 2.744 ± 0.317
2.662ArgGly: 2.662 ± 0.302
0.823ArgHis: 0.823 ± 0.146
3.046ArgIle: 3.046 ± 0.344
4.143ArgLys: 4.143 ± 0.359
4.335ArgLeu: 4.335 ± 0.362
0.933ArgMet: 0.933 ± 0.168
2.909ArgAsn: 2.909 ± 0.241
1.729ArgPro: 1.729 ± 0.194
1.399ArgGln: 1.399 ± 0.19
2.387ArgArg: 2.387 ± 0.304
2.662ArgSer: 2.662 ± 0.292
3.128ArgThr: 3.128 ± 0.296
3.265ArgVal: 3.265 ± 0.283
0.302ArgTrp: 0.302 ± 0.087
1.811ArgTyr: 1.811 ± 0.226
0.0ArgXaa: 0.0 ± 0.0
Ser
3.375SerAla: 3.375 ± 0.318
1.043SerCys: 1.043 ± 0.176
4.665SerAsp: 4.665 ± 0.43
3.896SerGlu: 3.896 ± 0.351
2.936SerPhe: 2.936 ± 0.244
3.787SerGly: 3.787 ± 0.411
1.454SerHis: 1.454 ± 0.204
4.967SerIle: 4.967 ± 0.45
5.954SerLys: 5.954 ± 0.409
5.982SerLeu: 5.982 ± 0.37
1.646SerMet: 1.646 ± 0.203
5.296SerAsn: 5.296 ± 0.383
2.387SerPro: 2.387 ± 0.28
1.729SerGln: 1.729 ± 0.25
3.21SerArg: 3.21 ± 0.288
5.378SerSer: 5.378 ± 0.462
4.665SerThr: 4.665 ± 0.418
4.61SerVal: 4.61 ± 0.35
0.686SerTrp: 0.686 ± 0.118
3.54SerTyr: 3.54 ± 0.305
0.0SerXaa: 0.0 ± 0.0
Thr
3.43ThrAla: 3.43 ± 0.279
0.741ThrCys: 0.741 ± 0.15
3.896ThrAsp: 3.896 ± 0.364
4.39ThrGlu: 4.39 ± 0.366
2.305ThrPhe: 2.305 ± 0.251
3.814ThrGly: 3.814 ± 0.292
1.811ThrHis: 1.811 ± 0.279
4.171ThrIle: 4.171 ± 0.412
4.884ThrLys: 4.884 ± 0.434
5.598ThrLeu: 5.598 ± 0.37
1.152ThrMet: 1.152 ± 0.162
3.951ThrAsn: 3.951 ± 0.346
2.47ThrPro: 2.47 ± 0.338
2.277ThrGln: 2.277 ± 0.251
3.128ThrArg: 3.128 ± 0.309
4.198ThrSer: 4.198 ± 0.346
4.363ThrThr: 4.363 ± 0.406
4.582ThrVal: 4.582 ± 0.425
0.521ThrTrp: 0.521 ± 0.119
3.046ThrTyr: 3.046 ± 0.341
0.0ThrXaa: 0.0 ± 0.0
Val
3.622ValAla: 3.622 ± 0.34
0.933ValCys: 0.933 ± 0.166
4.281ValAsp: 4.281 ± 0.336
4.006ValGlu: 4.006 ± 0.383
2.113ValPhe: 2.113 ± 0.234
3.704ValGly: 3.704 ± 0.33
1.345ValHis: 1.345 ± 0.202
4.088ValIle: 4.088 ± 0.328
5.104ValLys: 5.104 ± 0.413
5.707ValLeu: 5.707 ± 0.501
1.29ValMet: 1.29 ± 0.23
3.704ValAsn: 3.704 ± 0.309
2.387ValPro: 2.387 ± 0.266
2.031ValGln: 2.031 ± 0.257
2.936ValArg: 2.936 ± 0.274
5.268ValSer: 5.268 ± 0.377
4.473ValThr: 4.473 ± 0.309
4.39ValVal: 4.39 ± 0.405
0.604ValTrp: 0.604 ± 0.12
3.238ValTyr: 3.238 ± 0.264
0.0ValXaa: 0.0 ± 0.0
Trp
0.466TrpAla: 0.466 ± 0.107
0.11TrpCys: 0.11 ± 0.063
0.549TrpAsp: 0.549 ± 0.114
0.521TrpGlu: 0.521 ± 0.116
0.192TrpPhe: 0.192 ± 0.068
0.439TrpGly: 0.439 ± 0.108
0.22TrpHis: 0.22 ± 0.088
0.713TrpIle: 0.713 ± 0.162
0.851TrpLys: 0.851 ± 0.153
0.851TrpLeu: 0.851 ± 0.158
0.082TrpMet: 0.082 ± 0.042
0.549TrpAsn: 0.549 ± 0.118
0.165TrpPro: 0.165 ± 0.067
0.247TrpGln: 0.247 ± 0.085
0.22TrpArg: 0.22 ± 0.065
0.494TrpSer: 0.494 ± 0.107
0.384TrpThr: 0.384 ± 0.102
0.631TrpVal: 0.631 ± 0.107
0.22TrpTrp: 0.22 ± 0.081
0.713TrpTyr: 0.713 ± 0.154
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.756TyrAla: 1.756 ± 0.2
0.823TyrCys: 0.823 ± 0.163
3.046TyrAsp: 3.046 ± 0.275
2.442TyrGlu: 2.442 ± 0.276
1.674TyrPhe: 1.674 ± 0.238
2.634TyrGly: 2.634 ± 0.287
1.235TyrHis: 1.235 ± 0.178
3.649TyrIle: 3.649 ± 0.299
4.088TyrLys: 4.088 ± 0.349
3.787TyrLeu: 3.787 ± 0.322
0.96TyrMet: 0.96 ± 0.167
3.951TyrAsn: 3.951 ± 0.37
1.756TyrPro: 1.756 ± 0.234
1.29TyrGln: 1.29 ± 0.164
2.195TyrArg: 2.195 ± 0.194
3.402TyrSer: 3.402 ± 0.321
3.512TyrThr: 3.512 ± 0.453
2.909TyrVal: 2.909 ± 0.29
0.494TyrTrp: 0.494 ± 0.117
2.387TyrTyr: 2.387 ± 0.294
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 177 proteins (36445 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski