Amino acid dipepetide frequency for Staphylococcus virus SA11

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
0.051AlaAla: 0.051 ± 0.031
0.434AlaCys: 0.434 ± 0.1
2.398AlaAsp: 2.398 ± 0.332
2.705AlaGlu: 2.705 ± 0.295
1.582AlaPhe: 1.582 ± 0.188
1.633AlaGly: 1.633 ± 0.279
1.021AlaHis: 1.021 ± 0.152
3.419AlaIle: 3.419 ± 0.335
4.491AlaLys: 4.491 ± 0.346
3.291AlaLeu: 3.291 ± 0.333
0.97AlaMet: 0.97 ± 0.167
2.424AlaAsn: 2.424 ± 0.359
1.25AlaPro: 1.25 ± 0.193
1.48AlaGln: 1.48 ± 0.228
1.378AlaArg: 1.378 ± 0.178
3.393AlaSer: 3.393 ± 0.377
2.985AlaThr: 2.985 ± 0.273
2.577AlaVal: 2.577 ± 0.265
0.383AlaTrp: 0.383 ± 0.099
2.118AlaTyr: 2.118 ± 0.213
0.0AlaXaa: 0.0 ± 0.0
Cys
0.255CysAla: 0.255 ± 0.083
0.077CysCys: 0.077 ± 0.04
0.332CysAsp: 0.332 ± 0.093
0.434CysGlu: 0.434 ± 0.117
0.332CysPhe: 0.332 ± 0.093
0.408CysGly: 0.408 ± 0.122
0.102CysHis: 0.102 ± 0.056
0.485CysIle: 0.485 ± 0.108
0.791CysLys: 0.791 ± 0.161
0.74CysLeu: 0.74 ± 0.154
0.128CysMet: 0.128 ± 0.061
0.332CysAsn: 0.332 ± 0.092
0.255CysPro: 0.255 ± 0.094
0.153CysGln: 0.153 ± 0.066
0.332CysArg: 0.332 ± 0.082
0.434CysSer: 0.434 ± 0.11
0.23CysThr: 0.23 ± 0.074
0.357CysVal: 0.357 ± 0.091
0.128CysTrp: 0.128 ± 0.051
0.51CysTyr: 0.51 ± 0.13
0.0CysXaa: 0.0 ± 0.0
Asp
2.475AspAla: 2.475 ± 0.306
0.357AspCys: 0.357 ± 0.107
4.133AspAsp: 4.133 ± 0.342
4.312AspGlu: 4.312 ± 0.321
3.062AspPhe: 3.062 ± 0.266
3.725AspGly: 3.725 ± 0.341
0.51AspHis: 0.51 ± 0.124
6.94AspIle: 6.94 ± 0.465
7.323AspLys: 7.323 ± 0.471
6.43AspLeu: 6.43 ± 0.489
1.888AspMet: 1.888 ± 0.222
5.486AspAsn: 5.486 ± 0.297
1.327AspPro: 1.327 ± 0.212
0.791AspGln: 0.791 ± 0.177
2.526AspArg: 2.526 ± 0.261
3.853AspSer: 3.853 ± 0.352
4.567AspThr: 4.567 ± 0.331
4.644AspVal: 4.644 ± 0.315
0.765AspTrp: 0.765 ± 0.141
4.159AspTyr: 4.159 ± 0.338
0.0AspXaa: 0.0 ± 0.0
Glu
3.317GluAla: 3.317 ± 0.356
0.281GluCys: 0.281 ± 0.086
5.511GluAsp: 5.511 ± 0.348
6.583GluGlu: 6.583 ± 0.698
2.756GluPhe: 2.756 ± 0.277
3.521GluGly: 3.521 ± 0.272
1.505GluHis: 1.505 ± 0.186
4.389GluIle: 4.389 ± 0.373
6.685GluLys: 6.685 ± 0.594
8.012GluLeu: 8.012 ± 0.577
1.99GluMet: 1.99 ± 0.231
4.363GluAsn: 4.363 ± 0.372
1.607GluPro: 1.607 ± 0.367
3.827GluGln: 3.827 ± 0.339
2.679GluArg: 2.679 ± 0.264
4.338GluSer: 4.338 ± 0.331
3.393GluThr: 3.393 ± 0.296
6.251GluVal: 6.251 ± 0.446
0.561GluTrp: 0.561 ± 0.115
4.031GluTyr: 4.031 ± 0.426
0.0GluXaa: 0.0 ± 0.0
Phe
1.174PheAla: 1.174 ± 0.168
0.332PheCys: 0.332 ± 0.092
2.271PheAsp: 2.271 ± 0.198
2.807PheGlu: 2.807 ± 0.246
1.199PhePhe: 1.199 ± 0.155
2.143PheGly: 2.143 ± 0.259
0.459PheHis: 0.459 ± 0.1
3.598PheIle: 3.598 ± 0.369
3.904PheLys: 3.904 ± 0.275
2.934PheLeu: 2.934 ± 0.292
0.944PheMet: 0.944 ± 0.142
3.419PheAsn: 3.419 ± 0.296
0.97PhePro: 0.97 ± 0.143
0.842PheGln: 0.842 ± 0.109
0.919PheArg: 0.919 ± 0.171
2.781PheSer: 2.781 ± 0.288
2.373PheThr: 2.373 ± 0.279
1.965PheVal: 1.965 ± 0.228
0.204PheTrp: 0.204 ± 0.07
2.118PheTyr: 2.118 ± 0.243
0.0PheXaa: 0.0 ± 0.0
Gly
2.347GlyAla: 2.347 ± 0.419
0.383GlyCys: 0.383 ± 0.095
3.572GlyAsp: 3.572 ± 0.36
3.368GlyGlu: 3.368 ± 0.284
2.067GlyPhe: 2.067 ± 0.238
3.802GlyGly: 3.802 ± 0.534
0.893GlyHis: 0.893 ± 0.145
4.286GlyIle: 4.286 ± 0.324
5.333GlyLys: 5.333 ± 0.494
3.904GlyLeu: 3.904 ± 0.353
1.148GlyMet: 1.148 ± 0.179
3.751GlyAsn: 3.751 ± 0.42
0.0GlyPro: 0.0 ± 0.0
1.684GlyGln: 1.684 ± 0.209
2.067GlyArg: 2.067 ± 0.219
3.853GlySer: 3.853 ± 0.442
3.98GlyThr: 3.98 ± 0.364
3.368GlyVal: 3.368 ± 0.361
0.663GlyTrp: 0.663 ± 0.134
3.725GlyTyr: 3.725 ± 0.302
0.0GlyXaa: 0.0 ± 0.0
His
0.638HisAla: 0.638 ± 0.106
0.204HisCys: 0.204 ± 0.066
0.919HisAsp: 0.919 ± 0.151
0.944HisGlu: 0.944 ± 0.177
0.561HisPhe: 0.561 ± 0.106
1.072HisGly: 1.072 ± 0.169
0.179HisHis: 0.179 ± 0.072
1.454HisIle: 1.454 ± 0.215
1.684HisLys: 1.684 ± 0.249
1.25HisLeu: 1.25 ± 0.19
0.536HisMet: 0.536 ± 0.113
1.021HisAsn: 1.021 ± 0.153
0.408HisPro: 0.408 ± 0.098
0.485HisGln: 0.485 ± 0.097
0.536HisArg: 0.536 ± 0.123
0.944HisSer: 0.944 ± 0.163
0.97HisThr: 0.97 ± 0.146
1.429HisVal: 1.429 ± 0.197
0.102HisTrp: 0.102 ± 0.061
0.868HisTyr: 0.868 ± 0.149
0.0HisXaa: 0.0 ± 0.0
Ile
2.551IleAla: 2.551 ± 0.261
0.561IleCys: 0.561 ± 0.124
6.43IleAsp: 6.43 ± 0.396
5.996IleGlu: 5.996 ± 0.538
2.067IlePhe: 2.067 ± 0.243
3.955IleGly: 3.955 ± 0.312
1.072IleHis: 1.072 ± 0.143
5.511IleIle: 5.511 ± 0.472
7.603IleLys: 7.603 ± 0.514
5.613IleLeu: 5.613 ± 0.328
1.582IleMet: 1.582 ± 0.196
5.69IleAsn: 5.69 ± 0.5
2.398IlePro: 2.398 ± 0.281
2.73IleGln: 2.73 ± 0.24
2.373IleArg: 2.373 ± 0.243
4.72IleSer: 4.72 ± 0.313
5.69IleThr: 5.69 ± 0.407
4.516IleVal: 4.516 ± 0.383
0.434IleTrp: 0.434 ± 0.091
3.189IleTyr: 3.189 ± 0.273
0.0IleXaa: 0.0 ± 0.0
Lys
3.98LysAla: 3.98 ± 0.386
0.689LysCys: 0.689 ± 0.165
7.986LysAsp: 7.986 ± 0.447
10.206LysGlu: 10.206 ± 0.668
3.036LysPhe: 3.036 ± 0.259
6.532LysGly: 6.532 ± 0.574
2.041LysHis: 2.041 ± 0.29
4.363LysIle: 4.363 ± 0.364
8.037LysLys: 8.037 ± 0.684
7.501LysLeu: 7.501 ± 0.507
1.812LysMet: 1.812 ± 0.202
6.021LysAsn: 6.021 ± 0.419
2.858LysPro: 2.858 ± 0.395
3.929LysGln: 3.929 ± 0.425
3.751LysArg: 3.751 ± 0.359
5.715LysSer: 5.715 ± 0.433
5.128LysThr: 5.128 ± 0.374
6.812LysVal: 6.812 ± 0.484
0.663LysTrp: 0.663 ± 0.115
5.843LysTyr: 5.843 ± 0.427
0.0LysXaa: 0.0 ± 0.0
Leu
3.751LeuAla: 3.751 ± 0.318
0.51LeuCys: 0.51 ± 0.111
6.302LeuAsp: 6.302 ± 0.408
6.889LeuGlu: 6.889 ± 0.423
3.113LeuPhe: 3.113 ± 0.282
4.286LeuGly: 4.286 ± 0.395
1.378LeuHis: 1.378 ± 0.198
5.282LeuIle: 5.282 ± 0.404
8.369LeuLys: 8.369 ± 0.522
6.557LeuLeu: 6.557 ± 0.512
2.092LeuMet: 2.092 ± 0.239
5.154LeuAsn: 5.154 ± 0.386
2.781LeuPro: 2.781 ± 0.293
3.24LeuGln: 3.24 ± 0.311
3.444LeuArg: 3.444 ± 0.333
6.302LeuSer: 6.302 ± 0.471
5.537LeuThr: 5.537 ± 0.376
4.567LeuVal: 4.567 ± 0.312
0.587LeuTrp: 0.587 ± 0.111
4.031LeuTyr: 4.031 ± 0.289
0.0LeuXaa: 0.0 ± 0.0
Met
1.225MetAla: 1.225 ± 0.17
0.179MetCys: 0.179 ± 0.072
1.378MetAsp: 1.378 ± 0.204
1.709MetGlu: 1.709 ± 0.221
0.97MetPhe: 0.97 ± 0.162
0.842MetGly: 0.842 ± 0.17
0.332MetHis: 0.332 ± 0.096
1.607MetIle: 1.607 ± 0.193
2.194MetLys: 2.194 ± 0.207
1.556MetLeu: 1.556 ± 0.182
0.485MetMet: 0.485 ± 0.144
1.327MetAsn: 1.327 ± 0.193
0.51MetPro: 0.51 ± 0.124
0.842MetGln: 0.842 ± 0.128
0.97MetArg: 0.97 ± 0.18
2.194MetSer: 2.194 ± 0.221
1.378MetThr: 1.378 ± 0.175
1.378MetVal: 1.378 ± 0.175
0.179MetTrp: 0.179 ± 0.074
1.352MetTyr: 1.352 ± 0.193
0.0MetXaa: 0.0 ± 0.0
Asn
2.577AsnAla: 2.577 ± 0.254
0.434AsnCys: 0.434 ± 0.129
3.904AsnAsp: 3.904 ± 0.314
4.567AsnGlu: 4.567 ± 0.39
2.424AsnPhe: 2.424 ± 0.237
3.776AsnGly: 3.776 ± 0.314
1.225AsnHis: 1.225 ± 0.206
5.919AsnIle: 5.919 ± 0.492
8.165AsnLys: 8.165 ± 0.523
6.073AsnLeu: 6.073 ± 0.473
1.454AsnMet: 1.454 ± 0.204
6.098AsnAsn: 6.098 ± 0.499
2.347AsnPro: 2.347 ± 0.253
2.424AsnGln: 2.424 ± 0.296
2.398AsnArg: 2.398 ± 0.267
3.547AsnSer: 3.547 ± 0.251
5.333AsnThr: 5.333 ± 0.457
4.031AsnVal: 4.031 ± 0.348
0.561AsnTrp: 0.561 ± 0.126
3.24AsnTyr: 3.24 ± 0.351
0.0AsnXaa: 0.0 ± 0.0
Pro
1.021ProAla: 1.021 ± 0.148
0.153ProCys: 0.153 ± 0.066
1.378ProAsp: 1.378 ± 0.202
1.965ProGlu: 1.965 ± 0.314
0.944ProPhe: 0.944 ± 0.154
0.714ProGly: 0.714 ± 0.179
0.408ProHis: 0.408 ± 0.098
2.194ProIle: 2.194 ± 0.352
2.654ProLys: 2.654 ± 0.218
2.22ProLeu: 2.22 ± 0.234
0.587ProMet: 0.587 ± 0.111
1.99ProAsn: 1.99 ± 0.234
0.536ProPro: 0.536 ± 0.142
1.072ProGln: 1.072 ± 0.218
0.893ProArg: 0.893 ± 0.163
1.658ProSer: 1.658 ± 0.251
2.296ProThr: 2.296 ± 0.261
1.276ProVal: 1.276 ± 0.172
0.051ProTrp: 0.051 ± 0.034
1.684ProTyr: 1.684 ± 0.21
0.0ProXaa: 0.0 ± 0.0
Gln
1.99GlnAla: 1.99 ± 0.28
0.23GlnCys: 0.23 ± 0.079
2.475GlnAsp: 2.475 ± 0.277
3.138GlnGlu: 3.138 ± 0.383
1.148GlnPhe: 1.148 ± 0.208
2.398GlnGly: 2.398 ± 0.274
0.459GlnHis: 0.459 ± 0.107
2.194GlnIle: 2.194 ± 0.28
2.398GlnLys: 2.398 ± 0.282
3.164GlnLeu: 3.164 ± 0.31
0.868GlnMet: 0.868 ± 0.144
1.761GlnAsn: 1.761 ± 0.231
0.842GlnPro: 0.842 ± 0.184
1.965GlnGln: 1.965 ± 0.377
1.301GlnArg: 1.301 ± 0.181
2.449GlnSer: 2.449 ± 0.239
1.582GlnThr: 1.582 ± 0.201
2.424GlnVal: 2.424 ± 0.254
0.281GlnTrp: 0.281 ± 0.087
1.556GlnTyr: 1.556 ± 0.203
0.0GlnXaa: 0.0 ± 0.0
Arg
1.684ArgAla: 1.684 ± 0.241
0.306ArgCys: 0.306 ± 0.097
2.551ArgAsp: 2.551 ± 0.247
2.322ArgGlu: 2.322 ± 0.247
1.965ArgPhe: 1.965 ± 0.192
2.067ArgGly: 2.067 ± 0.302
0.612ArgHis: 0.612 ± 0.104
2.143ArgIle: 2.143 ± 0.218
3.291ArgLys: 3.291 ± 0.308
3.087ArgLeu: 3.087 ± 0.269
0.995ArgMet: 0.995 ± 0.151
2.041ArgAsn: 2.041 ± 0.227
0.791ArgPro: 0.791 ± 0.138
1.174ArgGln: 1.174 ± 0.149
1.505ArgArg: 1.505 ± 0.174
1.709ArgSer: 1.709 ± 0.206
2.22ArgThr: 2.22 ± 0.26
2.628ArgVal: 2.628 ± 0.285
0.306ArgTrp: 0.306 ± 0.096
1.48ArgTyr: 1.48 ± 0.179
0.0ArgXaa: 0.0 ± 0.0
Ser
2.603SerAla: 2.603 ± 0.273
0.408SerCys: 0.408 ± 0.097
4.338SerAsp: 4.338 ± 0.365
4.235SerGlu: 4.235 ± 0.342
3.011SerPhe: 3.011 ± 0.244
3.368SerGly: 3.368 ± 0.403
0.842SerHis: 0.842 ± 0.133
5.817SerIle: 5.817 ± 0.441
6.889SerLys: 6.889 ± 0.458
5.333SerLeu: 5.333 ± 0.37
1.148SerMet: 1.148 ± 0.142
5.817SerAsn: 5.817 ± 0.465
1.403SerPro: 1.403 ± 0.199
1.786SerGln: 1.786 ± 0.195
1.939SerArg: 1.939 ± 0.191
5.026SerSer: 5.026 ± 0.49
4.414SerThr: 4.414 ± 0.411
3.317SerVal: 3.317 ± 0.28
0.689SerTrp: 0.689 ± 0.14
3.572SerTyr: 3.572 ± 0.275
0.0SerXaa: 0.0 ± 0.0
Thr
2.934ThrAla: 2.934 ± 0.296
0.128ThrCys: 0.128 ± 0.062
4.21ThrAsp: 4.21 ± 0.346
4.414ThrGlu: 4.414 ± 0.435
2.832ThrPhe: 2.832 ± 0.287
3.291ThrGly: 3.291 ± 0.334
1.327ThrHis: 1.327 ± 0.146
5.231ThrIle: 5.231 ± 0.404
5.537ThrLys: 5.537 ± 0.365
5.664ThrLeu: 5.664 ± 0.462
1.25ThrMet: 1.25 ± 0.174
4.593ThrAsn: 4.593 ± 0.423
2.373ThrPro: 2.373 ± 0.301
2.373ThrGln: 2.373 ± 0.316
1.99ThrArg: 1.99 ± 0.279
4.082ThrSer: 4.082 ± 0.346
3.802ThrThr: 3.802 ± 0.386
4.465ThrVal: 4.465 ± 0.384
0.663ThrTrp: 0.663 ± 0.143
3.291ThrTyr: 3.291 ± 0.36
0.0ThrXaa: 0.0 ± 0.0
Val
2.934ValAla: 2.934 ± 0.296
0.663ValCys: 0.663 ± 0.132
4.873ValAsp: 4.873 ± 0.352
5.026ValGlu: 5.026 ± 0.394
2.271ValPhe: 2.271 ± 0.21
3.164ValGly: 3.164 ± 0.278
0.944ValHis: 0.944 ± 0.165
4.644ValIle: 4.644 ± 0.316
5.792ValLys: 5.792 ± 0.377
5.562ValLeu: 5.562 ± 0.4
1.352ValMet: 1.352 ± 0.205
4.21ValAsn: 4.21 ± 0.381
1.684ValPro: 1.684 ± 0.212
1.939ValGln: 1.939 ± 0.238
1.863ValArg: 1.863 ± 0.219
5.282ValSer: 5.282 ± 0.427
4.082ValThr: 4.082 ± 0.323
3.802ValVal: 3.802 ± 0.349
0.485ValTrp: 0.485 ± 0.105
3.317ValTyr: 3.317 ± 0.29
0.0ValXaa: 0.0 ± 0.0
Trp
0.434TrpAla: 0.434 ± 0.087
0.026TrpCys: 0.026 ± 0.022
0.638TrpAsp: 0.638 ± 0.119
0.842TrpGlu: 0.842 ± 0.141
0.357TrpPhe: 0.357 ± 0.085
0.612TrpGly: 0.612 ± 0.141
0.051TrpHis: 0.051 ± 0.035
0.587TrpIle: 0.587 ± 0.106
0.816TrpLys: 0.816 ± 0.144
0.663TrpLeu: 0.663 ± 0.134
0.153TrpMet: 0.153 ± 0.071
0.485TrpAsn: 0.485 ± 0.102
0.0TrpPro: 0.0 ± 0.0
0.204TrpGln: 0.204 ± 0.078
0.153TrpArg: 0.153 ± 0.068
0.459TrpSer: 0.459 ± 0.117
0.434TrpThr: 0.434 ± 0.102
0.689TrpVal: 0.689 ± 0.129
0.204TrpTrp: 0.204 ± 0.076
0.612TrpTyr: 0.612 ± 0.138
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.041TyrAla: 2.041 ± 0.227
0.459TyrCys: 0.459 ± 0.115
3.674TyrAsp: 3.674 ± 0.307
3.138TyrGlu: 3.138 ± 0.307
1.658TyrPhe: 1.658 ± 0.163
2.883TyrGly: 2.883 ± 0.288
0.842TyrHis: 0.842 ± 0.146
4.618TyrIle: 4.618 ± 0.415
4.95TyrLys: 4.95 ± 0.383
4.644TyrLeu: 4.644 ± 0.356
1.199TyrMet: 1.199 ± 0.157
4.516TyrAsn: 4.516 ± 0.36
1.352TyrPro: 1.352 ± 0.226
1.684TyrGln: 1.684 ± 0.217
1.863TyrArg: 1.863 ± 0.186
3.215TyrSer: 3.215 ± 0.288
4.057TyrThr: 4.057 ± 0.397
3.393TyrVal: 3.393 ± 0.294
0.51TyrTrp: 0.51 ± 0.121
3.113TyrTyr: 3.113 ± 0.315
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 186 proteins (39194 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski