Amino acid dipepetide frequency for Staphylococcus phage Terranova

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
0.025AlaAla: 0.025 ± 0.023
0.198AlaCys: 0.198 ± 0.072
1.954AlaAsp: 1.954 ± 0.268
2.325AlaGlu: 2.325 ± 0.315
1.385AlaPhe: 1.385 ± 0.189
1.632AlaGly: 1.632 ± 0.207
0.767AlaHis: 0.767 ± 0.107
2.523AlaIle: 2.523 ± 0.218
4.229AlaLys: 4.229 ± 0.283
3.166AlaLeu: 3.166 ± 0.286
0.816AlaMet: 0.816 ± 0.145
1.929AlaAsn: 1.929 ± 0.246
1.014AlaPro: 1.014 ± 0.2
1.336AlaGln: 1.336 ± 0.245
1.286AlaArg: 1.286 ± 0.17
2.597AlaSer: 2.597 ± 0.289
2.622AlaThr: 2.622 ± 0.263
2.053AlaVal: 2.053 ± 0.242
0.371AlaTrp: 0.371 ± 0.089
1.731AlaTyr: 1.731 ± 0.226
0.0AlaXaa: 0.0 ± 0.0
Cys
0.173CysAla: 0.173 ± 0.074
0.099CysCys: 0.099 ± 0.048
0.322CysAsp: 0.322 ± 0.085
0.396CysGlu: 0.396 ± 0.09
0.322CysPhe: 0.322 ± 0.082
0.569CysGly: 0.569 ± 0.138
0.148CysHis: 0.148 ± 0.064
0.495CysIle: 0.495 ± 0.13
0.643CysLys: 0.643 ± 0.129
0.544CysLeu: 0.544 ± 0.116
0.148CysMet: 0.148 ± 0.064
0.322CysAsn: 0.322 ± 0.087
0.297CysPro: 0.297 ± 0.091
0.198CysGln: 0.198 ± 0.078
0.223CysArg: 0.223 ± 0.066
0.445CysSer: 0.445 ± 0.11
0.322CysThr: 0.322 ± 0.097
0.371CysVal: 0.371 ± 0.097
0.099CysTrp: 0.099 ± 0.046
0.445CysTyr: 0.445 ± 0.093
0.0CysXaa: 0.0 ± 0.0
Asp
2.251AspAla: 2.251 ± 0.243
0.322AspCys: 0.322 ± 0.082
4.13AspAsp: 4.13 ± 0.337
4.872AspGlu: 4.872 ± 0.346
3.463AspPhe: 3.463 ± 0.294
2.572AspGly: 2.572 ± 0.335
0.643AspHis: 0.643 ± 0.132
6.901AspIle: 6.901 ± 0.44
7.272AspLys: 7.272 ± 0.565
6.134AspLeu: 6.134 ± 0.5
1.781AspMet: 1.781 ± 0.205
5.689AspAsn: 5.689 ± 0.392
1.212AspPro: 1.212 ± 0.181
0.717AspGln: 0.717 ± 0.155
2.251AspArg: 2.251 ± 0.217
4.205AspSer: 4.205 ± 0.373
4.353AspThr: 4.353 ± 0.332
4.328AspVal: 4.328 ± 0.303
0.618AspTrp: 0.618 ± 0.121
3.933AspTyr: 3.933 ± 0.306
0.0AspXaa: 0.0 ± 0.0
Glu
3.092GluAla: 3.092 ± 0.297
0.445GluCys: 0.445 ± 0.096
6.233GluAsp: 6.233 ± 0.452
8.731GluGlu: 8.731 ± 0.57
3.586GluPhe: 3.586 ± 0.277
4.081GluGly: 4.081 ± 0.419
1.336GluHis: 1.336 ± 0.194
5.862GluIle: 5.862 ± 0.444
7.148GluLys: 7.148 ± 0.439
7.519GluLeu: 7.519 ± 0.517
1.954GluMet: 1.954 ± 0.262
5.54GluAsn: 5.54 ± 0.4
1.83GluPro: 1.83 ± 0.239
3.661GluGln: 3.661 ± 0.353
3.116GluArg: 3.116 ± 0.272
4.848GluSer: 4.848 ± 0.373
3.933GluThr: 3.933 ± 0.35
5.491GluVal: 5.491 ± 0.341
0.668GluTrp: 0.668 ± 0.13
4.18GluTyr: 4.18 ± 0.395
0.0GluXaa: 0.0 ± 0.0
Phe
1.187PheAla: 1.187 ± 0.15
0.346PheCys: 0.346 ± 0.088
2.696PheAsp: 2.696 ± 0.33
3.116PheGlu: 3.116 ± 0.265
1.583PhePhe: 1.583 ± 0.259
1.806PheGly: 1.806 ± 0.209
0.396PheHis: 0.396 ± 0.111
4.007PheIle: 4.007 ± 0.357
4.625PheLys: 4.625 ± 0.359
3.092PheLeu: 3.092 ± 0.295
0.791PheMet: 0.791 ± 0.146
4.625PheAsn: 4.625 ± 0.38
0.816PhePro: 0.816 ± 0.112
0.89PheGln: 0.89 ± 0.146
1.113PheArg: 1.113 ± 0.168
2.869PheSer: 2.869 ± 0.253
2.449PheThr: 2.449 ± 0.269
2.77PheVal: 2.77 ± 0.3
0.247PheTrp: 0.247 ± 0.085
2.646PheTyr: 2.646 ± 0.313
0.0PheXaa: 0.0 ± 0.0
Gly
1.781GlyAla: 1.781 ± 0.224
0.346GlyCys: 0.346 ± 0.089
3.29GlyAsp: 3.29 ± 0.316
3.388GlyGlu: 3.388 ± 0.326
1.855GlyPhe: 1.855 ± 0.209
2.795GlyGly: 2.795 ± 0.299
0.816GlyHis: 0.816 ± 0.141
4.106GlyIle: 4.106 ± 0.358
4.872GlyLys: 4.872 ± 0.404
4.576GlyLeu: 4.576 ± 0.317
1.36GlyMet: 1.36 ± 0.167
3.512GlyAsn: 3.512 ± 0.313
0.0GlyPro: 0.0 ± 0.0
1.509GlyGln: 1.509 ± 0.205
1.781GlyArg: 1.781 ± 0.215
2.82GlySer: 2.82 ± 0.31
3.586GlyThr: 3.586 ± 0.354
3.636GlyVal: 3.636 ± 0.31
0.42GlyTrp: 0.42 ± 0.082
3.562GlyTyr: 3.562 ± 0.25
0.0GlyXaa: 0.0 ± 0.0
His
0.495HisAla: 0.495 ± 0.132
0.247HisCys: 0.247 ± 0.08
0.841HisAsp: 0.841 ± 0.13
0.841HisGlu: 0.841 ± 0.152
0.767HisPhe: 0.767 ± 0.154
1.064HisGly: 1.064 ± 0.164
0.346HisHis: 0.346 ± 0.109
1.707HisIle: 1.707 ± 0.226
1.385HisLys: 1.385 ± 0.222
1.632HisLeu: 1.632 ± 0.245
0.297HisMet: 0.297 ± 0.082
1.286HisAsn: 1.286 ± 0.236
0.643HisPro: 0.643 ± 0.12
0.445HisGln: 0.445 ± 0.101
0.618HisArg: 0.618 ± 0.107
0.693HisSer: 0.693 ± 0.137
0.767HisThr: 0.767 ± 0.123
0.989HisVal: 0.989 ± 0.131
0.124HisTrp: 0.124 ± 0.074
0.989HisTyr: 0.989 ± 0.17
0.0HisXaa: 0.0 ± 0.0
Ile
2.844IleAla: 2.844 ± 0.327
0.643IleCys: 0.643 ± 0.127
6.604IleAsp: 6.604 ± 0.428
7.024IleGlu: 7.024 ± 0.415
2.745IlePhe: 2.745 ± 0.274
3.512IleGly: 3.512 ± 0.279
1.385IleHis: 1.385 ± 0.223
6.208IleIle: 6.208 ± 0.479
8.583IleLys: 8.583 ± 0.461
6.208IleLeu: 6.208 ± 0.402
1.435IleMet: 1.435 ± 0.18
6.01IleAsn: 6.01 ± 0.51
2.028IlePro: 2.028 ± 0.239
2.424IleGln: 2.424 ± 0.246
2.597IleArg: 2.597 ± 0.262
5.441IleSer: 5.441 ± 0.386
4.774IleThr: 4.774 ± 0.343
4.848IleVal: 4.848 ± 0.385
0.445IleTrp: 0.445 ± 0.11
3.487IleTyr: 3.487 ± 0.324
0.0IleXaa: 0.0 ± 0.0
Lys
3.586LysAla: 3.586 ± 0.305
0.668LysCys: 0.668 ± 0.141
7.494LysAsp: 7.494 ± 0.498
10.611LysGlu: 10.611 ± 0.735
3.24LysPhe: 3.24 ± 0.291
5.491LysGly: 5.491 ± 0.383
1.88LysHis: 1.88 ± 0.222
5.565LysIle: 5.565 ± 0.396
9.374LysLys: 9.374 ± 0.507
7.568LysLeu: 7.568 ± 0.439
2.177LysMet: 2.177 ± 0.206
6.975LysAsn: 6.975 ± 0.359
2.622LysPro: 2.622 ± 0.278
4.13LysGln: 4.13 ± 0.413
3.982LysArg: 3.982 ± 0.318
5.986LysSer: 5.986 ± 0.414
4.675LysThr: 4.675 ± 0.347
6.752LysVal: 6.752 ± 0.394
0.915LysTrp: 0.915 ± 0.147
5.268LysTyr: 5.268 ± 0.374
0.0LysXaa: 0.0 ± 0.0
Leu
2.671LeuAla: 2.671 ± 0.296
0.346LeuCys: 0.346 ± 0.114
6.233LeuAsp: 6.233 ± 0.451
7.643LeuGlu: 7.643 ± 0.551
3.116LeuPhe: 3.116 ± 0.315
4.477LeuGly: 4.477 ± 0.301
1.459LeuHis: 1.459 ± 0.176
6.183LeuIle: 6.183 ± 0.432
7.766LeuLys: 7.766 ± 0.501
6.48LeuLeu: 6.48 ± 0.529
2.053LeuMet: 2.053 ± 0.202
5.887LeuAsn: 5.887 ± 0.436
2.449LeuPro: 2.449 ± 0.275
2.646LeuGln: 2.646 ± 0.266
2.894LeuArg: 2.894 ± 0.272
6.01LeuSer: 6.01 ± 0.391
5.12LeuThr: 5.12 ± 0.382
5.021LeuVal: 5.021 ± 0.355
0.495LeuTrp: 0.495 ± 0.113
3.858LeuTyr: 3.858 ± 0.304
0.0LeuXaa: 0.0 ± 0.0
Met
1.286MetAla: 1.286 ± 0.166
0.099MetCys: 0.099 ± 0.057
1.039MetAsp: 1.039 ± 0.188
1.781MetGlu: 1.781 ± 0.21
0.915MetPhe: 0.915 ± 0.171
0.693MetGly: 0.693 ± 0.116
0.099MetHis: 0.099 ± 0.051
1.608MetIle: 1.608 ± 0.266
2.226MetLys: 2.226 ± 0.221
1.781MetLeu: 1.781 ± 0.224
0.495MetMet: 0.495 ± 0.118
1.583MetAsn: 1.583 ± 0.19
0.594MetPro: 0.594 ± 0.094
0.791MetGln: 0.791 ± 0.149
1.385MetArg: 1.385 ± 0.212
1.657MetSer: 1.657 ± 0.185
1.336MetThr: 1.336 ± 0.171
1.088MetVal: 1.088 ± 0.155
0.074MetTrp: 0.074 ± 0.043
0.965MetTyr: 0.965 ± 0.156
0.0MetXaa: 0.0 ± 0.0
Asn
2.424AsnAla: 2.424 ± 0.313
0.445AsnCys: 0.445 ± 0.125
4.403AsnAsp: 4.403 ± 0.286
5.021AsnGlu: 5.021 ± 0.352
3.487AsnPhe: 3.487 ± 0.295
4.032AsnGly: 4.032 ± 0.348
1.36AsnHis: 1.36 ± 0.253
6.233AsnIle: 6.233 ± 0.465
8.137AsnLys: 8.137 ± 0.446
5.318AsnLeu: 5.318 ± 0.452
1.459AsnMet: 1.459 ± 0.236
6.604AsnAsn: 6.604 ± 0.673
2.671AsnPro: 2.671 ± 0.294
2.35AsnGln: 2.35 ± 0.277
2.449AsnArg: 2.449 ± 0.251
4.774AsnSer: 4.774 ± 0.407
4.452AsnThr: 4.452 ± 0.384
4.13AsnVal: 4.13 ± 0.35
0.742AsnTrp: 0.742 ± 0.129
3.883AsnTyr: 3.883 ± 0.273
0.0AsnXaa: 0.0 ± 0.0
Pro
0.89ProAla: 0.89 ± 0.15
0.099ProCys: 0.099 ± 0.066
1.583ProAsp: 1.583 ± 0.213
2.399ProGlu: 2.399 ± 0.265
1.261ProPhe: 1.261 ± 0.169
1.064ProGly: 1.064 ± 0.179
0.322ProHis: 0.322 ± 0.094
2.152ProIle: 2.152 ± 0.207
2.424ProLys: 2.424 ± 0.275
2.177ProLeu: 2.177 ± 0.239
0.47ProMet: 0.47 ± 0.104
1.904ProAsn: 1.904 ± 0.232
0.668ProPro: 0.668 ± 0.125
1.162ProGln: 1.162 ± 0.318
0.693ProArg: 0.693 ± 0.158
1.583ProSer: 1.583 ± 0.22
1.756ProThr: 1.756 ± 0.226
1.459ProVal: 1.459 ± 0.191
0.124ProTrp: 0.124 ± 0.055
1.509ProTyr: 1.509 ± 0.264
0.0ProXaa: 0.0 ± 0.0
Gln
1.707GlnAla: 1.707 ± 0.207
0.124GlnCys: 0.124 ± 0.064
1.929GlnAsp: 1.929 ± 0.189
3.067GlnGlu: 3.067 ± 0.343
1.36GlnPhe: 1.36 ± 0.161
1.979GlnGly: 1.979 ± 0.234
0.643GlnHis: 0.643 ± 0.133
2.201GlnIle: 2.201 ± 0.262
2.721GlnLys: 2.721 ± 0.295
2.745GlnLeu: 2.745 ± 0.29
0.643GlnMet: 0.643 ± 0.116
1.632GlnAsn: 1.632 ± 0.249
1.014GlnPro: 1.014 ± 0.371
2.275GlnGln: 2.275 ± 0.701
1.286GlnArg: 1.286 ± 0.183
2.028GlnSer: 2.028 ± 0.229
1.682GlnThr: 1.682 ± 0.216
2.424GlnVal: 2.424 ± 0.275
0.272GlnTrp: 0.272 ± 0.078
1.682GlnTyr: 1.682 ± 0.173
0.0GlnXaa: 0.0 ± 0.0
Arg
1.484ArgAla: 1.484 ± 0.183
0.297ArgCys: 0.297 ± 0.101
2.622ArgAsp: 2.622 ± 0.263
3.116ArgGlu: 3.116 ± 0.273
1.657ArgPhe: 1.657 ± 0.192
1.632ArgGly: 1.632 ± 0.23
0.544ArgHis: 0.544 ± 0.108
2.548ArgIle: 2.548 ± 0.238
3.29ArgLys: 3.29 ± 0.336
3.116ArgLeu: 3.116 ± 0.292
0.841ArgMet: 0.841 ± 0.142
2.078ArgAsn: 2.078 ± 0.199
1.162ArgPro: 1.162 ± 0.177
1.261ArgGln: 1.261 ± 0.209
1.707ArgArg: 1.707 ± 0.199
1.41ArgSer: 1.41 ± 0.166
2.053ArgThr: 2.053 ± 0.232
2.795ArgVal: 2.795 ± 0.355
0.346ArgTrp: 0.346 ± 0.105
2.053ArgTyr: 2.053 ± 0.221
0.0ArgXaa: 0.0 ± 0.0
Ser
1.954SerAla: 1.954 ± 0.23
0.297SerCys: 0.297 ± 0.077
4.18SerAsp: 4.18 ± 0.317
4.6SerGlu: 4.6 ± 0.434
3.042SerPhe: 3.042 ± 0.29
3.438SerGly: 3.438 ± 0.279
0.915SerHis: 0.915 ± 0.14
5.664SerIle: 5.664 ± 0.386
6.579SerLys: 6.579 ± 0.483
5.169SerLeu: 5.169 ± 0.37
1.162SerMet: 1.162 ± 0.185
4.699SerAsn: 4.699 ± 0.404
1.484SerPro: 1.484 ± 0.223
1.756SerGln: 1.756 ± 0.26
2.152SerArg: 2.152 ± 0.165
4.526SerSer: 4.526 ± 0.379
3.834SerThr: 3.834 ± 0.394
3.784SerVal: 3.784 ± 0.297
0.544SerTrp: 0.544 ± 0.11
3.784SerTyr: 3.784 ± 0.319
0.0SerXaa: 0.0 ± 0.0
Thr
1.929ThrAla: 1.929 ± 0.257
0.346ThrCys: 0.346 ± 0.079
3.463ThrAsp: 3.463 ± 0.361
4.625ThrGlu: 4.625 ± 0.445
2.572ThrPhe: 2.572 ± 0.298
3.116ThrGly: 3.116 ± 0.248
1.088ThrHis: 1.088 ± 0.159
4.947ThrIle: 4.947 ± 0.371
5.441ThrLys: 5.441 ± 0.408
5.342ThrLeu: 5.342 ± 0.398
1.039ThrMet: 1.039 ± 0.171
4.032ThrAsn: 4.032 ± 0.29
2.028ThrPro: 2.028 ± 0.247
2.424ThrGln: 2.424 ± 0.266
2.028ThrArg: 2.028 ± 0.261
3.388ThrSer: 3.388 ± 0.348
3.883ThrThr: 3.883 ± 0.365
4.452ThrVal: 4.452 ± 0.422
0.495ThrTrp: 0.495 ± 0.113
2.869ThrTyr: 2.869 ± 0.266
0.0ThrXaa: 0.0 ± 0.0
Val
2.201ValAla: 2.201 ± 0.273
0.519ValCys: 0.519 ± 0.121
4.155ValAsp: 4.155 ± 0.274
5.639ValGlu: 5.639 ± 0.391
2.646ValPhe: 2.646 ± 0.289
2.82ValGly: 2.82 ± 0.341
0.89ValHis: 0.89 ± 0.152
5.293ValIle: 5.293 ± 0.376
6.505ValLys: 6.505 ± 0.458
5.244ValLeu: 5.244 ± 0.398
1.237ValMet: 1.237 ± 0.158
4.477ValAsn: 4.477 ± 0.322
1.781ValPro: 1.781 ± 0.204
1.583ValGln: 1.583 ± 0.243
2.597ValArg: 2.597 ± 0.242
4.798ValSer: 4.798 ± 0.369
4.106ValThr: 4.106 ± 0.316
3.388ValVal: 3.388 ± 0.352
0.594ValTrp: 0.594 ± 0.167
3.092ValTyr: 3.092 ± 0.292
0.0ValXaa: 0.0 ± 0.0
Trp
0.346TrpAla: 0.346 ± 0.087
0.099TrpCys: 0.099 ± 0.049
0.544TrpAsp: 0.544 ± 0.118
0.742TrpGlu: 0.742 ± 0.135
0.445TrpPhe: 0.445 ± 0.108
0.445TrpGly: 0.445 ± 0.122
0.173TrpHis: 0.173 ± 0.068
0.618TrpIle: 0.618 ± 0.153
0.816TrpLys: 0.816 ± 0.158
0.396TrpLeu: 0.396 ± 0.095
0.124TrpMet: 0.124 ± 0.066
0.742TrpAsn: 0.742 ± 0.157
0.0TrpPro: 0.0 ± 0.0
0.223TrpGln: 0.223 ± 0.078
0.099TrpArg: 0.099 ± 0.048
0.495TrpSer: 0.495 ± 0.102
0.42TrpThr: 0.42 ± 0.104
0.618TrpVal: 0.618 ± 0.133
0.074TrpTrp: 0.074 ± 0.055
0.618TrpTyr: 0.618 ± 0.136
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.632TyrAla: 1.632 ± 0.206
0.594TyrCys: 0.594 ± 0.142
3.933TyrAsp: 3.933 ± 0.358
3.314TyrGlu: 3.314 ± 0.354
2.622TyrPhe: 2.622 ± 0.218
2.795TyrGly: 2.795 ± 0.307
0.965TyrHis: 0.965 ± 0.176
4.452TyrIle: 4.452 ± 0.333
4.922TyrLys: 4.922 ± 0.36
4.6TyrLeu: 4.6 ± 0.418
1.187TyrMet: 1.187 ± 0.164
4.848TyrAsn: 4.848 ± 0.313
1.385TyrPro: 1.385 ± 0.209
1.632TyrGln: 1.632 ± 0.192
1.756TyrArg: 1.756 ± 0.188
2.993TyrSer: 2.993 ± 0.262
3.388TyrThr: 3.388 ± 0.348
3.166TyrVal: 3.166 ± 0.308
0.371TyrTrp: 0.371 ± 0.105
3.067TyrTyr: 3.067 ± 0.302
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 205 proteins (40432 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski