Amino acid dipepetide frequency for Arthrobacter phage Franzy

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
19.23AlaAla: 19.23 ± 2.155
0.828AlaCys: 0.828 ± 0.293
6.877AlaAsp: 6.877 ± 0.748
9.36AlaGlu: 9.36 ± 1.17
3.247AlaPhe: 3.247 ± 0.663
10.188AlaGly: 10.188 ± 0.811
2.101AlaHis: 2.101 ± 0.411
4.648AlaIle: 4.648 ± 0.449
5.158AlaLys: 5.158 ± 0.526
10.315AlaLeu: 10.315 ± 0.968
2.993AlaMet: 2.993 ± 0.353
3.375AlaAsn: 3.375 ± 0.561
6.049AlaPro: 6.049 ± 0.771
5.794AlaGln: 5.794 ± 0.516
7.768AlaArg: 7.768 ± 0.871
6.049AlaSer: 6.049 ± 0.614
7.323AlaThr: 7.323 ± 1.111
8.532AlaVal: 8.532 ± 0.749
1.337AlaTrp: 1.337 ± 0.305
2.611AlaTyr: 2.611 ± 0.333
0.0AlaXaa: 0.0 ± 0.0
Cys
0.573CysAla: 0.573 ± 0.195
0.0CysCys: 0.0 ± 0.0
0.446CysAsp: 0.446 ± 0.178
0.127CysGlu: 0.127 ± 0.089
0.191CysPhe: 0.191 ± 0.091
0.446CysGly: 0.446 ± 0.183
0.191CysHis: 0.191 ± 0.108
0.382CysIle: 0.382 ± 0.238
0.191CysLys: 0.191 ± 0.111
0.637CysLeu: 0.637 ± 0.229
0.064CysMet: 0.064 ± 0.08
0.191CysAsn: 0.191 ± 0.107
0.509CysPro: 0.509 ± 0.223
0.191CysGln: 0.191 ± 0.089
0.318CysArg: 0.318 ± 0.158
0.382CysSer: 0.382 ± 0.178
0.382CysThr: 0.382 ± 0.162
0.573CysVal: 0.573 ± 0.187
0.127CysTrp: 0.127 ± 0.073
0.191CysTyr: 0.191 ± 0.104
0.0CysXaa: 0.0 ± 0.0
Asp
8.787AspAla: 8.787 ± 0.806
0.318AspCys: 0.318 ± 0.133
4.648AspAsp: 4.648 ± 0.504
5.731AspGlu: 5.731 ± 0.805
1.719AspPhe: 1.719 ± 0.359
5.985AspGly: 5.985 ± 0.625
0.891AspHis: 0.891 ± 0.248
2.802AspIle: 2.802 ± 0.427
2.802AspLys: 2.802 ± 0.396
4.075AspLeu: 4.075 ± 0.443
0.891AspMet: 0.891 ± 0.217
1.847AspAsn: 1.847 ± 0.305
3.375AspPro: 3.375 ± 0.614
1.592AspGln: 1.592 ± 0.258
3.184AspArg: 3.184 ± 0.472
2.802AspSer: 2.802 ± 0.375
3.184AspThr: 3.184 ± 0.503
4.903AspVal: 4.903 ± 0.536
1.401AspTrp: 1.401 ± 0.281
1.401AspTyr: 1.401 ± 0.288
0.0AspXaa: 0.0 ± 0.0
Glu
6.304GluAla: 6.304 ± 0.694
0.7GluCys: 0.7 ± 0.237
3.438GluAsp: 3.438 ± 0.474
3.438GluGlu: 3.438 ± 0.497
1.019GluPhe: 1.019 ± 0.271
4.648GluGly: 4.648 ± 0.649
1.465GluHis: 1.465 ± 0.311
3.247GluIle: 3.247 ± 0.466
3.438GluLys: 3.438 ± 0.521
7.259GluLeu: 7.259 ± 0.77
2.229GluMet: 2.229 ± 0.388
2.292GluAsn: 2.292 ± 0.337
4.712GluPro: 4.712 ± 0.485
2.165GluGln: 2.165 ± 0.345
4.839GluArg: 4.839 ± 0.812
3.056GluSer: 3.056 ± 0.411
3.693GluThr: 3.693 ± 0.503
3.629GluVal: 3.629 ± 0.502
0.637GluTrp: 0.637 ± 0.188
2.038GluTyr: 2.038 ± 0.349
0.0GluXaa: 0.0 ± 0.0
Phe
2.993PheAla: 2.993 ± 0.419
0.064PheCys: 0.064 ± 0.064
2.101PheAsp: 2.101 ± 0.341
2.547PheGlu: 2.547 ± 0.466
0.446PhePhe: 0.446 ± 0.206
2.356PheGly: 2.356 ± 0.411
0.509PheHis: 0.509 ± 0.187
1.337PheIle: 1.337 ± 0.322
1.273PheLys: 1.273 ± 0.335
1.273PheLeu: 1.273 ± 0.342
0.891PheMet: 0.891 ± 0.224
0.828PheAsn: 0.828 ± 0.238
1.592PhePro: 1.592 ± 0.31
0.446PheGln: 0.446 ± 0.162
1.719PheArg: 1.719 ± 0.46
1.656PheSer: 1.656 ± 0.442
2.229PheThr: 2.229 ± 0.375
2.42PheVal: 2.42 ± 0.329
0.7PheTrp: 0.7 ± 0.229
0.446PheTyr: 0.446 ± 0.157
0.0PheXaa: 0.0 ± 0.0
Gly
8.405GlyAla: 8.405 ± 0.836
0.255GlyCys: 0.255 ± 0.162
5.412GlyAsp: 5.412 ± 0.65
4.521GlyGlu: 4.521 ± 0.624
1.847GlyPhe: 1.847 ± 0.327
7.068GlyGly: 7.068 ± 1.315
1.401GlyHis: 1.401 ± 0.256
3.566GlyIle: 3.566 ± 0.512
3.247GlyLys: 3.247 ± 0.453
5.794GlyLeu: 5.794 ± 0.753
1.91GlyMet: 1.91 ± 0.359
2.611GlyAsn: 2.611 ± 0.486
2.993GlyPro: 2.993 ± 0.592
2.101GlyGln: 2.101 ± 0.402
4.776GlyArg: 4.776 ± 0.571
5.412GlySer: 5.412 ± 0.648
5.285GlyThr: 5.285 ± 0.685
6.431GlyVal: 6.431 ± 0.652
1.656GlyTrp: 1.656 ± 0.423
2.229GlyTyr: 2.229 ± 0.356
0.0GlyXaa: 0.0 ± 0.0
His
2.101HisAla: 2.101 ± 0.385
0.064HisCys: 0.064 ± 0.067
0.764HisAsp: 0.764 ± 0.224
1.146HisGlu: 1.146 ± 0.288
0.446HisPhe: 0.446 ± 0.166
0.573HisGly: 0.573 ± 0.177
0.382HisHis: 0.382 ± 0.182
0.955HisIle: 0.955 ± 0.221
0.509HisLys: 0.509 ± 0.168
1.21HisLeu: 1.21 ± 0.26
0.318HisMet: 0.318 ± 0.139
0.509HisAsn: 0.509 ± 0.178
0.637HisPro: 0.637 ± 0.209
0.382HisGln: 0.382 ± 0.149
0.891HisArg: 0.891 ± 0.198
0.509HisSer: 0.509 ± 0.163
1.719HisThr: 1.719 ± 0.357
1.719HisVal: 1.719 ± 0.377
0.446HisTrp: 0.446 ± 0.152
0.509HisTyr: 0.509 ± 0.15
0.0HisXaa: 0.0 ± 0.0
Ile
5.285IleAla: 5.285 ± 0.633
0.191IleCys: 0.191 ± 0.103
3.184IleAsp: 3.184 ± 0.51
4.011IleGlu: 4.011 ± 0.532
1.337IlePhe: 1.337 ± 0.306
3.82IleGly: 3.82 ± 0.612
0.573IleHis: 0.573 ± 0.148
2.165IleIle: 2.165 ± 0.408
2.292IleLys: 2.292 ± 0.429
2.292IleLeu: 2.292 ± 0.392
1.273IleMet: 1.273 ± 0.294
1.783IleAsn: 1.783 ± 0.268
2.229IlePro: 2.229 ± 0.412
1.082IleGln: 1.082 ± 0.302
2.356IleArg: 2.356 ± 0.336
2.356IleSer: 2.356 ± 0.365
3.629IleThr: 3.629 ± 0.484
4.776IleVal: 4.776 ± 0.395
0.318IleTrp: 0.318 ± 0.152
1.465IleTyr: 1.465 ± 0.285
0.0IleXaa: 0.0 ± 0.0
Lys
6.113LysAla: 6.113 ± 0.6
0.318LysCys: 0.318 ± 0.145
1.974LysAsp: 1.974 ± 0.367
1.847LysGlu: 1.847 ± 0.308
1.783LysPhe: 1.783 ± 0.313
2.929LysGly: 2.929 ± 0.45
0.955LysHis: 0.955 ± 0.286
1.974LysIle: 1.974 ± 0.403
1.21LysLys: 1.21 ± 0.319
3.948LysLeu: 3.948 ± 0.415
1.401LysMet: 1.401 ± 0.271
0.891LysAsn: 0.891 ± 0.242
2.547LysPro: 2.547 ± 0.59
2.547LysGln: 2.547 ± 0.563
3.502LysArg: 3.502 ± 0.544
2.483LysSer: 2.483 ± 0.535
3.184LysThr: 3.184 ± 0.417
2.356LysVal: 2.356 ± 0.387
0.891LysTrp: 0.891 ± 0.201
0.955LysTyr: 0.955 ± 0.213
0.0LysXaa: 0.0 ± 0.0
Leu
9.042LeuAla: 9.042 ± 0.793
0.318LeuCys: 0.318 ± 0.135
4.712LeuAsp: 4.712 ± 0.536
4.075LeuGlu: 4.075 ± 0.457
1.719LeuPhe: 1.719 ± 0.293
5.221LeuGly: 5.221 ± 0.924
1.21LeuHis: 1.21 ± 0.306
4.011LeuIle: 4.011 ± 0.752
4.33LeuLys: 4.33 ± 0.471
7.131LeuLeu: 7.131 ± 0.997
1.91LeuMet: 1.91 ± 0.451
2.929LeuAsn: 2.929 ± 0.403
3.502LeuPro: 3.502 ± 0.508
3.438LeuGln: 3.438 ± 0.439
5.794LeuArg: 5.794 ± 0.698
5.794LeuSer: 5.794 ± 0.767
6.749LeuThr: 6.749 ± 0.559
6.24LeuVal: 6.24 ± 0.58
0.955LeuTrp: 0.955 ± 0.253
1.847LeuTyr: 1.847 ± 0.281
0.0LeuXaa: 0.0 ± 0.0
Met
2.865MetAla: 2.865 ± 0.413
0.064MetCys: 0.064 ± 0.054
0.637MetAsp: 0.637 ± 0.226
0.7MetGlu: 0.7 ± 0.26
0.573MetPhe: 0.573 ± 0.17
1.719MetGly: 1.719 ± 0.34
0.127MetHis: 0.127 ± 0.077
1.592MetIle: 1.592 ± 0.372
1.019MetLys: 1.019 ± 0.277
1.465MetLeu: 1.465 ± 0.307
0.446MetMet: 0.446 ± 0.168
1.019MetAsn: 1.019 ± 0.206
1.656MetPro: 1.656 ± 0.435
1.273MetGln: 1.273 ± 0.262
2.483MetArg: 2.483 ± 0.379
1.337MetSer: 1.337 ± 0.205
2.738MetThr: 2.738 ± 0.46
1.146MetVal: 1.146 ± 0.29
0.255MetTrp: 0.255 ± 0.114
0.382MetTyr: 0.382 ± 0.154
0.0MetXaa: 0.0 ± 0.0
Asn
4.33AsnAla: 4.33 ± 0.403
0.509AsnCys: 0.509 ± 0.184
2.101AsnAsp: 2.101 ± 0.306
2.165AsnGlu: 2.165 ± 0.418
0.637AsnPhe: 0.637 ± 0.174
2.865AsnGly: 2.865 ± 0.379
0.318AsnHis: 0.318 ± 0.111
1.974AsnIle: 1.974 ± 0.382
1.656AsnLys: 1.656 ± 0.375
2.038AsnLeu: 2.038 ± 0.358
0.382AsnMet: 0.382 ± 0.132
0.955AsnAsn: 0.955 ± 0.265
2.101AsnPro: 2.101 ± 0.452
0.891AsnGln: 0.891 ± 0.246
1.974AsnArg: 1.974 ± 0.296
1.337AsnSer: 1.337 ± 0.281
2.229AsnThr: 2.229 ± 0.4
2.165AsnVal: 2.165 ± 0.369
0.828AsnTrp: 0.828 ± 0.227
0.891AsnTyr: 0.891 ± 0.278
0.0AsnXaa: 0.0 ± 0.0
Pro
7.832ProAla: 7.832 ± 0.873
0.446ProCys: 0.446 ± 0.172
3.693ProAsp: 3.693 ± 0.53
5.03ProGlu: 5.03 ± 0.622
1.21ProPhe: 1.21 ± 0.222
4.139ProGly: 4.139 ± 0.594
0.891ProHis: 0.891 ± 0.224
3.12ProIle: 3.12 ± 0.664
2.165ProLys: 2.165 ± 0.34
3.502ProLeu: 3.502 ± 0.639
0.7ProMet: 0.7 ± 0.232
2.101ProAsn: 2.101 ± 0.386
2.611ProPro: 2.611 ± 0.761
1.719ProGln: 1.719 ± 0.297
2.993ProArg: 2.993 ± 0.508
2.865ProSer: 2.865 ± 0.519
3.12ProThr: 3.12 ± 0.526
3.184ProVal: 3.184 ± 0.489
0.637ProTrp: 0.637 ± 0.195
1.082ProTyr: 1.082 ± 0.302
0.0ProXaa: 0.0 ± 0.0
Gln
5.03GlnAla: 5.03 ± 0.523
0.255GlnCys: 0.255 ± 0.112
1.465GlnAsp: 1.465 ± 0.218
1.273GlnGlu: 1.273 ± 0.31
1.273GlnPhe: 1.273 ± 0.407
2.483GlnGly: 2.483 ± 0.448
0.7GlnHis: 0.7 ± 0.199
1.719GlnIle: 1.719 ± 0.327
1.592GlnLys: 1.592 ± 0.36
4.967GlnLeu: 4.967 ± 0.584
0.764GlnMet: 0.764 ± 0.214
0.7GlnAsn: 0.7 ± 0.178
2.165GlnPro: 2.165 ± 0.399
2.356GlnGln: 2.356 ± 0.354
2.738GlnArg: 2.738 ± 0.484
1.592GlnSer: 1.592 ± 0.377
1.783GlnThr: 1.783 ± 0.292
2.611GlnVal: 2.611 ± 0.409
0.509GlnTrp: 0.509 ± 0.183
0.891GlnTyr: 0.891 ± 0.227
0.0GlnXaa: 0.0 ± 0.0
Arg
7.386ArgAla: 7.386 ± 0.574
0.446ArgCys: 0.446 ± 0.14
4.011ArgAsp: 4.011 ± 0.496
4.075ArgGlu: 4.075 ± 0.498
2.356ArgPhe: 2.356 ± 0.363
3.884ArgGly: 3.884 ± 0.506
0.955ArgHis: 0.955 ± 0.317
2.42ArgIle: 2.42 ± 0.321
3.184ArgLys: 3.184 ± 0.408
5.285ArgLeu: 5.285 ± 0.726
1.528ArgMet: 1.528 ± 0.283
1.783ArgAsn: 1.783 ± 0.43
3.502ArgPro: 3.502 ± 0.622
2.292ArgGln: 2.292 ± 0.319
5.221ArgArg: 5.221 ± 0.762
3.566ArgSer: 3.566 ± 0.453
4.521ArgThr: 4.521 ± 0.648
5.158ArgVal: 5.158 ± 0.611
1.401ArgTrp: 1.401 ± 0.295
1.91ArgTyr: 1.91 ± 0.409
0.0ArgXaa: 0.0 ± 0.0
Ser
5.922SerAla: 5.922 ± 0.681
0.382SerCys: 0.382 ± 0.178
3.693SerAsp: 3.693 ± 0.489
2.865SerGlu: 2.865 ± 0.479
1.783SerPhe: 1.783 ± 0.406
4.776SerGly: 4.776 ± 0.741
0.7SerHis: 0.7 ± 0.258
2.292SerIle: 2.292 ± 0.355
2.292SerLys: 2.292 ± 0.313
4.839SerLeu: 4.839 ± 0.586
1.273SerMet: 1.273 ± 0.283
1.592SerAsn: 1.592 ± 0.344
2.42SerPro: 2.42 ± 0.366
1.082SerGln: 1.082 ± 0.217
2.738SerArg: 2.738 ± 0.387
2.738SerSer: 2.738 ± 0.388
4.967SerThr: 4.967 ± 0.618
5.221SerVal: 5.221 ± 0.598
0.891SerTrp: 0.891 ± 0.302
1.847SerTyr: 1.847 ± 0.34
0.0SerXaa: 0.0 ± 0.0
Thr
9.36ThrAla: 9.36 ± 1.075
0.446ThrCys: 0.446 ± 0.173
4.011ThrAsp: 4.011 ± 0.501
3.438ThrGlu: 3.438 ± 0.586
2.738ThrPhe: 2.738 ± 0.451
4.967ThrGly: 4.967 ± 0.553
1.146ThrHis: 1.146 ± 0.287
2.865ThrIle: 2.865 ± 0.348
2.547ThrLys: 2.547 ± 0.513
5.349ThrLeu: 5.349 ± 0.582
1.592ThrMet: 1.592 ± 0.315
1.91ThrAsn: 1.91 ± 0.379
4.903ThrPro: 4.903 ± 0.63
2.229ThrGln: 2.229 ± 0.364
3.502ThrArg: 3.502 ± 0.364
3.629ThrSer: 3.629 ± 0.69
5.03ThrThr: 5.03 ± 0.549
6.113ThrVal: 6.113 ± 0.582
0.764ThrTrp: 0.764 ± 0.244
1.91ThrTyr: 1.91 ± 0.477
0.0ThrXaa: 0.0 ± 0.0
Val
7.641ValAla: 7.641 ± 0.67
0.255ValCys: 0.255 ± 0.123
6.495ValAsp: 6.495 ± 0.664
5.922ValGlu: 5.922 ± 0.657
2.356ValPhe: 2.356 ± 0.329
5.476ValGly: 5.476 ± 0.812
0.509ValHis: 0.509 ± 0.176
3.82ValIle: 3.82 ± 0.436
2.929ValLys: 2.929 ± 0.43
6.558ValLeu: 6.558 ± 0.707
1.783ValMet: 1.783 ± 0.302
2.738ValAsn: 2.738 ± 0.347
3.948ValPro: 3.948 ± 0.462
3.693ValGln: 3.693 ± 0.453
4.648ValArg: 4.648 ± 0.601
4.648ValSer: 4.648 ± 0.575
5.158ValThr: 5.158 ± 0.599
6.622ValVal: 6.622 ± 0.896
1.082ValTrp: 1.082 ± 0.251
1.656ValTyr: 1.656 ± 0.319
0.0ValXaa: 0.0 ± 0.0
Trp
1.91TrpAla: 1.91 ± 0.397
0.127TrpCys: 0.127 ± 0.092
0.955TrpAsp: 0.955 ± 0.23
0.637TrpGlu: 0.637 ± 0.165
0.446TrpPhe: 0.446 ± 0.204
0.891TrpGly: 0.891 ± 0.254
0.318TrpHis: 0.318 ± 0.138
0.7TrpIle: 0.7 ± 0.213
0.891TrpLys: 0.891 ± 0.241
1.91TrpLeu: 1.91 ± 0.413
0.382TrpMet: 0.382 ± 0.155
0.891TrpAsn: 0.891 ± 0.224
0.7TrpPro: 0.7 ± 0.221
0.509TrpGln: 0.509 ± 0.172
1.528TrpArg: 1.528 ± 0.367
0.382TrpSer: 0.382 ± 0.135
0.637TrpThr: 0.637 ± 0.215
0.955TrpVal: 0.955 ± 0.227
0.255TrpTrp: 0.255 ± 0.119
0.318TrpTyr: 0.318 ± 0.158
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.674TyrAla: 2.674 ± 0.374
0.127TyrCys: 0.127 ± 0.085
1.719TyrAsp: 1.719 ± 0.327
1.337TyrGlu: 1.337 ± 0.439
0.7TyrPhe: 0.7 ± 0.208
2.483TyrGly: 2.483 ± 0.422
0.446TyrHis: 0.446 ± 0.169
0.828TyrIle: 0.828 ± 0.196
1.082TyrLys: 1.082 ± 0.3
1.146TyrLeu: 1.146 ± 0.254
0.7TyrMet: 0.7 ± 0.203
1.401TyrAsn: 1.401 ± 0.291
0.891TyrPro: 0.891 ± 0.316
1.082TyrGln: 1.082 ± 0.264
1.91TyrArg: 1.91 ± 0.397
1.719TyrSer: 1.719 ± 0.394
0.891TyrThr: 0.891 ± 0.224
3.184TyrVal: 3.184 ± 0.554
0.255TyrTrp: 0.255 ± 0.116
0.573TyrTyr: 0.573 ± 0.191
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 72 proteins (15706 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski