Amino acid dipepetide frequency for Rana grylio iridovirus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
10.4AlaAla: 10.4 ± 0.872
1.711AlaCys: 1.711 ± 0.216
4.328AlaAsp: 4.328 ± 0.383
5.536AlaGlu: 5.536 ± 0.48
2.751AlaPhe: 2.751 ± 0.338
6.911AlaGly: 6.911 ± 0.63
1.677AlaHis: 1.677 ± 0.212
2.483AlaIle: 2.483 ± 0.283
4.697AlaLys: 4.697 ± 0.394
7.347AlaLeu: 7.347 ± 0.603
3.053AlaMet: 3.053 ± 0.304
1.711AlaAsn: 1.711 ± 0.254
5.334AlaPro: 5.334 ± 1.007
2.818AlaGln: 2.818 ± 0.392
4.932AlaArg: 4.932 ± 0.534
6.374AlaSer: 6.374 ± 0.682
4.898AlaThr: 4.898 ± 0.922
8.689AlaVal: 8.689 ± 0.63
1.308AlaTrp: 1.308 ± 0.206
2.717AlaTyr: 2.717 ± 0.245
0.0AlaXaa: 0.0 ± 0.0
Cys
1.845CysAla: 1.845 ± 0.259
0.772CysCys: 0.772 ± 0.175
1.174CysAsp: 1.174 ± 0.208
1.04CysGlu: 1.04 ± 0.192
0.503CysPhe: 0.503 ± 0.13
1.644CysGly: 1.644 ± 0.225
0.57CysHis: 0.57 ± 0.149
0.637CysIle: 0.637 ± 0.155
1.342CysLys: 1.342 ± 0.233
1.51CysLeu: 1.51 ± 0.264
0.637CysMet: 0.637 ± 0.154
0.671CysAsn: 0.671 ± 0.165
1.711CysPro: 1.711 ± 0.283
0.57CysGln: 0.57 ± 0.139
1.51CysArg: 1.51 ± 0.264
1.51CysSer: 1.51 ± 0.197
0.772CysThr: 0.772 ± 0.139
1.644CysVal: 1.644 ± 0.258
0.503CysTrp: 0.503 ± 0.127
0.604CysTyr: 0.604 ± 0.153
0.0CysXaa: 0.0 ± 0.0
Asp
5.267AspAla: 5.267 ± 0.416
1.443AspCys: 1.443 ± 0.203
3.388AspAsp: 3.388 ± 0.409
3.087AspGlu: 3.087 ± 0.474
1.644AspPhe: 1.644 ± 0.295
4.932AspGly: 4.932 ± 0.492
1.006AspHis: 1.006 ± 0.163
2.449AspIle: 2.449 ± 0.273
2.55AspLys: 2.55 ± 0.27
5.334AspLeu: 5.334 ± 0.476
2.114AspMet: 2.114 ± 0.273
1.912AspAsn: 1.912 ± 0.321
4.596AspPro: 4.596 ± 0.516
1.543AspGln: 1.543 ± 0.261
3.959AspArg: 3.959 ± 0.369
4.73AspSer: 4.73 ± 0.456
2.516AspThr: 2.516 ± 0.339
4.999AspVal: 4.999 ± 0.354
0.839AspTrp: 0.839 ± 0.189
2.281AspTyr: 2.281 ± 0.269
0.0AspXaa: 0.0 ± 0.0
Glu
5.804GluAla: 5.804 ± 0.484
1.376GluCys: 1.376 ± 0.24
3.59GluAsp: 3.59 ± 0.455
3.791GluGlu: 3.791 ± 0.616
1.845GluPhe: 1.845 ± 0.235
3.657GluGly: 3.657 ± 0.409
0.872GluHis: 0.872 ± 0.188
1.711GluIle: 1.711 ± 0.213
3.087GluLys: 3.087 ± 0.392
3.456GluLeu: 3.456 ± 0.357
2.248GluMet: 2.248 ± 0.291
1.074GluAsn: 1.074 ± 0.187
3.053GluPro: 3.053 ± 0.352
1.778GluGln: 1.778 ± 0.359
3.825GluArg: 3.825 ± 0.391
3.825GluSer: 3.825 ± 0.392
3.556GluThr: 3.556 ± 0.392
3.456GluVal: 3.456 ± 0.371
1.241GluTrp: 1.241 ± 0.182
1.946GluTyr: 1.946 ± 0.301
0.0GluXaa: 0.0 ± 0.0
Phe
2.986PheAla: 2.986 ± 0.372
0.637PheCys: 0.637 ± 0.155
1.409PheAsp: 1.409 ± 0.23
1.879PheGlu: 1.879 ± 0.211
1.006PhePhe: 1.006 ± 0.212
2.55PheGly: 2.55 ± 0.307
0.637PheHis: 0.637 ± 0.155
0.973PheIle: 0.973 ± 0.166
1.376PheLys: 1.376 ± 0.187
2.818PheLeu: 2.818 ± 0.254
0.939PheMet: 0.939 ± 0.152
1.208PheAsn: 1.208 ± 0.205
1.879PhePro: 1.879 ± 0.283
0.671PheGln: 0.671 ± 0.141
2.281PheArg: 2.281 ± 0.312
2.583PheSer: 2.583 ± 0.296
1.979PheThr: 1.979 ± 0.318
2.785PheVal: 2.785 ± 0.305
0.302PheTrp: 0.302 ± 0.091
1.006PheTyr: 1.006 ± 0.182
0.0PheXaa: 0.0 ± 0.0
Gly
5.871GlyAla: 5.871 ± 0.498
1.711GlyCys: 1.711 ± 0.258
4.73GlyAsp: 4.73 ± 0.512
3.288GlyGlu: 3.288 ± 0.336
2.684GlyPhe: 2.684 ± 0.29
5.703GlyGly: 5.703 ± 0.527
2.046GlyHis: 2.046 ± 0.36
2.483GlyIle: 2.483 ± 0.322
4.328GlyLys: 4.328 ± 0.407
5.536GlyLeu: 5.536 ± 0.525
1.979GlyMet: 1.979 ± 0.249
1.376GlyAsn: 1.376 ± 0.244
5.636GlyPro: 5.636 ± 0.969
1.912GlyGln: 1.912 ± 0.288
5.636GlyArg: 5.636 ± 0.64
5.703GlySer: 5.703 ± 0.413
4.462GlyThr: 4.462 ± 0.474
5.737GlyVal: 5.737 ± 0.483
1.376GlyTrp: 1.376 ± 0.213
2.382GlyTyr: 2.382 ± 0.264
0.0GlyXaa: 0.0 ± 0.0
His
1.812HisAla: 1.812 ± 0.21
0.335HisCys: 0.335 ± 0.1
1.141HisAsp: 1.141 ± 0.182
0.637HisGlu: 0.637 ± 0.156
0.47HisPhe: 0.47 ± 0.13
1.946HisGly: 1.946 ± 0.246
0.604HisHis: 0.604 ± 0.186
0.872HisIle: 0.872 ± 0.155
0.772HisLys: 0.772 ± 0.16
2.248HisLeu: 2.248 ± 0.263
0.604HisMet: 0.604 ± 0.137
0.604HisAsn: 0.604 ± 0.163
1.51HisPro: 1.51 ± 0.276
0.738HisGln: 0.738 ± 0.165
1.677HisArg: 1.677 ± 0.32
1.308HisSer: 1.308 ± 0.233
1.308HisThr: 1.308 ± 0.224
1.979HisVal: 1.979 ± 0.269
0.235HisTrp: 0.235 ± 0.083
0.805HisTyr: 0.805 ± 0.171
0.0HisXaa: 0.0 ± 0.0
Ile
2.281IleAla: 2.281 ± 0.28
0.604IleCys: 0.604 ± 0.136
1.979IleAsp: 1.979 ± 0.282
1.577IleGlu: 1.577 ± 0.239
1.04IlePhe: 1.04 ± 0.173
1.677IleGly: 1.677 ± 0.232
0.906IleHis: 0.906 ± 0.173
0.973IleIle: 0.973 ± 0.177
2.449IleLys: 2.449 ± 0.271
3.355IleLeu: 3.355 ± 0.347
1.208IleMet: 1.208 ± 0.188
0.906IleAsn: 0.906 ± 0.194
2.147IlePro: 2.147 ± 0.234
0.839IleGln: 0.839 ± 0.179
2.65IleArg: 2.65 ± 0.282
2.281IleSer: 2.281 ± 0.262
1.543IleThr: 1.543 ± 0.264
2.919IleVal: 2.919 ± 0.313
0.268IleTrp: 0.268 ± 0.094
0.906IleTyr: 0.906 ± 0.214
0.0IleXaa: 0.0 ± 0.0
Lys
4.663LysAla: 4.663 ± 0.541
0.872LysCys: 0.872 ± 0.186
3.087LysAsp: 3.087 ± 0.402
3.087LysGlu: 3.087 ± 0.346
1.376LysPhe: 1.376 ± 0.219
4.261LysGly: 4.261 ± 0.448
0.805LysHis: 0.805 ± 0.153
2.281LysIle: 2.281 ± 0.286
3.623LysLys: 3.623 ± 0.516
3.992LysLeu: 3.992 ± 0.447
2.046LysMet: 2.046 ± 0.237
1.778LysAsn: 1.778 ± 0.232
3.288LysPro: 3.288 ± 0.493
1.51LysGln: 1.51 ± 0.26
5.133LysArg: 5.133 ± 0.709
3.959LysSer: 3.959 ± 0.766
3.925LysThr: 3.925 ± 0.387
3.489LysVal: 3.489 ± 0.307
0.537LysTrp: 0.537 ± 0.154
1.946LysTyr: 1.946 ± 0.225
0.0LysXaa: 0.0 ± 0.0
Leu
6.509LeuAla: 6.509 ± 0.514
2.114LeuCys: 2.114 ± 0.3
5.301LeuAsp: 5.301 ± 0.431
4.999LeuGlu: 4.999 ± 0.454
2.751LeuPhe: 2.751 ± 0.313
5.603LeuGly: 5.603 ± 0.483
1.778LeuHis: 1.778 ± 0.299
2.717LeuIle: 2.717 ± 0.327
4.73LeuLys: 4.73 ± 0.45
6.676LeuLeu: 6.676 ± 0.644
2.181LeuMet: 2.181 ± 0.273
2.483LeuAsn: 2.483 ± 0.266
4.328LeuPro: 4.328 ± 0.403
1.677LeuGln: 1.677 ± 0.224
6.139LeuArg: 6.139 ± 0.603
6.106LeuSer: 6.106 ± 0.468
4.898LeuThr: 4.898 ± 0.462
5.905LeuVal: 5.905 ± 0.355
1.04LeuTrp: 1.04 ± 0.18
2.248LeuTyr: 2.248 ± 0.245
0.0LeuXaa: 0.0 ± 0.0
Met
3.288MetAla: 3.288 ± 0.36
0.738MetCys: 0.738 ± 0.206
2.147MetAsp: 2.147 ± 0.23
2.013MetGlu: 2.013 ± 0.251
1.174MetPhe: 1.174 ± 0.194
2.617MetGly: 2.617 ± 0.291
0.705MetHis: 0.705 ± 0.179
0.57MetIle: 0.57 ± 0.133
0.839MetLys: 0.839 ± 0.168
2.013MetLeu: 2.013 ± 0.252
0.772MetMet: 0.772 ± 0.163
0.403MetAsn: 0.403 ± 0.128
1.443MetPro: 1.443 ± 0.222
0.805MetGln: 0.805 ± 0.188
2.114MetArg: 2.114 ± 0.263
3.12MetSer: 3.12 ± 0.412
2.114MetThr: 2.114 ± 0.23
2.281MetVal: 2.281 ± 0.329
0.637MetTrp: 0.637 ± 0.152
0.772MetTyr: 0.772 ± 0.217
0.0MetXaa: 0.0 ± 0.0
Asn
2.348AsnAla: 2.348 ± 0.297
0.503AsnCys: 0.503 ± 0.137
0.973AsnAsp: 0.973 ± 0.195
0.906AsnGlu: 0.906 ± 0.174
0.637AsnPhe: 0.637 ± 0.132
1.879AsnGly: 1.879 ± 0.203
0.369AsnHis: 0.369 ± 0.12
1.241AsnIle: 1.241 ± 0.245
0.939AsnLys: 0.939 ± 0.154
2.785AsnLeu: 2.785 ± 0.373
0.939AsnMet: 0.939 ± 0.229
0.839AsnAsn: 0.839 ± 0.222
2.382AsnPro: 2.382 ± 0.443
0.738AsnGln: 0.738 ± 0.152
1.476AsnArg: 1.476 ± 0.216
1.644AsnSer: 1.644 ± 0.195
1.376AsnThr: 1.376 ± 0.287
2.785AsnVal: 2.785 ± 0.353
0.47AsnTrp: 0.47 ± 0.113
0.939AsnTyr: 0.939 ± 0.171
0.0AsnXaa: 0.0 ± 0.0
Pro
7.515ProAla: 7.515 ± 1.506
1.074ProCys: 1.074 ± 0.201
3.858ProAsp: 3.858 ± 0.331
4.563ProGlu: 4.563 ± 0.51
2.046ProPhe: 2.046 ± 0.255
4.462ProGly: 4.462 ± 0.491
1.812ProHis: 1.812 ± 0.271
1.845ProIle: 1.845 ± 0.256
3.59ProLys: 3.59 ± 0.62
4.428ProLeu: 4.428 ± 0.503
1.308ProMet: 1.308 ± 0.166
1.543ProAsn: 1.543 ± 0.271
4.16ProPro: 4.16 ± 0.622
1.979ProGln: 1.979 ± 0.32
4.16ProArg: 4.16 ± 0.58
5.133ProSer: 5.133 ± 0.467
3.12ProThr: 3.12 ± 0.472
6.81ProVal: 6.81 ± 0.724
0.973ProTrp: 0.973 ± 0.191
1.476ProTyr: 1.476 ± 0.24
0.0ProXaa: 0.0 ± 0.0
Gln
2.416GlnAla: 2.416 ± 0.243
0.772GlnCys: 0.772 ± 0.195
1.979GlnAsp: 1.979 ± 0.284
1.979GlnGlu: 1.979 ± 0.355
0.637GlnPhe: 0.637 ± 0.14
2.013GlnGly: 2.013 ± 0.327
0.906GlnHis: 0.906 ± 0.268
1.04GlnIle: 1.04 ± 0.18
1.342GlnLys: 1.342 ± 0.238
1.778GlnLeu: 1.778 ± 0.3
0.772GlnMet: 0.772 ± 0.175
0.671GlnAsn: 0.671 ± 0.117
1.577GlnPro: 1.577 ± 0.329
1.778GlnGln: 1.778 ± 0.634
1.644GlnArg: 1.644 ± 0.211
1.946GlnSer: 1.946 ± 0.254
1.912GlnThr: 1.912 ± 0.276
2.114GlnVal: 2.114 ± 0.287
0.369GlnTrp: 0.369 ± 0.114
0.705GlnTyr: 0.705 ± 0.133
0.0GlnXaa: 0.0 ± 0.0
Arg
5.2ArgAla: 5.2 ± 0.479
1.107ArgCys: 1.107 ± 0.199
4.865ArgAsp: 4.865 ± 0.458
4.261ArgGlu: 4.261 ± 0.386
2.013ArgPhe: 2.013 ± 0.241
5.603ArgGly: 5.603 ± 0.63
1.745ArgHis: 1.745 ± 0.273
2.046ArgIle: 2.046 ± 0.286
4.73ArgLys: 4.73 ± 0.8
5.77ArgLeu: 5.77 ± 0.465
2.214ArgMet: 2.214 ± 0.284
2.181ArgAsn: 2.181 ± 0.351
4.663ArgPro: 4.663 ± 0.517
2.214ArgGln: 2.214 ± 0.319
6.039ArgArg: 6.039 ± 0.549
3.825ArgSer: 3.825 ± 0.397
3.69ArgThr: 3.69 ± 0.46
5.469ArgVal: 5.469 ± 0.452
1.006ArgTrp: 1.006 ± 0.168
2.013ArgTyr: 2.013 ± 0.215
0.0ArgXaa: 0.0 ± 0.0
Ser
6.374SerAla: 6.374 ± 0.623
1.51SerCys: 1.51 ± 0.247
4.965SerAsp: 4.965 ± 0.381
3.724SerGlu: 3.724 ± 0.376
2.852SerPhe: 2.852 ± 0.332
5.569SerGly: 5.569 ± 0.448
1.644SerHis: 1.644 ± 0.266
2.013SerIle: 2.013 ± 0.247
3.556SerLys: 3.556 ± 0.425
6.139SerLeu: 6.139 ± 0.594
1.912SerMet: 1.912 ± 0.304
1.745SerAsn: 1.745 ± 0.245
5.905SerPro: 5.905 ± 1.174
1.912SerGln: 1.912 ± 0.266
4.395SerArg: 4.395 ± 0.487
5.536SerSer: 5.536 ± 0.519
3.019SerThr: 3.019 ± 0.327
6.509SerVal: 6.509 ± 0.616
1.275SerTrp: 1.275 ± 0.208
1.745SerTyr: 1.745 ± 0.222
0.0SerXaa: 0.0 ± 0.0
Thr
5.77ThrAla: 5.77 ± 0.61
1.107ThrCys: 1.107 ± 0.167
3.489ThrAsp: 3.489 ± 0.354
2.416ThrGlu: 2.416 ± 0.305
2.214ThrPhe: 2.214 ± 0.251
5.032ThrGly: 5.032 ± 0.504
0.503ThrHis: 0.503 ± 0.118
2.013ThrIle: 2.013 ± 0.232
2.684ThrLys: 2.684 ± 0.343
4.596ThrLeu: 4.596 ± 0.499
1.912ThrMet: 1.912 ± 0.245
1.04ThrAsn: 1.04 ± 0.201
4.563ThrPro: 4.563 ± 0.655
1.51ThrGln: 1.51 ± 0.213
3.456ThrArg: 3.456 ± 0.324
2.986ThrSer: 2.986 ± 0.394
1.946ThrThr: 1.946 ± 0.401
5.972ThrVal: 5.972 ± 0.414
0.604ThrTrp: 0.604 ± 0.198
1.308ThrTyr: 1.308 ± 0.244
0.0ThrXaa: 0.0 ± 0.0
Val
6.139ValAla: 6.139 ± 0.527
1.778ValCys: 1.778 ± 0.262
4.865ValAsp: 4.865 ± 0.384
4.059ValGlu: 4.059 ± 0.438
2.919ValPhe: 2.919 ± 0.27
4.999ValGly: 4.999 ± 0.739
2.382ValHis: 2.382 ± 0.335
2.382ValIle: 2.382 ± 0.249
6.408ValLys: 6.408 ± 0.653
7.012ValLeu: 7.012 ± 0.465
2.55ValMet: 2.55 ± 0.284
2.617ValAsn: 2.617 ± 0.309
4.932ValPro: 4.932 ± 0.607
2.248ValGln: 2.248 ± 0.246
6.878ValArg: 6.878 ± 0.609
6.475ValSer: 6.475 ± 0.551
4.965ValThr: 4.965 ± 0.522
6.878ValVal: 6.878 ± 0.55
1.208ValTrp: 1.208 ± 0.174
2.281ValTyr: 2.281 ± 0.247
0.0ValXaa: 0.0 ± 0.0
Trp
0.872TrpAla: 0.872 ± 0.236
0.268TrpCys: 0.268 ± 0.083
1.074TrpAsp: 1.074 ± 0.183
0.906TrpGlu: 0.906 ± 0.19
0.537TrpPhe: 0.537 ± 0.132
0.906TrpGly: 0.906 ± 0.16
0.235TrpHis: 0.235 ± 0.093
0.503TrpIle: 0.503 ± 0.126
0.939TrpLys: 0.939 ± 0.163
1.376TrpLeu: 1.376 ± 0.196
0.436TrpMet: 0.436 ± 0.111
0.637TrpAsn: 0.637 ± 0.156
0.738TrpPro: 0.738 ± 0.145
0.268TrpGln: 0.268 ± 0.092
1.04TrpArg: 1.04 ± 0.18
0.872TrpSer: 0.872 ± 0.15
1.543TrpThr: 1.543 ± 0.261
0.872TrpVal: 0.872 ± 0.185
0.235TrpTrp: 0.235 ± 0.096
0.436TrpTyr: 0.436 ± 0.155
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.315TyrAla: 2.315 ± 0.274
0.738TyrCys: 0.738 ± 0.156
2.248TyrAsp: 2.248 ± 0.274
1.443TyrGlu: 1.443 ± 0.21
0.805TyrPhe: 0.805 ± 0.159
2.617TyrGly: 2.617 ± 0.266
0.403TyrHis: 0.403 ± 0.123
1.275TyrIle: 1.275 ± 0.165
1.677TyrLys: 1.677 ± 0.236
2.046TyrLeu: 2.046 ± 0.262
0.705TyrMet: 0.705 ± 0.164
0.772TyrAsn: 0.772 ± 0.179
2.013TyrPro: 2.013 ± 0.225
0.839TyrGln: 0.839 ± 0.135
1.711TyrArg: 1.711 ± 0.232
2.382TyrSer: 2.382 ± 0.316
1.577TyrThr: 1.577 ± 0.225
2.684TyrVal: 2.684 ± 0.31
0.268TyrTrp: 0.268 ± 0.102
0.738TyrTyr: 0.738 ± 0.177
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 106 proteins (29808 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski