Amino acid dipepetide frequency for Rana esculenta virus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
10.502AlaAla: 10.502 ± 1.106
1.548AlaCys: 1.548 ± 0.232
4.207AlaAsp: 4.207 ± 0.379
5.722AlaGlu: 5.722 ± 0.577
2.861AlaPhe: 2.861 ± 0.319
7.035AlaGly: 7.035 ± 0.625
2.02AlaHis: 2.02 ± 0.256
2.289AlaIle: 2.289 ± 0.25
4.443AlaLys: 4.443 ± 0.435
6.968AlaLeu: 6.968 ± 0.599
3.097AlaMet: 3.097 ± 0.358
1.717AlaAsn: 1.717 ± 0.243
5.419AlaPro: 5.419 ± 1.06
2.625AlaGln: 2.625 ± 0.37
5.083AlaArg: 5.083 ± 0.499
6.294AlaSer: 6.294 ± 0.614
4.443AlaThr: 4.443 ± 0.408
8.752AlaVal: 8.752 ± 0.617
1.346AlaTrp: 1.346 ± 0.26
2.491AlaTyr: 2.491 ± 0.264
0.0AlaXaa: 0.0 ± 0.0
Cys
1.851CysAla: 1.851 ± 0.284
0.64CysCys: 0.64 ± 0.149
1.245CysAsp: 1.245 ± 0.188
1.111CysGlu: 1.111 ± 0.17
0.505CysPhe: 0.505 ± 0.16
1.582CysGly: 1.582 ± 0.243
0.539CysHis: 0.539 ± 0.15
0.707CysIle: 0.707 ± 0.167
1.38CysLys: 1.38 ± 0.235
1.346CysLeu: 1.346 ± 0.295
0.673CysMet: 0.673 ± 0.199
0.606CysAsn: 0.606 ± 0.15
1.447CysPro: 1.447 ± 0.304
0.572CysGln: 0.572 ± 0.154
1.515CysArg: 1.515 ± 0.262
1.616CysSer: 1.616 ± 0.21
0.841CysThr: 0.841 ± 0.183
1.582CysVal: 1.582 ± 0.259
0.505CysTrp: 0.505 ± 0.122
0.539CysTyr: 0.539 ± 0.145
0.0CysXaa: 0.0 ± 0.0
Asp
5.386AspAla: 5.386 ± 0.474
1.346AspCys: 1.346 ± 0.204
3.164AspAsp: 3.164 ± 0.371
3.164AspGlu: 3.164 ± 0.384
1.75AspPhe: 1.75 ± 0.294
4.578AspGly: 4.578 ± 0.458
0.976AspHis: 0.976 ± 0.149
2.188AspIle: 2.188 ± 0.333
2.659AspLys: 2.659 ± 0.3
5.217AspLeu: 5.217 ± 0.474
1.919AspMet: 1.919 ± 0.253
2.02AspAsn: 2.02 ± 0.356
4.679AspPro: 4.679 ± 0.545
1.481AspGln: 1.481 ± 0.224
3.938AspArg: 3.938 ± 0.428
4.578AspSer: 4.578 ± 0.458
2.424AspThr: 2.424 ± 0.318
4.813AspVal: 4.813 ± 0.412
0.774AspTrp: 0.774 ± 0.171
2.356AspTyr: 2.356 ± 0.289
0.0AspXaa: 0.0 ± 0.0
Glu
5.991GluAla: 5.991 ± 0.433
1.447GluCys: 1.447 ± 0.277
3.837GluAsp: 3.837 ± 0.461
3.433GluGlu: 3.433 ± 0.483
1.851GluPhe: 1.851 ± 0.256
3.804GluGly: 3.804 ± 0.413
0.808GluHis: 0.808 ± 0.162
1.784GluIle: 1.784 ± 0.207
3.097GluLys: 3.097 ± 0.341
3.299GluLeu: 3.299 ± 0.301
2.154GluMet: 2.154 ± 0.261
1.144GluAsn: 1.144 ± 0.167
2.996GluPro: 2.996 ± 0.366
1.986GluGln: 1.986 ± 0.543
3.972GluArg: 3.972 ± 0.441
3.602GluSer: 3.602 ± 0.423
3.804GluThr: 3.804 ± 0.382
3.534GluVal: 3.534 ± 0.4
1.245GluTrp: 1.245 ± 0.22
2.188GluTyr: 2.188 ± 0.271
0.0GluXaa: 0.0 ± 0.0
Phe
2.659PheAla: 2.659 ± 0.344
0.673PheCys: 0.673 ± 0.169
1.481PheAsp: 1.481 ± 0.194
1.919PheGlu: 1.919 ± 0.23
1.279PhePhe: 1.279 ± 0.202
2.659PheGly: 2.659 ± 0.263
0.673PheHis: 0.673 ± 0.191
1.043PheIle: 1.043 ± 0.253
1.38PheLys: 1.38 ± 0.196
2.962PheLeu: 2.962 ± 0.305
0.942PheMet: 0.942 ± 0.161
1.212PheAsn: 1.212 ± 0.212
2.121PhePro: 2.121 ± 0.277
0.741PheGln: 0.741 ± 0.137
2.356PheArg: 2.356 ± 0.327
2.827PheSer: 2.827 ± 0.306
2.02PheThr: 2.02 ± 0.307
2.928PheVal: 2.928 ± 0.307
0.303PheTrp: 0.303 ± 0.097
1.077PheTyr: 1.077 ± 0.186
0.0PheXaa: 0.0 ± 0.0
Gly
5.857GlyAla: 5.857 ± 0.451
1.683GlyCys: 1.683 ± 0.289
4.376GlyAsp: 4.376 ± 0.392
3.299GlyGlu: 3.299 ± 0.263
2.895GlyPhe: 2.895 ± 0.339
5.756GlyGly: 5.756 ± 0.625
1.919GlyHis: 1.919 ± 0.326
2.255GlyIle: 2.255 ± 0.27
4.14GlyLys: 4.14 ± 0.436
5.588GlyLeu: 5.588 ± 0.492
1.952GlyMet: 1.952 ± 0.223
1.447GlyAsn: 1.447 ± 0.209
4.308GlyPro: 4.308 ± 0.461
1.885GlyGln: 1.885 ± 0.289
5.756GlyArg: 5.756 ± 0.646
5.756GlySer: 5.756 ± 0.422
4.544GlyThr: 4.544 ± 0.501
5.352GlyVal: 5.352 ± 0.531
1.38GlyTrp: 1.38 ± 0.235
2.39GlyTyr: 2.39 ± 0.277
0.0GlyXaa: 0.0 ± 0.0
His
1.616HisAla: 1.616 ± 0.212
0.37HisCys: 0.37 ± 0.119
1.144HisAsp: 1.144 ± 0.178
0.606HisGlu: 0.606 ± 0.177
0.539HisPhe: 0.539 ± 0.128
1.683HisGly: 1.683 ± 0.221
0.606HisHis: 0.606 ± 0.212
0.774HisIle: 0.774 ± 0.161
0.774HisLys: 0.774 ± 0.174
2.121HisLeu: 2.121 ± 0.29
0.606HisMet: 0.606 ± 0.159
0.673HisAsn: 0.673 ± 0.214
1.683HisPro: 1.683 ± 0.278
0.741HisGln: 0.741 ± 0.156
1.414HisArg: 1.414 ± 0.229
1.447HisSer: 1.447 ± 0.265
1.346HisThr: 1.346 ± 0.255
2.087HisVal: 2.087 ± 0.307
0.202HisTrp: 0.202 ± 0.08
0.841HisTyr: 0.841 ± 0.174
0.0HisXaa: 0.0 ± 0.0
Ile
2.222IleAla: 2.222 ± 0.314
0.572IleCys: 0.572 ± 0.14
1.919IleAsp: 1.919 ± 0.29
1.616IleGlu: 1.616 ± 0.246
1.178IlePhe: 1.178 ± 0.184
1.616IleGly: 1.616 ± 0.225
0.909IleHis: 0.909 ± 0.183
1.111IleIle: 1.111 ± 0.219
2.255IleLys: 2.255 ± 0.231
3.366IleLeu: 3.366 ± 0.333
1.178IleMet: 1.178 ± 0.177
0.942IleAsn: 0.942 ± 0.207
2.154IlePro: 2.154 ± 0.237
0.875IleGln: 0.875 ± 0.182
2.592IleArg: 2.592 ± 0.277
2.424IleSer: 2.424 ± 0.278
1.717IleThr: 1.717 ± 0.278
2.592IleVal: 2.592 ± 0.319
0.236IleTrp: 0.236 ± 0.078
1.01IleTyr: 1.01 ± 0.251
0.0IleXaa: 0.0 ± 0.0
Lys
4.443LysAla: 4.443 ± 0.579
0.942LysCys: 0.942 ± 0.203
2.794LysAsp: 2.794 ± 0.339
3.13LysGlu: 3.13 ± 0.389
1.481LysPhe: 1.481 ± 0.234
4.006LysGly: 4.006 ± 0.352
0.673LysHis: 0.673 ± 0.135
2.457LysIle: 2.457 ± 0.298
4.006LysLys: 4.006 ± 0.658
3.938LysLeu: 3.938 ± 0.428
1.885LysMet: 1.885 ± 0.246
1.75LysAsn: 1.75 ± 0.244
3.602LysPro: 3.602 ± 0.526
1.447LysGln: 1.447 ± 0.276
5.015LysArg: 5.015 ± 0.755
4.275LysSer: 4.275 ± 0.983
3.905LysThr: 3.905 ± 0.36
3.433LysVal: 3.433 ± 0.281
0.572LysTrp: 0.572 ± 0.128
1.717LysTyr: 1.717 ± 0.204
0.0LysXaa: 0.0 ± 0.0
Leu
6.328LeuAla: 6.328 ± 0.505
1.986LeuCys: 1.986 ± 0.326
5.184LeuAsp: 5.184 ± 0.431
5.083LeuGlu: 5.083 ± 0.49
2.794LeuPhe: 2.794 ± 0.294
5.588LeuGly: 5.588 ± 0.541
1.851LeuHis: 1.851 ± 0.348
2.592LeuIle: 2.592 ± 0.313
4.813LeuLys: 4.813 ± 0.353
5.924LeuLeu: 5.924 ± 0.494
2.323LeuMet: 2.323 ± 0.282
2.457LeuAsn: 2.457 ± 0.256
4.275LeuPro: 4.275 ± 0.488
1.447LeuGln: 1.447 ± 0.211
6.395LeuArg: 6.395 ± 0.608
5.958LeuSer: 5.958 ± 0.546
5.049LeuThr: 5.049 ± 0.4
5.857LeuVal: 5.857 ± 0.394
1.043LeuTrp: 1.043 ± 0.169
2.154LeuTyr: 2.154 ± 0.231
0.0LeuXaa: 0.0 ± 0.0
Met
3.332MetAla: 3.332 ± 0.411
0.841MetCys: 0.841 ± 0.224
2.188MetAsp: 2.188 ± 0.236
1.952MetGlu: 1.952 ± 0.225
1.178MetPhe: 1.178 ± 0.19
2.524MetGly: 2.524 ± 0.308
0.741MetHis: 0.741 ± 0.181
0.539MetIle: 0.539 ± 0.127
0.774MetLys: 0.774 ± 0.186
2.121MetLeu: 2.121 ± 0.287
0.808MetMet: 0.808 ± 0.211
0.438MetAsn: 0.438 ± 0.118
1.38MetPro: 1.38 ± 0.166
0.707MetGln: 0.707 ± 0.168
2.188MetArg: 2.188 ± 0.283
3.13MetSer: 3.13 ± 0.384
2.323MetThr: 2.323 ± 0.231
2.255MetVal: 2.255 ± 0.328
0.471MetTrp: 0.471 ± 0.169
0.741MetTyr: 0.741 ± 0.186
0.0MetXaa: 0.0 ± 0.0
Asn
2.39AsnAla: 2.39 ± 0.243
0.471AsnCys: 0.471 ± 0.155
0.909AsnAsp: 0.909 ± 0.167
0.808AsnGlu: 0.808 ± 0.157
0.808AsnPhe: 0.808 ± 0.159
1.75AsnGly: 1.75 ± 0.227
0.438AsnHis: 0.438 ± 0.111
1.313AsnIle: 1.313 ± 0.236
1.01AsnLys: 1.01 ± 0.205
2.895AsnLeu: 2.895 ± 0.375
1.01AsnMet: 1.01 ± 0.206
0.707AsnAsn: 0.707 ± 0.18
2.188AsnPro: 2.188 ± 0.363
0.64AsnGln: 0.64 ± 0.108
1.616AsnArg: 1.616 ± 0.218
1.717AsnSer: 1.717 ± 0.256
1.346AsnThr: 1.346 ± 0.265
2.996AsnVal: 2.996 ± 0.344
0.471AsnTrp: 0.471 ± 0.118
0.909AsnTyr: 0.909 ± 0.144
0.0AsnXaa: 0.0 ± 0.0
Pro
7.237ProAla: 7.237 ± 1.351
1.077ProCys: 1.077 ± 0.227
3.703ProAsp: 3.703 ± 0.362
4.376ProGlu: 4.376 ± 0.491
2.255ProPhe: 2.255 ± 0.309
4.376ProGly: 4.376 ± 0.509
1.784ProHis: 1.784 ± 0.253
1.986ProIle: 1.986 ± 0.25
3.635ProLys: 3.635 ± 0.62
4.006ProLeu: 4.006 ± 0.529
1.346ProMet: 1.346 ± 0.236
1.346ProAsn: 1.346 ± 0.222
4.275ProPro: 4.275 ± 0.63
1.986ProGln: 1.986 ± 0.319
3.804ProArg: 3.804 ± 0.482
4.813ProSer: 4.813 ± 0.501
3.164ProThr: 3.164 ± 0.614
7.17ProVal: 7.17 ± 1.187
1.144ProTrp: 1.144 ± 0.264
1.717ProTyr: 1.717 ± 0.254
0.0ProXaa: 0.0 ± 0.0
Gln
2.491GlnAla: 2.491 ± 0.256
0.707GlnCys: 0.707 ± 0.157
1.784GlnAsp: 1.784 ± 0.304
2.053GlnGlu: 2.053 ± 0.438
0.707GlnPhe: 0.707 ± 0.167
1.851GlnGly: 1.851 ± 0.3
0.505GlnHis: 0.505 ± 0.153
1.043GlnIle: 1.043 ± 0.177
1.279GlnLys: 1.279 ± 0.219
1.851GlnLeu: 1.851 ± 0.328
0.673GlnMet: 0.673 ± 0.163
0.707GlnAsn: 0.707 ± 0.13
1.649GlnPro: 1.649 ± 0.314
2.154GlnGln: 2.154 ± 0.823
2.154GlnArg: 2.154 ± 0.439
2.323GlnSer: 2.323 ± 0.389
1.919GlnThr: 1.919 ± 0.316
2.053GlnVal: 2.053 ± 0.337
0.337GlnTrp: 0.337 ± 0.117
0.606GlnTyr: 0.606 ± 0.129
0.0GlnXaa: 0.0 ± 0.0
Arg
5.251ArgAla: 5.251 ± 0.484
0.976ArgCys: 0.976 ± 0.199
4.611ArgAsp: 4.611 ± 0.501
4.645ArgGlu: 4.645 ± 0.419
1.885ArgPhe: 1.885 ± 0.25
5.419ArgGly: 5.419 ± 0.578
1.717ArgHis: 1.717 ± 0.256
1.952ArgIle: 1.952 ± 0.282
4.712ArgLys: 4.712 ± 0.752
6.025ArgLeu: 6.025 ± 0.432
2.289ArgMet: 2.289 ± 0.292
2.222ArgAsn: 2.222 ± 0.37
4.679ArgPro: 4.679 ± 0.603
2.154ArgGln: 2.154 ± 0.308
6.261ArgArg: 6.261 ± 0.575
3.871ArgSer: 3.871 ± 0.406
3.804ArgThr: 3.804 ± 0.441
5.789ArgVal: 5.789 ± 0.499
0.909ArgTrp: 0.909 ± 0.189
2.289ArgTyr: 2.289 ± 0.263
0.0ArgXaa: 0.0 ± 0.0
Ser
6.429SerAla: 6.429 ± 0.577
1.515SerCys: 1.515 ± 0.267
5.184SerAsp: 5.184 ± 0.377
3.837SerGlu: 3.837 ± 0.389
2.861SerPhe: 2.861 ± 0.284
5.655SerGly: 5.655 ± 0.445
1.582SerHis: 1.582 ± 0.261
2.154SerIle: 2.154 ± 0.254
3.366SerLys: 3.366 ± 0.403
6.059SerLeu: 6.059 ± 0.633
2.053SerMet: 2.053 ± 0.345
1.75SerAsn: 1.75 ± 0.227
6.328SerPro: 6.328 ± 1.194
2.053SerGln: 2.053 ± 0.271
4.073SerArg: 4.073 ± 0.439
5.924SerSer: 5.924 ± 0.728
3.198SerThr: 3.198 ± 0.324
6.395SerVal: 6.395 ± 0.609
1.279SerTrp: 1.279 ± 0.208
1.582SerTyr: 1.582 ± 0.234
0.0SerXaa: 0.0 ± 0.0
Thr
5.554ThrAla: 5.554 ± 0.537
1.077ThrCys: 1.077 ± 0.192
3.635ThrAsp: 3.635 ± 0.363
2.558ThrGlu: 2.558 ± 0.253
2.289ThrPhe: 2.289 ± 0.246
4.914ThrGly: 4.914 ± 0.474
0.64ThrHis: 0.64 ± 0.133
2.053ThrIle: 2.053 ± 0.234
2.693ThrLys: 2.693 ± 0.338
4.881ThrLeu: 4.881 ± 0.49
1.784ThrMet: 1.784 ± 0.207
1.212ThrAsn: 1.212 ± 0.19
4.207ThrPro: 4.207 ± 0.521
1.919ThrGln: 1.919 ± 0.359
3.467ThrArg: 3.467 ± 0.294
3.063ThrSer: 3.063 ± 0.415
2.491ThrThr: 2.491 ± 0.642
6.463ThrVal: 6.463 ± 0.469
0.471ThrTrp: 0.471 ± 0.175
1.212ThrTyr: 1.212 ± 0.225
0.0ThrXaa: 0.0 ± 0.0
Val
6.059ValAla: 6.059 ± 0.503
2.02ValCys: 2.02 ± 0.292
4.712ValAsp: 4.712 ± 0.378
4.308ValGlu: 4.308 ± 0.384
2.76ValPhe: 2.76 ± 0.291
4.679ValGly: 4.679 ± 0.551
2.255ValHis: 2.255 ± 0.349
2.289ValIle: 2.289 ± 0.271
6.631ValLys: 6.631 ± 1.148
7.035ValLeu: 7.035 ± 0.527
2.76ValMet: 2.76 ± 0.344
2.659ValAsn: 2.659 ± 0.314
5.083ValPro: 5.083 ± 0.56
2.289ValGln: 2.289 ± 0.315
7.136ValArg: 7.136 ± 0.601
6.429ValSer: 6.429 ± 0.514
4.881ValThr: 4.881 ± 0.529
6.934ValVal: 6.934 ± 0.648
1.111ValTrp: 1.111 ± 0.159
2.524ValTyr: 2.524 ± 0.269
0.0ValXaa: 0.0 ± 0.0
Trp
0.909TrpAla: 0.909 ± 0.234
0.303TrpCys: 0.303 ± 0.09
1.178TrpAsp: 1.178 ± 0.208
0.808TrpGlu: 0.808 ± 0.171
0.539TrpPhe: 0.539 ± 0.153
0.808TrpGly: 0.808 ± 0.154
0.236TrpHis: 0.236 ± 0.097
0.572TrpIle: 0.572 ± 0.153
0.942TrpLys: 0.942 ± 0.163
1.38TrpLeu: 1.38 ± 0.211
0.37TrpMet: 0.37 ± 0.094
0.539TrpAsn: 0.539 ± 0.152
0.673TrpPro: 0.673 ± 0.156
0.269TrpGln: 0.269 ± 0.095
0.976TrpArg: 0.976 ± 0.195
0.741TrpSer: 0.741 ± 0.141
1.481TrpThr: 1.481 ± 0.24
0.774TrpVal: 0.774 ± 0.166
0.168TrpTrp: 0.168 ± 0.072
0.539TrpTyr: 0.539 ± 0.14
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.356TyrAla: 2.356 ± 0.314
0.606TyrCys: 0.606 ± 0.15
2.154TyrAsp: 2.154 ± 0.279
1.548TyrGlu: 1.548 ± 0.234
0.841TyrPhe: 0.841 ± 0.164
2.255TyrGly: 2.255 ± 0.246
0.404TyrHis: 0.404 ± 0.104
1.38TyrIle: 1.38 ± 0.203
1.481TyrLys: 1.481 ± 0.199
2.154TyrLeu: 2.154 ± 0.255
0.774TyrMet: 0.774 ± 0.151
0.841TyrAsn: 0.841 ± 0.203
2.053TyrPro: 2.053 ± 0.241
0.875TyrGln: 0.875 ± 0.159
1.75TyrArg: 1.75 ± 0.223
2.491TyrSer: 2.491 ± 0.259
1.885TyrThr: 1.885 ± 0.254
2.794TyrVal: 2.794 ± 0.334
0.236TyrTrp: 0.236 ± 0.095
0.808TyrTyr: 0.808 ± 0.174
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 101 proteins (29710 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski