Amino acid dipepetide frequency for Singapore grouper iridovirus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
8.053AlaAla: 8.053 ± 0.548
1.667AlaCys: 1.667 ± 0.213
4.351AlaAsp: 4.351 ± 0.655
5.499AlaGlu: 5.499 ± 0.746
3.853AlaPhe: 3.853 ± 1.39
5.217AlaGly: 5.217 ± 0.641
1.559AlaHis: 1.559 ± 0.187
3.55AlaIle: 3.55 ± 0.266
4.395AlaLys: 4.395 ± 0.392
6.754AlaLeu: 6.754 ± 0.502
1.905AlaMet: 1.905 ± 0.231
3.291AlaAsn: 3.291 ± 0.331
3.918AlaPro: 3.918 ± 0.343
2.944AlaGln: 2.944 ± 0.283
3.745AlaArg: 3.745 ± 0.343
6.754AlaSer: 6.754 ± 1.523
4.979AlaThr: 4.979 ± 0.809
6.646AlaVal: 6.646 ± 0.433
1.147AlaTrp: 1.147 ± 0.268
2.706AlaTyr: 2.706 ± 0.232
0.0AlaXaa: 0.0 ± 0.0
Cys
2.251CysAla: 2.251 ± 0.253
0.779CysCys: 0.779 ± 0.138
1.277CysAsp: 1.277 ± 0.186
1.472CysGlu: 1.472 ± 0.194
0.974CysPhe: 0.974 ± 0.131
1.862CysGly: 1.862 ± 0.211
0.541CysHis: 0.541 ± 0.102
0.779CysIle: 0.779 ± 0.131
1.927CysLys: 1.927 ± 0.203
1.667CysLeu: 1.667 ± 0.208
0.585CysMet: 0.585 ± 0.104
0.866CysAsn: 0.866 ± 0.124
1.364CysPro: 1.364 ± 0.172
0.433CysGln: 0.433 ± 0.09
0.953CysArg: 0.953 ± 0.132
1.364CysSer: 1.364 ± 0.217
1.321CysThr: 1.321 ± 0.189
2.013CysVal: 2.013 ± 0.226
0.433CysTrp: 0.433 ± 0.119
0.736CysTyr: 0.736 ± 0.142
0.0CysXaa: 0.0 ± 0.0
Asp
4.633AspAla: 4.633 ± 0.351
1.256AspCys: 1.256 ± 0.189
2.598AspAsp: 2.598 ± 0.282
3.594AspGlu: 3.594 ± 0.353
2.273AspPhe: 2.273 ± 0.203
3.291AspGly: 3.291 ± 0.265
0.758AspHis: 0.758 ± 0.146
2.619AspIle: 2.619 ± 0.191
2.728AspLys: 2.728 ± 0.315
4.46AspLeu: 4.46 ± 0.504
1.667AspMet: 1.667 ± 0.178
1.818AspAsn: 1.818 ± 0.19
2.684AspPro: 2.684 ± 0.264
1.818AspGln: 1.818 ± 0.526
2.619AspArg: 2.619 ± 0.214
3.745AspSer: 3.745 ± 0.449
2.425AspThr: 2.425 ± 0.217
3.139AspVal: 3.139 ± 0.278
0.628AspTrp: 0.628 ± 0.15
1.84AspTyr: 1.84 ± 0.223
0.0AspXaa: 0.0 ± 0.0
Glu
5.109GluAla: 5.109 ± 1.17
1.234GluCys: 1.234 ± 0.24
3.355GluAsp: 3.355 ± 0.389
4.871GluGlu: 4.871 ± 0.709
1.97GluPhe: 1.97 ± 0.25
3.637GluGly: 3.637 ± 0.42
1.256GluHis: 1.256 ± 0.159
2.944GluIle: 2.944 ± 0.29
3.832GluLys: 3.832 ± 0.428
4.676GluLeu: 4.676 ± 0.337
1.97GluMet: 1.97 ± 0.241
2.771GluAsn: 2.771 ± 0.236
3.096GluPro: 3.096 ± 0.41
1.862GluGln: 1.862 ± 0.262
3.962GluArg: 3.962 ± 0.555
3.442GluSer: 3.442 ± 0.395
4.503GluThr: 4.503 ± 0.308
2.901GluVal: 2.901 ± 0.466
0.888GluTrp: 0.888 ± 0.134
1.992GluTyr: 1.992 ± 0.233
0.0GluXaa: 0.0 ± 0.0
Phe
2.533PheAla: 2.533 ± 0.219
0.671PheCys: 0.671 ± 0.12
1.754PheAsp: 1.754 ± 0.178
1.818PheGlu: 1.818 ± 0.376
1.494PhePhe: 1.494 ± 0.2
2.511PheGly: 2.511 ± 0.342
0.671PheHis: 0.671 ± 0.13
1.689PheIle: 1.689 ± 0.21
2.533PheLys: 2.533 ± 0.205
2.901PheLeu: 2.901 ± 0.289
1.342PheMet: 1.342 ± 0.165
1.559PheAsn: 1.559 ± 0.182
1.84PhePro: 1.84 ± 0.199
0.563PheGln: 0.563 ± 0.104
1.84PheArg: 1.84 ± 0.225
3.464PheSer: 3.464 ± 0.331
2.706PheThr: 2.706 ± 0.284
2.533PheVal: 2.533 ± 0.22
1.385PheTrp: 1.385 ± 0.994
1.147PheTyr: 1.147 ± 0.166
0.0PheXaa: 0.0 ± 0.0
Gly
4.546GlyAla: 4.546 ± 0.381
0.996GlyCys: 0.996 ± 0.151
4.156GlyAsp: 4.156 ± 0.458
3.832GlyGlu: 3.832 ± 0.487
2.706GlyPhe: 2.706 ± 0.31
5.434GlyGly: 5.434 ± 0.833
1.104GlyHis: 1.104 ± 0.142
2.944GlyIle: 2.944 ± 0.263
3.962GlyLys: 3.962 ± 0.316
5.239GlyLeu: 5.239 ± 0.654
1.559GlyMet: 1.559 ± 0.183
2.208GlyAsn: 2.208 ± 0.321
8.356GlyPro: 8.356 ± 2.061
2.36GlyGln: 2.36 ± 0.505
3.81GlyArg: 3.81 ± 0.645
4.957GlySer: 4.957 ± 0.481
3.853GlyThr: 3.853 ± 0.395
4.351GlyVal: 4.351 ± 0.444
0.866GlyTrp: 0.866 ± 0.144
2.078GlyTyr: 2.078 ± 0.256
0.0GlyXaa: 0.0 ± 0.0
His
1.905HisAla: 1.905 ± 0.299
0.476HisCys: 0.476 ± 0.118
0.844HisAsp: 0.844 ± 0.15
0.996HisGlu: 0.996 ± 0.162
0.736HisPhe: 0.736 ± 0.136
1.299HisGly: 1.299 ± 0.207
0.628HisHis: 0.628 ± 0.124
0.974HisIle: 0.974 ± 0.166
1.126HisLys: 1.126 ± 0.146
1.818HisLeu: 1.818 ± 0.235
0.476HisMet: 0.476 ± 0.12
0.671HisAsn: 0.671 ± 0.136
1.45HisPro: 1.45 ± 0.168
0.671HisGln: 0.671 ± 0.116
0.974HisArg: 0.974 ± 0.133
1.342HisSer: 1.342 ± 0.191
1.364HisThr: 1.364 ± 0.168
1.732HisVal: 1.732 ± 0.211
0.433HisTrp: 0.433 ± 0.102
0.628HisTyr: 0.628 ± 0.098
0.0HisXaa: 0.0 ± 0.0
Ile
3.399IleAla: 3.399 ± 0.286
0.909IleCys: 0.909 ± 0.153
2.381IleAsp: 2.381 ± 0.238
2.208IleGlu: 2.208 ± 0.281
2.013IlePhe: 2.013 ± 0.244
2.403IleGly: 2.403 ± 0.244
1.082IleHis: 1.082 ± 0.159
2.23IleIle: 2.23 ± 0.256
3.182IleLys: 3.182 ± 0.367
4.005IleLeu: 4.005 ± 0.294
1.515IleMet: 1.515 ± 0.185
2.23IleAsn: 2.23 ± 0.209
3.074IlePro: 3.074 ± 0.261
1.061IleGln: 1.061 ± 0.172
2.576IleArg: 2.576 ± 0.24
3.507IleSer: 3.507 ± 0.263
3.074IleThr: 3.074 ± 0.283
3.962IleVal: 3.962 ± 0.278
0.649IleTrp: 0.649 ± 0.13
1.277IleTyr: 1.277 ± 0.184
0.0IleXaa: 0.0 ± 0.0
Lys
5.217LysAla: 5.217 ± 1.257
1.624LysCys: 1.624 ± 0.239
3.269LysAsp: 3.269 ± 0.294
2.966LysGlu: 2.966 ± 0.298
1.754LysPhe: 1.754 ± 0.209
3.572LysGly: 3.572 ± 0.396
1.407LysHis: 1.407 ± 0.163
3.745LysIle: 3.745 ± 0.26
3.226LysLys: 3.226 ± 0.327
5.477LysLeu: 5.477 ± 0.455
2.1LysMet: 2.1 ± 0.201
2.987LysAsn: 2.987 ± 0.299
2.901LysPro: 2.901 ± 0.35
1.97LysGln: 1.97 ± 0.294
3.853LysArg: 3.853 ± 0.4
4.156LysSer: 4.156 ± 0.417
4.654LysThr: 4.654 ± 0.383
2.36LysVal: 2.36 ± 0.271
1.061LysTrp: 1.061 ± 0.157
2.23LysTyr: 2.23 ± 0.24
0.0LysXaa: 0.0 ± 0.0
Leu
6.43LeuAla: 6.43 ± 0.442
1.883LeuCys: 1.883 ± 0.271
4.005LeuAsp: 4.005 ± 0.349
4.763LeuGlu: 4.763 ± 0.46
2.987LeuPhe: 2.987 ± 0.335
4.957LeuGly: 4.957 ± 0.928
1.602LeuHis: 1.602 ± 0.19
3.832LeuIle: 3.832 ± 0.267
5.564LeuLys: 5.564 ± 0.541
7.166LeuLeu: 7.166 ± 0.587
2.273LeuMet: 2.273 ± 0.219
3.637LeuAsn: 3.637 ± 0.282
4.2LeuPro: 4.2 ± 0.399
2.446LeuGln: 2.446 ± 0.286
5.044LeuArg: 5.044 ± 0.431
5.672LeuSer: 5.672 ± 0.357
6.148LeuThr: 6.148 ± 0.416
5.715LeuVal: 5.715 ± 0.423
1.082LeuTrp: 1.082 ± 0.316
2.273LeuTyr: 2.273 ± 0.211
0.0LeuXaa: 0.0 ± 0.0
Met
2.879MetAla: 2.879 ± 0.238
0.779MetCys: 0.779 ± 0.132
1.537MetAsp: 1.537 ± 0.2
1.818MetGlu: 1.818 ± 0.226
0.931MetPhe: 0.931 ± 0.138
2.122MetGly: 2.122 ± 0.267
0.628MetHis: 0.628 ± 0.109
1.017MetIle: 1.017 ± 0.131
1.342MetLys: 1.342 ± 0.186
2.381MetLeu: 2.381 ± 0.209
0.801MetMet: 0.801 ± 0.149
0.779MetAsn: 0.779 ± 0.126
0.974MetPro: 0.974 ± 0.146
0.758MetGln: 0.758 ± 0.119
1.494MetArg: 1.494 ± 0.182
2.078MetSer: 2.078 ± 0.21
2.316MetThr: 2.316 ± 0.257
1.689MetVal: 1.689 ± 0.177
0.433MetTrp: 0.433 ± 0.114
1.061MetTyr: 1.061 ± 0.153
0.0MetXaa: 0.0 ± 0.0
Asn
2.641AsnAla: 2.641 ± 0.271
1.321AsnCys: 1.321 ± 0.198
1.883AsnAsp: 1.883 ± 0.183
1.97AsnGlu: 1.97 ± 0.251
1.537AsnPhe: 1.537 ± 0.163
2.987AsnGly: 2.987 ± 0.342
0.801AsnHis: 0.801 ± 0.153
2.035AsnIle: 2.035 ± 0.269
1.927AsnLys: 1.927 ± 0.24
3.355AsnLeu: 3.355 ± 0.309
1.104AsnMet: 1.104 ± 0.172
2.035AsnAsn: 2.035 ± 0.231
2.598AsnPro: 2.598 ± 0.353
0.844AsnGln: 0.844 ± 0.161
2.1AsnArg: 2.1 ± 0.222
2.706AsnSer: 2.706 ± 0.292
2.078AsnThr: 2.078 ± 0.285
3.507AsnVal: 3.507 ± 0.307
0.476AsnTrp: 0.476 ± 0.101
1.407AsnTyr: 1.407 ± 0.186
0.0AsnXaa: 0.0 ± 0.0
Pro
5.434ProAla: 5.434 ± 0.508
1.061ProCys: 1.061 ± 0.147
2.468ProAsp: 2.468 ± 0.267
4.676ProGlu: 4.676 ± 1.008
1.775ProPhe: 1.775 ± 0.245
4.979ProGly: 4.979 ± 1.11
1.472ProHis: 1.472 ± 0.26
2.814ProIle: 2.814 ± 0.262
3.832ProLys: 3.832 ± 0.426
3.875ProLeu: 3.875 ± 0.376
1.147ProMet: 1.147 ± 0.151
1.97ProAsn: 1.97 ± 0.204
5.239ProPro: 5.239 ± 0.762
1.58ProGln: 1.58 ± 0.308
3.226ProArg: 3.226 ± 0.433
4.763ProSer: 4.763 ± 0.575
3.897ProThr: 3.897 ± 0.553
4.914ProVal: 4.914 ± 0.457
0.476ProTrp: 0.476 ± 0.109
1.472ProTyr: 1.472 ± 0.178
0.0ProXaa: 0.0 ± 0.0
Gln
1.71GlnAla: 1.71 ± 0.185
0.563GlnCys: 0.563 ± 0.111
1.407GlnAsp: 1.407 ± 0.202
2.035GlnGlu: 2.035 ± 0.34
1.104GlnPhe: 1.104 ± 0.172
2.533GlnGly: 2.533 ± 0.544
0.823GlnHis: 0.823 ± 0.138
1.321GlnIle: 1.321 ± 0.156
2.468GlnLys: 2.468 ± 0.917
2.533GlnLeu: 2.533 ± 0.267
1.017GlnMet: 1.017 ± 0.143
1.407GlnAsn: 1.407 ± 0.21
1.277GlnPro: 1.277 ± 0.215
1.104GlnGln: 1.104 ± 0.167
1.602GlnArg: 1.602 ± 0.185
1.667GlnSer: 1.667 ± 0.196
2.619GlnThr: 2.619 ± 0.288
1.126GlnVal: 1.126 ± 0.142
0.498GlnTrp: 0.498 ± 0.1
1.082GlnTyr: 1.082 ± 0.158
0.0GlnXaa: 0.0 ± 0.0
Arg
4.178ArgAla: 4.178 ± 0.439
0.953ArgCys: 0.953 ± 0.136
3.009ArgAsp: 3.009 ± 0.405
3.031ArgGlu: 3.031 ± 0.314
1.905ArgPhe: 1.905 ± 0.244
5.109ArgGly: 5.109 ± 0.919
1.104ArgHis: 1.104 ± 0.155
2.619ArgIle: 2.619 ± 0.237
3.355ArgLys: 3.355 ± 0.446
4.828ArgLeu: 4.828 ± 0.393
1.732ArgMet: 1.732 ± 0.169
1.97ArgAsn: 1.97 ± 0.223
3.139ArgPro: 3.139 ± 0.355
2.143ArgGln: 2.143 ± 0.259
4.113ArgArg: 4.113 ± 0.472
3.096ArgSer: 3.096 ± 0.286
2.814ArgThr: 2.814 ± 0.275
3.485ArgVal: 3.485 ± 0.327
0.649ArgTrp: 0.649 ± 0.133
1.71ArgTyr: 1.71 ± 0.206
0.0ArgXaa: 0.0 ± 0.0
Ser
6.949SerAla: 6.949 ± 1.193
1.775SerCys: 1.775 ± 0.238
3.55SerAsp: 3.55 ± 0.317
3.788SerGlu: 3.788 ± 0.335
2.663SerPhe: 2.663 ± 0.274
5.455SerGly: 5.455 ± 1.133
1.58SerHis: 1.58 ± 0.218
2.901SerIle: 2.901 ± 0.273
3.399SerLys: 3.399 ± 0.406
5.823SerLeu: 5.823 ± 0.427
1.775SerMet: 1.775 ± 0.184
2.554SerAsn: 2.554 ± 0.25
4.763SerPro: 4.763 ± 0.81
1.84SerGln: 1.84 ± 0.234
3.55SerArg: 3.55 ± 0.343
7.339SerSer: 7.339 ± 0.949
3.507SerThr: 3.507 ± 0.382
6.776SerVal: 6.776 ± 1.052
0.953SerTrp: 0.953 ± 0.273
2.122SerTyr: 2.122 ± 0.199
0.0SerXaa: 0.0 ± 0.0
Thr
5.607ThrAla: 5.607 ± 0.36
1.754ThrCys: 1.754 ± 0.243
3.204ThrAsp: 3.204 ± 0.309
3.983ThrGlu: 3.983 ± 0.693
2.23ThrPhe: 2.23 ± 0.216
4.741ThrGly: 4.741 ± 0.532
1.407ThrHis: 1.407 ± 0.177
3.247ThrIle: 3.247 ± 0.256
3.334ThrLys: 3.334 ± 0.279
4.914ThrLeu: 4.914 ± 0.378
1.689ThrMet: 1.689 ± 0.231
2.035ThrAsn: 2.035 ± 0.239
4.005ThrPro: 4.005 ± 0.459
1.992ThrGln: 1.992 ± 0.303
3.485ThrArg: 3.485 ± 0.32
3.962ThrSer: 3.962 ± 0.44
3.031ThrThr: 3.031 ± 0.291
4.979ThrVal: 4.979 ± 0.386
0.931ThrTrp: 0.931 ± 0.16
2.381ThrTyr: 2.381 ± 0.278
0.0ThrXaa: 0.0 ± 0.0
Val
5.953ValAla: 5.953 ± 0.92
2.186ValCys: 2.186 ± 0.266
2.987ValAsp: 2.987 ± 0.266
4.178ValGlu: 4.178 ± 0.419
2.641ValPhe: 2.641 ± 0.232
3.983ValGly: 3.983 ± 0.46
1.407ValHis: 1.407 ± 0.194
3.485ValIle: 3.485 ± 0.296
5.196ValLys: 5.196 ± 0.391
5.997ValLeu: 5.997 ± 0.362
1.97ValMet: 1.97 ± 0.201
2.749ValAsn: 2.749 ± 0.258
3.637ValPro: 3.637 ± 0.373
2.23ValGln: 2.23 ± 0.234
3.81ValArg: 3.81 ± 0.304
5.282ValSer: 5.282 ± 0.396
4.676ValThr: 4.676 ± 0.38
5.261ValVal: 5.261 ± 0.439
0.931ValTrp: 0.931 ± 0.166
2.576ValTyr: 2.576 ± 0.23
0.0ValXaa: 0.0 ± 0.0
Trp
1.234TrpAla: 1.234 ± 0.188
0.606TrpCys: 0.606 ± 0.145
0.498TrpAsp: 0.498 ± 0.107
0.866TrpGlu: 0.866 ± 0.149
0.476TrpPhe: 0.476 ± 0.112
0.844TrpGly: 0.844 ± 0.253
0.238TrpHis: 0.238 ± 0.074
0.563TrpIle: 0.563 ± 0.099
0.888TrpLys: 0.888 ± 0.118
1.602TrpLeu: 1.602 ± 0.359
0.26TrpMet: 0.26 ± 0.077
0.498TrpAsn: 0.498 ± 0.105
1.169TrpPro: 1.169 ± 0.419
0.52TrpGln: 0.52 ± 0.105
0.498TrpArg: 0.498 ± 0.089
1.299TrpSer: 1.299 ± 0.563
0.779TrpThr: 0.779 ± 0.132
0.909TrpVal: 0.909 ± 0.274
0.173TrpTrp: 0.173 ± 0.06
0.498TrpTyr: 0.498 ± 0.11
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.273TyrAla: 2.273 ± 0.234
1.104TyrCys: 1.104 ± 0.16
2.057TyrAsp: 2.057 ± 0.288
1.927TyrGlu: 1.927 ± 0.226
0.931TyrPhe: 0.931 ± 0.111
2.381TyrGly: 2.381 ± 0.253
0.476TyrHis: 0.476 ± 0.119
1.429TyrIle: 1.429 ± 0.171
2.316TyrLys: 2.316 ± 0.237
2.208TyrLeu: 2.208 ± 0.218
0.909TyrMet: 0.909 ± 0.142
1.277TyrAsn: 1.277 ± 0.185
1.385TyrPro: 1.385 ± 0.177
0.758TyrGln: 0.758 ± 0.121
1.689TyrArg: 1.689 ± 0.209
2.468TyrSer: 2.468 ± 0.221
2.165TyrThr: 2.165 ± 0.243
3.139TyrVal: 3.139 ± 0.283
0.325TyrTrp: 0.325 ± 0.073
1.061TyrTyr: 1.061 ± 0.155
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 162 proteins (46194 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski