Amino acid dipepetide frequency for Red sea bream iridovirus (RSIV)

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
8.87AlaAla: 8.87 ± 0.658
2.084AlaCys: 2.084 ± 0.283
4.28AlaAsp: 4.28 ± 0.35
3.633AlaGlu: 3.633 ± 0.346
3.295AlaPhe: 3.295 ± 0.316
3.604AlaGly: 3.604 ± 0.361
3.154AlaHis: 3.154 ± 0.412
3.689AlaIle: 3.689 ± 0.276
2.9AlaLys: 2.9 ± 0.284
7.547AlaLeu: 7.547 ± 0.629
3.604AlaMet: 3.604 ± 0.359
3.267AlaAsn: 3.267 ± 0.249
4.759AlaPro: 4.759 ± 0.603
3.858AlaGln: 3.858 ± 0.383
5.407AlaArg: 5.407 ± 0.393
4.252AlaSer: 4.252 ± 0.326
5.942AlaThr: 5.942 ± 0.399
7.603AlaVal: 7.603 ± 0.55
0.986AlaTrp: 0.986 ± 0.172
3.604AlaTyr: 3.604 ± 0.502
0.0AlaXaa: 0.0 ± 0.0
Cys
2.675CysAla: 2.675 ± 0.308
0.845CysCys: 0.845 ± 0.135
2.309CysAsp: 2.309 ± 0.309
1.352CysGlu: 1.352 ± 0.26
0.732CysPhe: 0.732 ± 0.122
1.915CysGly: 1.915 ± 0.262
1.211CysHis: 1.211 ± 0.279
1.211CysIle: 1.211 ± 0.165
1.38CysLys: 1.38 ± 0.274
1.915CysLeu: 1.915 ± 0.305
1.239CysMet: 1.239 ± 0.202
1.267CysAsn: 1.267 ± 0.21
1.718CysPro: 1.718 ± 0.49
0.873CysGln: 0.873 ± 0.14
1.859CysArg: 1.859 ± 0.253
1.915CysSer: 1.915 ± 0.21
1.971CysThr: 1.971 ± 0.247
2.9CysVal: 2.9 ± 0.304
0.169CysTrp: 0.169 ± 0.063
0.957CysTyr: 0.957 ± 0.16
0.0CysXaa: 0.0 ± 0.0
Asp
5.35AspAla: 5.35 ± 0.781
1.661AspCys: 1.661 ± 0.303
4.224AspAsp: 4.224 ± 0.401
2.506AspGlu: 2.506 ± 0.295
1.549AspPhe: 1.549 ± 0.232
4.196AspGly: 4.196 ± 0.419
1.549AspHis: 1.549 ± 0.212
4.139AspIle: 4.139 ± 0.402
2.196AspLys: 2.196 ± 0.261
3.267AspLeu: 3.267 ± 0.298
3.379AspMet: 3.379 ± 0.291
2.647AspAsn: 2.647 ± 0.238
2.816AspPro: 2.816 ± 0.324
1.521AspGln: 1.521 ± 0.226
3.604AspArg: 3.604 ± 0.267
3.464AspSer: 3.464 ± 0.318
4.618AspThr: 4.618 ± 0.378
5.913AspVal: 5.913 ± 0.478
0.591AspTrp: 0.591 ± 0.122
2.365AspTyr: 2.365 ± 0.282
0.0AspXaa: 0.0 ± 0.0
Glu
3.295GluAla: 3.295 ± 0.331
1.549GluCys: 1.549 ± 0.27
2.816GluAsp: 2.816 ± 0.414
2.225GluGlu: 2.225 ± 0.526
0.957GluPhe: 0.957 ± 0.169
2.253GluGly: 2.253 ± 0.254
1.633GluHis: 1.633 ± 0.195
1.408GluIle: 1.408 ± 0.185
0.929GluLys: 0.929 ± 0.2
3.126GluLeu: 3.126 ± 0.329
1.464GluMet: 1.464 ± 0.25
1.155GluAsn: 1.155 ± 0.189
2.225GluPro: 2.225 ± 0.417
2.14GluGln: 2.14 ± 0.302
2.985GluArg: 2.985 ± 0.285
1.746GluSer: 1.746 ± 0.242
2.563GluThr: 2.563 ± 0.208
2.675GluVal: 2.675 ± 0.342
0.676GluTrp: 0.676 ± 0.167
2.394GluTyr: 2.394 ± 0.259
0.0GluXaa: 0.0 ± 0.0
Phe
2.563PheAla: 2.563 ± 0.263
0.732PheCys: 0.732 ± 0.134
2.591PheAsp: 2.591 ± 0.298
1.859PheGlu: 1.859 ± 0.196
1.098PhePhe: 1.098 ± 0.162
1.408PheGly: 1.408 ± 0.198
0.788PheHis: 0.788 ± 0.133
1.352PheIle: 1.352 ± 0.173
1.577PheLys: 1.577 ± 0.218
1.69PheLeu: 1.69 ± 0.184
1.492PheMet: 1.492 ± 0.217
1.38PheAsn: 1.38 ± 0.164
1.155PhePro: 1.155 ± 0.168
0.957PheGln: 0.957 ± 0.137
1.408PheArg: 1.408 ± 0.195
1.887PheSer: 1.887 ± 0.25
2.253PheThr: 2.253 ± 0.269
2.957PheVal: 2.957 ± 0.338
0.282PheTrp: 0.282 ± 0.082
0.76PheTyr: 0.76 ± 0.134
0.0PheXaa: 0.0 ± 0.0
Gly
4.421GlyAla: 4.421 ± 0.378
1.267GlyCys: 1.267 ± 0.191
3.013GlyAsp: 3.013 ± 0.267
1.661GlyGlu: 1.661 ± 0.237
1.774GlyPhe: 1.774 ± 0.26
3.886GlyGly: 3.886 ± 0.351
2.45GlyHis: 2.45 ± 0.266
2.45GlyIle: 2.45 ± 0.24
1.887GlyLys: 1.887 ± 0.216
3.83GlyLeu: 3.83 ± 0.34
2.478GlyMet: 2.478 ± 0.28
2.225GlyAsn: 2.225 ± 0.25
2.929GlyPro: 2.929 ± 0.345
2.309GlyGln: 2.309 ± 0.317
3.21GlyArg: 3.21 ± 0.285
2.816GlySer: 2.816 ± 0.345
4.111GlyThr: 4.111 ± 0.393
5.21GlyVal: 5.21 ± 0.35
0.535GlyTrp: 0.535 ± 0.137
2.619GlyTyr: 2.619 ± 0.251
0.0GlyXaa: 0.0 ± 0.0
His
2.9HisAla: 2.9 ± 0.288
1.126HisCys: 1.126 ± 0.175
1.943HisAsp: 1.943 ± 0.246
1.211HisGlu: 1.211 ± 0.198
0.732HisPhe: 0.732 ± 0.135
2.281HisGly: 2.281 ± 0.238
1.211HisHis: 1.211 ± 0.333
2.703HisIle: 2.703 ± 0.315
1.267HisLys: 1.267 ± 0.171
2.647HisLeu: 2.647 ± 0.277
1.661HisMet: 1.661 ± 0.241
2.056HisAsn: 2.056 ± 0.255
1.042HisPro: 1.042 ± 0.192
0.957HisGln: 0.957 ± 0.147
2.872HisArg: 2.872 ± 0.856
1.943HisSer: 1.943 ± 0.283
2.957HisThr: 2.957 ± 0.293
3.717HisVal: 3.717 ± 0.318
0.394HisTrp: 0.394 ± 0.087
1.126HisTyr: 1.126 ± 0.169
0.0HisXaa: 0.0 ± 0.0
Ile
3.802IleAla: 3.802 ± 0.307
1.267IleCys: 1.267 ± 0.174
3.098IleAsp: 3.098 ± 0.269
2.196IleGlu: 2.196 ± 0.255
1.239IlePhe: 1.239 ± 0.17
2.563IleGly: 2.563 ± 0.268
1.464IleHis: 1.464 ± 0.233
2.675IleIle: 2.675 ± 0.314
2.112IleLys: 2.112 ± 0.283
3.52IleLeu: 3.52 ± 0.363
2.084IleMet: 2.084 ± 0.218
1.999IleAsn: 1.999 ± 0.25
2.225IlePro: 2.225 ± 0.248
1.774IleGln: 1.774 ± 0.201
2.534IleArg: 2.534 ± 0.298
2.985IleSer: 2.985 ± 0.313
2.591IleThr: 2.591 ± 0.23
4.139IleVal: 4.139 ± 0.41
0.366IleTrp: 0.366 ± 0.114
1.38IleTyr: 1.38 ± 0.188
0.0IleXaa: 0.0 ± 0.0
Lys
2.788LysAla: 2.788 ± 0.296
1.042LysCys: 1.042 ± 0.171
1.83LysAsp: 1.83 ± 0.201
1.549LysGlu: 1.549 ± 0.218
0.929LysPhe: 0.929 ± 0.162
1.661LysGly: 1.661 ± 0.256
1.352LysHis: 1.352 ± 0.165
1.352LysIle: 1.352 ± 0.212
1.183LysLys: 1.183 ± 0.31
2.816LysLeu: 2.816 ± 0.307
1.267LysMet: 1.267 ± 0.21
1.183LysAsn: 1.183 ± 0.202
2.253LysPro: 2.253 ± 0.358
1.774LysGln: 1.774 ± 0.283
2.788LysArg: 2.788 ± 0.361
1.464LysSer: 1.464 ± 0.256
2.309LysThr: 2.309 ± 0.277
2.563LysVal: 2.563 ± 0.302
0.479LysTrp: 0.479 ± 0.103
1.492LysTyr: 1.492 ± 0.234
0.0LysXaa: 0.0 ± 0.0
Leu
7.209LeuAla: 7.209 ± 0.506
2.929LeuCys: 2.929 ± 0.28
4.59LeuAsp: 4.59 ± 0.318
3.098LeuGlu: 3.098 ± 0.306
2.816LeuPhe: 2.816 ± 0.273
3.773LeuGly: 3.773 ± 0.342
3.126LeuHis: 3.126 ± 0.288
3.21LeuIle: 3.21 ± 0.353
2.844LeuLys: 2.844 ± 0.304
8.589LeuLeu: 8.589 ± 0.524
3.295LeuMet: 3.295 ± 0.318
2.563LeuAsn: 2.563 ± 0.244
4.421LeuPro: 4.421 ± 0.347
3.633LeuGln: 3.633 ± 0.328
5.604LeuArg: 5.604 ± 0.383
4.787LeuSer: 4.787 ± 0.339
5.097LeuThr: 5.097 ± 0.453
5.407LeuVal: 5.407 ± 0.5
1.155LeuTrp: 1.155 ± 0.186
3.126LeuTyr: 3.126 ± 0.301
0.0LeuXaa: 0.0 ± 0.0
Met
4.562MetAla: 4.562 ± 0.407
1.915MetCys: 1.915 ± 0.272
2.534MetAsp: 2.534 ± 0.308
1.605MetGlu: 1.605 ± 0.264
1.323MetPhe: 1.323 ± 0.188
1.549MetGly: 1.549 ± 0.202
1.295MetHis: 1.295 ± 0.176
1.436MetIle: 1.436 ± 0.241
0.901MetLys: 0.901 ± 0.153
3.942MetLeu: 3.942 ± 0.348
1.323MetMet: 1.323 ± 0.182
0.873MetAsn: 0.873 ± 0.142
2.731MetPro: 2.731 ± 0.227
1.577MetGln: 1.577 ± 0.249
2.619MetArg: 2.619 ± 0.246
3.182MetSer: 3.182 ± 0.318
2.788MetThr: 2.788 ± 0.336
2.394MetVal: 2.394 ± 0.226
0.451MetTrp: 0.451 ± 0.093
2.196MetTyr: 2.196 ± 0.273
0.0MetXaa: 0.0 ± 0.0
Asn
3.069AsnAla: 3.069 ± 0.354
0.648AsnCys: 0.648 ± 0.147
1.577AsnAsp: 1.577 ± 0.219
1.408AsnGlu: 1.408 ± 0.184
0.845AsnPhe: 0.845 ± 0.171
2.534AsnGly: 2.534 ± 0.286
0.873AsnHis: 0.873 ± 0.179
2.788AsnIle: 2.788 ± 0.27
1.746AsnLys: 1.746 ± 0.195
2.844AsnLeu: 2.844 ± 0.303
1.436AsnMet: 1.436 ± 0.217
1.802AsnAsn: 1.802 ± 0.231
2.196AsnPro: 2.196 ± 0.296
0.76AsnGln: 0.76 ± 0.122
1.971AsnArg: 1.971 ± 0.271
2.365AsnSer: 2.365 ± 0.249
3.52AsnThr: 3.52 ± 0.352
3.576AsnVal: 3.576 ± 0.352
0.169AsnTrp: 0.169 ± 0.07
1.042AsnTyr: 1.042 ± 0.19
0.0AsnXaa: 0.0 ± 0.0
Pro
3.83ProAla: 3.83 ± 0.394
1.774ProCys: 1.774 ± 0.369
4.646ProAsp: 4.646 ± 1.104
2.675ProGlu: 2.675 ± 0.653
1.774ProPhe: 1.774 ± 0.207
3.069ProGly: 3.069 ± 0.33
2.056ProHis: 2.056 ± 0.267
2.14ProIle: 2.14 ± 0.284
1.577ProLys: 1.577 ± 0.288
4.731ProLeu: 4.731 ± 0.546
1.802ProMet: 1.802 ± 0.226
1.746ProAsn: 1.746 ± 0.211
4.365ProPro: 4.365 ± 0.826
2.506ProGln: 2.506 ± 0.271
2.647ProArg: 2.647 ± 0.314
2.957ProSer: 2.957 ± 0.301
3.604ProThr: 3.604 ± 0.478
5.322ProVal: 5.322 ± 0.394
0.535ProTrp: 0.535 ± 0.125
2.056ProTyr: 2.056 ± 0.296
0.0ProXaa: 0.0 ± 0.0
Gln
3.041GlnAla: 3.041 ± 0.335
1.774GlnCys: 1.774 ± 0.363
2.112GlnAsp: 2.112 ± 0.384
1.408GlnGlu: 1.408 ± 0.285
1.38GlnPhe: 1.38 ± 0.2
1.633GlnGly: 1.633 ± 0.216
2.056GlnHis: 2.056 ± 0.253
1.014GlnIle: 1.014 ± 0.176
0.873GlnLys: 0.873 ± 0.146
3.97GlnLeu: 3.97 ± 0.331
1.69GlnMet: 1.69 ± 0.188
0.817GlnAsn: 0.817 ± 0.169
2.027GlnPro: 2.027 ± 0.283
2.45GlnGln: 2.45 ± 0.336
2.816GlnArg: 2.816 ± 0.282
1.971GlnSer: 1.971 ± 0.238
2.084GlnThr: 2.084 ± 0.334
2.647GlnVal: 2.647 ± 0.263
0.676GlnTrp: 0.676 ± 0.149
2.112GlnTyr: 2.112 ± 0.231
0.0GlnXaa: 0.0 ± 0.0
Arg
5.041ArgAla: 5.041 ± 0.448
1.943ArgCys: 1.943 ± 0.244
3.689ArgAsp: 3.689 ± 0.326
2.422ArgGlu: 2.422 ± 0.297
1.774ArgPhe: 1.774 ± 0.211
2.675ArgGly: 2.675 ± 0.29
2.675ArgHis: 2.675 ± 0.265
2.9ArgIle: 2.9 ± 0.314
1.605ArgLys: 1.605 ± 0.223
5.012ArgLeu: 5.012 ± 0.369
2.394ArgMet: 2.394 ± 0.286
2.422ArgAsn: 2.422 ± 0.204
4.449ArgPro: 4.449 ± 1.068
2.394ArgGln: 2.394 ± 0.241
5.435ArgArg: 5.435 ± 0.557
3.464ArgSer: 3.464 ± 0.37
3.717ArgThr: 3.717 ± 0.356
5.745ArgVal: 5.745 ± 0.392
0.901ArgTrp: 0.901 ± 0.165
2.309ArgTyr: 2.309 ± 0.205
0.0ArgXaa: 0.0 ± 0.0
Ser
4.787SerAla: 4.787 ± 0.35
1.69SerCys: 1.69 ± 0.241
3.689SerAsp: 3.689 ± 0.299
1.802SerGlu: 1.802 ± 0.226
1.352SerPhe: 1.352 ± 0.185
4.055SerGly: 4.055 ± 0.387
1.887SerHis: 1.887 ± 0.233
2.534SerIle: 2.534 ± 0.237
2.196SerLys: 2.196 ± 0.294
4.787SerLeu: 4.787 ± 0.388
2.394SerMet: 2.394 ± 0.259
1.915SerAsn: 1.915 ± 0.257
3.238SerPro: 3.238 ± 0.44
1.718SerGln: 1.718 ± 0.223
3.464SerArg: 3.464 ± 0.308
3.886SerSer: 3.886 ± 0.444
3.83SerThr: 3.83 ± 0.283
5.181SerVal: 5.181 ± 0.422
0.394SerTrp: 0.394 ± 0.092
1.802SerTyr: 1.802 ± 0.2
0.0SerXaa: 0.0 ± 0.0
Thr
6.449ThrAla: 6.449 ± 0.483
1.943ThrCys: 1.943 ± 0.255
3.914ThrAsp: 3.914 ± 0.326
3.098ThrGlu: 3.098 ± 0.405
2.027ThrPhe: 2.027 ± 0.252
4.674ThrGly: 4.674 ± 0.319
2.422ThrHis: 2.422 ± 0.355
2.563ThrIle: 2.563 ± 0.265
2.281ThrLys: 2.281 ± 0.282
6.533ThrLeu: 6.533 ± 0.462
2.76ThrMet: 2.76 ± 0.274
2.14ThrAsn: 2.14 ± 0.261
4.139ThrPro: 4.139 ± 0.344
2.45ThrGln: 2.45 ± 0.277
3.323ThrArg: 3.323 ± 0.399
3.717ThrSer: 3.717 ± 0.372
5.322ThrThr: 5.322 ± 1.167
6.533ThrVal: 6.533 ± 0.477
0.901ThrTrp: 0.901 ± 0.146
2.478ThrTyr: 2.478 ± 0.276
0.0ThrXaa: 0.0 ± 0.0
Val
7.575ValAla: 7.575 ± 0.515
2.816ValCys: 2.816 ± 0.311
5.294ValAsp: 5.294 ± 0.334
2.337ValGlu: 2.337 ± 0.257
3.041ValPhe: 3.041 ± 0.295
4.421ValGly: 4.421 ± 0.335
3.802ValHis: 3.802 ± 0.416
3.633ValIle: 3.633 ± 0.262
2.619ValLys: 2.619 ± 0.31
7.519ValLeu: 7.519 ± 0.583
3.041ValMet: 3.041 ± 0.341
2.788ValAsn: 2.788 ± 0.364
5.378ValPro: 5.378 ± 0.341
3.069ValGln: 3.069 ± 0.385
5.632ValArg: 5.632 ± 0.4
5.097ValSer: 5.097 ± 0.451
6.195ValThr: 6.195 ± 0.476
6.336ValVal: 6.336 ± 0.543
0.901ValTrp: 0.901 ± 0.161
3.83ValTyr: 3.83 ± 0.445
0.0ValXaa: 0.0 ± 0.0
Trp
0.873TrpAla: 0.873 ± 0.17
0.451TrpCys: 0.451 ± 0.093
0.563TrpAsp: 0.563 ± 0.141
0.479TrpGlu: 0.479 ± 0.133
0.422TrpPhe: 0.422 ± 0.119
0.535TrpGly: 0.535 ± 0.133
0.394TrpHis: 0.394 ± 0.122
0.394TrpIle: 0.394 ± 0.115
0.282TrpLys: 0.282 ± 0.093
1.098TrpLeu: 1.098 ± 0.174
0.338TrpMet: 0.338 ± 0.099
0.479TrpAsn: 0.479 ± 0.109
0.648TrpPro: 0.648 ± 0.149
0.845TrpGln: 0.845 ± 0.155
0.591TrpArg: 0.591 ± 0.118
0.535TrpSer: 0.535 ± 0.133
0.817TrpThr: 0.817 ± 0.173
0.62TrpVal: 0.62 ± 0.127
0.197TrpTrp: 0.197 ± 0.083
0.563TrpTyr: 0.563 ± 0.117
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.238TyrAla: 3.238 ± 0.3
0.957TyrCys: 0.957 ± 0.131
2.844TyrAsp: 2.844 ± 0.322
1.69TyrGlu: 1.69 ± 0.195
1.126TyrPhe: 1.126 ± 0.194
2.394TyrGly: 2.394 ± 0.298
1.352TyrHis: 1.352 ± 0.607
2.422TyrIle: 2.422 ± 0.271
1.746TyrLys: 1.746 ± 0.21
1.887TyrLeu: 1.887 ± 0.2
2.027TyrMet: 2.027 ± 0.268
2.45TyrAsn: 2.45 ± 0.225
1.098TyrPro: 1.098 ± 0.17
0.986TyrGln: 0.986 ± 0.14
2.196TyrArg: 2.196 ± 0.228
2.168TyrSer: 2.168 ± 0.18
3.379TyrThr: 3.379 ± 0.259
3.886TyrVal: 3.886 ± 0.361
0.366TyrTrp: 0.366 ± 0.1
1.492TyrTyr: 1.492 ± 0.2
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 118 proteins (35513 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski